HARP

Human Annotated Reasoning Problems (HARP) is a math reasoning dataset consisting of 4,780 short answer questions from US national math competitions, spanning from 1950 to Sept. 2024. The paper can be found at HARP.pdf.

We release a few splits of our dataset:

HARP.jsonl.zip consists of 4,780 short answer questions and is the default split for running the evaluation.
HARP_mcq.jsonl.zip consists of 4,110 multiple choice questions.
HARP_proof-based.jsonl.zip consists of 310 proof-based problems.
HARP_raw.jsonl.zip consists of all 5,409 problems we scraped and processed.

We provide all jsonl files in zip format, to avoid leaking the data in plain text (and thus hopefully mitigate contamination).

In addition to the dataset and paper, this repo also include all our code for constructing the dataset, running evaluations, checking answers, and various other analyses that we did in the paper.

Fields in the main dataset

problem: Problem text
answer: Ground truth answer
solution_{i}: Human-written solution(s) (at least 1 per problem)
year/contest/number: Three fields that jointly uniquely identify the source problem
level: Difficulty level
subject: Subject label for the problem -- one of ['prealgebra', 'algebra', 'number_theory', 'geometry', 'counting_and_probability', 'precalculus']

Citation

@misc{yue2024harp,
    title={{HARP}: A challenging human-annotated math reasoning benchmark},
    author={Albert S. Yue and Lovish Madaan and Ted Moskovitz and DJ Strouse and Aaditya K. Singh},
    year={2024},
    url={https://github.com/aadityasingh/HARP}
}

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
images		images
src		src
.gitignore		.gitignore
HARP.jsonl.zip		HARP.jsonl.zip
HARP.pdf		HARP.pdf
HARP_mcq.jsonl.zip		HARP_mcq.jsonl.zip
HARP_proof-based.jsonl.zip		HARP_proof-based.jsonl.zip
HARP_raw.jsonl.zip		HARP_raw.jsonl.zip
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

HARP

Fields in the main dataset

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

aadityasingh/HARP

Folders and files

Latest commit

History

Repository files navigation

HARP

Fields in the main dataset

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages