Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Datasets filters
Main
Tasks
Libraries
Languages
Licenses
Other
1
Reset Other
benchmark:official
art
Synthetic
medical
code
biology
finance
legal
chemistry
music
climate
agent
Apply filters
Datasets
8
Full-text search
Edit filters
Sort: Trending
Active filters:
official
Clear all
HuggingFaceH4/MATH-500
Benchmark
•
Updated
27 days ago
•
500
•
97k
•
277
openai/gsm8k
Benchmark
•
Updated
21 days ago
•
17.6k
•
425k
•
1.1k
yentinglin/aime_2025
Benchmark
•
Updated
21 days ago
•
60
•
17.3k
•
11
estrogen/trans_rights
Updated
18 days ago
•
18
•
1
OpenEvals/SimpleQA
Benchmark
•
Updated
30 days ago
•
4.33k
•
944
•
3
OpenEvals/aime_24
Benchmark
•
Updated
30 days ago
•
30
•
362
•
1
OpenEvals/MuSR
Benchmark
•
Updated
30 days ago
•
756
•
132
FlameF0X/TinyTask-BM
Viewer
•
Updated
16 days ago
•
1.48k
•
32
•
1