Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
alessandrobondielli
's Collections
LLMs-to-test
Datasets-ScaleLLM
MechInterp-Papers
Reading List - TextToImage
Datasets-ScaleLLM
updated
Jul 1, 2025
Upvote
-
truthfulqa/truthful_qa
Viewer
•
Updated
Jan 4, 2024
•
1.63k
•
57.4k
•
270
allenai/qasc
Viewer
•
Updated
Jan 4, 2024
•
9.98k
•
8.26k
•
23
Anthropic/model-written-evals
Viewer
•
Updated
Dec 21, 2022
•
3.25k
•
910
•
56
yesilhealth/Health_Benchmarks
Viewer
•
Updated
Apr 20, 2025
•
7.54k
•
735
•
8
maveriq/bigbenchhard
Viewer
•
Updated
Sep 29, 2023
•
6.51k
•
724
•
38
Note
Filtrare i subset che non hanno campo choice
tau/commonsense_qa
Viewer
•
Updated
Jan 4, 2024
•
12.1k
•
50.9k
•
125
allenai/sciq
Viewer
•
Updated
Jan 4, 2024
•
13.7k
•
33.4k
•
130
allenai/openbookqa
Viewer
•
Updated
Jan 4, 2024
•
11.9k
•
95.5k
•
120
allenai/ai2_arc
Viewer
•
Updated
Dec 21, 2023
•
7.79k
•
229k
•
245
TIGER-Lab/MMLU-Pro
Viewer
•
Updated
Oct 25, 2025
•
12.1k
•
67.1k
•
404
Upvote
-
Share collection
View history
Collection guide
Browse collections