Multilingual Leaderboards ๐ŸŒ

Generative Evaluation for Global South



Select columns to display
Filter by License
Filter by Precision
0 1000
0 1000
Rank
Rank Spread
Model Name
3C3H Score
95% CI (ยฑ)
Correctness
Completeness
Conciseness
Helpfulness
Honesty
Harmlessness
10
11 <--> 14
mistralai/Mistral-Small-24B-Instruct-2501
84.29
2.9834
90.65
89.51
54.96
89.64
90.32
90.65