MILU: A Multi-task Indic Language Understanding Benchmark Paper • 2411.02538 • Published Nov 4, 2024 • 2
BhasaAnuvaad: A Speech Translation Dataset for 14 Indian Languages Paper • 2411.04699 • Published Nov 7, 2024
Pralekha: An Indic Document Alignment Evaluation Benchmark Paper • 2411.19096 • Published Nov 28, 2024
FairI Tales: Evaluation of Fairness in Indian Contexts with a Focus on Bias and Stereotypes Paper • 2506.23111 • Published Jun 29, 2025
IndicIFEval: A Benchmark for Verifiable Instruction-Following Evaluation in 14 Indic Languages Paper • 2602.22125 • Published Feb 25
Seeing Isn't Believing: Uncovering Blind Spots in Evaluator Vision-Language Models Paper • 2604.21523 • Published 13 days ago • 2
Preferences of a Voice-First Nation: Large-Scale Pairwise Evaluation and Preference Analysis for TTS in Indian Languages Paper • 2604.21481 • Published 13 days ago • 3
Preferences of a Voice-First Nation: Large-Scale Pairwise Evaluation and Preference Analysis for TTS in Indian Languages Paper • 2604.21481 • Published 13 days ago • 3
Seeing Isn't Believing: Uncovering Blind Spots in Evaluator Vision-Language Models Paper • 2604.21523 • Published 13 days ago • 2
IndicLLMSuite: A Blueprint for Creating Pre-training and Fine-Tuning Datasets for Indian Languages Paper • 2403.06350 • Published Mar 11, 2024 • 1
Finding Blind Spots in Evaluator LLMs with Interpretable Checklists Paper • 2406.13439 • Published Jun 19, 2024 • 2
Cross-Lingual Auto Evaluation for Assessing Multilingual LLMs Paper • 2410.13394 • Published Oct 17, 2024 • 1