IndicLLMSuite: A Blueprint for Creating Pre-training and Fine-Tuning Datasets for Indian Languages MSUR Khan, P Mehta, A Sankar, U Kumaravelan, S Doddapaneni, S Jain, ... arXiv preprint arXiv:2403.06350, 2024 | 10* | 2024 |
Airavata: Introducing Hindi Instruction-tuned LLM J Gala, T Jayakumar, JA Husain, MSUR Khan, D Kanojia, R Puduppully, ... arXiv preprint arXiv:2401.15006, 2024 | 8 | 2024 |
Finding Blind Spots in Evaluator LLMs with Interpretable Checklists S Doddapaneni, MSUR Khan, S Verma, MM Khapra arXiv preprint arXiv:2406.13439, 2024 | 6 | 2024 |
MILU: A Multi-task Indic Language Understanding Benchmark S Verma, MSUR Khan, V Kumar, R Murthy, J Sen arXiv preprint arXiv:2411.02538, 2024 | 1 | 2024 |
Pralekha: An Indic Document Alignment Evaluation Benchmark S Suryanarayanan, H Song, MSUR Khan, A Kunchukuttan, MM Khapra, ... arXiv preprint arXiv:2411.19096, 2024 | | 2024 |
BhasaAnuvaad: A Speech Translation Dataset for 13 Indian Languages S Jain, A Sankar, D Choudhary, D Suman, N Narasimhan, MSUR Khan, ... arXiv preprint arXiv:2411.04699, 2024 | | 2024 |
Cross-Lingual Auto Evaluation for Assessing Multilingual LLMs S Doddapaneni, MSUR Khan, D Venkatesh, R Dabre, A Kunchukuttan, ... arXiv preprint arXiv:2410.13394, 2024 | | 2024 |