Paper Page - Eka-Eval : A Comprehensive Evaluation Framework For Large Language Models In Indian Languages

Abstract
The rapid advancement of Large Language Models (LLMs) has intensified the need for evaluation frameworks that address the requirements of linguistically diverse regions, such as India, and go beyond English-centric benchmarks. We introduce EKA-EVAL, a unified evaluation framework that integrates over 35+ benchmarks (including 10 Indic benchmarks) across nine major evaluation categories. The framework provides broader coverage than existing Indian language evaluation tools, offering 11 core capabilities through a modular architecture, seamless integration with Hugging Face and proprietary models, and plug-and-play usability. As the first end-to-end suite for scalable, multilingual LLM benchmarking, the framework combines extensive benchmarks, modular workflows, and dedicated support for low-resource Indian languages to enable inclusive assessment of LLM capabilities across diverse domains. We conducted extensive comparisons against five existing baselines, demonstrating that EKA-EVAL achieves the highest participant ratings in four out of five categories. The framework is open-source and publicly available at: https://github.com/lingo-iitgn/eka-eval.

Source link

What's Hot

Explain Before You Answer: A Survey on Compositional Visual Reasoning – Takara TLDR

10 DeepSeek AI Prompts for Productivity

Elon Musk accuses Apple and OpenAI of stifling AI competition in antitrust lawsuit

Paper page – Eka-Eval : A Comprehensive Evaluation Framework for Large Language Models in Indian Languages

Explain Before You Answer: A Survey on Compositional Visual Reasoning – Takara TLDR

Neither Valid nor Reliable? Investigating the Use of LLMs as Judges – Takara TLDR

AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs – Takara TLDR

People Inc. Sells Oldenburg and Van Bruggen ‘Plantoir’ Sculpture

Amy Sherald Speaks Out About Government Censorship at the Smithsonian

Dealers Living Like Collectors, Egypt’s Tourism and More: Morning Links

Mütter Museum in Philadelphia Announces New Policy for Human Remains

Explain Before You Answer: A Survey on Compositional Visual Reasoning – Takara TLDR

10 DeepSeek AI Prompts for Productivity

Elon Musk accuses Apple and OpenAI of stifling AI competition in antitrust lawsuit

What's Hot

Paper page – Eka-Eval : A Comprehensive Evaluation Framework for Large Language Models in Indian Languages

Related Posts

Subscribe to Updates