IndicIFEval: A Benchmark for Verifiable Instruction-Following Evaluation in 14 Indic Languages

Jayakumar, Thanmay; Khan, Mohammed Safi Ur Rahman; Dabre, Raj; Puduppully, Ratish; Kunchukuttan, Anoop

Computer Science > Computation and Language

arXiv:2602.22125 (cs)

[Submitted on 25 Feb 2026]

Title:IndicIFEval: A Benchmark for Verifiable Instruction-Following Evaluation in 14 Indic Languages

Authors:Thanmay Jayakumar, Mohammed Safi Ur Rahman Khan, Raj Dabre, Ratish Puduppully, Anoop Kunchukuttan

View PDF HTML (experimental)

Abstract:Instruction-following benchmarks remain predominantly English-centric, leaving a critical evaluation gap for the hundreds of millions of Indic language speakers. We introduce IndicIFEval, a benchmark evaluating constrained generation of LLMs across 14 Indic languages using automatically verifiable, rule-based instructions. It comprises around 800 human-verified examples per language spread across two complementary subsets: IndicIFEval-Ground, translated prompts from IFEval (Zhou et al., 2023) carefully localized for Indic contexts, and IndicIFEval-Ground, synthetically generated instructions grounded in native Indic content. We conduct a comprehensive evaluation of major open-weight and proprietary models spanning both reasoning and non-reasoning models. While models maintain strong adherence to formatting constraints, they struggle significantly with lexical and cross-lingual tasks -- and despite progress in high-resource languages, instruction-following across the broader Indic family lags significantly behind English. We release IndicIFEval and its evaluation scripts to support progress on multilingual constrained generation (this http URL).

Comments:	8 pages + Appendix
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2602.22125 [cs.CL]
	(or arXiv:2602.22125v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2602.22125

Submission history

From: Thanmay Jayakumar [view email]
[v1] Wed, 25 Feb 2026 17:12:37 UTC (4,529 KB)

Computer Science > Computation and Language

Title:IndicIFEval: A Benchmark for Verifiable Instruction-Following Evaluation in 14 Indic Languages

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:IndicIFEval: A Benchmark for Verifiable Instruction-Following Evaluation in 14 Indic Languages

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators