OpenAI launches IndQA benchmark to evaluate performance in Indian languages, culture

The benchmark currently comprises 2,278 questions spanning 11 Indian languages (Hindi, Hinglish, Gujarati, Punjabi, Kannada, Odia, Marathi, Malayalam, Tamil, Bengali, and Telugu)

November 4, 2025 21:19 IST

IndQA employs a rubric-based approach where each response is graded against criteria written by domain experts for specific questions.

OpenAI, on Tuesday, unveiled IndQA, a benchmark designed to evaluate how well artificial intelligence (AI) models understand and reason about questions rooted in Indian languages and cultural contexts.

IndQA marks the AI giant’s first focused effort to create a region specific benchmark. OpenAI’s aim is to create similar benchmarks for other languages and regions moving forward.

Srinivas Narayan, CTO b2b applications at OpenAI, noted India was selected “as an obvious starting point given its market size, linguistic diversity with approximately one billion people who don’t use English as their primary language, and cultural richness.”

ALSO READ

Zoho Notebook adds powerful AI features, offers free access for students

Meanwhile, India represents OpenAI’s second-largest market for ChatGPT, amongst its 8 million weekly active users globally. IndQA employs a rubric-based approach where each response is graded against criteria written by domain experts for specific questions. These criteria outline what an ideal answer should include or avoid, with each criterion assigned a weighted point value based on its importance.

A model-based grader then checks whether each criterion is met, with the final score calculated as the sum of points earned out of the total possible. The benchmark currently comprises 2,278 questions spanning 11 Indian languages (Hindi, Hinglish, Gujarati, Punjabi, Kannada, Odia, Marathi, Malayalam, Tamil, Bengali, and Telugu) and 10 cultural domains (Law and ethics, Architecture and design, Food and cuisine, Everyday life, Religion and spirituality, Sports and recreation, Literature and linguistics, Media and entertainment, Arts and culture, and History), developed in collaboration with 261 domain experts including journalists, linguists, scholars, artists, and industry practitioners.

Related News

An incorrect name on your Aadhaar can lead to severe issues with KYC (Know Your Customer) compliance, bank accounts, and benefits distribution.

Aadhaar card online update: How to change name, address, date of birth and phone number online in simple steps

US student visa: F-1 visa denied after 40-second interview, applicant says ‘I had applied in…’

Lawyer claims OpenAI called police to his home after advocating for AI regulation: Details inside

ChatGPT ‘restricted’ from giving medical, legal, or financial advice over liability fears: Report

New Zealand visa, New Zealand investor visa, New Zealand Business Investor Work Visa, NZ investment visa 2025, invest in New Zealand, New Zealand residency by investment

New Zealand announces work visa for global investors without starting a new business

Sridhar Vembu’s Zoho Arattai drops out of the top 100 apps list while WhatsApp, other rivals soar

Gems & jewellery exports: UAE gains share as US loses ground

Policy29 min ago

Saikat Neogi reports that shipments of gems and jewellery to the US have been declining for four consecutive years. The decline was even more significant in FY26, with exports decreasing every month except for March. The US tariffs resulted in a 74% decrease in September, but the UAE quickly took over as a new market for exporters.

View all shorts