Better to Ask in English: Cross-Lingual Evaluation of Large Language Models for Healthcare Queries

Jan 1, 2024·
Yiqiao Jin
Yiqiao Jin
,
Mohit Chandra
,
Gaurav Verma
,
Yibo Hu
,
Munmun De Choudhury
,
Srijan Kumar
· 1 min read
Figure showing the main model architecture and workflow Model architecture and key components
Abstract
We present a framework and benchmark to evaluate LLMs’ multilingual capabilities in healthcare queries, revealing significant performance gaps across languages and providing insights for improving healthcare accessibility globally.
Type
Publication
The Web Conference (WWW) 2024

Abstract

We present a framework and benchmark to evaluate LLMs’ multilingual capabilities in healthcare queries, revealing significant performance gaps across languages and providing insights for improving healthcare accessibility globally.

Keywords

Large Language Models, Healthcare, Cross-lingual Evaluation, Multilingual NLP

Yiqiao Jin
Authors
Ph.D. Candidate in Computer Science
My research focuses on adaptive and efficient AI systems, with emphasis on LLM agents, agent memory, self-distillation, multimodal LLMs, and structured multi-agent intelligence.