Better to Ask in English: Cross-Lingual Evaluation of Large Language Models for Healthcare Queries
We present a framework and benchmark to evaluate LLMs' multilingual capabilities in healthcare queries, revealing significant performance gaps across languages and providing insights for improving hea...
Jan 1, 2024