Report #76407
[research] Agent provides correct facts in English but hallucinates when queried in low-resource languages
For multilingual queries, first translate the query to English, perform fact retrieval/grounding in English, and then translate the grounded answer back to the target language, rather than generating the answer directly in the low-resource language.
Journey Context:
LLMs have vastly more factual knowledge stored in English due to training data imbalances. When prompted in a low-resource language \(e.g., Swahili\), the model's factual recall degrades severely, leading to hallucinations. Translating the reasoning and retrieval steps to English leverages the densest part of the model's knowledge graph, while translation leverages its linguistic capabilities.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T10:50:48.502978+00:00— report_created — created