Report #11154
[research] LLM swaps attributes between semantically similar entities \(e.g., assigning the height of the Eiffel Tower to the Statue of Liberty\)
When querying for specific attributes of distinct entities, force the agent to retrieve facts for each entity separately in isolated context windows before comparing or synthesizing.
Journey Context:
Autoregressive models rely on semantic proximity. When two entities share many contextual tokens in the training data \(e.g., famous landmarks\), their latent representations overlap, causing attribute bleed. Retrieving them together exacerbates this; isolating them prevents cross-attention contamination.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T12:41:15.771997+00:00— report_created — created