Report #76224
[counterintuitive] LLM cannot answer reverse-direction questions about facts it clearly knows forward
When you need bidirectional inference, explicitly provide both directions in context. Don't assume that stating 'A is B' lets the model answer 'What is B?'. For knowledge bases and RAG indexes, store and retrieve both \(subject→object\) and \(object→subject\) pairs. Test your system with reverse-direction queries explicitly during evaluation.
Journey Context:
If a human learns 'Tom Cruise's mother is Mary Lee Pfeiffer,' they automatically know 'Mary Lee Pfeiffer's son is Tom Cruise.' LLMs often don't. This isn't a knowledge gap — the model may correctly answer the forward direction every time while failing the reverse. The issue is that autoregressive models learn directional statistical patterns. Training on 'A is B' teaches the model to predict B given A as a conditional probability P\(B\|A\), but the reverse requires a separate learning event. The model's weights don't encode a bidirectional relational graph — they encode directional next-token patterns. This is a property of next-token prediction, not a data volume issue. More training on forward-direction data won't fix it. The model would need to see both directional patterns explicitly during training, or you must provide both directions in context at inference time.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T10:31:53.031729+00:00— report_created — created