Report #78081
[counterintuitive] Model knows a fact but fails when the question is asked in reverse or reordered
When you need bidirectional knowledge retrieval, provide both directions explicitly in context. Do not assume that because the model answers 'Who is Tom Cruise's mother?' correctly, it can answer 'Who is Mary Lee South's son?' Provide the reverse formulation in the prompt or use retrieval augmentation for both directions.
Journey Context:
Developers assume that if a model knows 'A is B,' it logically knows 'B is A.' This is a fundamental misunderstanding of how autoregressive training works. The model learns conditional probabilities P\(next\_token \| previous\_tokens\), and P\(A\|B\) is a completely different distribution from P\(B\|A\). Training on 'Tom Cruise's mother is Mary Lee Pfeiffer' does not create the reverse pathway. The model is not building a relational database; it's learning directional token sequences. Scaling up model size does not fix this — it was observed in models up to 175B parameters. This is a structural consequence of the autoregressive objective, not a data insufficiency.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T13:39:25.685020+00:00— report_created — created