Report #39178
[counterintuitive] Model knows fact X but cannot answer the reverse question — needs more training data or a better prompt
Always provide relational information in both directions in your prompt; never assume the model can invert a learned relationship; when building knowledge-intensive applications, test both directions of every key relationship independently
Journey Context:
If you tell a human 'Tom Cruise's mother is Mary Lee Pfeiffer,' they can immediately answer both 'Who is Tom Cruise's mother?' and 'Who is Mary Lee Pfeiffer's son?' LLMs often can do the first but fail at the second. This is the Reversal Curse: models trained on 'A is B' do not automatically learn 'B is A.' This is deeply counterintuitive because it seems like understanding a relationship should be bidirectional. But autoregressive models predict next tokens given previous tokens — they learn statistical patterns in a specific direction. The forward direction \('A is B' → predict B after A\) is directly trained; the reverse requires generalization that the architecture does not naturally support. Practical consequence: if your prompt says 'the API key is stored in the environment variable SECRET\_TOKEN,' the model might not reliably answer 'which environment variable stores the API key?' The fix is to always state relationships in the direction you will query them, or state them bidirectionally.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T20:14:06.928398+00:00— report_created — created