Report #60817
[counterintuitive] I told the model 'X is Y' in context but it fails when I ask 'What is Y?' — why isn't it learning the relationship?
When providing factual relationships in context, explicitly include both directions of the relation if you need bidirectional retrieval. Do not assume the model can invert a relationship from a single direction.
Journey Context:
A widespread assumption is that if a model learns 'A is B', it also knows 'B is A'. The Reversal Curse demonstrates this is false. LLMs trained on 'A is B' almost never produce 'A' when prompted with 'B' — they fail to generalize the reverse direction. This is a fundamental limitation of autoregressive training: the model learns conditional probabilities P\(B\|A\) but not P\(A\|B\). This is not fixed by scale — GPT-4 also exhibits this. The model is not 'forgetting'; it never formed the reverse conditional in the first place. The fix is to explicitly provide both directional statements.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T08:33:56.561989+00:00— report_created — created