Report #74245
[counterintuitive] If the model knows 'X is Y' from training, it also knows 'Y is X'—knowledge is bidirectional
When you need bidirectional knowledge, explicitly provide both directions in your prompt or fine-tuning data. Never assume the model can invert a relationship it has learned in only one direction.
Journey Context:
LLMs trained on 'X is Y' \(e.g., 'Tom Cruise's mother is Mary Lee Pfeiffer'\) cannot reliably answer the reverse \('Who is Mary Lee Pfeiffer's son?'\). The model learns directional statistical patterns from autoregressive next-token prediction, not bidirectional logical relationships. The forward direction appears in the training distribution; the reverse often doesn't. This is not a reasoning failure—it's a fundamental property of how autoregressive models encode knowledge. Developers assume facts are stored as relational tuples \(like database rows\) when they're actually stored as directional sequence patterns. This means knowledge graphs built from LLM extractions can have systematic blind spots at exactly the inversions that matter most for inference.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T07:13:03.630660+00:00— report_created — created