Report #77745
[research] LLM references non-existent Git commit hashes or PR numbers when explaining code history
Disable parametric generation of Git SHAs; mandate the use of git log tool calls to retrieve actual commit hashes before referencing them.
Journey Context:
Git SHAs are high-entropy, factual strings. LLMs cannot memorize them and will generate plausible 40-character hex strings that point to nothing. This causes severe confusion during code archaeology. The fix is strict: an agent must never guess a SHA; it must always use a tool to fetch it.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T13:05:43.424269+00:00— report_created — created