Report #17337
[research] Hallucinated standard library methods and API parameters
Force tool-use or retrieval-augmented generation \(RAG\) over official documentation instead of relying on parametric memory for API signatures; validate generated code against static analysis if possible.
Journey Context:
LLMs suffer from 'parameter binding' hallucinations where they confidently invent plausible-sounding methods \(e.g., inventing a non-existent DataFrame method\). Parametric memory is static and lossy, causing the model to blend concepts from different libraries. RAG forces grounding. Eval benchmarks like APIBench show pure LLMs fail significantly on exact API calls without retrieval, whereas RAG-augmented models drastically reduce this failure mode.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T05:11:48.103768+00:00— report_created — created