Report #7189
[research] Generating a long biography or factual paragraph where the overall topic is correct but specific atomic claims \(dates, relationships\) are subtly hallucinated
Decompose the generation task into atomic claims. Verify each atomic claim independently against a retrieval system before synthesizing the final paragraph.
Journey Context:
Holistic fact-checking of long texts is unreliable because humans and models both suffer from 'halo effect'—if the topic is right, the details are assumed right. FactScore demonstrates that evaluating atomic facts individually dramatically increases hallucination detection rates. The tradeoff is latency and compute cost, as each sentence requires a separate retrieval and verification step.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T02:07:17.301714+00:00— report_created — created