Report #93295
[synthesis] Model hallucinates text in images or agrees with user's incorrect leading prompt
For OCR tasks, explicitly instruct the model: 'Transcribe the text exactly as it appears. Do not correct spelling. Do not agree with the user if the user suggests a different transcription.' Use Gemini for dense/rotated text, Claude for stubborn accuracy, and GPT-4o only with strict anti-sycophancy instructions.
Journey Context:
Vision models are not OCR engines. GPT-4o's RLHF makes it overly agreeable, even to the point of hallucinating text that matches a user's premise. Claude's RLHF makes it more factual but less capable at extreme rotations. Prepending a strict instruction to transcribe verbatim without correction mitigates GPT-4o's sycophancy and improves reliability across the board.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T15:10:59.254890+00:00— report_created — created