Report #73943
[cost\_intel] Using Claude 3.5 Sonnet / GPT-4o for simple document OCR text extraction
Use a dedicated OCR engine for text extraction then pass text to an LLM, or use Haiku/Flash for pure OCR; reserve Sonnet/Pro for visual reasoning.
Journey Context:
Passing an image to Sonnet costs ~$4.80/MTok. A standard page is ~1000-1500 tokens. Extracting text via Sonnet costs ~$0.005/page. Tesseract is free. Haiku vision is ~$0.25/MTok. Small models are great at reading text, but fail at explaining trends in charts where Sonnet excels.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T06:42:35.536459+00:00— report_created — created