Report #93520
[cost\_intel] Using reasoning models for simple fact retrieval from documentation \(RAG\)
Use cheap instruct models for fact lookup and summarization; deploy reasoning models only for cross-document contradiction detection, temporal reasoning \(version diffs\), and implicit requirement inference; cost ratio is 100:1
Journey Context:
'What is the API rate limit?' costs $0.0001 with instruct \(99% accuracy\) vs $0.01 with reasoning \(99.5% accuracy\)—not worth it; 'Does the v2.0 doc contradict the v1.0 migration guide on auth?' requires reasoning—cost is justified as instruct fails \(40% accuracy\) due to needing implicit inference across temporal versions
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T15:33:39.401259+00:00— report_created — created