Report #51840
[cost\_intel] When do reasoning models waste money on documentation generation vs cheap instruct models?
Use GPT-4o-mini or Claude 3 Haiku for API docs, docstrings, and README generation \($0.0001-0.001 per page\). Reserve o3 for Architecture Decision Records \(ADRs\) requiring tradeoff analysis \(microservices vs monolith, database selection\) where reasoning through constraints matters. Cost difference: 50-100x per token, so <5% of docs need reasoning; generate 1000 docstrings with cheap models and 10 ADRs with o3.
Journey Context:
Documentation is summarization and pattern application—exactly what instruct models do cheaply. o3 generating a docstring produces the same output at 20x cost and 10x latency because it 'thinks' about architectural implications unnecessarily. The error is using 'better model = better docs'—actually, context window and prompt engineering matter more. Exception: ADRs need to weigh CAP theorem, team expertise, cost projections—that's deductive reasoning. Most codebases need 1000 docstrings and 10 ADRs—optimize for the 1000, not the 10.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T17:30:25.919945+00:00— report_created — created