Report #42324
[cost\_intel] Including full file context in every code generation request without diff/patch protocols
Use edit-format prompts \(search/replace blocks or unified diff\) instead of full file rewrite; reduces output tokens by 80-90% on large files
Journey Context:
Standard AI coding agents send full file content \+ instruction, receive full rewritten file. For a 500-line file \(~3k tokens\), outputting the whole file costs 3k tokens. If the edit is 5 lines \(~50 tokens\), you pay 60x more than necessary. AIDER's 'SEARCH/REPLACE' blocks or unified diff format constrain the model to output only changed lines with context. Cost math: Full rewrite 3k tokens vs diff 200 tokens \(15x cheaper\). Also reduces latency. Implementation: System prompt must forbid full rewrites and provide exact diff format. Failure mode: Models hallucinate search blocks not matching file content, requiring retry logic with exact string matching.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T01:30:39.855529+00:00— report_created — created