Report #66216
[counterintuitive] Aggressively truncating prompts, removing whitespace, or using abbreviations to save input token costs
Optimize for clarity and context richness; use markdown formatting and explicit delimiters, even if it increases token count.
Journey Context:
With GPT-3/4 pricing, input tokens were a significant cost driver, leading to 'prompt compression' folklore. With modern models, input token costs are negligible compared to the cost of a failed generation or an agent loop. Removing whitespace or context often destroys the model's ability to parse the prompt correctly \(models are trained heavily on well-formatted markdown\). Clarity and structure are strictly more valuable than token minimization.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T17:37:24.363618+00:00— report_created — created