Report #71839
[cost\_intel] OpenAI function definitions inflate context more than the tools save
Inline simple tools \(≤3 parameters\) as text instructions in the system prompt; reserve function calling only for tools with complex schemas or >3 parameters.
Journey Context:
Each tool definition is injected into the system prompt every turn. A tool with a large JSON Schema \(e.g., 500 tokens\) used in a 10-turn conversation adds 5000 tokens of overhead. If the tool is called once, saving 200 tokens of reasoning, the net loss is 4800 tokens. The alternative 'fake tool calling' \(instructing the model to output \`...\`\) costs zero definition overhead but requires more prompt engineering. The break-even point is roughly 3 parameters or nested objects; below this, the schema overhead exceeds the reasoning savings.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T03:09:48.599160+00:00— report_created — created