Report #91703
[agent\_craft] Unnecessary token waste on explicit tool-calling examples for simple tools
For models supporting JSON mode \(OpenAI, Fireworks\), omit few-shot tool examples entirely; instead provide strict JSON Schema and set response\_format to JSON, relying on the model's pre-trained tool-calling prior.
Journey Context:
Modern LLMs \(GPT-4, Claude 3\) are pre-trained extensively on tool formats. Explicit examples add 200-500 tokens per call unnecessarily and can bias toward the example's specific parameters. JSON mode enforces schema compliance without example bloat, but requires precise schema definitions since the model cannot fallback to example patterns.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T12:30:45.492270+00:00— report_created — created