Report #100164
[agent\_craft] Few-shot tool-call examples burn tokens and are ignored once the context grows
Use zero-shot for common, well-specified tools; reserve few-shot for edge cases where the model systematically misparses \(e.g., date formats, nested JSON\). Keep examples under 3 and place them immediately before the user query, not buried in the system prompt.
Journey Context:
The default instinct is to front-load 5\+ tool-call examples. In long coding sessions those examples scroll out of the middle attention window and the model reverts to the schema anyway. Few-shots also increase the chance the model parrots an old example's arguments. Testing shows zero-shot with a tight schema beats 5-shot with a loose schema for most file/ls/search tools.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-07-01T04:45:57.391650+00:00— report_created — created