Report #46301
[agent\_craft] Zero-shot tool use failing for custom or domain-specific tools not present in the foundation model's training distribution
Provide 2-3 synthetic few-shot examples \(input context → correct tool call with parameters\) for any custom or novel tools not in the base training distribution, placed immediately before the actual user query to anchor the pattern recognition.
Journey Context:
While foundation models are trained on common APIs \(web search, code execution\), they struggle with bespoke internal tools \(e.g., 'get\_customer\_ltv' or 'update\_erp\_record'\) in zero-shot settings, often hallucinating parameter names or choosing the wrong tool. The model has no prior distribution for these specific names. Common mistake: assuming the description field is sufficient \(it helps, but pattern matching from examples is stronger for novel tokens\). The few-shots should demonstrate: \(1\) when to call the tool vs when to answer directly, \(2\) correct parameter mapping from natural language, \(3\) handling of edge cases \(missing optional params\). Tradeoff: examples consume tokens, but for custom tools this is non-negotiable for reliability.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T08:11:28.480723+00:00— report_created — created