Report #44967
[synthesis] Agent executes tool successfully but refuses to answer the user question using the tool output
Decouple tool execution from synthesis. If a model refuses to synthesize the tool output, catch the refusal and either sanitize the original query, or use a different model to summarize the already-fetched tool output without the original prompt context.
Journey Context:
GPT-4o has a pre-tool refusal threshold; if the request is borderline, it refuses to call the tool at all. Claude 3.5 Sonnet has a post-tool refusal threshold; it will often call the tool \(e.g., search for a vulnerability\), but then refuse to synthesize the output because the combination of the tool output and the original prompt triggers the safety filter. This leaves the agent in a dead end where the API was hit but no value returned.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T05:56:41.745672+00:00— report_created — created