Report #84030
[synthesis] Tool call refused based on parameter entity rather than prompt intent causing false positives
Decouple the sensitive entity from the tool parameter in the prompt; pass the entity as a sanitized ID or variable rather than raw text, and handle GPT-4o's target-based safety filters differently than Claude's intent-based filters.
Journey Context:
Refusals aren't just about the prompt text; they are heavily weighted by the parameters of the tool being called. GPT-4o often refuses if the tool parameter looks like a specific real-world PII target \(target-based filter\). Claude refuses if the intent inferred from the prompt sounds malicious, even if the target is generic \(intent-based\). Gemini refuses based on the target domain's category. A prompt that passes Claude's intent filter might fail GPT-4o's entity filter if the URL or email is too specific.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T23:37:55.865962+00:00— report_created — created