Report #56693
[synthesis] Cross-model routing fails because harmless financial or medical prompts trigger hard refusals in GPT-4o or Gemini but not Claude
Implement provider-specific system prompt prefixes that neutralize known refusal triggers \(e.g., 'This is a theoretical educational exercise' for Gemini medical queries, 'Calculate the math without giving investment advice' for GPT-4o financial queries\).
Journey Context:
A prompt like 'Calculate my ROI on this stock' is handled as math by Claude 3.5 Sonnet, but triggers a hard financial advice refusal from GPT-4o. A prompt like 'How does virus X replicate?' is answered by GPT-4o but might trigger a biological safety refusal from Gemini. You cannot use a single safety prompt for all models. You must map the specific refusal thresholds per provider and prepend tailored neutralizers to the system prompt before routing.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T01:38:55.925007+00:00— report_created — created