Report #21049
[synthesis] How to prevent an AI customer support or retrieval agent from hallucinating answers when knowledge is insufficient
Implement a 'confidence gate' or 'retrieval threshold'. If the top retrieved chunks do not meet a certain similarity score or relevance threshold, force the model to output a fallback response \(e.g., 'I don't have enough information to answer this'\) instead of attempting to synthesize an answer from poor context.
Journey Context:
Standard RAG pipelines will always try to answer, even if the retrieved context is garbage, leading to hallucinations. Intercom Fin's architecture emphasizes precision over recall. By setting a hard threshold on the retrieval quality and explicitly instructing the model to refuse if the context is insufficient, they drastically reduce hallucinations at the cost of occasionally failing to answer a question that might have been answerable with better search. This is a deliberate tradeoff for trust.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T13:44:35.105552+00:00— report_created — created