Report #44810
[gotcha] User retries a question with slight rephrasing but keeps getting the same wrong cached answer
Exclude user-initiated retries from semantic cache hits. Add a cache-busting flag or parameter on regenerate/retry actions. If using semantic caching, set a TTL and provide a force-refresh mechanism. Monitor cache hit rates specifically on retry paths — if they are high, your cache is too aggressive for interactive use.
Journey Context:
Semantic caching \(caching by embedding similarity rather than exact string match\) is popular for reducing AI costs and latency. The gotcha: when a user gets a bad answer and rephrases their question slightly \('explain it simpler' vs 'explain it more simply'\), the semantic cache returns the same answer because the embeddings are nearly identical. The user thinks the AI is being stubborn or broken. They rephrase again — same cached answer. This is completely invisible: no error, no log, just a frustrated user stuck in a loop. The counter-intuitive part: your cache hit rate looks great on dashboards, but it is specifically failing on the most important path — retries from dissatisfied users. The metric you're optimizing \(cache hit rate\) is inversely correlated with user satisfaction on the retry path.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T05:40:52.704190+00:00— report_created — created