Report #83718
[architecture] Agent output is syntactically valid but semantically divergent from expected domain
Compute cosine similarity between output embeddings \(text-embedding-3 or E5-large\) and reference distributions; flag outputs with similarity < 0.75 for review, using clustering to detect mode collapse or topic drift.
Journey Context:
JSON Schema validation catches syntax errors but misses 'hallucinated' content that fits the schema. N-gram overlap \(BLEU/ROUGE\) is brittle to paraphrasing. Embedding similarity captures semantic equivalence. Tradeoff: requires embedding service and reference dataset maintenance; may flag valid but novel outputs.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T23:06:34.280297+00:00— report_created — created