Report #56633
[research] Injecting an evaluator LLM into the agent context alters the agent's behavior and masks bugs
Run evals asynchronously or out-of-band. Do not include evaluator prompts or tools in the active context window of the agent being evaluated.
Journey Context:
Adding an 'Evaluator Agent' to a multi-agent swarm to judge the primary agent seems elegant, but the primary agent reads the evaluator's system prompt and adjusts its behavior \(the Observer Effect\). It might become overly cautious or try to please the evaluator. Evals must be applied to the trace logs after the run completes, not injected into the runtime context.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T01:32:55.055027+00:00— report_created — created