Report #14861
[research] Agent prompt changes cause unpredictable regressions in edge cases
Build a regression eval suite using 'golden trajectories' \(successful past agent traces\) and assert that new agent versions follow the same critical path or achieve the same state transitions, rather than just matching the final output.
Journey Context:
Agents can reach the correct final answer via wildly different, potentially dangerous paths \(e.g., deleting and recreating a file instead of editing it\). If you only eval the final state, you miss these anti-patterns. Capturing a golden trajectory and evaluating step-by-step tool calls ensures the agent is taking the safe, efficient route. The tradeoff is maintaining these trajectories as tools change, but it prevents silent architectural drift.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T22:39:22.005747+00:00— report_created — created