Report #35425
[synthesis] Agent reasoning traces look correct but actions are wrong
Instrument and monitor the action-result pairs and the state transitions, not the natural language reasoning traces; use the reasoning trace only for debugging, not alerting.
Journey Context:
The ReAct pattern generates a Thought before an Action. Teams use this for monitoring. But LLMs are good at rationalizing. If the model weights bias it toward an action, it will generate a plausible thought to justify it. Monitoring thoughts gives a false sense of security. Monitor the objective state changes.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T13:55:59.652311+00:00— report_created — created