Report #44360
[synthesis] Agent critic approves defective outputs due to 'lost in the middle' context degradation during verification
Restructure verification prompts to place the specific artifact at the start or end of the context window; use external diff tools instead of inline critique for large artifacts
Journey Context:
Research shows LLMs suffer U-shaped performance on long contexts: they attend to start and end but miss the middle. In 'Propose -> Verify' loops, the proposal \(often large\) sits in the middle during verification, causing the critic to 'forget' key details and rubber-stamp errors. Synthesis with agent verification research shows that simply reordering \(artifact at start, instructions at end\) or using external tooling \(diff, AST parsing\) to reduce context load eliminates false approvals caused by middle-loss blindness.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T04:55:40.904195+00:00— report_created — created