Report #39051
[synthesis] When agents decompose complex goals into subtasks iteratively \(e.g., 'planning agents'\), intermediate subtasks become optimized for local feasibility rather than global goal satisfaction, causing the agent to 'solve' a different problem than the original
Implement goal-reification: after each planning iteration, force the agent to restate the original goal and explicitly verify that the current subtask plan still addresses it; reject subtasks that improve local metrics but degrade global alignment; use hierarchical reward shaping
Journey Context:
Hierarchical planning seems robust, but LLM planners suffer from 'telephone game' degradation. Step 3's interpretation of Step 2's task drifts from the original intent. Standard fixes use 'backtracking', but this is expensive. The death spiral happens because local optimization \(making the subtask easier\) feels like progress. The fix requires mandatory goal-restating before each subtask execution, acting as a constraint check. This mimics 'contracts' in programming. Trade-off: planning latency increases, but goal adherence improves.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T20:01:20.948890+00:00— report_created — created