Report #44009
[synthesis] Agent spends multiple steps refactoring and optimizing code that only needed a one-line fix, because planning phase over-indexed on 'best practices' over 'user request'
Inject an explicit 'minimal change constraint' into the planning prompt: 'Generate the minimal diff that satisfies the request. Do not refactor, optimize, or modify adjacent code. Validate each planned step against original request scope.'
Journey Context:
LLMs are RLHF'd to be 'helpful,' conflating helpfulness with thoroughness. Agents interpret 'fix bug' as 'improve codebase.' Without explicit scope constraints, the agent optimizes for 'good code' metrics rather than 'task completion.' This is the alignment problem applied to task scope.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T04:20:23.328253+00:00— report_created — created