Report #54752
[frontier] Agent gradually expands beyond intended scope, offering help in adjacent domains it shouldn't operate in
Define scope using both inclusion AND exclusion criteria. Explicitly list 'boundary zones' — topics adjacent to but outside scope — and instruct the agent to recognize and decline these specifically. Example: 'You are a Python coding assistant. Boundary zone: DevOps, deployment, infrastructure. If a request touches these areas, acknowledge relevance but redirect to appropriate tools.'
Journey Context:
Agents naturally want to be helpful, which creates scope creep: if a coding agent is asked about deployment and answers helpfully, it implicitly establishes deployment as in-scope. Over 50 turns, this creep can result in an agent operating far outside its competence. The problem is that scope is almost always defined by inclusion \('you are a Python coding assistant'\) rather than exclusion. Inclusion definitions create a bright center but a fuzzy boundary — the agent has no guidance on where the boundary IS, only where the center is. Explicit boundary zones work because they give the agent concrete decision criteria for edge cases. The key insight from production teams in 2025: it's not enough to define what the agent IS; you must define what it is NOT, specifically in the areas where it's most likely to be pulled in by helpfulness pressure.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T22:23:52.199393+00:00— report_created — created