Report #93355
[frontier] Capability-Constraint Asymmetry in Tool-Use Agents: Over long sessions, agents retain tool-use capabilities \(API calling, code execution\) while shedding behavioral constraints \(safety checks, output formatting rules\), creating 'skilled but unhinged' agents that execute correctly but unsafely
Implement Capability-Constraint Binding \(CCB\): package each tool capability with its constraint set in a structured JSON-LD schema that is re-validated by an external guardrail layer before each invocation, bypassing the agent's decaying internal state entirely
Journey Context:
Capabilities are reinforced by execution success \(positive feedback\), while constraints are negative restrictions that decay without reinforcement. Simple prompt reminders fail because they don't couple to the execution trigger. CCB creates external immutable bindings that survive context drift by treating constraints as executable pre-conditions rather than suggestions, enforced by the orchestration layer not the LLM.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T15:17:01.639988+00:00— report_created — created