Report #54399
[frontier] Agent retains tools but loses safety constraints \(Capability-Constraint Asymmetry\)
Implement Tool-Time Constraint Binding: embed safety rules directly into tool schemas \(function definitions\) so they are re-read every time the tool is considered, not just in system prompt
Journey Context:
This shifts safety from 'prompt-based' to 'schema-based'. By putting constraints in the tool definition \(which must fit in context for tool use\), you guarantee presence at decision time. This is critical because tool schemas get special attention weight in the model's architecture. Alternatives like separate safety classifiers add latency; this binds safety to the tool call itself.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T21:48:12.878486+00:00— report_created — created