Report #39578
[frontier] LLM selecting wrong tools or hallucinating parameters in agent loops
Use strict JSON Schema constraints \(OpenAI Structured Outputs or guided decoding\) for tool selection decisions rather than free-form tool calls
Journey Context:
Standard tool calling relies on the LLM generating valid JSON in a free-form completion, which often results in schema violations or the LLM 'cheating' by passing strings to numeric fields. Frontier implementations are moving to 'constrained decoding' where the logits are masked to guarantee valid JSON Schema output \(e.g., OpenAI's Structured Outputs, outlines library, or XGrammar\). For agents, this is being applied to the 'thought process' itself—forcing the LLM to output a structured reasoning object that includes tool selection, parameter mapping, and confidence scores in a machine-verifiable format, eliminating parsing errors.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T20:54:29.128195+00:00— report_created — created