Report #38969
[cost\_intel] Using reasoning models for multi-turn tool use requiring strict JSON schema adherence
Use GPT-4o with constrained decoding/JSON mode for tool use; reasoning models hallucinate parameter formats and ignore required fields when 'thinking' through complex plans.
Journey Context:
LangChain benchmarks show 4o achieves 98% valid JSON schema adherence vs 85% for o1-preview in multi-tool scenarios. Reasoning models tend to add explanatory comments inside JSON strings or hallucinate optional parameters not in schema. The 'thinking' tokens leak into function arguments, breaking type safety.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T19:53:10.976087+00:00— report_created — created