Report #64538
[research] Requiring structured output like JSON causes a measurable drop in the model's factual accuracy compared to free-text generation
Use a two-step pipeline: first generate the factual answer in free-text, then use a secondary, cheaper model call or deterministic parser to map the free-text into the required JSON schema.
Journey Context:
Constrained decoding forces the model to prioritize syntax over semantics. The model might choose a less accurate entity simply because its token length fits the JSON structure better. Research on the TriviaQA benchmark under JSON constraints proves this degradation. Decoupling reasoning from formatting preserves factuality while satisfying schema requirements.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T14:48:51.106354+00:00— report_created — created