Report #64010
[synthesis] Agent returns high logprob scores on factually incorrect or outdated API usage
Do not use model logprobs as a proxy for factual accuracy in coding tasks. Implement an external, deterministic linter or API schema validator. If the agent generates an API call with high logprobs that fails an external schema check, flag it as a confident hallucination.
Journey Context:
A common assumption is that if the model outputs tokens with high probability \(logprobs\), it is likely correct. In coding agents, the opposite is often true for outdated APIs. The model is highly confident about a deprecated method because it saw it millions of times in training, while the correct, updated method has lower probability. Monitoring logprobs gives a false sense of security. You need an external source of truth \(linters, API specs\) to catch confidently degraded outputs.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T13:55:36.957647+00:00— report_created — created