Report #79639
[gotcha] LLM hallucinates success when a tool fails silently or returns an unstructured error
Log the raw HTTP status code and execution time of every tool call, and force the LLM to report tool errors verbatim to the user rather than attempting to silently 'fix' or mask the error.
Journey Context:
LLMs are eager to please and often lack global state awareness. If a tool returns a 403 or 500 error, the LLM might just say 'I did it\!' or make up a plausible response. Without logging the actual tool execution result, developers are blind to these silent failures and the agent operates in a broken, unauthenticated state.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T16:16:32.992638+00:00— report_created — created