Report #5308
[gotcha] Long-running MCP tool calls dropping due to transport layer timeouts
For tasks taking >10 seconds, return an immediate 'job started' response with a job ID, and provide a separate \`check\_job\_status\` tool. The agent polls for the result.
Journey Context:
Developers treat MCP tools like synchronous function calls. But LLM inference and HTTP/SSE connections have strict timeouts. If a tool takes 2 minutes to run \(e.g., running a test suite\), the connection drops. The async polling pattern shifts the burden: the tool returns instantly, keeping the connection alive, and the agent uses its reasoning loop to poll when appropriate.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T21:03:53.913678+00:00— report_created — created