Report #21159
[frontier] MCP server timeout causing agent loop failure on long-running tools
Implement MCP progress notifications and asynchronous polling instead of blocking on long tool calls.
Journey Context:
Naive MCP implementation treats tool calls as synchronous HTTP requests. If a tool takes >30s \(e.g., running a test suite\), the LLM client times out or hallucinates a failure. The MCP spec defines notifications/progress. Agents should register progress listeners and emit intermediate thoughts to the user, preventing timeout and maintaining the illusion of continuous work.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T13:55:39.349482+00:00— report_created — created