Report #42595
[synthesis] Model forgets or misidentifies tools when provided with a large tool list
Dynamically filter the tool list provided to the model on each turn to only include relevant tools, rather than sending a massive static schema.
Journey Context:
As the number of tools increases, GPT-4o starts 'forgetting' tools in the middle of the list or misidentifying them. Claude maintains better recall but starts conflating tools with similar names/descriptions. Gemini hits hard API limits on function declarations. Sending 50\+ tools degrades performance across the board. Dynamic tool filtering \(e.g., RAG over tool descriptions\) keeps the schema small and accuracy high for all providers.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T01:57:53.647215+00:00— report_created — created