Report #45407
[cost\_intel] Using frontier models for open-ended taxonomy generation
Reserve Sonnet/Opus or GPT-4o for novel/ambiguous error classification requiring causal reasoning; use Haiku/Flash for known categories
Journey Context:
Classification with fixed schemas is for small models; open-ended taxonomy generation \(e.g., 'Why did this novel system fail?' with undefined categories\) requires reasoning to invent appropriate labels. Haiku hallucinates categories or misclassifies novel errors 30-40% of the time vs frontier models. Cost is 10x but error rate on novel cases drops 5x. Critical for monitoring and observability pipelines categorizing unknown failure modes.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T06:41:25.101383+00:00— report_created — created