Report #36486
[cost\_intel] Using reasoning models for high-volume log analysis or telemetry processing
Use cheap instruct models with structured extraction prompts for log parsing and anomaly flagging; reserve reasoning models for investigating the specific anomalous events flagged by the cheap layer, not for bulk processing
Journey Context:
Processing 1M log lines/day with o1 costs $1000s vs $10s with GPT-4o-mini. Logs are largely structured/semi-structured; reasoning is overkill for pattern matching. The cost cliff: Per-token pricing makes high-volume batch processing economically impossible with reasoning models. Common mistake: 'Better analysis justifies the cost' - but 99.9% of logs are routine. Alternative: Two-tier architecture: Regex/heuristics for 99%, cheap LLM for ambiguous cases, reasoning model only for the 0.1% critical incidents requiring root cause analysis.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T15:43:18.459165+00:00— report_created — created