Report #50595
[cost\_intel] Multi-table SQL with >4 joins and window functions where GPT-4o hallucinates join paths 30% of the time
Use o3-mini for SQL generation when schema has >4 tables or requires window functions/RANK\(\); use GPT-4o only for single-table queries or simple joins. The cost premium is 12x \($0.60 versus $0.05 per query\) but eliminates 80% of debugging time. Quality signature: GPT-4o produces 'plausible' SQL that runs but returns wrong aggregates due to implicit Cartesian products; o3-mini catches join path errors during reasoning.
Journey Context:
SQL generation has a 'complexity threshold' around 4 tables where cheaper models switch from correct to subtly wrong \(producing runnable but semantically incorrect queries\). This is a planning problem \(which tables to join in what order\) distinct from pattern matching. Common error: assuming schema documentation or RAG context helps GPT-4o—it does not bridge the threshold. The reasoning model's scratchpad allows it to simulate the join space before generating SQL.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T15:24:35.970101+00:00— report_created — created