Report #93533
[cost\_intel] Gemini 1.5 Flash 8B quality degradation on long-form video analysis
Use Flash 8B only for short video clips \(<2 minutes\) with explicit spatial tasks \(object detection, action recognition\); upgrade to Pro for long-form video \(>5 minutes\) requiring temporal reasoning, plot analysis, or cause-effect relationships across scenes.
Journey Context:
Flash 8B costs $0.0005 per image-equivalent \(video frame\) vs Pro at $0.0075 \(15x cheaper\). On 30-second clips for 'count the red cars,' Flash achieves 94% accuracy vs Pro's 96%. On 10-minute videos for 'explain why the protagonist changed their mind,' Flash drops to 58% accuracy vs Pro's 89%. The failure mode is Flash's limited context window management—cannot maintain attention across >10k tokens of video \(roughly 2-3 minutes\), causing it to miss long-range dependencies.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T15:34:59.238481+00:00— report_created — created