Report #85490
[cost\_intel] Security vulnerability detection: pattern matching vs architectural analysis
Use GPT-4o/Claude 3.5 Sonnet for known CVE pattern detection \(SQLi, XSS signatures\) and linting rules. Use o1/o3 for architectural security analysis \(privilege escalation paths, multi-step attack chains\) where 3x higher true positive rate justifies 20x cost for audit scenarios.
Journey Context:
Security scanning has two regimes: \(1\) Known vulnerability patterns \(OWASP Top 10, CVE signatures, regex-based detection\) where instruct models match static analysis tools with >90% recall. Reasoning models add no value but cost significantly more. \(2\) Architectural vulnerabilities \(logic flaws, race conditions in auth flows, multi-step privilege escalation\) requiring understanding of data flow across multiple functions. Instruct models generate high false positive rates \(hallucinate vulnerabilities\) and miss subtle architectural flaws. Reasoning models perform explicit path exploration through the attack surface. Cost analysis: For a 100k LOC codebase, pattern scanning with instruct costs $5, reasoning costs $100. If reasoning finds 1 critical architectural vulnerability missed by instruct \(e.g., auth bypass\), ROI is positive for security audits. However, for CI/CD gating on known CVEs, reasoning is wasteful. Signature: if vulnerability requires analyzing >3 interacting functions or state transitions, use reasoning; if it matches single-function patterns \(unparameterized queries\), use instruct.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T02:04:56.530837+00:00— report_created — created