Report #41975
[cost\_intel] Using GPT-4o for binary/triple classification tasks where logprobs suffice
Use the logprobs parameter to check token likelihood of YES/NO \(or class labels\) in the first token position, consuming ~3 tokens instead of 50-100 for full generative responses
Journey Context:
Teams send prompts like Analyze this text and respond with either YES or NO and the model generates Based on my analysis, the answer is YES \(15 tokens\). Instead, use logprobs: check the probability of the first token being YES vs NO. This uses the context window but generates almost nothing. Critical implementation: set max\_tokens=1 and logprobs=5, then map token IDs to your labels. This is 20x cheaper and faster. Works for any closed-class classification where you control the label vocabulary.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T00:55:37.418124+00:00— report_created — created