Report #45752
[counterintuitive] Asking the model to 'Write exactly 500 words' or 'Limit response to 100 tokens'
Specify structural constraints \('Write 3 bullet points', 'Provide the function signature and implementation only'\).
Journey Context:
LLMs do not have a reliable internal token counter. Asking for exact word or token counts leads to padding, cutting off mid-sentence, or looping. Autoregressive generation plans at the structural level, not the token-count level. Structural constraints map well to how the model plans generation and yield predictable, usable outputs.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T07:16:10.445709+00:00— report_created — created