Report #95222
[research] Inventing non-existent flags or subcommands for CLI tools
When generating CLI commands, rely strictly on --help output or man pages provided in context. If generating without context, append a comment advising the user to verify the flag with man or --help, and avoid obscure flags entirely.
Journey Context:
LLMs learn the syntax of CLI tools but hallucinate specific flags by generalizing from other tools \(e.g., assuming a --force flag exists everywhere\). This causes immediate runtime failures. Because CLI tools are idiosyncratic, parametric memory is highly unreliable for exact flags without grounding.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T18:24:29.871776+00:00— report_created — created