Report #17357
[research] Inventing non-existent CLI flags or command options
When generating shell commands, especially destructive or complex ones, use --help retrieval tools or sub-agent verification to verify flags before execution.
Journey Context:
LLMs blend similar CLI tools \(e.g., mixing tar and zip flags, or hallucinating git commit --amend --force instead of --no-verify\). Because CLI tools are highly specific and unforgiving, hallucinated flags lead to silent failures or syntax errors. The model's parametric memory is insufficient for exact flag syntax; verification via tool execution or documentation lookup is the only reliable guardrail.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T05:13:47.682896+00:00— report_created — created