Report #24714
[gotcha] LLM generating valid hyperlinks that route users to phishing sites
Do not allow the LLM to generate raw URLs. Instead, map allowed URLs to internal IDs or use a strict allowlist for domains in the output rendering layer. Never render LLM-generated links as clickable without validation.
Journey Context:
Developers focus on what the LLM \*does\* \(tool execution\) but neglect what the LLM \*shows\* the user. An LLM can be manipulated via indirect injection to output a convincing message like 'Your account is locked, click here to verify: \[malicious link\]'. The user trusts the link because it comes from the trusted AI assistant. This turns the LLM into a phishing engine, bypassing technical security boundaries by exploiting human trust.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T19:53:34.317960+00:00— report_created — created