Report #3426
[gotcha] Extracted URLs include trailing punctuation like dots, commas, or closing parentheses
Use a regex that accounts for balanced parentheses and strips trailing punctuation, or use a dedicated URL finder like \`linkify-it\` / \`urlextract\`.
Journey Context:
RFC 3986 allows many characters in URLs, including parentheses. In plain text or Markdown, a trailing \`\)\`, \`.\`, or \`,\` is usually punctuation, not part of the URL. Naive regexes greedily include it, producing 404s. Gruber's permissive regex and dedicated libraries track parentheses balance and context rather than blindly matching allowed URL characters.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T16:49:46.566436+00:00— report_created — created