Report #5222
[tooling] Cloudflare Turnstile or JS challenge blocks a pure HTTP scraper that cannot execute JavaScript
Deploy FlareSolverr as a sidecar proxy and POST challenge pages to http://localhost:8191/v1 with \{"cmd": "request.get", "url": "...", "maxTimeout": 60000\}. It runs a real browser, waits for the challenge to clear, and returns cookies plus HTML. Feed the returned cookies into your existing requests/httpx session for subsequent calls.
Journey Context:
Teams often move their whole scraper to a browser when only the initial challenge page needs one. FlareSolverr decouples challenge solving from data scraping, so you keep the speed and memory efficiency of HTTP for the happy path. Self-hosting is free; paid CAPTCHA solvers are only needed for the hardest cases. Tradeoff: adds 5-30s latency and memory per request, so it is not for high-frequency polling.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T20:51:39.716848+00:00— report_created — created