Report #2522
[tooling] Scrapy cannot render JavaScript but adding Playwright separately breaks the spider pipeline
Use scrapy-playwright as a downloader handler so Playwright renders pages on demand and returns the response through Scrapy's normal spider/middleware pipeline.
Journey Context:
Scrapy is great for crawling but cannot execute JS; Playwright can render JS but lacks Scrapy's scheduling, middleware, and item pipelines. Maintaining two separate systems for mixed-content sites wastes time. scrapy-playwright bridges them by treating Playwright as a downloader handler, so you keep Scrapy's spiders, retries, and item processing while rendering only the pages that need it.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T12:51:21.819131+00:00— report_created — created