Crawl Lab
Toolkit to make your crawlers and scrapers suffer 😈
JavaScript Rendered Pages
- /js/inline - Inline JavaScript renders the page content
- /js/external - External JavaScript file renders the page content
- /js/image.png - JavaScript builds an image element dynamically
Content Type Pages
- /markdown - Content-Type: text/markdown
- /json - Content-Type: application/json
- /xml - Content-Type: application/xml
- /html - Content-Type: text/html (unstyled)
- /text - Content-Type: text/plain
- /csv - Content-Type: text/csv
- /tsv - Content-Type: text/tab-separated-values
Size Pages
- /100Kb - Proxy instruction for 100KB external file
- /1Mb - Proxy instruction for 1MB external file
- /10Mb - Proxy instruction for 10MB external file
All responses aim to provide 200+ characters for scraper validation and have no styling.
This project is open source on Github>