| apify/crawlee |
11,229 |
|
0 |
42 |
about 2 years ago |
747 |
December 10, 2023 |
96 |
apache-2.0 |
TypeScript |
| Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation. |
| projectdiscovery/katana |
7,995 |
|
0 |
1 |
about 2 years ago |
8 |
September 14, 2023 |
67 |
mit |
Go |
| A next-generation crawling and spidering framework. |
| yujiosaka/headless-chrome-crawler |
5,051 |
|
10 |
12 |
over 4 years ago |
21 |
June 11, 2018 |
28 |
mit |
JavaScript |
| Distributed crawler powered by Headless Chrome |
| go-rod/rod |
4,505 |
|
0 |
140 |
about 2 years ago |
406 |
November 06, 2023 |
106 |
mit |
Go |
| A Devtools driver for web automation and scraping |
| hardkoded/puppeteer-sharp |
3,022 |
|
27 |
120 |
about 2 years ago |
78 |
December 05, 2023 |
120 |
mit |
C# |
| Headless Chrome .NET API |
| Qianlitp/crawlergo |
3,016 |
|
0 |
0 |
about 1 year ago |
2 |
December 06, 2022 |
32 |
gpl-3.0 |
Go |
| A powerful browser crawler for web vulnerability scanners |
| transitive-bullshit/awesome-puppeteer |
2,245 |
|
0 |
0 |
over 2 years ago |
0 |
|
19 |
|
|
| A curated list of awesome puppeteer resources. |
| rendora/rendora |
1,950 |
|
0 |
0 |
about 3 years ago |
1 |
January 04, 2019 |
28 |
apache-2.0 |
Go |
| dynamic server-side rendering using headless Chrome to effortlessly solve the SEO problem for modern javascript websites |
| vifreefly/kimuraframework |
874 |
|
4 |
2 |
about 4 years ago |
10 |
January 30, 2019 |
34 |
mit |
Ruby |
| Kimurai is a modern web scraping framework written in Ruby which works out of box with Headless Chromium/Firefox, PhantomJS, or simple HTTP requests and allows to scrape and interact with JavaScript rendered websites |
| slotix/dataflowkit |
710 |
|
0 |
0 |
about 3 years ago |
0 |
|
0 |
bsd-3-clause |
Go |
| Extract structured data from web sites. Web sites scraping. |