| puppeteer/puppeteer |
85,859 |
|
12,128 |
15,686 |
about 2 years ago |
851 |
December 06, 2023 |
311 |
apache-2.0 |
TypeScript |
| Node.js API for Chrome |
| apify/crawlee |
11,229 |
|
0 |
42 |
about 2 years ago |
747 |
December 10, 2023 |
96 |
apache-2.0 |
TypeScript |
| Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation. |
| alvarcarto/url-to-pdf-api |
6,946 |
|
0 |
0 |
about 2 years ago |
0 |
|
57 |
mit |
HTML |
| Web page PDF/PNG rendering done right. Self-hosted service for rendering receipts, invoices, or any content. |
| yujiosaka/headless-chrome-crawler |
5,051 |
|
10 |
12 |
over 4 years ago |
21 |
June 11, 2018 |
28 |
mit |
JavaScript |
| Distributed crawler powered by Headless Chrome |
| FlareSolverr/FlareSolverr |
4,623 |
|
0 |
0 |
about 2 years ago |
0 |
|
49 |
mit |
Python |
| Proxy server to bypass Cloudflare protection |
| miyakogi/pyppeteer |
3,240 |
|
0 |
0 |
almost 6 years ago |
0 |
|
154 |
other |
Python |
| Headless chrome/chromium automation library (unofficial port of puppeteer) |
| pyppeteer/pyppeteer |
3,226 |
|
185 |
199 |
over 2 years ago |
31 |
January 11, 2022 |
198 |
other |
Python |
| Headless chrome/chromium automation library (unofficial port of puppeteer) |
| hardkoded/puppeteer-sharp |
3,022 |
|
27 |
120 |
about 2 years ago |
78 |
December 05, 2023 |
120 |
mit |
C# |
| Headless Chrome .NET API |
| puppeteer/examples |
2,299 |
|
0 |
0 |
over 2 years ago |
0 |
|
37 |
apache-2.0 |
JavaScript |
| Use case-driven examples for using Puppeteer and headless chrome |
| emadehsan/thal |
2,268 |
|
0 |
0 |
over 5 years ago |
0 |
|
0 |
mit |
JavaScript |
| Getting started with Puppeteer and Chrome Headless for Web Scraping |