| psf/requests-html |
13,100 |
|
0 |
0 |
almost 3 years ago |
6 |
July 26, 2022 |
198 |
mit |
Python |
| Pythonic HTML Parsing for Humans™ |
| monperrus/crawler-user-agents |
1,045 |
|
5 |
8 |
over 2 years ago |
118 |
November 20, 2023 |
7 |
mit |
Python |
| Syntactic patterns of HTTP user-agents used by bots / robots / crawlers / scrapers / spiders. pull-request welcome :star: |
| scrapinghub/scrapyrt |
793 |
|
5 |
0 |
over 2 years ago |
7 |
September 20, 2023 |
31 |
bsd-3-clause |
Python |
| HTTP API for Scrapy spiders |
| benibela/xidel |
611 |
|
0 |
0 |
over 2 years ago |
0 |
|
18 |
gpl-3.0 |
Pascal |
| Command line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON documents. |
| f-prime/HTTPLang |
508 |
|
0 |
0 |
over 8 years ago |
0 |
|
2 |
mit |
Python |
| A scripting langauge to do HTTP routines. |
| daijro/hrequests |
327 |
|
0 |
0 |
over 2 years ago |
14 |
September 10, 2023 |
3 |
apache-2.0 |
Python |
| 🚀 Web scraping for humans |
| DarkSand/Sasila |
264 |
|
0 |
0 |
over 6 years ago |
17 |
November 02, 2017 |
1 |
apache-2.0 |
Python |
| 一个灵活、友好的爬虫框架 |
| iw4p/proxy-scraper |
260 |
|
0 |
0 |
about 3 years ago |
0 |
|
12 |
|
Python |
| scrape proxies from more than 5 different sources and check which ones are still alive |
| jamesturk/scrapelib |
195 |
|
126 |
12 |
over 2 years ago |
44 |
December 15, 2023 |
4 |
bsd-2-clause |
Python |
| ⛏ a library for scraping unreliable pages |
| egoist/tokio |
144 |
|
1 |
2 |
about 4 years ago |
3 |
May 14, 2018 |
3 |
mit |
JavaScript |
| Web scraping made simple. |