| scrapinghub/splash |
3,860 |
|
9 |
1 |
almost 3 years ago |
30 |
June 16, 2020 |
397 |
bsd-3-clause |
Python |
| Lightweight, scriptable browser as a service with an HTTP API |
| scrapy/scrapyd |
2,766 |
|
187 |
15 |
about 2 years ago |
11 |
September 25, 2023 |
31 |
bsd-3-clause |
Python |
| A service daemon to run Scrapy spiders |
| scrapinghub/scrapyrt |
793 |
|
5 |
0 |
over 2 years ago |
7 |
September 20, 2023 |
31 |
bsd-3-clause |
Python |
| HTTP API for Scrapy spiders |
| TeamHG-Memex/arachnado |
148 |
|
0 |
0 |
about 4 years ago |
1 |
August 07, 2015 |
21 |
|
Python |
| Web Crawling UI and HTTP API, based on Scrapy and Tornado |
| TeamHG-Memex/autologin |
106 |
|
0 |
0 |
about 4 years ago |
5 |
May 24, 2017 |
12 |
apache-2.0 |
Python |
| A project to attempt to automatically login to a website given a single seed |
| zhangslob/docs |
102 |
|
0 |
0 |
almost 7 years ago |
0 |
|
3 |
|
|
| 《数据采集从入门到放弃》源码。内容简介:爬虫介绍、就业情况、爬虫工程师面试题 ;HTTP协议介绍; Requests使用 ;解析器Xpath介绍; MongoDB与MySQL; 多线程爬虫; Scrapy介绍 ;Scrapy-redis介绍; 使用docker部署; 使用nomad管理docker集群; 使用EFK查询docker日志 |
| monkey-soft/Scrapy_IPProxyPool |
86 |
|
0 |
0 |
over 7 years ago |
0 |
|
4 |
|
Python |
| 免费 IP 代理池。Scrapy 爬虫框架插件 |
| zkqiang/awesome-python-primer |
78 |
|
0 |
0 |
over 3 years ago |
0 |
|
0 |
mit |
Python |
| 自学入门 Python 优质中文资源索引,包含 书籍 / 文档 / 视频,适用于 爬虫 / Web / 数据分析 / 机器学习 方向 |
| kuaidaili/python-sdk |
54 |
|
0 |
2 |
almost 3 years ago |
13 |
October 10, 2022 |
0 |
bsd-2-clause |
Python |
| 快代理API SDK Python和官方代码样例 |
| cdrx/scrapyd-authenticated |
39 |
|
0 |
0 |
about 4 years ago |
0 |
|
4 |
mit |
Dockerfile |
| Docker container running scrapyd with HTTP authentication |