| CrawlScript/WebCollector |
2,974 |
|
123 |
1 |
almost 3 years ago |
22 |
June 03, 2023 |
61 |
gpl-3.0 |
Java |
| WebCollector is an open source web crawler framework based on Java.It provides some simple interfaces for crawling the Web,you can setup a multi-threaded web crawler in less than 5 minutes. |
| markdalgleish/static-site-generator-webpack-plugin |
1,538 |
|
3,586 |
214 |
over 7 years ago |
21 |
November 19, 2018 |
45 |
mit |
JavaScript |
| Minimal, unopinionated static site generator powered by webpack |
| scrapy-plugins/scrapy-zyte-smartproxy |
343 |
|
29 |
0 |
over 2 years ago |
16 |
December 01, 2020 |
7 |
bsd-3-clause |
Python |
| Zyte Smart Proxy Manager (formerly Crawlera) middleware for Scrapy |
| Kharacternyk/dotcommon |
327 |
|
0 |
0 |
almost 6 years ago |
0 |
|
1 |
gpl-3.0 |
Python |
| What do people have in their dotfiles? |
| ADmad/CakePHP-HybridAuth |
81 |
|
13 |
1 |
over 7 years ago |
14 |
July 11, 2018 |
9 |
mit |
PHP |
| CakePHP plugin for HybridAuth |
| jae-jae/QueryList-PhantomJS |
45 |
|
7 |
4 |
about 7 years ago |
2 |
September 30, 2017 |
7 |
|
PHP |
| QueryList Plugin: Use PhantomJS to crawl Javascript dynamically rendered pages.(headless WebKit ) 使用PhantomJS采集JavaScript动态渲染的页面 |
| HoussemCharf/FunUtils |
42 |
|
0 |
0 |
over 2 years ago |
0 |
|
9 |
mit |
Python |
| Some codes i wrote to help me with me with my daily errands ;) |
| ijanos/ebedke |
33 |
|
0 |
0 |
about 5 years ago |
0 |
|
0 |
other |
Python |
| crawl pages to check what is for lunch today |
| scrapy-plugins/scrapy-zyte-api |
30 |
|
0 |
1 |
about 2 years ago |
23 |
October 19, 2023 |
21 |
bsd-3-clause |
Python |
| Zyte API integration for Scrapy |
| momer/nutch-selenium |
27 |
|
0 |
0 |
almost 10 years ago |
0 |
|
2 |
apache-2.0 |
Java |