Spider Alternatives

Name: MaLei666/Spider
Brand: MaLei666/Spider
SKU: project/MaLei666/Spider
Rating: 4.55 (356 reviews)

爬虫实例：微博、b站、csdn、淘宝、今日头条、知乎、豆瓣、知乎APP、大众点评

Categories > Data Storage > Mongodb

Suggest Alternative

Stars

356

Alternatives

License

No license specified

Open Issues

Most Recent Commit

almost 7 years ago

Programming Language

Python

Dependent Repos

Dependent Packages

Total Releases

Categories

Programming Languages > Python

Data Storage > Mongodb

Data Processing > Spider

Data Processing > Scrapy

Content Management > Bilibili

Social Media > Weibo

Collaboration > Zhihu

Economics > Taobao

Repo

Alternatives To MaLei666/Spider

Project Name	Stars	Repos Using This	Packages Using This	Most Recent Commit	Total Releases	Latest Release	Open Issues	License	Language
crawlab-team/crawlab	10,521	0	0	over 2 years ago	1	March 03, 2019	58	bsd-3-clause	Go
Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台，支持任何语言和框架
gnemoug/distribute_crawler	3,176	0	0	almost 9 years ago	0		26		Python
使用scrapy,redis, mongodb,graphite实现的一个分布式网络爬虫,底层存储mongodb集群,分布式使用redis实现,爬虫状态显示使用graphite实现
ramsayleung/jd_spider	728	0	0	about 7 years ago	0		2		Python
Two dumb distributed crawlers
lb2281075105/Python-Spider	680	0	0	over 3 years ago	0		0	apache-2.0	Python
豆瓣电影top250、斗鱼爬取json数据以及爬取美女图片、淘宝、有缘、CrawlSpider爬取红娘网相亲人的部分基本信息以及红娘网分布式爬取和存储redis、爬虫小demo、Selenium、爬取多点、django开发接口、爬取有缘网信息、模拟知乎登录、模拟github登录、模拟图虫网登录、爬取多点商城整站数据、爬取微信公众号历史文章、爬取微信群或者微信好友分享的文章、itchat监听指定微信公众号分享的文章
lixiang0/WEB_KG	435	0	0	over 5 years ago	0		9		Python
爬取百度百科中文页面，抽取三元组信息，构建中文知识图谱
MaLei666/Spider	356	0	0	almost 7 years ago	0		8		Python
爬虫实例：微博、b站、csdn、淘宝、今日头条、知乎、豆瓣、知乎APP、大众点评
sebdah/scrapy-mongodb	327	23	0	almost 8 years ago	22	January 08, 2018	6	other	Python
MongoDB pipeline for Scrapy. This module supports both MongoDB in standalone setups and replica sets. scrapy-mongodb will insert the items to MongoDB as soon as your spider finds data to extract.
fankcoder/findtrip	324	0	0	about 10 years ago	0		1		Python
机票爬虫（去哪儿和携程网）。flight tickets multiple webspider.(scrapy + selenium + phantomjs + mongodb)
alanchn31/Data-Engineering-Projects	322	0	0	about 3 years ago	0		5		Jupyter Notebook
Personal Data Engineering Projects
teamssix/pigat	187	0	0	almost 4 years ago	0		1		Python
pigat ( Passive Intelligence Gathering Aggregation Tool ) 被动信息收集聚合工具

Alternatives To MaLei666/Spider

Select To Compare

crawlab-team/crawlab ⭐ 10,521

Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台，支持任何语言和框架

dependent packages 0 total releases 1 most recent commit over 2 years ago downloads badge

gnemoug/distribute_crawler ⭐ 3,176

使用scrapy,redis, mongodb,graphite实现的一个分布式网络爬虫,底层存储mongodb集群,分布式使用redis实现,爬虫状态显示使用graphite实现

dependent packages 0 total releases 0 most recent commit almost 9 years ago

ramsayleung/jd_spider ⭐ 728

Two dumb distributed crawlers

dependent packages 0 total releases 0 most recent commit about 7 years ago

lb2281075105/Python-Spider ⭐ 680

豆瓣电影top250、斗鱼爬取json数据以及爬取美女图片、淘宝、有缘、CrawlSpider爬取红娘网相亲人的部分基本信息以及红娘网分布式爬取和存储redis、爬虫小demo、Selenium、爬取多点、django开发接口、爬取有缘网信息、模拟知乎登录、模拟github登录、模拟图虫网登录、爬取多点商城整站数据、爬取微信公众号历史文章、爬取微信群或者微信好友分享的文章、itchat监听指定微信公众号分享的文章

dependent packages 0 total releases 0 most recent commit over 3 years ago

lixiang0/WEB_KG ⭐ 435

爬取百度百科中文页面，抽取三元组信息，构建中文知识图谱

dependent packages 0 total releases 0 most recent commit over 5 years ago

MaLei666/Spider ⭐ 356

爬虫实例：微博、b站、csdn、淘宝、今日头条、知乎、豆瓣、知乎APP、大众点评

dependent packages 0 total releases 0 most recent commit almost 7 years ago

sebdah/scrapy-mongodb ⭐ 327

MongoDB pipeline for Scrapy. This module supports both MongoDB in standalone setups and replica sets. scrapy-mongodb will insert the items to MongoDB as soon as your spider finds data to extract.

dependent packages 0 total releases 22 most recent commit almost 8 years ago downloads badge

fankcoder/findtrip ⭐ 324

机票爬虫（去哪儿和携程网）。flight tickets multiple webspider.(scrapy + selenium + phantomjs + mongodb)

dependent packages 0 total releases 0 most recent commit about 10 years ago

alanchn31/Data-Engineering-Projects ⭐ 322

Personal Data Engineering Projects

dependent packages 0 total releases 0 most recent commit about 3 years ago

teamssix/pigat ⭐ 187

pigat ( Passive Intelligence Gathering Aggregation Tool ) 被动信息收集聚合工具

dependent packages 0 total releases 0 most recent commit almost 4 years ago

Suggest An Alternative To Spider

Alternative Project Comparisons

MaLei666/Spider vs Crawlab

MaLei666/Spider vs Distribute_crawler

MaLei666/Spider vs Jd_spider

MaLei666/Spider vs Python Spider

MaLei666/Spider vs Web_kg

MaLei666/Spider vs Spider

MaLei666/Spider vs Scrapy Mongodb

MaLei666/Spider vs Findtrip

MaLei666/Spider vs Data Engineering Projects

MaLei666/Spider vs Pigat

Popular Mongodb Projects

macrozheng/mall⭐ 73,367

mall项目是一套电商系统，包括前台商城系统及后台管理系统，基于SpringBoot+MyBatis实现，采用Docker容器化部署。前台商城系统包含首页门户、商品推荐、商品搜索、商品展示、购物车、订单流程、会员中心、客户服务、帮助中心等模块。后台管理系统包含商品管理、订单管理、会员管理、促销管理、运营管理、内容管理、统计报表、财务管理、权限管理、设置等模块。

netdata/netdata⭐ 66,844

Monitor your servers, containers, and applications, in high-resolution and in real-time!

strapi/strapi⭐ 58,534

🚀 Strapi is the leading open-source headless CMS. It’s 100% JavaScript/TypeScript, fully customizable and developer-first.

meteor/meteor⭐ 43,960

Meteor, the JavaScript App Platform

prisma/prisma⭐ 35,913

Next-generation ORM for Node.js & TypeScript | PostgreSQL, MySQL, MariaDB, SQL Server, SQLite, MongoDB and CockroachDB

Popular Scrapy Projects

scrapy/scrapy⭐ 49,918

Scrapy, a fast high-level web crawling & scraping framework for Python.

wistbean/learn_python3_spider⭐ 14,425

python爬虫教程系列、从0到1学习python爬虫，包括浏览器抓包，手机APP抓包，如 fiddler、mitmproxy，各种爬虫涉及的模块的使用，如：requests、beautifulSoup、selenium、appium、scrapy等，以及IP代理，验证码识别，Mysql，MongoDB数据库的python使用，多线程多进程爬虫的使用，css 爬虫加密逆向破解，JS爬虫逆向，分布式爬虫，爬虫项目实战实例等

iamshuaidi/CS-Book⭐ 11,024

计算机类常用电子书整理，并且附带下载链接，包括Java，Python，Linux，Go，C，C++，数据结构与算法，人工智能，计算机基础，面试，设计模式，数据库，前端等书籍

scrapinghub/portia⭐ 8,982

Visual scraping for Scrapy

lining0806/PythonSpiderNotes⭐ 6,473

Python入门网络爬虫之精华版

Popular Data Storage Categories

Database

Mysql

Redis

Mongodb

Postgresql

Firebase

Elasticsearch

Backup

Sqlite