Nutch Alternatives

Apache Nutch is an extensible and scalable web crawler
Suggest Alternative
Alternatives To apache/nutch
Project Name Stars Downloads Repos Using This Packages Using This Most Recent Commit Total Releases Latest Release Open Issues License Language
apache/nutch 2,742 82 1 about 2 years ago 26 August 22, 2022 14 apache-2.0 Java
Apache Nutch is an extensible and scalable web crawler
DigitalPebble/storm-crawler 834 7 10 about 2 years ago 36 October 25, 2023 34 apache-2.0 HTML
A scalable, mature and versatile web crawler based on Apache Storm
USCDataScience/sparkler 401 0 0 about 3 years ago 0 55 apache-2.0 Java
Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.
liinnux/awesome-crawler-cn 243 0 0 over 3 years ago 0 0 mit
互联网爬虫,蜘蛛,数据采集器,网页解析器的汇总,因新技术不断发展,新框架层出不穷,此文会不断更新...
xautlx/nutch-htmlunit 122 0 0 almost 11 years ago 0 1 apache-2.0 Java
基于Apache Nutch和Htmlunit的扩展实现AJAX页面爬虫抓取解析插件
nasa-jpl-memex/memex-explorer 106 0 0 about 10 years ago 0 67 bsd-2-clause Python
Viewers for statistics and dashboarding of Domain Search Engine data
abola/CrawlerPack 99 51 0 over 9 years ago 9 December 10, 2016 0 apache-2.0 Java
Java 網路資料爬蟲包
heyZeus/clj-web-crawler 38 0 0 almost 15 years ago 0 0 mit Clojure
A wrapper around Apache commons-client for the Clojure programming language.
tpickett/mongo-elasticsearch-nutch 15 0 0 over 10 years ago 0 2 Shell
Docker image for creating a single Apache Nutch server, with mongodb as crawl storage and Elasticsearch for indexing
yegor256/nutch-in-java 14 0 0 over 3 years ago 0 1 mit Java
How to use Apache Nutch without command line
Alternatives To apache/nutch
Select To Compare


Alternative Project Comparisons
Popular Crawler Projects
Popular Apache Projects
Popular Data Processing Categories
Related Searches
Get A Weekly Email With Trending Projects
No Spam. Unsubscribe easily at any time.
Privacy | About | Terms | Follow Us On Twitter

Downloads, Dependent Repos, Dependent Packages, Total Releases, Latest Releases data powered by Libraries.io.

Copyright 2018-2026 Awesome Open Source.  All rights reserved.