| Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
|---|---|---|---|---|---|---|---|---|---|---|
| donnemartin/data-science-ipython-notebooks | 25,668 | 0 | 0 | over 2 years ago | 0 | 34 | other | Python | ||
| Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines. | ||||||||||
| heibaiying/BigData-Notes | 14,872 | 0 | 0 | over 2 years ago | 0 | 39 | Java | |||
| 大数据入门指南 :star: | ||||||||||
| andkret/Cookbook | 12,557 | 0 | 0 | over 2 years ago | 0 | 111 | apache-2.0 | |||
| The Data Engineering Cookbook | ||||||||||
| apache/hive | 5,222 | 0 | 0 | about 2 years ago | 0 | 89 | apache-2.0 | Java | ||
| Apache Hive | ||||||||||
| twitter/scalding | 3,433 | 37 | 40 | almost 3 years ago | 43 | September 14, 2016 | 319 | apache-2.0 | Scala | |
| A Scala API for Cascading | ||||||||||
| Yelp/mrjob | 2,584 | 112 | 2 | over 3 years ago | 62 | December 15, 2021 | 211 | other | Python | |
| Run MapReduce jobs on Hadoop or Amazon Web Services | ||||||||||
| Qihoo360/poseidon | 1,543 | 0 | 0 | almost 9 years ago | 0 | 9 | bsd-3-clause | Go | ||
| A search engine which can hold 100 trillion lines of log data. | ||||||||||
| mongodb/mongo-hadoop | 1,511 | 78 | 10 | about 4 years ago | 14 | January 27, 2017 | 16 | Java | ||
| MongoDB Connector for Hadoop | ||||||||||
| will-che/BigData-Interview | 1,397 | 0 | 0 | over 4 years ago | 0 | |||||
| :dart: :star2:[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结 | ||||||||||
| collabH/bigdata-growth | 1,162 | 0 | 0 | about 2 years ago | 0 | 1 | mit | Shell | ||
| 大数据知识仓库涉及到数据仓库建模、实时计算、大数据、数据中台、系统设计、Java、算法等。 | ||||||||||