Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
The Top 10 Spark Open Source Projects
Open source projects categorized as Spark
Categories
>
Data Processing
>
Spark
Edit Category
apache/spark
⭐
37,661
Apache Spark - A unified analytics engine for large-scale data processing
dependent packages
0
total releases
0
most recent commit
about 2 years ago
donnemartin/data-science-ipython-notebooks
⭐
25,668
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
dependent packages
0
total releases
0
most recent commit
over 2 years ago
getredash/redash
⭐
24,479
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
dependent packages
0
total releases
0
most recent commit
about 2 years ago
yeasy/docker_practice
⭐
23,279
Learn and understand Docker&Container technologies, with real DevOps practice!
dependent packages
0
total releases
0
most recent commit
over 2 years ago
DataTalksClub/data-engineering-zoomcamp
⭐
19,461
Free Data Engineering course!
dependent packages
0
total releases
0
most recent commit
about 2 years ago
heibaiying/BigData-Notes
⭐
14,872
大数据入门指南 :star:
dependent packages
0
total releases
0
most recent commit
over 2 years ago
zhisheng17/flink-learning
⭐
13,801
flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》
dependent packages
0
total releases
0
most recent commit
over 2 years ago
horovod/horovod
⭐
13,755
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
dependent packages
0
total releases
0
most recent commit
about 2 years ago
aalansehaiyang/technology-talk
⭐
13,579
【大厂面试专栏】一份Java程序员需要的技术指南,这里有面试题、系统架构、职场锦囊、主流中间件等,让你成为更牛的自己!
dependent packages
0
total releases
0
most recent commit
over 2 years ago
deeplearning4j/deeplearning4j
⭐
13,290
Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learning using automatic differentiation.
dependent packages
0
total releases
0
most recent commit
over 2 years ago
Get A Weekly Email With Trending Spark Projects
No Spam. Unsubscribe easily at any time.
Spark
Subscribe
Javascript must be enabled to subscribe.
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2026 Awesome Open Source. All rights reserved.