| delta-io/delta |
6,656 |
|
0 |
45 |
about 2 years ago |
24 |
May 24, 2023 |
601 |
apache-2.0 |
HTML |
| An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs |
| Alluxio/alluxio |
6,544 |
|
31 |
53 |
about 2 years ago |
73 |
November 29, 2023 |
969 |
apache-2.0 |
Java |
| Alluxio, data orchestration for analytics and machine learning in the cloud |
| databricks/learning-spark |
3,804 |
|
0 |
0 |
over 3 years ago |
0 |
|
30 |
mit |
Java |
| Example code from Learning Spark book |
| holdenk/spark-testing-base |
1,475 |
|
25 |
52 |
about 2 years ago |
295 |
October 10, 2023 |
101 |
apache-2.0 |
Scala |
| Base classes to use when writing tests with Spark |
| deanwampler/spark-scala-tutorial |
922 |
|
0 |
0 |
over 5 years ago |
0 |
|
3 |
other |
Jupyter Notebook |
| A free tutorial for Apache Spark. |
| typelevel/frameless |
867 |
|
0 |
21 |
about 2 years ago |
9 |
September 27, 2023 |
40 |
apache-2.0 |
Scala |
| Expressive types for Spark. |
| mongodb/mongo-spark |
692 |
|
58 |
5 |
about 2 years ago |
42 |
April 05, 2022 |
1 |
apache-2.0 |
Java |
| The MongoDB Spark Connector |
| TalkingData/Fregata |
674 |
|
0 |
0 |
about 8 years ago |
3 |
August 01, 2017 |
8 |
other |
Scala |
| A light weight, super fast, large scale machine learning library on spark . |
| databricks/spark-training |
365 |
|
0 |
0 |
over 10 years ago |
0 |
|
4 |
|
Scala |
| Apache Spark training material |
| amplab/graphx |
353 |
|
0 |
0 |
over 3 years ago |
0 |
|
31 |
apache-2.0 |
Scala |
| Former GraphX development repository. GraphX has been merged into Apache Spark; please submit pull requests there. |