| databricks/koalas |
3,291 |
|
1 |
16 |
over 2 years ago |
47 |
October 19, 2021 |
112 |
apache-2.0 |
Python |
| Koalas: pandas API on Apache Spark |
| ballista-compute/ballista |
2,244 |
|
0 |
13 |
almost 5 years ago |
4 |
May 10, 2020 |
0 |
apache-2.0 |
|
| Distributed compute platform implemented in Rust, and powered by Apache Arrow. |
| graphframes/graphframes |
944 |
|
2 |
8 |
over 2 years ago |
1 |
December 05, 2018 |
165 |
apache-2.0 |
Scala |
| microsoft/Mobius |
940 |
|
6 |
0 |
over 3 years ago |
22 |
January 29, 2017 |
88 |
mit |
C# |
| C# and F# language binding and extensions to Apache Spark |
| RedisLabs/spark-redis |
926 |
|
0 |
3 |
over 2 years ago |
5 |
June 14, 2022 |
133 |
bsd-3-clause |
Scala |
| A connector for Spark that allows reading and writing to/from Redis cluster |
| MrPowers/spark-daria |
738 |
|
0 |
1 |
about 4 years ago |
7 |
February 09, 2022 |
11 |
mit |
Scala |
| Essential Spark extensions and helper methods ✨😲 |
| andygrove/datafusion |
626 |
|
0 |
0 |
about 7 years ago |
0 |
|
0 |
apache-2.0 |
Rust |
| DataFusion has now been donated to the Apache Arrow project |
| YotpoLtd/metorikku |
536 |
|
0 |
0 |
about 3 years ago |
126 |
February 27, 2023 |
65 |
mit |
Scala |
| A simplified, lightweight ETL Framework based on Apache Spark |
| databricks/spark-avro |
535 |
|
47 |
39 |
over 7 years ago |
8 |
October 30, 2017 |
77 |
apache-2.0 |
Scala |
| Avro Data Source for Apache Spark |
| polyaxon/traceml |
488 |
|
45 |
12 |
about 2 years ago |
10 |
November 25, 2021 |
6 |
apache-2.0 |
Python |
| Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon. |