| svenkreiss/pysparkling |
253 |
|
7 |
1 |
over 3 years ago |
69 |
November 13, 2022 |
9 |
other |
Python |
| A pure Python implementation of Apache Spark's RDD and DStream interfaces. |
| apache/incubator-wayang |
162 |
|
1 |
18 |
about 2 years ago |
4 |
June 24, 2025 |
84 |
apache-2.0 |
Java |
| Apache Wayang(incubating) is the first cross-platform data processing system. |
| syoummer/SpatialSpark |
141 |
|
0 |
0 |
about 9 years ago |
1 |
March 07, 2017 |
6 |
apache-2.0 |
Scala |
| Big Spatial Data Processing using Spark |
| utdemir/distributed-dataset |
107 |
|
0 |
0 |
almost 6 years ago |
0 |
|
19 |
bsd-3-clause |
Haskell |
| A distributed data processing framework in Haskell. |
| streamnative/pulsar-spark |
103 |
|
0 |
2 |
over 2 years ago |
10 |
November 06, 2023 |
9 |
apache-2.0 |
Scala |
| Spark Connector to read and write with Pulsar |
| luisbelloch/data_processing_course |
53 |
|
0 |
0 |
over 3 years ago |
0 |
|
5 |
other |
Python |
| Some class materials for a data processing course using PySpark |
| asavinov/prosto |
53 |
|
0 |
0 |
over 4 years ago |
5 |
November 21, 2021 |
5 |
mit |
Python |
| Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby |
| csimplestring/delta-go |
26 |
|
0 |
0 |
over 2 years ago |
0 |
|
4 |
|
Go |
| Native Delta Lake Implementation in Go |
| daxnet/abacuza |
19 |
|
0 |
0 |
about 4 years ago |
0 |
|
12 |
apache-2.0 |
JavaScript |
| Easing your on-premise Data Processing |
| streamnative/pulsar-hub |
17 |
|
0 |
0 |
over 2 years ago |
0 |
|
8 |
apache-2.0 |
JavaScript |
| The canonical source of StreamNative Hub. |