| GoogleCloudPlatform/DataflowJavaSDK |
853 |
|
249 |
14 |
over 5 years ago |
38 |
June 26, 2018 |
54 |
|
|
| Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines. |
| lithops-cloud/lithops |
295 |
|
0 |
3 |
about 2 years ago |
45 |
December 05, 2023 |
7 |
apache-2.0 |
Python |
| A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel applications in the cloud ☁️🚀 |
| apache/incubator-wayang |
162 |
|
1 |
18 |
about 2 years ago |
4 |
June 24, 2025 |
84 |
apache-2.0 |
Java |
| Apache Wayang(incubating) is the first cross-platform data processing system. |
| luisbelloch/data_processing_course |
53 |
|
0 |
0 |
over 3 years ago |
0 |
|
5 |
other |
Python |
| Some class materials for a data processing course using PySpark |
| csimplestring/delta-go |
26 |
|
0 |
0 |
over 2 years ago |
0 |
|
4 |
|
Go |
| Native Delta Lake Implementation in Go |
| asavinov/machine-learning-and-data-processing |
15 |
|
0 |
0 |
about 3 years ago |
0 |
|
0 |
|
|
| A collection of resources on machine learning, data processing and related areas |
| elgeish/Computing-with-Data |
15 |
|
0 |
0 |
over 2 years ago |
0 |
|
1 |
|
Java |
| Code samples for my book "Computing with Data: An Introduction to the Data Industry" |
| marcelmittelstaedt/BigData |
12 |
|
0 |
0 |
over 2 years ago |
0 |
|
1 |
|
HTML |
| Lecture: Big Data |
| brunocampos01/data-paths |
11 |
|
0 |
0 |
almost 3 years ago |
0 |
|
0 |
mit |
Python |
| eBay/ExpertmakerAccelerator |
7 |
|
0 |
0 |
almost 8 years ago |
0 |
|
0 |
|
|