| GoogleCloudPlatform/DataflowJavaSDK |
853 |
|
249 |
14 |
over 5 years ago |
38 |
June 26, 2018 |
54 |
|
|
| Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines. |
| infoslack/awesome-kafka |
549 |
|
0 |
0 |
over 2 years ago |
0 |
|
1 |
|
|
| A list about Apache Kafka |
| apache/incubator-wayang |
162 |
|
1 |
18 |
about 2 years ago |
4 |
June 24, 2025 |
84 |
apache-2.0 |
Java |
| Apache Wayang(incubating) is the first cross-platform data processing system. |
| utdemir/distributed-dataset |
107 |
|
0 |
0 |
almost 6 years ago |
0 |
|
19 |
bsd-3-clause |
Haskell |
| A distributed data processing framework in Haskell. |
| luisbelloch/data_processing_course |
53 |
|
0 |
0 |
over 3 years ago |
0 |
|
5 |
other |
Python |
| Some class materials for a data processing course using PySpark |
| 31z4/storm-docker |
52 |
|
0 |
0 |
about 2 years ago |
0 |
|
0 |
mit |
Dockerfile |
| Docker image packaging for Apache Storm |
| nancyyanyu/kafka_stock |
34 |
|
0 |
0 |
over 4 years ago |
0 |
|
1 |
apache-2.0 |
Python |
| A financial data processing and visualization platform using Apache Kafka, Apache Cassandra, and Bokeh. |
| IBM/ibm-cloud-functions-data-processing-message-hub |
19 |
|
0 |
0 |
almost 7 years ago |
0 |
|
5 |
apache-2.0 |
Shell |
| Create a serverless, event-driven application with Apache OpenWhisk on IBM Cloud Functions that executes code in response to messages or to handle streams of data records from Apache Kafka or IBM Message Hub. |
| ksbg/sparklanes |
16 |
|
1 |
0 |
about 6 years ago |
5 |
January 31, 2019 |
2 |
mit |
Python |
| A lightweight data processing framework for Apache Spark |