| ICT-BDA/EasyML |
1,966 |
|
0 |
0 |
over 2 years ago |
0 |
|
47 |
apache-2.0 |
Java |
| Easy Machine Learning is a general-purpose dataflow-based system for easing the process of applying machine learning algorithms to real world tasks. |
| DTStack/Taier |
1,220 |
|
0 |
0 |
about 2 years ago |
2 |
February 25, 2022 |
61 |
apache-2.0 |
Java |
| Taier is a big data development platform for submission, scheduling, operation and maintenance, and indicator information display |
| san089/goodreads_etl_pipeline |
593 |
|
0 |
0 |
about 6 years ago |
0 |
|
0 |
mit |
Python |
| An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform. |
| MeetYouDevs/big-whale |
290 |
|
0 |
0 |
over 2 years ago |
0 |
|
11 |
apache-2.0 |
Java |
| Spark、Flink等离线任务的调度以及实时任务的监控 |
| cordon-thiago/airflow-spark |
64 |
|
0 |
0 |
about 4 years ago |
0 |
|
6 |
|
Python |
| Docker with Airflow and Spark standalone cluster |
| isaacmg/fb_scraper |
52 |
|
0 |
0 |
about 7 years ago |
0 |
|
10 |
apache-2.0 |
Jupyter Notebook |
| FBLYZE is a Facebook scraping system and analysis system. |
| vivek-bombatkar/Spark-with-Python---My-learning-notes- |
39 |
|
0 |
0 |
about 6 years ago |
0 |
|
0 |
|
CSS |
| ETL pipeline using pyspark (Spark - Python) |
| hortonworks/spark-native-yarn |
32 |
|
0 |
0 |
about 9 years ago |
0 |
|
6 |
apache-2.0 |
Scala |
| Tez port for Spark API |
| KenSuenobu/scattersphere |
30 |
|
0 |
0 |
over 7 years ago |
0 |
|
12 |
apache-2.0 |
Scala |
| Job Coordination API for Tasks |
| panovvv/airflow-livy-operators |
17 |
|
0 |
0 |
over 3 years ago |
8 |
August 27, 2021 |
3 |
mit |
Python |
| Lets Airflow DAGs run Spark jobs via Livy: sessions and/or batches. |