| apache/doris |
10,666 |
|
0 |
0 |
about 2 years ago |
8 |
September 27, 2023 |
2,332 |
apache-2.0 |
Java |
| Apache Doris is an easy-to-use, high performance and unified analytics database. |
| dagster-io/dagster |
9,467 |
|
2 |
133 |
about 2 years ago |
585 |
December 07, 2023 |
2,343 |
apache-2.0 |
Python |
| An orchestration platform for the development, production, and observation of data assets. |
| mage-ai/mage-ai |
6,324 |
|
0 |
0 |
about 2 years ago |
314 |
December 06, 2023 |
189 |
apache-2.0 |
Python |
| 🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data. |
| aws-samples/aws-glue-samples |
1,334 |
|
0 |
0 |
over 2 years ago |
0 |
|
37 |
mit-0 |
Python |
| AWS Glue code samples |
| AlexIoannides/pyspark-example-project |
1,034 |
|
0 |
0 |
over 3 years ago |
0 |
|
11 |
|
Python |
| Example project implementing best practices for PySpark ETL jobs and applications. |
| zinggAI/zingg |
828 |
|
0 |
0 |
about 2 years ago |
1 |
June 01, 2022 |
76 |
agpl-3.0 |
Java |
| Scalable identity resolution, entity resolution, data mastering and deduplication using ML |
| san089/goodreads_etl_pipeline |
593 |
|
0 |
0 |
about 6 years ago |
0 |
|
0 |
mit |
Python |
| An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform. |
| awslabs/aws-glue-libs |
568 |
|
0 |
0 |
over 2 years ago |
0 |
|
96 |
other |
Python |
| AWS Glue Libraries are additions and enhancements to Spark for ETL operations. |
| YotpoLtd/metorikku |
536 |
|
0 |
0 |
about 3 years ago |
126 |
February 27, 2023 |
65 |
mit |
Scala |
| A simplified, lightweight ETL Framework based on Apache Spark |
| zhaoyachao/zdh_web |
379 |
|
0 |
0 |
over 2 years ago |
0 |
|
19 |
apache-2.0 |
Java |
| 大数据采集,抽取平台,zdh_web是zdh系列服务的可视化管理平台,包含数据采集,调度,权限,审批流,私域营销等模块 |