| elastic/eland |
588 |
|
0 |
3 |
about 2 years ago |
30 |
November 22, 2023 |
88 |
apache-2.0 |
Python |
| Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch |
| YotpoLtd/metorikku |
536 |
|
0 |
0 |
about 3 years ago |
126 |
February 27, 2023 |
65 |
mit |
Scala |
| A simplified, lightweight ETL Framework based on Apache Spark |
| grailbio/bigslice |
525 |
|
0 |
0 |
almost 3 years ago |
13 |
April 05, 2021 |
23 |
apache-2.0 |
Go |
| A serverless cluster computing system for the Go programming language |
| zhaoyachao/zdh_web |
379 |
|
0 |
0 |
over 2 years ago |
0 |
|
19 |
apache-2.0 |
Java |
| 大数据采集,抽取平台,zdh_web是zdh系列服务的可视化管理平台,包含数据采集,调度,权限,审批流,私域营销等模块 |
| smooks/smooks |
377 |
|
0 |
14 |
about 2 years ago |
5 |
June 19, 2023 |
19 |
other |
Java |
| Extensible data integration Java framework for building XML and non-XML fragment-based applications |
| houshanren/big_data_architect_skills |
353 |
|
0 |
0 |
over 6 years ago |
0 |
|
1 |
|
|
| 一个大数据架构师应该掌握的技能 |
| aws-samples/aws-etl-orchestrator |
185 |
|
0 |
0 |
over 6 years ago |
0 |
|
1 |
other |
Python |
| A serverless architecture for orchestrating ETL jobs in arbitrarily-complex workflows using AWS Step Functions and AWS Lambda. |
| SETL-Framework/setl |
172 |
|
0 |
0 |
over 2 years ago |
4 |
August 21, 2020 |
5 |
apache-2.0 |
Scala |
| A simple Spark-powered ETL framework that just works 🍺 |
| alibaba/GraphAr |
145 |
|
0 |
0 |
about 2 years ago |
0 |
|
56 |
apache-2.0 |
C++ |
| An open source, standard data file format for graph data storage and retrieval |
| 51zero/eel-sdk |
140 |
|
1 |
17 |
over 5 years ago |
103 |
February 11, 2019 |
25 |
apache-2.0 |
Scala |
| Big Data Toolkit for the JVM |