| dagster-io/dagster |
9,467 |
|
2 |
133 |
about 2 years ago |
585 |
December 07, 2023 |
2,343 |
apache-2.0 |
Python |
| An orchestration platform for the development, production, and observation of data assets. |
| mage-ai/mage-ai |
6,324 |
|
0 |
0 |
about 2 years ago |
314 |
December 06, 2023 |
189 |
apache-2.0 |
Python |
| 🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data. |
| salesforce/TransmogrifAI |
2,099 |
|
0 |
3 |
about 4 years ago |
9 |
June 11, 2020 |
44 |
bsd-3-clause |
Scala |
| TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning |
| tencentmusic/cube-studio |
1,710 |
|
0 |
0 |
about 2 years ago |
1 |
October 13, 2022 |
74 |
other |
Jupyter Notebook |
| cube studio开源云原生一站式机器学习/深度学习AI平台,支持sso登录,多租户/多项目组,数据资产对接,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式算法训练,超参搜索,推理服务VGPU,多集群调度,边缘计算,serverless,标注平台,自动化标注,数据集管理,大模型一键微调,llmops,私有知识库,AI应用商店,支持模型一键开发/推理/微调,私有化部署,支持国产cpu/gpu/npu芯片,支持RDMA,支持pytorch/tf/mxnet/deepspeed/paddle/colossalai/horovod/spark/ray/volcano分布式 |
| combust/mleap |
1,479 |
|
15 |
12 |
over 2 years ago |
26 |
May 07, 2021 |
109 |
apache-2.0 |
Scala |
| MLeap: Deploy ML Pipelines to Production |
| ColZer/DigAndBuried |
645 |
|
0 |
0 |
over 9 years ago |
0 |
|
4 |
|
GCC Machine Description |
| 挖坑与填坑 |
| san089/goodreads_etl_pipeline |
593 |
|
0 |
0 |
about 6 years ago |
0 |
|
0 |
mit |
Python |
| An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform. |
| amplab/keystone |
472 |
|
0 |
0 |
almost 9 years ago |
5 |
March 03, 2017 |
39 |
apache-2.0 |
Scala |
| Simplifying robust end-to-end machine learning on Apache Spark. |
| jamesward/koober |
301 |
|
0 |
0 |
about 8 years ago |
0 |
|
3 |
|
Scala |
| lifeomic/sparkflow |
301 |
|
0 |
0 |
over 2 years ago |
13 |
May 18, 2019 |
9 |
mit |
Python |
| Easy to use library to bring Tensorflow on Apache Spark |