| airbytehq/airbyte |
12,918 |
|
0 |
11 |
about 2 years ago |
311 |
December 08, 2023 |
5,111 |
other |
Python |
| The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted. |
| dirkjanm/ROADtools |
1,540 |
|
0 |
2 |
about 2 years ago |
22 |
December 05, 2023 |
10 |
mit |
Python |
| A collection of Azure AD tools for offensive and defensive security purposes |
| moby/datakit |
1,044 |
|
0 |
0 |
about 4 years ago |
0 |
|
34 |
apache-2.0 |
OCaml |
| Connect processes into powerful data pipelines with a simple git-like filesystem interface |
| NeumTry/NeumAI |
693 |
|
0 |
0 |
about 2 years ago |
0 |
|
7 |
apache-2.0 |
Python |
| Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale. |
| frotms/PaddleOCR2Pytorch |
553 |
|
0 |
0 |
almost 3 years ago |
0 |
|
45 |
apache-2.0 |
Python |
| PaddleOCR inference in PyTorch. Converted from [PaddleOCR](https://github.com/PaddlePaddle/PaddleOCR) |
| vmware/versatile-data-kit |
389 |
|
0 |
25 |
about 2 years ago |
181 |
November 28, 2023 |
220 |
apache-2.0 |
Python |
| One framework to develop, deploy and operate data workflows with Python and SQL. |
| geophile/marcel |
326 |
|
0 |
0 |
about 2 years ago |
131 |
November 15, 2023 |
5 |
gpl-3.0 |
Python |
| A modern shell |
| PacktPublishing/Data-Engineering-with-Python |
302 |
|
0 |
0 |
about 3 years ago |
0 |
|
1 |
mit |
Python |
| Data Engineering with Python, published by Packt |
| rdagumampan/yuniql |
292 |
|
1 |
7 |
almost 4 years ago |
25 |
May 25, 2022 |
65 |
apache-2.0 |
C# |
| Free and open source schema versioning and database migration made natively with .NET/6. NEW THIS MAY 2022! v1.3.15 released! |
| logrange/logrange |
192 |
|
0 |
0 |
about 3 years ago |
12 |
February 05, 2021 |
15 |
apache-2.0 |
Go |
| High performance data aggregating storage |