| airbytehq/airbyte |
12,918 |
|
0 |
11 |
about 2 years ago |
311 |
December 08, 2023 |
5,111 |
other |
Python |
| The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted. |
| allegroai/clearml |
5,009 |
|
0 |
16 |
about 2 years ago |
143 |
November 08, 2023 |
426 |
apache-2.0 |
Python |
| ClearML - Auto-Magical CI/CD to streamline your ML workflow. Experiment Manager, MLOps and Data-Management |
| san089/goodreads_etl_pipeline |
593 |
|
0 |
0 |
about 6 years ago |
0 |
|
0 |
mit |
Python |
| An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform. |
| datajoint/datajoint-python |
158 |
|
23 |
34 |
over 2 years ago |
71 |
July 31, 2025 |
143 |
lgpl-2.1 |
Python |
| Relational data pipelines for the science lab |
| aeksco/aws-pdf-textract-pipeline |
148 |
|
0 |
0 |
over 2 years ago |
0 |
|
5 |
mit |
TypeScript |
| :mag: Data pipeline for crawling PDFs from the Web and transforming their contents into structured data using AWS textract. Built with AWS CDK + TypeScript |
| aws-samples/rds-snapshot-export-to-s3-pipeline |
102 |
|
0 |
0 |
almost 3 years ago |
0 |
|
5 |
mit-0 |
TypeScript |
| RDS Snapshot Export to S3 Pipeline |
| orangain/scrapy-s3pipeline |
66 |
|
1 |
0 |
about 4 years ago |
8 |
January 31, 2021 |
1 |
mit |
Python |
| Scrapy pipeline to store chunked items into Amazon S3 or Google Cloud Storage bucket. |
| aws-samples/automating-livestream-video-monitoring |
34 |
|
0 |
0 |
over 2 years ago |
0 |
|
2 |
mit-0 |
Jupyter Notebook |
| This repo presents a demo application for realtime livestream video quality monitoring using AWS serverless and AI/ML services. |
| chadgeary/nifi |
32 |
|
0 |
0 |
over 3 years ago |
0 |
|
0 |
|
HCL |
| Deploy a secured, clustered, auto-scaling NiFi service in AWS. |
| shirosaidev/saisoku |
21 |
|
0 |
0 |
over 5 years ago |
0 |
|
0 |
apache-2.0 |
Python |
| Saisoku is a Python module that helps you build complex pipelines of batch file/directory transfer/sync jobs. |