| orchest/orchest |
3,876 |
|
0 |
0 |
almost 3 years ago |
19 |
December 13, 2022 |
125 |
apache-2.0 |
TypeScript |
| Build data pipelines, the easy way 🛠️ |
| opensemanticsearch/open-semantic-search |
741 |
|
0 |
0 |
about 3 years ago |
0 |
|
187 |
gpl-3.0 |
Shell |
| Open Source research tool to search, browse, analyze and explore large document collections by Semantic Search Engine and Open Source Text Mining & Text Analytics platform (Integrates ETL for document processing, OCR for images & PDF, named entity recognition for persons, organizations & locations, metadata management by thesaurus & ontologies, search user interface & search apps for fulltext search, faceted search & knowledge graph) |
| insitro/redun |
464 |
|
0 |
1 |
over 2 years ago |
18 |
November 12, 2023 |
28 |
apache-2.0 |
Python |
| Yet another redundant workflow engine |
| appbaseio/abc |
455 |
|
0 |
0 |
over 2 years ago |
0 |
|
29 |
apache-2.0 |
Go |
| Power of appbase.io via CLI, with nifty imports from your favorite data sources |
| nucleuscloud/neosync |
413 |
|
0 |
0 |
about 2 years ago |
0 |
|
38 |
mit |
TypeScript |
| A developer-first way to create high-fidelity synthetic data or anonymize sensitive data and sync it across all environments for testing, fine-tuning or model training. |
| smooks/smooks |
377 |
|
0 |
14 |
about 2 years ago |
5 |
June 19, 2023 |
19 |
other |
Java |
| Extensible data integration Java framework for building XML and non-XML fragment-based applications |
| josephmachado/beginner_de_project |
276 |
|
0 |
0 |
about 3 years ago |
0 |
|
1 |
mit |
HCL |
| Beginner data engineering project - batch edition |
| fedspendingtransparency/usaspending-api |
273 |
|
0 |
0 |
over 2 years ago |
0 |
|
59 |
cc0-1.0 |
Python |
| Server application to serve U.S. federal spending data via a RESTful API |
| linkedpipes/etl |
135 |
|
0 |
0 |
over 2 years ago |
0 |
|
188 |
other |
Java |
| LinkedPipes ETL is an RDF based, lightweight ETL tool |
| nicor88/aws-ecs-airflow |
110 |
|
0 |
0 |
almost 5 years ago |
0 |
|
6 |
mit |
HCL |
| Run Airflow in AWS ECS(Elastic Container Service) using Fargate tasks |