Airflow Alternatives

Name: mpavanetti/airflow
Brand: mpavanetti/airflow
SKU: project/mpavanetti/airflow
Rating: 4.42 (8 reviews)

This set of code and instructions has the porpouse to instanciate a compiled environment with set of docker images like airflow webserver, airflow scheduler, postgresql, pyspark, Data Pipeline consuming data from weather api , processing with pyspark and storing in postgresql

Categories > Data Processing > Postgresql

Suggest Alternative

Stars

Alternatives

License

No license specified

Open Issues

Most Recent Commit

over 2 years ago

Programming Language

PHP

Dependent Repos

Dependent Packages

Total Releases

Categories

Programming Languages > Php

Data Storage > Postgresql

Data Processing > Pipeline

Data Processing > Spark

Data Processing > Data Engineering

Control Flow > Airflow

Data Processing > Pyspark

Repo

Alternatives To mpavanetti/airflow

Project Name	Stars	Repos Using This	Packages Using This	Most Recent Commit	Total Releases	Latest Release	Open Issues	License	Language
combust/mleap	1,479	15	12	over 2 years ago	26	May 07, 2021	109	apache-2.0	Scala
MLeap: Deploy ML Pipelines to Production
quintoandar/butterfree	269	0	1	over 2 years ago	35	November 14, 2023	6	apache-2.0	Python
A tool for building feature stores.
Morphl-AI/MorphL-Community-Edition	233	0	0	almost 7 years ago	0		7	apache-2.0	Python
MorphL Community Edition uses big data and machine learning to predict user behaviors in digital products and services with the end goal of increasing KPIs (click-through rates, conversion rates, etc.) through personalization
awesome-spark/learn-by-examples	72	0	0	over 8 years ago	0		2		Scala
Real-world Spark pipelines examples
src-d/jgit-spark-connector	67	1	1	over 7 years ago	40	October 10, 2018	12	apache-2.0	Scala
jgit-spark-connector is a library for running scalable data retrieval pipelines that process any number of Git repositories for source code analysis.
crawles/spark-nba-analytics	41	0	0	over 9 years ago	0		0	mit	HTML
Analyzing NBA data using Spark 2.1
basin-etl/basin	29	0	0	over 3 years ago	0		42	other	TypeScript
Basin is a visual programming editor for building Spark and PySpark pipelines. Easily build, debug, and deploy complex ETL pipelines from your browser
cerndb/SparkDLTrigger	28	0	0	about 3 years ago	0		0	apache-2.0	Jupyter Notebook
Repo for the article "Machine Learning Pipelines with Modern Big DataTools for High Energy Physics"
FavioVazquez/ODSC_India_2018	26	0	0	almost 8 years ago	0		0		Jupyter Notebook
My presentation at ODSC India 2018 about Deep Learning with Apache Spark
guidok91/spark-movies-etl	21	0	0	almost 3 years ago	0		2		Python
Spark data pipeline that ingests and transforms movie ratings data.

Alternatives To mpavanetti/airflow

Select To Compare

combust/mleap ⭐ 1,479

MLeap: Deploy ML Pipelines to Production

dependent packages 12 total releases 26 most recent commit over 2 years ago

quintoandar/butterfree ⭐ 269

A tool for building feature stores.

dependent packages 1 total releases 35 most recent commit over 2 years ago downloads badge

Morphl-AI/MorphL-Community-Edition ⭐ 233

MorphL Community Edition uses big data and machine learning to predict user behaviors in digital products and services with the end goal of increasing KPIs (click-through rates, conversion rates, etc.) through personalization

dependent packages 0 total releases 0 most recent commit almost 7 years ago

awesome-spark/learn-by-examples ⭐ 72

Real-world Spark pipelines examples

dependent packages 0 total releases 0 most recent commit over 8 years ago

src-d/jgit-spark-connector ⭐ 67

jgit-spark-connector is a library for running scalable data retrieval pipelines that process any number of Git repositories for source code analysis.

dependent packages 1 total releases 40 most recent commit over 7 years ago downloads badge

crawles/spark-nba-analytics ⭐ 41

Analyzing NBA data using Spark 2.1

dependent packages 0 total releases 0 most recent commit over 9 years ago

basin-etl/basin ⭐ 29

Basin is a visual programming editor for building Spark and PySpark pipelines. Easily build, debug, and deploy complex ETL pipelines from your browser

dependent packages 0 total releases 0 most recent commit over 3 years ago

cerndb/SparkDLTrigger ⭐ 28

Repo for the article "Machine Learning Pipelines with Modern Big DataTools for High Energy Physics"

dependent packages 0 total releases 0 most recent commit about 3 years ago

FavioVazquez/ODSC_India_2018 ⭐ 26

My presentation at ODSC India 2018 about Deep Learning with Apache Spark

dependent packages 0 total releases 0 most recent commit almost 8 years ago

guidok91/spark-movies-etl ⭐ 21

Spark data pipeline that ingests and transforms movie ratings data.

dependent packages 0 total releases 0 most recent commit almost 3 years ago

Suggest An Alternative To airflow

Alternative Project Comparisons

mpavanetti/airflow vs Mleap

mpavanetti/airflow vs Butterfree

mpavanetti/airflow vs Morphl Community Edition

mpavanetti/airflow vs Learn By Examples

mpavanetti/airflow vs Jgit Spark Connector

mpavanetti/airflow vs Spark Nba Analytics

mpavanetti/airflow vs Basin

mpavanetti/airflow vs Sparkdltrigger

mpavanetti/airflow vs Odsc_india_2018

mpavanetti/airflow vs Spark Movies Etl

Popular Pipeline Projects

apache/airflow⭐ 33,219

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

nushell/nushell⭐ 28,304

A new type of shell

vectordotdev/vector⭐ 21,215

A high-performance observability data pipeline.

jina-ai/jina⭐ 19,573

☁️ Build multimodal AI applications with cloud-native stack

spotify/luigi⭐ 17,046

Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.

Popular Pyspark Projects

kailashahirwar/cheatsheets-ai⭐ 13,281

Essential Cheat Sheets for deep learning and machine learning researchers https://medium.com/@kailashahirwar/essential-cheat-sheets-for-machine-learning-and-deep-learning-researchers-efb6a8ebd2e5

microsoft/SynapseML⭐ 5,228

Simple and Distributed Machine Learning

JohnSnowLabs/spark-nlp⭐ 3,578

State of the Art Natural Language Processing

apache/linkis⭐ 3,408

Apache Linkis builds a computation middleware layer to facilitate connection, governance and orchestration between the upper applications and the underlying data engines.

ibis-project/ibis⭐ 3,404

The flexibility of Python with the scale and performance of modern SQL.

Popular Data Processing Categories

Jupyter Notebook

Dataset

Sql

Validation

Pipeline

Translation

Data Science

Classification

Transaction

Scraper