| combust/mleap |
1,479 |
|
15 |
12 |
over 2 years ago |
26 |
May 07, 2021 |
109 |
apache-2.0 |
Scala |
| MLeap: Deploy ML Pipelines to Production |
| quintoandar/butterfree |
269 |
|
0 |
1 |
over 2 years ago |
35 |
November 14, 2023 |
6 |
apache-2.0 |
Python |
| A tool for building feature stores. |
| Morphl-AI/MorphL-Community-Edition |
233 |
|
0 |
0 |
over 6 years ago |
0 |
|
7 |
apache-2.0 |
Python |
| MorphL Community Edition uses big data and machine learning to predict user behaviors in digital products and services with the end goal of increasing KPIs (click-through rates, conversion rates, etc.) through personalization |
| awesome-spark/learn-by-examples |
72 |
|
0 |
0 |
about 8 years ago |
0 |
|
2 |
|
Scala |
| Real-world Spark pipelines examples |
| src-d/jgit-spark-connector |
67 |
|
1 |
1 |
about 7 years ago |
40 |
October 10, 2018 |
12 |
apache-2.0 |
Scala |
| jgit-spark-connector is a library for running scalable data retrieval pipelines that process any number of Git repositories for source code analysis. |
| crawles/spark-nba-analytics |
41 |
|
0 |
0 |
about 9 years ago |
0 |
|
0 |
mit |
HTML |
| Analyzing NBA data using Spark 2.1 |
| basin-etl/basin |
29 |
|
0 |
0 |
over 3 years ago |
0 |
|
42 |
other |
TypeScript |
| Basin is a visual programming editor for building Spark and PySpark pipelines. Easily build, debug, and deploy complex ETL pipelines from your browser |
| cerndb/SparkDLTrigger |
28 |
|
0 |
0 |
almost 3 years ago |
0 |
|
0 |
apache-2.0 |
Jupyter Notebook |
| Repo for the article "Machine Learning Pipelines with Modern Big DataTools for High Energy Physics" |
| FavioVazquez/ODSC_India_2018 |
26 |
|
0 |
0 |
over 7 years ago |
0 |
|
0 |
|
Jupyter Notebook |
| My presentation at ODSC India 2018 about Deep Learning with Apache Spark |
| guidok91/spark-movies-etl |
21 |
|
0 |
0 |
over 2 years ago |
0 |
|
2 |
|
Python |
| Spark data pipeline that ingests and transforms movie ratings data. |