| spotify/scio |
2,505 |
|
0 |
37 |
about 2 years ago |
96 |
November 21, 2023 |
142 |
apache-2.0 |
Scala |
| A Scala API for Apache Beam and Google Cloud Dataflow. |
| SeldonIO/seldon-server |
1,420 |
|
0 |
0 |
about 6 years ago |
44 |
June 28, 2017 |
26 |
apache-2.0 |
Java |
| Machine Learning Platform and Recommendation Engine built on Kubernetes |
| HariSekhon/DevOps-Python-tools |
709 |
|
0 |
0 |
over 2 years ago |
0 |
|
37 |
mit |
Python |
| 80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Functions, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc. |
| elasticluster/elasticluster |
334 |
|
3 |
0 |
over 2 years ago |
12 |
October 22, 2014 |
182 |
gpl-3.0 |
Python |
| Create clusters of VMs on the cloud and configure them with Ansible. |
| GoogleCloudDataproc/spark-bigquery-connector |
332 |
|
0 |
12 |
about 2 years ago |
24 |
October 31, 2023 |
42 |
apache-2.0 |
Java |
| BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables. |
| lynnlangit/learning-hadoop-and-spark |
160 |
|
0 |
0 |
over 2 years ago |
0 |
|
0 |
apache-2.0 |
HTML |
| Companion to Learning Hadoop and Learning Spark courses on Linked In Learning |
| Hamagistral/de-zoomcamp-ui |
107 |
|
0 |
0 |
about 2 years ago |
0 |
|
0 |
|
Python |
| 🎨 UI for the Free Data Engineering Zoomcamp 2023 Course provided by DataTalksClub |
| ankurchavda/streamify |
97 |
|
0 |
0 |
almost 4 years ago |
0 |
|
0 |
|
Python |
| A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more! |
| sigmoidanalytics/spark_gce |
45 |
|
0 |
0 |
almost 11 years ago |
0 |
|
1 |
apache-2.0 |
Python |
| Spark GCE Script Helps you deploy Spark cluster on Google Cloud. |
| tharwaninitin/etlflow |
43 |
|
0 |
11 |
over 2 years ago |
37 |
July 19, 2023 |
0 |
apache-2.0 |
Scala |
| EtlFlow is an ecosystem of functional libraries in Scala based on ZIO for running complex Auditable workflows which can interact with Google Cloud Platform, AWS, Kubernetes, Databases, SFTP servers, On-Prem Systems and more. |