| HariSekhon/DevOps-Python-tools |
709 |
|
0 |
0 |
over 2 years ago |
0 |
|
37 |
mit |
Python |
| 80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Functions, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc. |
| huseinzol05/Gather-Deployment |
347 |
|
0 |
0 |
over 2 years ago |
0 |
|
0 |
mit |
Jupyter Notebook |
| Gathers Python deployment, infrastructure and practices. |
| cluster-apps-on-docker/spark-standalone-cluster-on-docker |
311 |
|
0 |
0 |
over 3 years ago |
0 |
|
16 |
mit |
Jupyter Notebook |
| Learn Apache Spark in Scala, Python (PySpark) and R (SparkR) by building your own cluster with a JupyterLab interface on Docker. :zap: |
| ThreatHuntingProject/hunter |
170 |
|
0 |
0 |
over 4 years ago |
0 |
|
0 |
mit |
Jupyter Notebook |
| A threat hunting / data analysis environment based on Python, Pandas, PySpark and Jupyter Notebook. |
| PacktPublishing/Mastering-Big-Data-Analytics-with-PySpark |
118 |
|
0 |
0 |
about 3 years ago |
0 |
|
6 |
mit |
Jupyter Notebook |
| Mastering Big Data Analytics with PySpark, Published by Packt |
| alanchn31/Movalytics-Data-Warehouse |
103 |
|
0 |
0 |
almost 6 years ago |
0 |
|
0 |
|
Python |
| Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow |
| sabman/PySparkGeoAnalysis |
60 |
|
0 |
0 |
about 9 years ago |
0 |
|
3 |
|
Jupyter Notebook |
| :globe_with_meridians: Interactive Workshop on GeoAnalysis using PySpark |
| groda/big_data |
55 |
|
0 |
0 |
over 2 years ago |
0 |
|
0 |
mit |
Jupyter Notebook |
| Tutorials on Big Data essentials: Hadoop, MapReduce, Spark. |
| arverma/TowardsDataEngineering |
52 |
|
0 |
0 |
about 3 years ago |
0 |
|
7 |
|
Python |
| This repo contains commands that data engineers use in day to day work. |
| TresAmigosSD/SMV |
41 |
|
0 |
0 |
almost 6 years ago |
10 |
September 19, 2019 |
73 |
apache-2.0 |
Python |
| Spark Modularized View |