| logicalclocks/hopsworks |
1,041 |
|
0 |
0 |
about 2 years ago |
1 |
September 11, 2019 |
12 |
agpl-3.0 |
Java |
| Hopsworks - Data-Intensive AI platform with a Feature Store |
| HariSekhon/DevOps-Python-tools |
709 |
|
0 |
0 |
over 2 years ago |
0 |
|
37 |
mit |
Python |
| 80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Functions, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc. |
| aws/sagemaker-spark |
285 |
|
2 |
0 |
over 2 years ago |
36 |
August 26, 2022 |
34 |
apache-2.0 |
Scala |
| A Spark library for Amazon SageMaker. |
| commoncrawl/cc-pyspark |
280 |
|
0 |
0 |
about 3 years ago |
0 |
|
4 |
mit |
Python |
| Process Common Crawl data with Python and Spark |
| PiercingDan/spark-Jupyter-AWS |
255 |
|
0 |
0 |
over 8 years ago |
0 |
|
2 |
|
Jupyter Notebook |
| A guide on how to set up Jupyter with Pyspark painlessly on AWS EC2 clusters, with S3 I/O support |
| RubensZimbres/Repo-2019 |
135 |
|
0 |
0 |
over 4 years ago |
0 |
|
1 |
|
Jupyter Notebook |
| BERT, AWS RDS, AWS Forecast, EMR Spark Cluster, Hive, Serverless, Google Assistant + Raspberry Pi, Infrared, Google Cloud Platform Natural Language, Anomaly detection, Tensorflow, Mathematics |
| adornes/spark_python_ml_examples |
81 |
|
0 |
0 |
over 6 years ago |
0 |
|
0 |
|
Python |
| Spark 2.0 Python Machine Learning examples |
| arverma/TowardsDataEngineering |
52 |
|
0 |
0 |
about 3 years ago |
0 |
|
7 |
|
Python |
| This repo contains commands that data engineers use in day to day work. |
| idealo/terraform-emr-pyspark |
46 |
|
0 |
0 |
over 2 years ago |
0 |
|
2 |
apache-2.0 |
HCL |
| Quickstart PySpark with Anaconda on AWS/EMR using Terraform |
| datitran/emr-bootstrap-pyspark |
43 |
|
0 |
0 |
over 9 years ago |
0 |
|
0 |
mit |
Python |
| Quickstart PySpark with Anaconda on AWS/EMR |