| donnemartin/data-science-ipython-notebooks |
25,668 |
|
0 |
0 |
over 2 years ago |
0 |
|
34 |
other |
Python |
| Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines. |
| donnemartin/dev-setup |
5,802 |
|
0 |
0 |
over 3 years ago |
0 |
|
34 |
other |
Python |
| macOS development environment setup: Easy-to-understand instructions with automated setup scripts for developer tools like Vim, Sublime Text, Bash, iTerm, Python data analysis, Spark, Hadoop MapReduce, AWS, Heroku, JavaScript web development, Android development, common data stores, and dev-based OS X defaults. |
| SeldonIO/seldon-server |
1,420 |
|
0 |
0 |
about 6 years ago |
44 |
June 28, 2017 |
26 |
apache-2.0 |
Java |
| Machine Learning Platform and Recommendation Engine built on Kubernetes |
| aws-samples/aws-glue-samples |
1,334 |
|
0 |
0 |
over 2 years ago |
0 |
|
37 |
mit-0 |
Python |
| AWS Glue code samples |
| HariSekhon/DevOps-Python-tools |
709 |
|
0 |
0 |
over 2 years ago |
0 |
|
37 |
mit |
Python |
| 80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Functions, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc. |
| nchammas/flintrock |
627 |
|
4 |
0 |
over 2 years ago |
14 |
November 27, 2023 |
36 |
apache-2.0 |
Python |
| A command-line tool for launching Apache Spark clusters. |
| awslabs/aws-glue-libs |
568 |
|
0 |
0 |
over 2 years ago |
0 |
|
96 |
other |
Python |
| AWS Glue Libraries are additions and enhancements to Spark for ETL operations. |
| OBenner/data-engineering-interview-questions |
554 |
|
0 |
0 |
over 2 years ago |
0 |
|
0 |
|
|
| More than 2000+ Data engineer interview questions. |
| databricks/spark-redshift |
514 |
|
4 |
1 |
over 6 years ago |
10 |
November 01, 2016 |
134 |
apache-2.0 |
Scala |
| Redshift data source for Apache Spark |
| rjurney/Agile_Data_Code_2 |
435 |
|
0 |
0 |
about 3 years ago |
0 |
|
7 |
mit |
Jupyter Notebook |
| Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition |