| Yelp/mrjob |
2,584 |
|
112 |
2 |
over 3 years ago |
62 |
December 15, 2021 |
211 |
other |
Python |
| Run MapReduce jobs on Hadoop or Amazon Web Services |
| databricks/spark-redshift |
514 |
|
4 |
1 |
over 6 years ago |
10 |
November 01, 2016 |
134 |
apache-2.0 |
Scala |
| Redshift data source for Apache Spark |
| datawrangling/trendingtopics |
351 |
|
0 |
0 |
over 14 years ago |
0 |
|
10 |
|
Ruby |
| Rails app for tracking trends in server logs - powered by the Cloudera Hadoop Distribution on EC2 |
| aws/sagemaker-spark |
285 |
|
2 |
0 |
over 2 years ago |
36 |
August 26, 2022 |
34 |
apache-2.0 |
Scala |
| A Spark library for Amazon SageMaker. |
| PiercingDan/spark-Jupyter-AWS |
255 |
|
0 |
0 |
over 8 years ago |
0 |
|
2 |
|
Jupyter Notebook |
| A guide on how to set up Jupyter with Pyspark painlessly on AWS EC2 clusters, with S3 I/O support |
| awslabs/emr-dynamodb-connector |
210 |
|
0 |
0 |
about 2 years ago |
15 |
September 28, 2021 |
64 |
apache-2.0 |
Java |
| Implementations of open source Apache Hadoop/Hive interfaces which allow for ingesting data from Amazon DynamoDB |
| tc/elastic-mapreduce-ruby |
86 |
|
0 |
0 |
over 11 years ago |
0 |
|
8 |
apache-2.0 |
Ruby |
| Amazon's elastic mapreduce ruby client. Ruby 1.9.X compatible |
| snowplow/scalding-example-project |
85 |
|
0 |
0 |
over 11 years ago |
0 |
|
3 |
apache-2.0 |
Scala |
| The Scalding WordCountJob example as a standalone SBT project with Specs2 tests, runnable on Amazon EMR |
| ATLANTBH/emr-s3-io |
29 |
|
0 |
0 |
almost 13 years ago |
0 |
|
5 |
|
Java |
| Hadoop IO for Amazon S3 |
| eleflow/nutch-aws |
23 |
|
0 |
0 |
about 11 years ago |
0 |
|
1 |
|
Makefile |