| awslabs/lambda-refarch-mapreduce |
355 |
|
0 |
0 |
over 6 years ago |
0 |
|
7 |
other |
JavaScript |
| This repo presents a reference architecture for running serverless MapReduce jobs. This has been implemented using AWS Lambda and Amazon S3. |
| petewarden/common_crawl_types |
28 |
|
0 |
0 |
about 14 years ago |
0 |
|
0 |
|
Ruby |
| A simple Ruby example of how to process Common Crawl files using Elastic MapReduce |
| ly16/GooglePlay-Web-Crawler |
15 |
|
0 |
0 |
about 9 years ago |
0 |
|
0 |
|
Java |
| Mapreduce project by Hadoop, Nutch, AWS EMR, Pig, Tez, Hive |
| pvnick/easy-s3-kafka-consumer |
6 |
|
0 |
0 |
about 12 years ago |
0 |
|
0 |
|
Python |
| Pain-free system for sinking data from kafka to s3. Written in Go for high concurrency. Great for streaming large amounts of web data into s3 for use with mapreduce. |
| nathants/py-aws |
5 |
|
0 |
0 |
over 2 years ago |
0 |
|
0 |
mit |
Python |
| succinct ec2 cli, ec2 python api, and mapreduce library. |