| HariSekhon/DevOps-Python-tools |
709 |
|
0 |
0 |
over 2 years ago |
0 |
|
37 |
mit |
Python |
| 80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Functions, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc. |
| OBenner/data-engineering-interview-questions |
554 |
|
0 |
0 |
over 2 years ago |
0 |
|
0 |
|
|
| More than 2000+ Data engineer interview questions. |
| uber/marmaray |
444 |
|
0 |
0 |
about 4 years ago |
0 |
|
14 |
other |
Java |
| Generic Data Ingestion & Dispersal Library for Hadoop |
| Netflix/iceberg |
409 |
|
0 |
0 |
over 4 years ago |
0 |
|
27 |
apache-2.0 |
Java |
| Iceberg is a table format for large, slow-moving tabular data |
| Chabane/bigdata-playground |
154 |
|
0 |
0 |
about 7 years ago |
0 |
|
4 |
apache-2.0 |
TypeScript |
| A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL |
| 51zero/eel-sdk |
140 |
|
1 |
17 |
over 5 years ago |
103 |
February 11, 2019 |
25 |
apache-2.0 |
Scala |
| Big Data Toolkit for the JVM |
| miguno/avro-hadoop-starter |
111 |
|
0 |
0 |
over 10 years ago |
0 |
|
0 |
other |
Java |
| Example MapReduce jobs in Java, Hive, Pig, and Hadoop Streaming that work on Avro data. |
| confluentinc/camus |
87 |
|
0 |
0 |
almost 3 years ago |
0 |
|
6 |
apache-2.0 |
Java |
| Mirror of Linkedin's Camus |
| spotify/hdfs2cass |
75 |
|
0 |
0 |
about 4 years ago |
0 |
|
6 |
apache-2.0 |
Java |
| Hadoop mapreduce job to bulk load data into Cassandra |
| phunt/avro-maven-plugin |
34 |
|
0 |
0 |
over 15 years ago |
0 |
|
2 |
apache-2.0 |
Java |
| Maven 2 Plugin for processing Apache Avro files. Avro is a subproject of Apache Hadoop. |