| databricks/spark-csv |
1,009 |
|
201 |
24 |
over 7 years ago |
13 |
September 05, 2016 |
206 |
apache-2.0 |
Scala |
| CSV Data Source for Apache Spark 1.x |
| uber/marmaray |
444 |
|
0 |
0 |
about 4 years ago |
0 |
|
14 |
other |
Java |
| Generic Data Ingestion & Dispersal Library for Hadoop |
| bluenote10/NimData |
276 |
|
0 |
0 |
almost 5 years ago |
0 |
December 12, 2023 |
27 |
mit |
Nim |
| DataFrame API written in Nim, enabling fast out-of-core data processing |
| linkedin/spark-tfrecord |
255 |
|
0 |
0 |
almost 3 years ago |
10 |
June 30, 2023 |
16 |
bsd-2-clause |
Scala |
| Read and write Tensorflow TFRecord data from Apache Spark. |
| AbsaOSS/ABRiS |
215 |
|
0 |
5 |
over 2 years ago |
17 |
October 06, 2020 |
14 |
apache-2.0 |
Scala |
| Avro SerDe for Apache Spark structured APIs. |
| streamnative/pulsar-spark |
103 |
|
0 |
2 |
over 2 years ago |
10 |
November 06, 2023 |
9 |
apache-2.0 |
Scala |
| Spark Connector to read and write with Pulsar |
| traviscrawford/spark-dynamodb |
90 |
|
0 |
0 |
over 4 years ago |
12 |
March 21, 2018 |
17 |
apache-2.0 |
Scala |
| DynamoDB data source for Apache Spark |
| indix/schemer |
89 |
|
0 |
0 |
about 6 years ago |
15 |
March 02, 2018 |
0 |
apache-2.0 |
Scala |
| Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API. |
| nikgraf/graphiql-spark |
79 |
|
0 |
0 |
about 6 years ago |
0 |
|
1 |
mit |
TypeScript |
| Demo a GraphQL schema without a GraphQL endpoint |
| hhbyyh/DataFrameCheatSheet |
74 |
|
0 |
0 |
over 6 years ago |
0 |
|
0 |
|
|
| Cheatsheet for Spark DataFrame |