| capitalone/DataProfiler |
1,310 |
|
0 |
3 |
about 2 years ago |
53 |
November 14, 2023 |
56 |
apache-2.0 |
Python |
| What's in your data? Extract schema, statistics and entities from datasets |
| Cinchoo/ChoETL |
693 |
|
1 |
9 |
over 2 years ago |
177 |
September 21, 2023 |
62 |
mit |
C# |
| ETL framework for .NET (Parser / Writer for CSV, Flat, Xml, JSON, Key-Value, Parquet, Yaml, Avro formatted files) |
| RandomFractals/vscode-data-preview |
447 |
|
0 |
0 |
almost 3 years ago |
0 |
|
54 |
apache-2.0 |
TypeScript |
| Data Preview 🈸 extension for importing 📤 viewing 🔎 slicing 🔪 dicing 🎲 charting 📊 & exporting 📥 large JSON array/config, YAML, Apache Arrow, Avro, Parquet & Excel data files |
| streamthoughts/kafka-connect-file-pulse |
289 |
|
0 |
5 |
over 2 years ago |
5 |
July 05, 2023 |
30 |
apache-2.0 |
Java |
| 🔗 A multipurpose Kafka Connect connector that makes it easy to parse, transform and stream any file, in any format, into Apache Kafka |
| RumbleDB/rumble |
194 |
|
0 |
0 |
almost 3 years ago |
4 |
December 03, 2019 |
134 |
other |
Java |
| ⛈️ RumbleDB 1.21.0 "Hawthorn blossom" 🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more |
| andygrove/bdt |
125 |
|
0 |
0 |
over 2 years ago |
21 |
November 22, 2023 |
6 |
apache-2.0 |
Rust |
| Boring Data Tool |
| indix/schemer |
89 |
|
0 |
0 |
about 6 years ago |
15 |
March 02, 2018 |
0 |
apache-2.0 |
Scala |
| Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API. |
| sparsecode/DaFlow |
24 |
|
0 |
0 |
almost 6 years ago |
0 |
|
8 |
other |
Scala |
| Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple categories of transformation rules. |
| saint1991/serialization-benchmark |
10 |
|
0 |
0 |
almost 8 years ago |
0 |
|
0 |
mit |
Scala |
| benchmark for modern serialization systems: Apache Avro, Protocol Buffers, Apache Thrift and MessagePack written in Scala |
| nezihyigitbasi/FlinkParquet |
10 |
|
0 |
0 |
over 10 years ago |
0 |
|
1 |
|
Java |
| Using the Parquet file format (with Avro) to process data with Apache Flink |