| lk-geimfari/mimesis |
4,182 |
|
42 |
32 |
about 2 years ago |
49 |
August 19, 2023 |
7 |
mit |
Python |
| Mimesis is a powerful Python library that empowers developers to generate massive amounts of synthetic data efficiently. |
| unionai-oss/pandera |
2,807 |
|
0 |
97 |
about 2 years ago |
79 |
December 08, 2023 |
321 |
mit |
Python |
| A light-weight, flexible, and expressive statistical data testing library |
| bluenote10/NimData |
276 |
|
0 |
0 |
almost 5 years ago |
0 |
December 12, 2023 |
27 |
mit |
Nim |
| DataFrame API written in Nim, enabling fast out-of-core data processing |
| AbsaOSS/ABRiS |
215 |
|
0 |
5 |
over 2 years ago |
17 |
October 06, 2020 |
14 |
apache-2.0 |
Scala |
| Avro SerDe for Apache Spark structured APIs. |
| google/tensorflow-recorder |
158 |
|
0 |
0 |
about 4 years ago |
0 |
|
14 |
apache-2.0 |
Python |
| TFRecorder makes it easy to create TensorFlow records (TFRecords) from Pandas DataFrames and CSVs files containing images or structured data. |
| ynqa/pandavro |
127 |
|
17 |
22 |
over 2 years ago |
16 |
August 27, 2025 |
8 |
mit |
Python |
| Apache Avro <-> pandas DataFrame |
| streamnative/pulsar-spark |
103 |
|
0 |
2 |
over 2 years ago |
10 |
November 06, 2023 |
9 |
apache-2.0 |
Scala |
| Spark Connector to read and write with Pulsar |
| areshytko/typedframe |
78 |
|
0 |
2 |
over 2 years ago |
22 |
September 07, 2023 |
3 |
mit |
Python |
| Typed wrappers over pandas DataFrames with schema validation |
| hhbyyh/DataFrameCheatSheet |
74 |
|
0 |
0 |
over 6 years ago |
0 |
|
0 |
|
|
| Cheatsheet for Spark DataFrame |
| CybercentreCanada/jupyterlab-sql-editor |
72 |
|
0 |
0 |
over 2 years ago |
79 |
November 14, 2023 |
9 |
mit |
Jupyter Notebook |
| A JupyterLab extension providing, SQL formatter, auto-completion, syntax highlighting, Spark SQL and Trino |