| pymupdf/PyMuPDF |
3,526 |
|
34 |
341 |
about 2 years ago |
124 |
November 30, 2023 |
13 |
agpl-3.0 |
Python |
| PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents. |
| turicas/rows |
845 |
|
26 |
3 |
almost 3 years ago |
10 |
December 15, 2021 |
170 |
lgpl-3.0 |
Python |
| A common, beautiful interface to tabular data, no matter the format |
| ome/ngff |
100 |
|
0 |
0 |
about 2 years ago |
0 |
|
104 |
other |
Bikeshed |
| Next-generation file format (NGFF) specifications for storing bioimaging data in the cloud. |
| CertifaiAI/classifai |
96 |
|
0 |
0 |
over 4 years ago |
0 |
|
16 |
apache-2.0 |
Java |
| :fire: One of the most comprehensive open-source data annotation platform. |
| finos/greenkey-asrtoolkit |
31 |
|
0 |
0 |
over 3 years ago |
24 |
May 19, 2021 |
5 |
apache-2.0 |
Python |
| A collection of useful tools for handling speech recognition data |
| bradlindblad/schrutepy |
20 |
|
0 |
0 |
about 4 years ago |
4 |
January 23, 2022 |
4 |
mit |
Python |
| The Entire Transcript from the Office in Tidy Format |
| maximveksler/awesome-serialization |
15 |
|
0 |
0 |
about 3 years ago |
0 |
|
0 |
unlicense |
|
| Data formats useful for API, Big Data, ML, Graph & co |
| lungben/TableIO.jl |
12 |
|
0 |
0 |
over 3 years ago |
0 |
|
1 |
mit |
Julia |
| A glue package for reading and writing tabular data. It aims to provide a uniform api for reading and writing tabular data from and to multiple sources. |