| kba/awesome-ocr |
1,951 |
|
0 |
0 |
over 4 years ago |
0 |
|
44 |
other |
|
| Links to awesome OCR projects |
| harvard-lil/capstone |
168 |
|
0 |
0 |
about 2 years ago |
0 |
|
20 |
mit |
HTML |
| CAP database scripts. |
| Mararsh/MyBox |
120 |
|
0 |
0 |
about 2 years ago |
0 |
|
16 |
apache-2.0 |
Java |
| Easy tools of document, image, file, network, data, color, and media. |
| WZBSocialScienceCenter/pdf2xml-viewer |
82 |
|
0 |
0 |
about 4 years ago |
0 |
|
0 |
apache-2.0 |
HTML |
| A simple viewer and inspection tool for text boxes in PDF documents |
| impactcentre/ocrevalUAtion |
52 |
|
0 |
0 |
over 4 years ago |
0 |
|
3 |
apache-2.0 |
HTML |
| OCR evaluation brought to you by University of Alicante |
| altoxml/schema |
45 |
|
0 |
0 |
about 3 years ago |
0 |
|
23 |
|
|
| ALTO XML schema - latest and all former versions |
| filak/hOCR-to-ALTO |
44 |
|
0 |
0 |
over 2 years ago |
0 |
|
0 |
mit |
XSLT |
| Convert between Tesseract hOCR and ALTO XML using XSL stylesheets |
| pyxploiter/deep-splerge |
39 |
|
0 |
0 |
over 3 years ago |
0 |
|
1 |
|
Python |
| Implementation of research paper "Deep Splitting and Merging for Table Structure Decomposition" |
| mauvilsa/tesseract-recognize |
35 |
|
0 |
0 |
over 3 years ago |
0 |
|
0 |
mit |
C++ |
| Tool that does layout analysis and/or text recognition using tesseract and outputs the result in Page XML format |
| Anyline/anyline-ocr-examples-android |
22 |
|
0 |
0 |
about 2 years ago |
0 |
|
0 |
other |
Kotlin |
| Example configurations of the Anyline OCR SDK. |