| ocrmypdf/OCRmyPDF |
32,432 |
|
6 |
11 |
2 months ago |
227 |
November 29, 2023 |
87 |
mpl-2.0 |
Python |
| OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched |
| CCExtractor/ccextractor |
642 |
|
0 |
0 |
about 2 years ago |
0 |
|
114 |
gpl-2.0 |
C |
| CCExtractor - Official version maintained by the core team |
| aryaminus/memento |
64 |
|
0 |
0 |
almost 5 years ago |
5 |
May 04, 2018 |
3 |
mit |
Python |
| Organize your meme image cluster in a better format using OCR from the meme to sort them using tesseract along with editing memes by segmenting them using OpenCV within a directory |
| hertzg/tesseract-server |
59 |
|
0 |
0 |
about 2 years ago |
0 |
|
0 |
mit |
TypeScript |
| A small lightweight HTTP server that converts photos, images and scanned documents to text using optical character recognition by utilizing the power of Google Tesseract. |
| devforth/imagetotext.app |
48 |
|
0 |
0 |
over 3 years ago |
0 |
|
2 |
|
HTML |
| Copy text from the raster images online |
| vitali84/pdf-to-csv-table-extactor |
45 |
|
0 |
0 |
about 7 years ago |
0 |
|
5 |
wtfpl |
Python |
| Extract tables from scanned documents pdf into csv file using ocr and image processing |
| bigchao8/Opencv-ImageBase |
41 |
|
0 |
0 |
about 7 years ago |
0 |
|
0 |
|
C++ |
| 对任何文字图片来源进行预处理结合tesseract-ocr进行识别,主要模块有纸张边缘查找,四角定位,仿射变换,二值化,模糊处理,摩尔纹处理,噪点过滤,图片exif,jfif信息处理,表格线删除,图片阴影处理,傅里叶图片矫正处理等等。。本程序依赖于与图片exif,jfif信息进行分类处理,传入时需带有信息 |
| Lucs1590/Nkocr |
31 |
|
0 |
0 |
over 2 years ago |
14 |
October 15, 2021 |
0 |
apache-2.0 |
Python |
| 🔎📝 This is a module to make specifics OCRs at food products and nutritional tables. |
| farhanchoudhary/PAN_Card_OCR_Project |
26 |
|
0 |
0 |
about 6 years ago |
0 |
|
0 |
|
Python |
| To extract details from Indian National Identification Cards such as PAN (completed) & Aadhar, Passport, Driving License (WIP) in a structured format |
| breezedave/android-anpr |
23 |
|
0 |
0 |
almost 12 years ago |
0 |
|
2 |
|
Java |
| Android number plate recognition |