| tesseract-ocr/tesseract |
56,096 |
|
0 |
7 |
about 2 years ago |
1 |
February 27, 2018 |
415 |
apache-2.0 |
C++ |
| Tesseract Open Source OCR Engine (main repository) |
| hiroi-sora/Umi-OCR |
43,032 |
|
0 |
0 |
5 months ago |
0 |
|
85 |
mit |
Python |
| OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。 |
| PaddlePaddle/PaddleOCR |
36,076 |
|
0 |
30 |
about 2 years ago |
40 |
September 15, 2023 |
1,027 |
apache-2.0 |
Python |
| Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices) |
| ShareX/ShareX |
35,447 |
|
0 |
0 |
2 months ago |
0 |
|
528 |
gpl-3.0 |
C# |
| ShareX is a free and open-source application that enables users to capture or record any area of their screen with a single keystroke. It also supports uploading images, text, and various file types to a wide range of destinations. |
| naptha/tesseract.js |
32,523 |
|
221 |
224 |
about 2 years ago |
66 |
October 30, 2023 |
19 |
apache-2.0 |
JavaScript |
| Pure Javascript OCR for more than 100 Languages 📖🎉🖥 |
| ocrmypdf/OCRmyPDF |
32,432 |
|
6 |
11 |
2 months ago |
227 |
November 29, 2023 |
87 |
mpl-2.0 |
Python |
| OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched |
| JaidedAI/EasyOCR |
20,438 |
|
0 |
69 |
over 2 years ago |
32 |
September 04, 2023 |
340 |
apache-2.0 |
Python |
| Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc. |
| siyuan-note/siyuan |
14,236 |
|
0 |
0 |
about 2 years ago |
1 |
July 07, 2022 |
70 |
agpl-3.0 |
TypeScript |
| A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang. |
| PaddlePaddle/PaddleHub |
12,193 |
|
0 |
6 |
over 2 years ago |
50 |
September 20, 2023 |
575 |
apache-2.0 |
Python |
| Awesome pre-trained models toolkit based on PaddlePaddle. (400+ models including Image, Text, Audio, Video and Cross-Modal with Easy Inference & Serving) |
| DayBreak-u/chineseocr_lite |
11,223 |
|
0 |
0 |
over 2 years ago |
2 |
January 25, 2022 |
241 |
gpl-2.0 |
C++ |
| 超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M |