| WorksApplications/Sudachi |
684 |
|
0 |
3 |
over 2 years ago |
22 |
June 23, 2023 |
20 |
apache-2.0 |
Java |
| A Japanese Tokenizer for Business |
| rivo/uniseg |
500 |
|
31 |
7,628 |
about 2 years ago |
17 |
February 21, 2023 |
2 |
mit |
Go |
| Unicode Text Segmentation, Word Wrapping, and String Width Calculation in Go |
| guokr/gkseg |
242 |
|
0 |
0 |
about 13 years ago |
0 |
|
3 |
other |
C |
| Yet another Chinese word segmentation package based on character-based tagging heuristics and CRF algorithm |
| HusseinYoussef/Arabic-OCR |
192 |
|
0 |
0 |
over 2 years ago |
0 |
|
9 |
mit |
Python |
| OCR system for Arabic language that converts images of typed text to machine-encoded text. |
| lumaku/ctc-segmentation |
192 |
|
0 |
2 |
over 3 years ago |
21 |
October 11, 2022 |
4 |
apache-2.0 |
Python |
| Segment an audio file and obtain utterance alignments. (Python package) |
| tonton-pixel/unicopedia-plus |
144 |
|
0 |
0 |
over 2 years ago |
0 |
|
0 |
mit |
JavaScript |
| Developer-oriented set of Unicode, Unihan & emoji utilities wrapped into one single app, built with Electron. |
| mountain/nseg |
93 |
|
4 |
4 |
about 14 years ago |
9 |
January 28, 2012 |
0 |
mit |
JavaScript |
| Node.js Version of MMSG for Chinese Word Segmentation |
| skotz/cbl-js |
85 |
|
0 |
0 |
about 5 years ago |
0 |
|
27 |
mit |
JavaScript |
| JavaScript CAPTCHA solving library |
| louismullie/scalpel |
52 |
|
39 |
2 |
over 10 years ago |
2 |
December 21, 2012 |
2 |
other |
Ruby |
| A fast and accurate rule-based sentence segmentation tool for Ruby. |
| Th1nkK1D/gocr |
50 |
|
0 |
0 |
over 3 years ago |
0 |
|
3 |
|
Go |
| OCR implementation with Golang |