| go-ego/gse |
2,352 |
|
14 |
21 |
over 2 years ago |
82 |
January 16, 2023 |
12 |
apache-2.0 |
Go |
| Go efficient multilingual NLP and text segmentation; support English, Chinese, Japanese and others. |
| ikawaha/kagome |
769 |
|
0 |
27 |
about 2 years ago |
74 |
September 27, 2023 |
4 |
mit |
Go |
| Self-contained Japanese Morphological Analyzer written in pure Go |
| taishi-i/nagisa |
365 |
|
1 |
7 |
about 2 years ago |
22 |
July 30, 2023 |
4 |
mit |
Python |
| A Japanese tokenizer based on recurrent neural networks |
| WorksApplications/SudachiPy |
318 |
|
0 |
0 |
over 3 years ago |
0 |
|
18 |
apache-2.0 |
Python |
| Python version of Sudachi, a Japanese tokenizer. |
| daac-tools/vibrato |
275 |
|
0 |
1 |
over 2 years ago |
11 |
May 12, 2023 |
3 |
apache-2.0 |
Rust |
| 🎤 vibrato: Viterbi-based accelerated tokenizer |
| daac-tools/vaporetto |
206 |
|
0 |
3 |
over 2 years ago |
16 |
April 01, 2023 |
0 |
apache-2.0 |
Rust |
| 🛥 Vaporetto: Very accelerated pointwise prediction based tokenizer |
| tonton-pixel/unicopedia-plus |
144 |
|
0 |
0 |
over 2 years ago |
0 |
|
0 |
mit |
JavaScript |
| Developer-oriented set of Unicode, Unihan & emoji utilities wrapped into one single app, built with Electron. |
| julius-speech/segmentation-kit |
53 |
|
0 |
0 |
almost 6 years ago |
0 |
|
2 |
mit |
Perl |
| Speech Segmentation Toolkit using Julius |
| wwwcojp/ja_sentence_segmenter |
46 |
|
0 |
0 |
about 3 years ago |
0 |
|
0 |
mit |
Python |
| japanese sentence segmentation library for python |
| timmahrt/pyJuliusAlign |
39 |
|
0 |
0 |
almost 3 years ago |
13 |
September 03, 2021 |
3 |
other |
Python |
| One-button-press forced aligner for Japanese, using Julius. |