| ikawaha/kagome |
769 |
|
0 |
27 |
about 2 years ago |
74 |
September 27, 2023 |
4 |
mit |
Go |
| Self-contained Japanese Morphological Analyzer written in pure Go |
| ku-nlp/jumanpp |
334 |
|
0 |
0 |
about 3 years ago |
0 |
|
30 |
apache-2.0 |
C++ |
| Juman++ (a Morphological Analyzer Toolkit) |
| WorksApplications/SudachiPy |
318 |
|
0 |
0 |
over 3 years ago |
0 |
|
18 |
apache-2.0 |
Python |
| Python version of Sudachi, a Japanese tokenizer. |
| daac-tools/vibrato |
275 |
|
0 |
1 |
over 2 years ago |
11 |
May 12, 2023 |
3 |
apache-2.0 |
Rust |
| 🎤 vibrato: Viterbi-based accelerated tokenizer |
| daac-tools/vaporetto |
206 |
|
0 |
3 |
over 2 years ago |
16 |
April 01, 2023 |
0 |
apache-2.0 |
Rust |
| 🛥 Vaporetto: Very accelerated pointwise prediction based tokenizer |
| vaaaaanquish/cloudia |
108 |
|
0 |
0 |
over 5 years ago |
9 |
May 09, 2020 |
10 |
mit |
Python |
| Tools to easily create a word cloud |
| ku-nlp/KWDLC |
71 |
|
0 |
0 |
over 2 years ago |
0 |
|
12 |
|
Python |
| Kyoto University Web Document Leads Corpus |
| Leko/goya |
55 |
|
0 |
0 |
over 4 years ago |
0 |
|
2 |
mit |
Rust |
| Japanese Morphological Analysis written in Rust |
| c-bata/pysearch |
48 |
|
0 |
0 |
almost 10 years ago |
0 |
|
0 |
|
Python |
| Web crawler and Search engine in Python. |
| ku-nlp/KyotoCorpus |
47 |
|
0 |
0 |
almost 3 years ago |
0 |
|
0 |
|
Perl |
| Kyoto University Text Corpus |