| ikawaha/kagome |
769 |
|
0 |
27 |
about 2 years ago |
74 |
September 27, 2023 |
4 |
mit |
Go |
| Self-contained Japanese Morphological Analyzer written in pure Go |
| ku-nlp/jumanpp |
334 |
|
0 |
0 |
about 3 years ago |
0 |
|
30 |
apache-2.0 |
C++ |
| Juman++ (a Morphological Analyzer Toolkit) |
| lindera-morphology/lindera |
326 |
|
0 |
27 |
about 2 years ago |
50 |
August 25, 2023 |
13 |
mit |
Rust |
| A morphological analysis library. |
| WorksApplications/SudachiPy |
318 |
|
0 |
0 |
over 3 years ago |
0 |
|
18 |
apache-2.0 |
Python |
| Python version of Sudachi, a Japanese tokenizer. |
| daac-tools/vibrato |
275 |
|
0 |
1 |
over 2 years ago |
11 |
May 12, 2023 |
3 |
apache-2.0 |
Rust |
| 🎤 vibrato: Viterbi-based accelerated tokenizer |
| daac-tools/vaporetto |
206 |
|
0 |
3 |
over 2 years ago |
16 |
April 01, 2023 |
0 |
apache-2.0 |
Rust |
| 🛥 Vaporetto: Very accelerated pointwise prediction based tokenizer |
| adbar/simplemma |
100 |
|
0 |
9 |
over 2 years ago |
14 |
January 20, 2023 |
17 |
mit |
Python |
| Simple multilingual lemmatizer for Python, especially useful for speed and efficiency |
| yoshoku/suika |
35 |
|
0 |
0 |
over 2 years ago |
8 |
July 03, 2021 |
0 |
bsd-3-clause |
Ruby |
| Suika 🍉 is a Japanese morphological analyzer written in pure Ruby |
| daac-tools/python-vibrato |
25 |
|
0 |
0 |
over 2 years ago |
0 |
|
0 |
apache-2.0 |
Rust |
| Viterbi-based accelerated tokenizer (Python wrapper) |
| daac-tools/python-vaporetto |
17 |
|
0 |
0 |
over 2 years ago |
2 |
June 11, 2022 |
0 |
apache-2.0 |
Rust |
| 🛥 Vaporetto is a fast and lightweight pointwise prediction based tokenizer. This is a Python wrapper for Vaporetto. |