| mocobeta/janome |
776 |
|
61 |
34 |
almost 3 years ago |
28 |
July 01, 2023 |
12 |
apache-2.0 |
Python |
| Japanese morphological analysis engine written in pure Python |
| ikawaha/kagome |
769 |
|
0 |
27 |
about 2 years ago |
74 |
September 27, 2023 |
4 |
mit |
Go |
| Self-contained Japanese Morphological Analyzer written in pure Go |
| atilika/kuromoji |
688 |
|
5 |
6 |
over 6 years ago |
1 |
September 09, 2015 |
20 |
apache-2.0 |
Java |
| Kuromoji is a self-contained and very easy to use Japanese morphological analyzer designed for search |
| taishi-i/awesome-japanese-nlp-resources |
522 |
|
0 |
0 |
about 2 years ago |
0 |
|
0 |
cc0-1.0 |
|
| A curated list of resources dedicated to Python libraries, LLMs, dictionaries, and corpora of NLP for Japanese |
| taishi-i/nagisa |
365 |
|
1 |
7 |
about 2 years ago |
22 |
July 30, 2023 |
4 |
mit |
Python |
| A Japanese tokenizer based on recurrent neural networks |
| WorksApplications/SudachiPy |
318 |
|
0 |
0 |
over 3 years ago |
0 |
|
18 |
apache-2.0 |
Python |
| Python version of Sudachi, a Japanese tokenizer. |
| taishi-i/toiro |
110 |
|
0 |
0 |
over 2 years ago |
8 |
July 31, 2023 |
1 |
apache-2.0 |
Python |
| A comparison tool of Japanese tokenizers |
| StarCC0/starcc-py |
25 |
|
0 |
0 |
over 3 years ago |
0 |
|
0 |
cc0-1.0 |
Python |
| 简繁转换 簡繁轉換 Python implementation of StarCC, the next generation of Simplified-Traditional Chinese conversion framework |