| shangjingbo1226/AutoPhrase |
978 |
|
0 |
0 |
about 4 years ago |
3 |
November 19, 2020 |
6 |
apache-2.0 |
C++ |
| AutoPhrase: Automated Phrase Mining from Massive Text Corpora |
| jiaeyan/Jiayan |
232 |
|
0 |
0 |
over 4 years ago |
3 |
September 16, 2019 |
7 |
mit |
Python |
| 甲言,专注于古代汉语(古汉语/古文/文言文/文言)处理的NLP工具包,支持文言词库构建、分词、词性标注、断句和标点。Jiayan, the 1st NLP toolkit designed for Classical Chinese, supports lexicon construction, tokenizing, POS tagging, sentence segmentation and punctuation. |
| WorksApplications/SudachiDict |
212 |
|
0 |
26 |
about 2 years ago |
24 |
December 14, 2023 |
9 |
apache-2.0 |
Python |
| A lexicon for Sudachi |
| sunpinyin/open-gram |
59 |
|
0 |
0 |
about 10 years ago |
0 |
|
2 |
|
Python |
| an open solution for collecting n-gram Chinese lexicon and n-gram statistics |
| kodexlab/eleve |
12 |
|
0 |
0 |
over 5 years ago |
12 |
October 25, 2020 |
1 |
lgpl-3.0 |
Python |
| Extraction de LExique par Variation d'Entropie - Lexicon extraction based on the variation of entropy |
| tchaikov/open-gram |
9 |
|
0 |
0 |
over 15 years ago |
0 |
|
0 |
|
Python |
| collect lexicon and build n-gram dataset for NLP in Chinese |
| minthanthtoo/myanmar-collation-stats |
7 |
|
0 |
0 |
over 4 years ago |
0 |
|
0 |
|
Java |
| Myanmar lexicon analyzer - Sorting and Segmentation |
| rnd2110/MorphAGram |
6 |
|
0 |
0 |
over 4 years ago |
0 |
|
1 |
|
Python |
| A Language-Independent Unsupervised Morphological Segmentation Framework based on Adaptor Grammars |
| alvations/mini-segmenter |
6 |
|
0 |
0 |
about 11 years ago |
0 |
|
0 |
|
Python |
| Lightweight lexicon/dictionary based Chinese text segmenter |