Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
The Top 10 Chinese Text Segmentation Open Source Projects
Open source projects categorized as Chinese Text Segmentation
Categories
>
Machine Learning
>
Chinese Text Segmentation
Edit Category
wolfgarbe/SymSpell
⭐
2,970
SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
dependent packages
0
total releases
0
most recent commit
over 2 years ago
koth/kcws
⭐
2,044
Deep Learning Chinese Word Segment
dependent packages
0
total releases
0
most recent commit
almost 8 years ago
fukuball/jieba-php
⭐
1,193
"結巴"中文分詞:做最好的 PHP 中文分詞、中文斷詞組件。 / "Jieba" (Chinese for "to stutter") Chinese text segmentation: built to be the best PHP Chinese word segmentation module.
dependent packages
0
total releases
0
most recent commit
over 3 years ago
lionsoul2014/jcseg
⭐
886
Jcseg is a light weight NLP framework developed with Java. Provide CJK and English segmentation based on MMSEG algorithm, With also keywords extraction, key sentence extraction, summary extraction implemented based on TEXTRANK algorithm. Jcseg had a build-in http server and search modules for lucene,solr,elasticsearch,opensearch
dependent packages
0
total releases
0
most recent commit
over 2 years ago
mammothb/symspellpy
⭐
693
Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
dependent packages
0
total releases
0
most recent commit
almost 3 years ago
amutu/zhparser
⭐
627
zhparser is a PostgreSQL extension for full-text search of Chinese language
dependent packages
0
total releases
0
most recent commit
about 2 years ago
hankcs/hanlp-lucene-plugin
⭐
284
HanLP中文分词Lucene插件,支持包括Solr在内的基于Lucene的系统
dependent packages
0
total releases
0
most recent commit
over 5 years ago
qinwf/jiebaR
⭐
277
Chinese text segmentation with R. R语言中文分词 (文档已更新 🎉 :https://qinwenfeng.com/jiebaR/ )
dependent packages
0
total releases
0
most recent commit
over 6 years ago
yongzhuo/Pytorch-NLU
⭐
226
Pytorch-NLU,一个中文文本分类、序列标注工具包,支持中文长文本、短文本的多类、多标签分类任务,支持中文命名实体识别、词性标注、分词等序列标注任务。 Ptorch NLU, a Chinese text classification and sequence annotation toolkit, supports multi class and multi label classification tasks of Chinese long text and short text, and supports sequence annotation tasks such as Chinese named entity recognition, part of speech tagging and word segmentation.
dependent packages
0
total releases
0
most recent commit
over 2 years ago
blueshen/ik-analyzer
⭐
191
Tokenizer support Lucene5/6/7/8/9+ version, LTS
dependent packages
0
total releases
0
most recent commit
over 2 years ago
Get A Weekly Email With Trending Chinese Text Segmentation Projects
No Spam. Unsubscribe easily at any time.
Chinese Text Segmentation
Subscribe
Javascript must be enabled to subscribe.
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2026 Awesome Open Source. All rights reserved.