| brightmart/nlp_chinese_corpus |
8,344 |
|
0 |
0 |
almost 3 years ago |
0 |
|
20 |
mit |
|
| 大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP |
| nl8590687/ASRT_SpeechRecognition |
7,253 |
|
0 |
0 |
about 2 years ago |
1 |
October 23, 2020 |
101 |
gpl-3.0 |
Python |
| A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统 |
| shibing624/pycorrector |
4,928 |
|
0 |
1 |
about 2 years ago |
30 |
November 07, 2023 |
27 |
apache-2.0 |
Python |
| pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,LLaMA等模型应用在纠错场景,开箱即用。 |
| wainshine/Chinese-Names-Corpus |
3,719 |
|
0 |
0 |
over 2 years ago |
0 |
|
7 |
apache-2.0 |
|
| 中文人名语料库。人名生成器。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。可用于中文分词、人名实体识别。 |
| CLUEbenchmark/CLUE |
3,345 |
|
0 |
0 |
almost 3 years ago |
0 |
|
73 |
|
Python |
| 中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard |
| dbiir/UER-py |
2,802 |
|
0 |
0 |
over 2 years ago |
0 |
|
132 |
apache-2.0 |
Python |
| Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo |
| CLUEbenchmark/CLUEDatasetSearch |
2,778 |
|
0 |
0 |
over 3 years ago |
0 |
|
6 |
|
Python |
| 搜索所有中文NLP数据集,附常用英文NLP数据集 |
| jinfagang/weibo_terminater |
2,265 |
|
0 |
0 |
over 6 years ago |
0 |
|
9 |
|
Python |
| Final Weibo Crawler Scrap Anything From Weibo, comments, weibo contents, followers, anything. The Terminator |
| imcaspar/gpt2-ml |
1,674 |
|
0 |
0 |
almost 3 years ago |
0 |
|
22 |
apache-2.0 |
Python |
| GPT2 for Multiple Languages, including pretrained models. GPT2 多语言支持, 15亿参数中文预训练模型 |
| crownpku/Rasa_NLU_Chi |
1,466 |
|
0 |
0 |
over 2 years ago |
0 |
|
79 |
apache-2.0 |
Python |
| Turn Chinese natural language into structured data 中文自然语言理解 |