| wainshine/Chinese-Names-Corpus |
3,719 |
|
0 |
0 |
over 2 years ago |
0 |
|
7 |
apache-2.0 |
|
| 中文人名语料库。人名生成器。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。可用于中文分词、人名实体识别。 |
| dbiir/UER-py |
2,802 |
|
0 |
0 |
over 2 years ago |
0 |
|
132 |
apache-2.0 |
Python |
| Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo |
| CLUEbenchmark/CLUEDatasetSearch |
2,778 |
|
0 |
0 |
over 3 years ago |
0 |
|
6 |
|
Python |
| 搜索所有中文NLP数据集,附常用英文NLP数据集 |
| juand-r/entity-recognition-datasets |
1,386 |
|
0 |
0 |
over 2 years ago |
0 |
|
7 |
mit |
Python |
| A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types. |
| wainshine/Company-Names-Corpus |
1,106 |
|
0 |
0 |
over 3 years ago |
0 |
|
3 |
apache-2.0 |
|
| 公司名语料库。机构名语料库。公司简称,缩写,品牌词,企业名。可用于中文分词、机构名实体识别。 |
| VinAIResearch/BERTweet |
542 |
|
0 |
0 |
over 2 years ago |
0 |
|
0 |
mit |
Python |
| BERTweet: A pre-trained language model for English Tweets (EMNLP-2020) |
| monikkinom/ner-lstm |
528 |
|
0 |
0 |
about 7 years ago |
0 |
|
12 |
|
Python |
| Named Entity Recognition using multilayered bidirectional LSTM |
| ko-nlp/Korpora |
500 |
|
0 |
3 |
over 3 years ago |
7 |
January 11, 2021 |
28 |
cc-by-4.0 |
Python |
| Korean corpus repository |
| OYE93/Chinese-NLP-Corpus |
378 |
|
0 |
0 |
over 5 years ago |
0 |
|
1 |
|
Python |
| Collections of Chinese NLP corpus |
| stefan-it/turkish-bert |
364 |
|
0 |
0 |
about 3 years ago |
0 |
|
11 |
|
Python |
| Turkish BERT/DistilBERT, ELECTRA and ConvBERT models |