| brightmart/nlp_chinese_corpus |
8,344 |
|
0 |
0 |
almost 3 years ago |
0 |
|
20 |
mit |
|
| 大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP |
| didi/ChineseNLP |
1,329 |
|
0 |
0 |
over 4 years ago |
0 |
|
3 |
|
HTML |
| Datasets, SOTA results of every fields of Chinese NLP |
| chatopera/insuranceqa-corpus-zh |
983 |
|
0 |
0 |
over 2 years ago |
11 |
November 15, 2023 |
9 |
other |
Python |
| :helicopter: 保险行业语料库,聊天机器人 |
| OpenBioLink/ThoughtSource |
680 |
|
0 |
0 |
over 2 years ago |
0 |
|
12 |
mit |
Jupyter Notebook |
| A central, open resource for data and tools related to chain-of-thought reasoning in large language models. Developed @ Samwald research group: https://samwald.info/ |
| stanfordnlp/mac-network |
445 |
|
0 |
0 |
about 5 years ago |
0 |
|
9 |
apache-2.0 |
Python |
| Implementation for the paper "Compositional Attention Networks for Machine Reasoning" (Hudson and Manning, ICLR 2018) |
| salesforce/DialogStudio |
356 |
|
0 |
0 |
over 2 years ago |
0 |
|
0 |
apache-2.0 |
Python |
| DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection and Instruction-Aware Models for Conversational AI |
| ymcui/cmrc2018 |
313 |
|
0 |
0 |
almost 4 years ago |
0 |
|
4 |
cc-by-sa-4.0 |
Python |
| A Span-Extraction Dataset for Chinese Machine Reading Comprehension (CMRC 2018) |
| abachaa/MedQuAD |
275 |
|
0 |
0 |
over 2 years ago |
0 |
|
4 |
other |
|
| Medical Question Answering Dataset of 47,457 QA pairs created from 12 NIH websites |
| mandarjoshi90/triviaqa |
227 |
|
0 |
0 |
over 2 years ago |
0 |
|
2 |
apache-2.0 |
Python |
| Code for the TriviaQA reading comprehension dataset |
| wenhuchen/OTT-QA |
141 |
|
0 |
0 |
over 2 years ago |
0 |
|
3 |
mit |
Python |
| Code and Data for ICLR2021 Paper "Open Question Answering over Tables and Text" |