| RaRe-Technologies/gensim-data |
492 |
|
0 |
0 |
about 8 years ago |
0 |
|
14 |
lgpl-2.1 |
Python |
| Data repository for pretrained NLP models and NLP corpora. |
| deepmind/narrativeqa |
362 |
|
0 |
0 |
about 6 years ago |
0 |
|
0 |
apache-2.0 |
Shell |
| This repository contains the NarrativeQA dataset. It includes the list of documents with Wikipedia summaries, links to full stories, and questions and answers. |
| chakki-works/chakin |
313 |
|
1 |
0 |
about 7 years ago |
7 |
March 27, 2019 |
8 |
mit |
Python |
| Simple downloader for pre-trained word vectors |
| bcicen/wikitables |
279 |
|
3 |
1 |
over 4 years ago |
14 |
August 26, 2021 |
4 |
mit |
Python |
| Import tables from any Wikipedia article as a dataset in Python |
| algolia/datasets |
192 |
|
0 |
0 |
over 2 years ago |
0 |
|
1 |
|
CSS |
| Interesting datasets you could use with Algolia |
| Pinafore/qb |
160 |
|
0 |
0 |
about 4 years ago |
0 |
|
7 |
mit |
Python |
| QANTA Quiz Bowl AI |
| saschagobel/legislatoR |
90 |
|
0 |
0 |
about 2 years ago |
1 |
April 24, 2020 |
1 |
|
R |
| Interface to the Comparative Legislators Database |
| shmsw25/AmbigQA |
86 |
|
0 |
0 |
over 4 years ago |
0 |
|
0 |
|
Python |
| An original implementation of EMNLP 2020, "AmbigQA: Answering Ambiguous Open-domain Questions" |
| emijrp/awesome-wikipedia |
76 |
|
0 |
0 |
over 2 years ago |
0 |
|
2 |
cc0-1.0 |
|
| A curated list of awesome Wikipedia-related frameworks, libraries, software, datasets and references. |
| koomri/text-segmentation |
73 |
|
0 |
0 |
over 6 years ago |
0 |
|
3 |
|
Python |
| Implementation of the paper: Text Segmentation as a Supervised Learning Task |