| juand-r/entity-recognition-datasets |
1,386 |
|
0 |
0 |
over 2 years ago |
0 |
|
7 |
mit |
Python |
| A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types. |
| propbank/propbank-release |
112 |
|
0 |
0 |
over 3 years ago |
0 |
|
11 |
cc-by-sa-4.0 |
|
| The official released annotations, both in .prop pointer format and as conll files. Does not contain the source texts |
| Yale-LILY/TutorialBank |
85 |
|
0 |
0 |
about 3 years ago |
0 |
|
0 |
|
HTML |
| UniversalDependencies/UD_Russian-SynTagRus |
77 |
|
0 |
0 |
over 2 years ago |
0 |
|
16 |
other |
Perl |
| Russian data from the SynTagRus corpus. |
| amir-zeldes/gum |
76 |
|
0 |
0 |
over 2 years ago |
0 |
|
6 |
other |
Python |
| Repository for the Georgetown University Multilayer Corpus (GUM) |
| ku-nlp/KWDLC |
71 |
|
0 |
0 |
over 2 years ago |
0 |
|
12 |
|
Python |
| Kyoto University Web Document Leads Corpus |
| korpling/ANNIS |
67 |
|
4 |
4 |
about 2 years ago |
45 |
February 03, 2023 |
44 |
apache-2.0 |
Java |
| ANNIS is an open source, versatile web browser-based search and visualization architecture for complex multilevel linguistic corpora with diverse types of annotation. |
| bdhingra/quasar |
64 |
|
0 |
0 |
about 8 years ago |
0 |
|
1 |
bsd-2-clause |
Python |
| Datasets for Question Answering by Search and Reading |
| nickyringland/nested_named_entities |
60 |
|
0 |
0 |
over 2 years ago |
0 |
|
0 |
|
Python |
| proycon/folia |
60 |
|
2 |
2 |
over 2 years ago |
93 |
October 08, 2021 |
21 |
gpl-3.0 |
Python |
| FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (including corpora) with linguistic annotations. A wide variety of linguistic annotations are supported, making FoLiA a useful format for NLP tasks and data interchange. Note that the actual Python library for processing FoLiA is implemented as part of PyNLPl, this contains higher-level tools that use the library as well as the full documentation, validation schemas, and set definitions |