| gambolputty/german-nouns |
107 |
|
0 |
0 |
almost 3 years ago |
6 |
July 17, 2022 |
5 |
cc-by-sa-4.0 |
Python |
| A list of ~100,000 German nouns and their grammatical properties compiled from WiktionaryDE as CSV file. Plus a module to look up the data and parse compound words. |
| sfu-discourse-lab/SOCC |
79 |
|
0 |
0 |
almost 5 years ago |
0 |
|
0 |
other |
Python |
| SFU Opinion and Comments Corpus |
| notnews/nytimes-corpus-extractor |
24 |
|
0 |
0 |
over 4 years ago |
0 |
|
0 |
|
Python |
| Extract all the fields from the NY Times Corpus to a csv |
| AndrewSB/TwitterPMI |
17 |
|
0 |
0 |
over 11 years ago |
0 |
|
0 |
|
Python |
| Pointwise mutual information on twitter corpus - Python script |
| eea/eea.corpus |
13 |
|
0 |
0 |
over 3 years ago |
0 |
|
24 |
gpl-3.0 |
Python |
| Machine Learning and Natural Language Processing of the EEA Corpus via spaCy, Textacy and pyLDAvis and other useful NLP algorithms. |
| ThomasK81/TEItoCEX |
9 |
|
0 |
0 |
almost 5 years ago |
12 |
August 30, 2020 |
0 |
mit |
Go |
| Turn CTS TEI corpora into CEX collection files |
| magnusnissel/birdbody |
7 |
|
0 |
0 |
over 9 years ago |
9 |
February 26, 2016 |
0 |
gpl-3.0 |
Python |
| A tool for the creation of twitter corpora |
| Quantyca/deepitalian |
7 |
|
0 |
0 |
almost 7 years ago |
0 |
|
1 |
|
Jupyter Notebook |