| stanfordnlp/GloVe |
6,480 |
|
0 |
0 |
over 2 years ago |
0 |
|
80 |
apache-2.0 |
C |
| Software in C and data files for the popular GloVe model for distributed word representations, a.k.a. word vectors or embeddings |
| bhoov/exbert |
541 |
|
0 |
0 |
over 2 years ago |
0 |
|
18 |
apache-2.0 |
Python |
| A Visual Analysis Tool to Explore Learned Representations in Transformers Models |
| CLUEbenchmark/CLUECorpus2020 |
517 |
|
0 |
0 |
over 3 years ago |
0 |
|
8 |
mit |
|
| Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料 |
| barrucadu/markov |
441 |
|
0 |
0 |
about 10 years ago |
0 |
|
2 |
wtfpl |
Python |
| Markov chain text generator, as used for KingJamesProgramming |
| kmkurn/id-nlp-resource |
211 |
|
0 |
0 |
about 4 years ago |
0 |
|
1 |
|
|
| A list of Indonesian NLP resources. |
| icoxfog417/fastTextJapaneseTutorial |
174 |
|
0 |
0 |
over 9 years ago |
0 |
|
0 |
mit |
Python |
| Tutorial to train fastText with Japanese corpus |
| fozziethebeat/TopicModelComparison |
78 |
|
0 |
0 |
over 13 years ago |
0 |
|
1 |
|
Scala |
| Scripts and codes for replicating experiments published in Exploring Topic Coherence over many models and many topics |
| famrashel/idn-tagged-corpus |
76 |
|
0 |
0 |
almost 4 years ago |
0 |
|
1 |
|
|
| Indonesian Manually Tagged Corpus |
| languageMIT/naturalstories |
31 |
|
0 |
0 |
over 4 years ago |
0 |
|
1 |
other |
Python |
| Corpus of naturalistic stories with annotation and psycholinguistic measures |
| proiel/proiel-treebank |
31 |
|
0 |
0 |
almost 3 years ago |
0 |
|
2 |
|
|
| Official releases of the PROIEL treebank of ancient Indo-European languages |