Amalgum Alternatives

English web corpus with 4M tokens and several annotation types
Suggest Alternative
Alternatives To gucorpling/amalgum
Project Name Stars Downloads Repos Using This Packages Using This Most Recent Commit Total Releases Latest Release Open Issues License Language
stanfordnlp/GloVe 6,480 0 0 over 2 years ago 0 80 apache-2.0 C
Software in C and data files for the popular GloVe model for distributed word representations, a.k.a. word vectors or embeddings
bhoov/exbert 541 0 0 over 2 years ago 0 18 apache-2.0 Python
A Visual Analysis Tool to Explore Learned Representations in Transformers Models
CLUEbenchmark/CLUECorpus2020 517 0 0 over 3 years ago 0 8 mit
Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料
barrucadu/markov 441 0 0 about 10 years ago 0 2 wtfpl Python
Markov chain text generator, as used for KingJamesProgramming
kmkurn/id-nlp-resource 211 0 0 about 4 years ago 0 1
A list of Indonesian NLP resources.
icoxfog417/fastTextJapaneseTutorial 174 0 0 over 9 years ago 0 0 mit Python
Tutorial to train fastText with Japanese corpus
fozziethebeat/TopicModelComparison 78 0 0 over 13 years ago 0 1 Scala
Scripts and codes for replicating experiments published in Exploring Topic Coherence over many models and many topics
famrashel/idn-tagged-corpus 76 0 0 almost 4 years ago 0 1
Indonesian Manually Tagged Corpus
languageMIT/naturalstories 31 0 0 over 4 years ago 0 1 other Python
Corpus of naturalistic stories with annotation and psycholinguistic measures
proiel/proiel-treebank 31 0 0 almost 3 years ago 0 2
Official releases of the PROIEL treebank of ancient Indo-European languages
Alternatives To gucorpling/amalgum
Select To Compare


Alternative Project Comparisons
Popular Token Projects
Popular Corpus Projects
Popular Security Categories
Related Searches
Get A Weekly Email With Trending Projects
No Spam. Unsubscribe easily at any time.
Privacy | About | Terms | Follow Us On Twitter

Downloads, Dependent Repos, Dependent Packages, Total Releases, Latest Releases data powered by Libraries.io.

Copyright 2018-2026 Awesome Open Source.  All rights reserved.