Polyglot Alternatives

Name: aboSamoor/polyglot
Brand: aboSamoor/polyglot
SKU: project/aboSamoor/polyglot
Rating: 4.94 (2212 reviews)

Multilingual text (NLP) processing toolkit

Categories > Text Processing > Natural Language Processing

Suggest Alternative

Stars

2,212

Alternatives

License

other

Open Issues

166

Most Recent Commit

over 2 years ago

Programming Language

Python

Monthly Downloads

Dependent Repos

Dependent Packages

Total Releases

Latest Release

December 15, 2021

Categories

Programming Languages > Python

Machine Learning > Natural Language Processing

Text Processing > Multilingual

Site

Repo

Alternatives To aboSamoor/polyglot

Project Name	Stars	Repos Using This	Packages Using This	Most Recent Commit	Total Releases	Latest Release	Open Issues	License	Language
aboSamoor/polyglot	2,212	65	28	over 2 years ago	9	December 15, 2021	166	other	Python
Multilingual text (NLP) processing toolkit
HIT-SCIR/ELMoForManyLangs	1,325	1	1	over 5 years ago	4	October 15, 2020		mit	Python
Pre-trained ELMo Representations for Many Languages
MilaNLProc/contextualized-topic-models	1,141	0	4	over 2 years ago	30	November 03, 2022	10	mit	Python
A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021.
bheinzerling/bpemb	1,068	15	86	almost 4 years ago	13	September 23, 2022	4	mit	Python
Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)
google-research-datasets/wit	896	0	0	over 2 years ago	0		3	other
WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique images across 100+ languages.
unitaryai/detoxify	774	0	10	over 2 years ago	11	December 19, 2022	41	apache-2.0	Python
Trained models & code to predict toxic comments on all 3 Jigsaw Toxic Comment Challenges. Built using ⚡ Pytorch Lightning and 🤗 Transformers. For access to our API, please email us at contact@unitary.ai.
nlp-uoregon/trankit	693	0	2	over 2 years ago	20	March 26, 2022	24	apache-2.0	Python
Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing
dccuchile/beto	462	0	0	almost 3 years ago	0		6	cc-by-4.0
BETO - Spanish version of the BERT model
filyp/autocorrect	376	18	30	almost 3 years ago	27	December 04, 2021	7	lgpl-3.0	Python
Spelling corrector in python
artitw/text2text	268	0	0	over 2 years ago	134	October 21, 2023	27	other	Python
Text2Text: Crosslingual NLP/G toolkit

Alternatives To aboSamoor/polyglot

Select To Compare

aboSamoor/polyglot ⭐ 2,212

Multilingual text (NLP) processing toolkit

dependent packages 28 total releases 9 most recent commit over 2 years ago downloads badge

HIT-SCIR/ELMoForManyLangs ⭐ 1,325

Pre-trained ELMo Representations for Many Languages

dependent packages 1 total releases 4 most recent commit over 5 years ago downloads badge

MilaNLProc/contextualized-topic-models ⭐ 1,141

A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021.

dependent packages 4 total releases 30 most recent commit over 2 years ago downloads badge

bheinzerling/bpemb ⭐ 1,068

Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)

dependent packages 86 total releases 13 most recent commit almost 4 years ago downloads badge

google-research-datasets/wit ⭐ 896

WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique images across 100+ languages.

dependent packages 0 total releases 0 most recent commit over 2 years ago

unitaryai/detoxify ⭐ 774

Trained models & code to predict toxic comments on all 3 Jigsaw Toxic Comment Challenges. Built using ⚡ Pytorch Lightning and 🤗 Transformers. For access to our API, please email us at contact@unitary.ai.

dependent packages 10 total releases 11 most recent commit over 2 years ago downloads badge

nlp-uoregon/trankit ⭐ 693

Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing

dependent packages 2 total releases 20 most recent commit over 2 years ago downloads badge

dccuchile/beto ⭐ 462

BETO - Spanish version of the BERT model

dependent packages 0 total releases 0 most recent commit almost 3 years ago

filyp/autocorrect ⭐ 376

Spelling corrector in python

dependent packages 30 total releases 27 most recent commit almost 3 years ago downloads badge

artitw/text2text ⭐ 268

Text2Text: Crosslingual NLP/G toolkit

dependent packages 0 total releases 134 most recent commit over 2 years ago downloads badge

Suggest An Alternative To polyglot

Alternative Project Comparisons

aboSamoor/polyglot vs Polyglot

aboSamoor/polyglot vs Elmoformanylangs

aboSamoor/polyglot vs Contextualized Topic Models

aboSamoor/polyglot vs Bpemb

aboSamoor/polyglot vs Wit

aboSamoor/polyglot vs Detoxify

aboSamoor/polyglot vs Trankit

aboSamoor/polyglot vs Beto

aboSamoor/polyglot vs Autocorrect

aboSamoor/polyglot vs Text2text

Popular Multilingual Projects

PaddlePaddle/PaddleOCR⭐ 36,076

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

adityatelange/hugo-PaperMod⭐ 7,897

A fast, clean, responsive Hugo theme.

facebookresearch/LASER⭐ 3,460

Language-Agnostic SEntence Representations

fluentmigrator/fluentmigrator⭐ 3,076

Fluent migrations framework for .NET

facebookresearch/MUSE⭐ 2,844

A library for Multilingual Unsupervised or Supervised word Embeddings

Popular Natural Language Processing Projects

huggingface/transformers⭐ 119,240

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

d2l-ai/d2l-zh⭐ 53,401

《动手学深度学习》：面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。

apachecn/ailearning⭐ 37,352

AiLearning：数据分析+机器学习实战+线性代数+PyTorch+NLTK+TF2

hankcs/HanLP⭐ 36,433

Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named Entity Recognition, Syntactic & Semantic Dependency Parsing, Document Classification

google-research/bert⭐ 36,099

TensorFlow code and pre-trained models for BERT

Popular Text Processing Categories

Tex

Format

Translation

Address

Character

Highlighter

Regular Expression

Formatter

Selector

Completion