Wit Alternatives

WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique images across 100+ languages.
Suggest Alternative
Alternatives To google-research-datasets/wit
Project Name Stars Downloads Repos Using This Packages Using This Most Recent Commit Total Releases Latest Release Open Issues License Language
aboSamoor/polyglot 2,212 65 28 over 2 years ago 9 December 15, 2021 166 other Python
Multilingual text (NLP) processing toolkit
zelon88/HRConvert2 746 0 0 over 2 years ago 0 8 gpl-3.0 PHP
A self-hosted, drag-and-drop & nosql file conversion server & share tool that supports 86 file formats in 13 languages.
ttop32/ImageScanOCR 16 0 0 over 3 years ago 0 3 mit C#
Convert image and pdf to text using Window OCR
penn-nlp/mmid 10 0 0 about 7 years ago 0 1
Words and their images in 98 languages
MX-Futhark/text-position-detector 9 0 0 almost 8 years ago 0 0 mit Java
Detects rectangular regions containing multilingual text in an image.
revollat/sulu-docker 7 0 0 about 9 years ago 0 1
Dockerized Sulu CMS (http://sulu.io/) (Multisite, multilingual CMS based on Symfony full stack and CMF (http://cmf.symfony.com/)
turbulent/docker-polyglot-base 5 0 0 over 8 years ago 0 0 gpl-3.0
Alpinx-Linux-based image with Polyglot installed. It is a natural language pipeline that supports massive multilingual applications.
Alternatives To google-research-datasets/wit
Select To Compare


Alternative Project Comparisons
Popular Image Projects
Popular Multilingual Projects
Popular Media Categories
Related Searches
Get A Weekly Email With Trending Projects
No Spam. Unsubscribe easily at any time.
Privacy | About | Terms | Follow Us On Twitter

Downloads, Dependent Repos, Dependent Packages, Total Releases, Latest Releases data powered by Libraries.io.

Copyright 2018-2026 Awesome Open Source.  All rights reserved.