Oscar Alternatives

Name: microsoft/Oscar
Brand: microsoft/Oscar
SKU: project/microsoft/Oscar
Rating: 4.78 (995 reviews)

Oscar and VinVL

Categories > Artificial Intelligence > Vqa

Suggest Alternative

Stars

995

Alternatives

License

mit

Open Issues

137

Most Recent Commit

over 2 years ago

Programming Language

Python

Dependent Repos

Dependent Packages

Total Releases

Categories

Programming Languages > Python

Artificial Intelligence > Vqa

Machine Learning > Image Captioning

Repo

Alternatives To microsoft/Oscar

Project Name	Stars	Repos Using This	Packages Using This	Most Recent Commit	Total Releases	Latest Release	Open Issues	License	Language
OpenGVLab/InternGPT	2,976	0	0	over 2 years ago	0		18	apache-2.0	Python
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)
NVlabs/prismer	1,245	0	0	about 2 years ago	0		0	other	Python
The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".
microsoft/Oscar	995	0	0	over 2 years ago	0		137	mit	Python
Oscar and VinVL
peteanderson80/bottom-up-attention	979	0	0	about 5 years ago	0		56	mit	Jupyter Notebook
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
subho406/OmniNet	426	0	0	over 5 years ago	0		1	apache-2.0	Python
Official Pytorch implementation of "OmniNet: A unified architecture for multi-modal multi-task learning" \| Authors: Subhojeet Pramanik, Priyanka Agrawal, Aman Hussain
TheoCoombes/ClipCap	64	0	0	about 3 years ago	1	May 29, 2022	4		Python
Using pretrained encoder and language models to generate captions from multimedia inputs.
X-PLUG/mPLUG	15	0	0	almost 3 years ago	0		0		Python
mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections. (EMNLP 2022)
YangLiu9208/CausalVLR	11	0	0	over 2 years ago	0		0
CausalVLR: A Toolbox and Benchmark for Visual-Linguistic Causal Reasoning
anujanegi/VQA	6	0	0	over 6 years ago	0		0	mit	Python
Visual Question Answering System

Alternatives To microsoft/Oscar

Select To Compare

OpenGVLab/InternGPT ⭐ 2,976

InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)

dependent packages 0 total releases 0 most recent commit over 2 years ago

NVlabs/prismer ⭐ 1,245

The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".

dependent packages 0 total releases 0 most recent commit about 2 years ago

microsoft/Oscar ⭐ 995

Oscar and VinVL

dependent packages 0 total releases 0 most recent commit over 2 years ago

peteanderson80/bottom-up-attention ⭐ 979

Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome

dependent packages 0 total releases 0 most recent commit about 5 years ago

subho406/OmniNet ⭐ 426

Official Pytorch implementation of "OmniNet: A unified architecture for multi-modal multi-task learning" | Authors: Subhojeet Pramanik, Priyanka Agrawal, Aman Hussain

dependent packages 0 total releases 0 most recent commit over 5 years ago

TheoCoombes/ClipCap ⭐ 64

Using pretrained encoder and language models to generate captions from multimedia inputs.

dependent packages 0 total releases 1 most recent commit about 3 years ago downloads badge

X-PLUG/mPLUG ⭐ 15

mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections. (EMNLP 2022)

dependent packages 0 total releases 0 most recent commit almost 3 years ago

YangLiu9208/CausalVLR ⭐ 11

CausalVLR: A Toolbox and Benchmark for Visual-Linguistic Causal Reasoning

dependent packages 0 total releases 0 most recent commit over 2 years ago

anujanegi/VQA ⭐ 6

Visual Question Answering System

dependent packages 0 total releases 0 most recent commit over 6 years ago

Suggest An Alternative To Oscar

Alternative Project Comparisons

microsoft/Oscar vs Interngpt

microsoft/Oscar vs Prismer

microsoft/Oscar vs Oscar

microsoft/Oscar vs Bottom Up Attention

microsoft/Oscar vs Omninet

microsoft/Oscar vs Clipcap

microsoft/Oscar vs Mplug

microsoft/Oscar vs Causalvlr

microsoft/Oscar vs Vqa

Popular Vqa Projects

facebookresearch/mmf⭐ 5,357

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

BDBC-KG-NLP/QA-Survey-CN⭐ 1,302

北京航空航天大学大数据高精尖中心自然语言处理研究团队开展了智能问答的研究与应用总结。包括基于知识图谱的问答（KBQA），基于文本的问答系统（TextQA），基于表格的问答系统（TableQA）、基于视觉的问答系统（VisualQA）和机器阅读理解（MRC）等，每类任务分别对学术界和工业界进行了相关总结。

ramprs/grad-cam⭐ 652

[ICCV 2017] Torch code for Grad-CAM

jayleicn/ClipBERT⭐ 649

[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning on image-text and video-text tasks.

hengyuan-hu/bottom-up-attention-vqa⭐ 606

An efficient PyTorch implementation of the winning entry of the 2017 VQA Challenge.

Popular Image Captioning Projects

salesforce/LAVIS⭐ 11,196

LAVIS - A One-stop Library for Language-Vision Intelligence

salesforce/BLIP⭐ 3,558

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

OFA-Sys/OFA⭐ 2,142

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

sgrvinod/a-PyTorch-Tutorial-to-Image-Captioning⭐ 2,084

Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning

ttengwang/Caption-Anything⭐ 1,374

Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/spaces/TencentARC/Caption-Anything https://huggingface.co/spaces/VIPLab/Caption-Anything

Popular Artificial Intelligence Categories

Artificial Intelligence

Robot

Ros

Discord Bot

Home Assistant

Telegram Bot

Chatbot

Robotics

Object Detection

Home Automation