| facebookresearch/mmf |
5,357 |
|
0 |
0 |
about 2 years ago |
0 |
|
145 |
other |
Python |
| A modular framework for vision & language multimodal research from Facebook AI Research (FAIR) |
| jayleicn/ClipBERT |
649 |
|
0 |
0 |
over 2 years ago |
0 |
|
12 |
mit |
Python |
| [CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning on image-text and video-text tasks. |
| hengyuan-hu/bottom-up-attention-vqa |
606 |
|
0 |
0 |
over 6 years ago |
0 |
|
15 |
gpl-3.0 |
Python |
| An efficient PyTorch implementation of the winning entry of the 2017 VQA Challenge. |
| Cadene/vqa.pytorch |
536 |
|
0 |
0 |
over 6 years ago |
0 |
|
19 |
|
Python |
| Visual Question Answering in Pytorch |
| davidmascharka/tbd-nets |
335 |
|
0 |
0 |
over 7 years ago |
0 |
|
2 |
mit |
Jupyter Notebook |
| PyTorch implementation of "Transparency by Design: Closing the Gap Between Performance and Interpretability in Visual Reasoning" |
| MILVLG/openvqa |
225 |
|
0 |
0 |
over 4 years ago |
0 |
|
1 |
apache-2.0 |
Python |
| A lightweight, scalable, and general framework for visual question answering research |
| Cyanogenoid/pytorch-vqa |
213 |
|
0 |
0 |
about 3 years ago |
0 |
|
5 |
|
Python |
| Strong baseline for visual question answering |
| vacancy/NSCL-PyTorch-Release |
209 |
|
0 |
0 |
almost 7 years ago |
0 |
|
7 |
mit |
Python |
| PyTorch implementation for the Neuro-Symbolic Concept Learner (NS-CL). |
| MILVLG/mcan-vqa |
181 |
|
0 |
0 |
almost 6 years ago |
0 |
|
2 |
apache-2.0 |
Python |
| Deep Modular Co-Attention Networks for Visual Question Answering |
| markdtw/vqa-winner-cvprw-2017 |
160 |
|
0 |
0 |
about 7 years ago |
0 |
|
3 |
|
Python |
| Pytorch implementation of winner from VQA Chllange Workshop in CVPR'17 |