| sgrvinod/a-PyTorch-Tutorial-to-Image-Captioning |
2,084 |
|
0 |
0 |
over 3 years ago |
0 |
|
97 |
mit |
Python |
| Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning |
| ruotianluo/self-critical.pytorch |
964 |
|
0 |
0 |
over 2 years ago |
0 |
|
84 |
mit |
Python |
| Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others. |
| njustkmg/OMML |
528 |
|
0 |
0 |
almost 3 years ago |
0 |
|
0 |
apache-2.0 |
Python |
| Multi-Modal learning toolkit based on PaddlePaddle and PyTorch, supporting multiple applications such as multi-modal classification, cross-modal retrieval and image caption. |
| kuanghuei/SCAN |
442 |
|
0 |
0 |
about 3 years ago |
0 |
|
18 |
apache-2.0 |
Python |
| PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018) |
| aimagelab/meshed-memory-transformer |
420 |
|
0 |
0 |
over 3 years ago |
0 |
|
48 |
bsd-3-clause |
Python |
| Meshed-Memory Transformer for Image Captioning. CVPR 2020 |
| aimagelab/show-control-and-tell |
273 |
|
0 |
0 |
over 3 years ago |
0 |
|
13 |
bsd-3-clause |
Python |
| Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions. CVPR 2019 |
| krasserm/fairseq-image-captioning |
134 |
|
0 |
0 |
over 5 years ago |
0 |
|
8 |
apache-2.0 |
Python |
| Transformer-based image captioning extension for pytorch/fairseq |
| snrazavi/Deep_Learning_in_Python_2018 |
114 |
|
0 |
0 |
about 3 years ago |
0 |
|
1 |
|
Jupyter Notebook |
| Deep Learning workshop including image classification, face recognition, Object detection, language modelling, image captioning and neural machine translation. |
| zhiqwang/sightseq |
109 |
|
0 |
0 |
over 6 years ago |
0 |
|
2 |
mit |
Python |
| 🔭 Computer vision tools for fairseq, containing PyTorch implementation of text recognition and object detection |
| AlexMa011/pytorch-polygon-rnn |
101 |
|
0 |
0 |
about 4 years ago |
0 |
|
4 |
gpl-3.0 |
Python |
| Pytorch implementation of Polygon-RNN(http://www.cs.toronto.edu/polyrnn/poly_cvpr17/) |