| lucidrains/vit-pytorch |
16,298 |
|
0 |
6 |
over 2 years ago |
184 |
November 15, 2023 |
114 |
mit |
Python |
| Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch |
| google-research/scenic |
2,733 |
|
0 |
0 |
about 2 years ago |
0 |
|
213 |
apache-2.0 |
Python |
| Scenic: A Jax Library for Computer Vision Research and Beyond |
| sgrvinod/a-PyTorch-Tutorial-to-Image-Captioning |
2,084 |
|
0 |
0 |
over 3 years ago |
0 |
|
97 |
mit |
Python |
| Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning |
| lucidrains/lambda-networks |
1,110 |
|
0 |
0 |
over 5 years ago |
11 |
November 18, 2020 |
8 |
mit |
Python |
| Implementation of LambdaNetworks, a new approach to image recognition that reaches SOTA with less compute |
| pprp/SimpleCVReproduction |
1,021 |
|
0 |
0 |
over 3 years ago |
0 |
|
3 |
apache-2.0 |
Jupyter Notebook |
| Replication of simple CV Projects including attention, classification, detection, keypoint detection, etc. |
| Ha0Tang/AttentionGAN |
564 |
|
0 |
0 |
almost 3 years ago |
0 |
|
16 |
other |
Python |
| AttentionGAN for Unpaired Image-to-Image Translation & Multi-Domain Image-to-Image Translation |
| kuanghuei/SCAN |
442 |
|
0 |
0 |
about 3 years ago |
0 |
|
18 |
apache-2.0 |
Python |
| PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018) |
| rayleizhu/BiFormer |
288 |
|
0 |
0 |
almost 3 years ago |
0 |
|
1 |
mit |
Python |
| [CVPR 2023] Official code release of our paper "BiFormer: Vision Transformer with Bi-Level Routing Attention" |
| faustomorales/vit-keras |
276 |
|
0 |
2 |
almost 3 years ago |
19 |
July 22, 2021 |
3 |
apache-2.0 |
Python |
| Keras implementation of ViT (Vision Transformer) |
| KaiyuYue/cgnl-network.pytorch |
253 |
|
0 |
0 |
about 5 years ago |
0 |
|
1 |
mit |
Python |
| Compact Generalized Non-local Network (NIPS 2018) |