| salesforce/LAVIS |
11,166 |
|
0 |
5 |
over 1 year ago |
12 |
March 06, 2023 |
366 |
bsd-3-clause |
Jupyter Notebook |
| LAVIS - A One-stop Library for Language-Vision Intelligence |
| salesforce/BLIP |
3,558 |
|
0 |
0 |
over 2 years ago |
0 |
|
98 |
bsd-3-clause |
Jupyter Notebook |
| PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation |
| OpenGVLab/InternGPT |
2,976 |
|
0 |
0 |
over 2 years ago |
0 |
|
18 |
apache-2.0 |
Python |
| InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统) |
| OFA-Sys/OFA |
2,142 |
|
0 |
0 |
over 2 years ago |
0 |
|
90 |
apache-2.0 |
Python |
| Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework |
| sgrvinod/a-PyTorch-Tutorial-to-Image-Captioning |
2,084 |
|
0 |
0 |
over 3 years ago |
0 |
|
97 |
mit |
Python |
| Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning |
| ttengwang/Caption-Anything |
1,374 |
|
0 |
0 |
over 2 years ago |
0 |
|
9 |
bsd-3-clause |
Python |
| Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/spaces/TencentARC/Caption-Anything https://huggingface.co/spaces/VIPLab/Caption-Anything |
| imaginary-cloud/CameraManager |
1,328 |
|
38 |
0 |
over 2 years ago |
49 |
April 20, 2020 |
50 |
mit |
Swift |
| Simple Swift class to provide all the configurations you need to create custom camera view in your app |
| NVlabs/prismer |
1,245 |
|
0 |
0 |
about 2 years ago |
0 |
|
0 |
other |
Python |
| The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts". |
| microsoft/Oscar |
995 |
|
0 |
0 |
over 2 years ago |
0 |
|
137 |
mit |
Python |
| Oscar and VinVL |
| peteanderson80/bottom-up-attention |
979 |
|
0 |
0 |
about 5 years ago |
0 |
|
56 |
mit |
Jupyter Notebook |
| Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome |