Caption Anything Alternatives

Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/spaces/TencentARC/Caption-Anything https://huggingface.co/spaces/VIPLab/Caption-Anything
Suggest Alternative
Alternatives To ttengwang/Caption-Anything
Project Name Stars Downloads Repos Using This Packages Using This Most Recent Commit Total Releases Latest Release Open Issues License Language
salesforce/LAVIS 11,166 0 5 over 1 year ago 12 March 06, 2023 366 bsd-3-clause Jupyter Notebook
LAVIS - A One-stop Library for Language-Vision Intelligence
salesforce/BLIP 3,558 0 0 over 2 years ago 0 98 bsd-3-clause Jupyter Notebook
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
OpenGVLab/InternGPT 2,976 0 0 over 2 years ago 0 18 apache-2.0 Python
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)
OFA-Sys/OFA 2,142 0 0 over 2 years ago 0 90 apache-2.0 Python
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
sgrvinod/a-PyTorch-Tutorial-to-Image-Captioning 2,084 0 0 over 3 years ago 0 97 mit Python
Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning
ttengwang/Caption-Anything 1,374 0 0 over 2 years ago 0 9 bsd-3-clause Python
Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/spaces/TencentARC/Caption-Anything https://huggingface.co/spaces/VIPLab/Caption-Anything
imaginary-cloud/CameraManager 1,328 38 0 over 2 years ago 49 April 20, 2020 50 mit Swift
Simple Swift class to provide all the configurations you need to create custom camera view in your app
NVlabs/prismer 1,245 0 0 about 2 years ago 0 0 other Python
The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".
microsoft/Oscar 995 0 0 over 2 years ago 0 137 mit Python
Oscar and VinVL
peteanderson80/bottom-up-attention 979 0 0 about 5 years ago 0 56 mit Jupyter Notebook
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
Alternatives To ttengwang/Caption-Anything
Select To Compare


Alternative Project Comparisons
Popular Image Captioning Projects
Popular Projects Projects
Popular Machine Learning Categories
Related Searches
Get A Weekly Email With Trending Projects
No Spam. Unsubscribe easily at any time.
Privacy | About | Terms | Follow Us On Twitter

Downloads, Dependent Repos, Dependent Packages, Total Releases, Latest Releases data powered by Libraries.io.

Copyright 2018-2026 Awesome Open Source.  All rights reserved.