1,169 skills found · Page 1 of 39
vladmandic / SdnextSD.Next: All-in-one WebUI for AI generative image and video creation, captioning and processing
karpathy / Neuraltalk2Efficient Image Captioning code in Torch, runs on GPU
ashnkumar / Sketch CodeKeras model to generate HTML code from hand-drawn website mockups. Implements an image captioning architecture to drawn source images.
sgrvinod / A PyTorch Tutorial To Image CaptioningShow, Attend, and Tell | a PyTorch Tutorial to Image Captioning
stephengpope / No Code Architects ToolkitThe NCA Toolkit API eliminates monthly subscription fees by consolidating common API functionalities into a single FREE API. Designed for businesses, creators, and developers, it streamlines advanced media processing, including video editing and captioning, image transformations, cloud storage, and Python code execution.
ttengwang / Caption AnythingCaption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/spaces/TencentARC/Caption-Anything https://huggingface.co/spaces/VIPLab/Caption-Anything
yawiii / ComfyUI Prompt Assistant提示词小助手可以一键调用智谱、硅基流动、gemini、本地ollama、百度等大语言模型服务,实现提示词翻译、润色扩写、图片反推。支持提示词预设实现一键插入、历史提示词查找等功能。是一个全能型提示词插件。The Prompt Assistant enables one-click access to LLMs/VLMs for prompt translation, expansion, and image captioning. It also supports one-click preset insertion and historical prompt search.
jcjohnson / DensecapDense image captioning in Torch
ruotianluo / ImageCaptioning.pytorchI decide to sync up this repo and self-critical.pytorch. (The old master is in old master branch for archive)
NVlabs / Describe Anything[ICCV 2025] Implementation for Describe Anything: Detailed Localized Image and Video Captioning
peteanderson80 / Bottom Up AttentionBottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
rmokady / CLIP Prefix CaptionSimple image captioning model
brh55 / React Native Masonry:raised_hands: A pure JS react-native component to render a masonry~ish layout for images with support for dynamic columns, progressive image loading, device rotation, on-press handlers, and headers/captions.
jhc13 / TagguiTag manager and captioner for image datasets
lucidrains / CoCa PytorchImplementation of CoCa, Contrastive Captioners are Image-Text Foundation Models, in Pytorch
W2GenAI-Lab / LucidFluxLucidFlux: Caption-Free Photo-Realistic Image Restoration via a Large-Scale Diffusion Transformer, ICLR 2026
fpgaminer / JoycaptionJoyCaption is an image captioning Visual Language Model (VLM) being built from the ground up as a free, open, and uncensored model for the community to use in training Diffusion models.
zhjohnchan / Awesome Image CaptioningA curated list of image captioning and related area resources. :-)
ruotianluo / Self Critical.pytorchUnofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.
YehLi / XmodalerX-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).