Results for "image-caption"

Claude Code Claude Desktop GitHub Copilot Cursor Windsurf Cline Zed JetBrains

📄SKILL.md 🤖CLAUDE.md ⚡Claude Commands 📐.cursorrules 📐Cursor Rules 🕹️AGENTS.md 🧬codex.md 🏄.windsurfrules 🔧.clinerules 🧑‍✈️Copilot Instructions

All Development Operations Data Product Marketing Customer Design Sales

1,169 skills found · Page 1 of 39

vladmandic / Sdnext

7.0k

SD.Next: All-in-one WebUI for AI generative image and video creation, captioning and processing

universal

ai-artcaptiondiffusers+7

Updated 2h ago

karpathy / Neuraltalk2

5.6k

Efficient Image Captioning code in Torch, runs on GPU

universal

Updated 2d ago

ashnkumar / Sketch Code

5.2k

Keras model to generate HTML code from hand-drawn website mockups. Implements an image captioning architecture to drawn source images.

universal

augmentationdeep-learningimage-processing+2

Updated 1d ago

sgrvinod / A PyTorch Tutorial To Image Captioning

2.9k

Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning

universal

attention-mechanismcomputer-visionencoder-decoder+5

Updated 2h ago

stephengpope / No Code Architects Toolkit

2.3k

The NCA Toolkit API eliminates monthly subscription fees by consolidating common API functionalities into a single FREE API. Designed for businesses, creators, and developers, it streamlines advanced media processing, including video editing and captioning, image transformations, cloud storage, and Python code execution.

universal

Updated 2h ago

ttengwang / Caption Anything

1.8k

Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/spaces/TencentARC/Caption-Anything https://huggingface.co/spaces/VIPLab/Caption-Anything

universal

chatgptcontrollable-generationcontrollable-image-captioning+2

Updated 7d ago

yawiii / ComfyUI Prompt Assistant

1.7k

提示词小助手可以一键调用智谱、硅基流动、gemini、本地ollama、百度等大语言模型服务，实现提示词翻译、润色扩写、图片反推。支持提示词预设实现一键插入、历史提示词查找等功能。是一个全能型提示词插件。The Prompt Assistant enables one-click access to LLMs/VLMs for prompt translation, expansion, and image captioning. It also supports one-click preset insertion and historical prompt search.

gemini cli

comfyuiexpandprompt+2

Updated 5h ago

jcjohnson / Densecap

1.6k

Dense image captioning in Torch

universal

Updated 6d ago

ruotianluo / ImageCaptioning.pytorch

1.5k

I decide to sync up this repo and self-critical.pytorch. (The old master is in old master branch for archive)

universal

Updated 2d ago

NVlabs / Describe Anything

1.5k

[ICCV 2025] Implementation for Describe Anything: Detailed Localized Image and Video Captioning

zed

describe-anythingdetailed-localized-captioninglarge-multimodal-models+1

Updated 8h ago

peteanderson80 / Bottom Up Attention

1.5k

Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome

universal

caffecaptioning-imagesfaster-rcnn+5

Updated 1mo ago

rmokady / CLIP Prefix Caption

1.4k

Simple image captioning model

universal

Updated 13h ago

brh55 / React Native Masonry

1.4k

:raised_hands: A pure JS react-native component to render a masonry~ish layout for images with support for dynamic columns, progressive image loading, device rotation, on-press handlers, and headers/captions.

universal

masonrymasonry-gridmasonry-layout+4

Updated 3d ago

jhc13 / Taggui

1.3k

Tag manager and captioner for image datasets

universal

cogvlmflorence-2image-captioning+5

Updated 1h ago

lucidrains / CoCa Pytorch

1.2k

Implementation of CoCa, Contrastive Captioners are Image-Text Foundation Models, in Pytorch

universal

artificial-intelligenceattention-mechanismcontrastive-learning+4

Updated 23h ago

W2GenAI-Lab / LucidFlux

1.2k

LucidFlux: Caption-Free Photo-Realistic Image Restoration via a Large-Scale Diffusion Transformer, ICLR 2026

universal

Updated 5h ago

fpgaminer / Joycaption

1.1k

JoyCaption is an image captioning Visual Language Model (VLM) being built from the ground up as a free, open, and uncensored model for the community to use in training Diffusion models.

universal

captioningjoycaptionvlm

Updated 12h ago

zhjohnchan / Awesome Image Captioning

1.1k

A curated list of image captioning and related area resources. :-)

universal

Updated 4d ago

ruotianluo / Self Critical.pytorch

1.0k

Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.

universal

image-captioning

Updated 6d ago

YehLi / Xmodaler

968

X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).

universal

cross-modal-retrievalimage-captioningpretraining+4

Updated 3d ago