Results for "video-grounding"

Claude Code Claude Desktop GitHub Copilot Cursor Windsurf Cline Zed JetBrains

📄SKILL.md 🤖CLAUDE.md ⚡Claude Commands 📐.cursorrules 📐Cursor Rules 🕹️AGENTS.md 🧬codex.md 🏄.windsurfrules 🔧.clinerules 🧑‍✈️Copilot Instructions

All Development Operations Data Product Marketing Customer Design Sales

128 skills found · Page 1 of 5

IDEA-Research / Grounded SAM 2

3.4k

Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2

universal

Updated 4h ago

TheShadow29 / Awesome Grounding

1.1k

awesome grounding: A curated list of research papers in visual grounding

universal

arxivawesome-listcaptioning-images+15

Updated 4d ago

showlab / UniVTG

375

[ICCV 2023] UniVTG: Towards Unified Video-Language Temporal Grounding

universal

highlight-detectionmoment-retrievalpretraining+3

Updated 5d ago

ttengwang / Awesome Long Form Video Understanding

364

Awesome papers & datasets specifically focused on long-term videos.

universal

audio-visual-event-localizationdense-video-captioninglong-term-video+8

Updated 1d ago

facebookresearch / Grounded Video Description

332

Video Grounding and Captioning

universal

Updated 4mo ago

mbzuai-oryx / Video LLaVA

263

PG-Video-LLaVA: Pixel Grounding in Large Multimodal Video Models

universal

groundingllmlmm+4

Updated 2d ago

antoyang / TubeDETR

194

[CVPR 2022 Oral] TubeDETR: Spatio-Temporal Video Grounding with Transformers

universal

hc-stvgmultimodal-learningspatio-temporal-video-grounding+5

Updated 20d ago

Soldelli / MAD

174

MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions

universal

Updated 8h ago

sutdcv / Animal Kingdom

155

[CVPR2022] Animal Kingdom: A Large and Diverse Dataset for Animal Behavior Understanding

universal

action-recognitionanimal-behavioranimal-behavioral-understanding+8

Updated 4d ago

wjun0830 / CGDETR

153

Official pytorch repository for CG-DETR "Correlation-guided Query-Dependency Calibration in Video Representation Learning for Temporal Grounding"

universal

computer-visiondetection-transformerdetr+9

Updated 26d ago

gyxxyg / TRACE

150

[ICLR 2025] TRACE: Temporal Grounding Video LLM via Casual Event Modeling

universal

dense-video-captioningmultimodal-large-language-modelsvideo-highlight-detection+2

Updated 1mo ago

yongliang-wu / NumPro

146

[CVPR2025] Number it: Temporal Grounding Videos like Flipping Manga

universal

Updated 5d ago

WHB139426 / Grounded Video LLM

142

[EMNLP 2025 Findings] Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models

universal

Updated 4d ago

iSEE-Laboratory / ReferDINO

136

(ICCV 2025) ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations

universal

Updated 7d ago

www-Ye / Time R1

135

R1-like Video-LLM for Temporal Grounding

universal

Updated 1mo ago

jayleicn / TVQAplus

133

[ACL 2020] PyTorch code for TVQA+: Spatio-Temporal Grounding for Video Question Answering

universal

datasetpytorchtvqa+1

Updated 1mo ago

JonghwanMun / LGI4temporalgrounding

131

Repository for the CVPR-20 paper "Local-Global Video-Text Interactions for Temporal Grounding"

universal

Updated 1y ago

fletcherjiang / LLMEPET

130

[MM'24 Oral] Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval

universal

pytorchvideo-grounding

Updated 7mo ago

zhongyingji / Guidedvd 3dgs

127

Taming Video Diffusion Prior with Scene-Grounding Guidance for 3D Gaussian Splatting from Sparse Inputs (CVPR2025 Highlight)

universal

Updated 23d ago

gyxxyg / VTG LLM

126

[AAAI 2025] VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding

universal

dense-video-captioningmoment-retrievalmulti-modal-large-language-model+2

Updated 1mo ago