Results for "videoqa"

Claude Code Claude Desktop GitHub Copilot Cursor Windsurf Cline Zed JetBrains

📄SKILL.md 🤖CLAUDE.md ⚡Claude Commands 📐.cursorrules 📐Cursor Rules 🕹️AGENTS.md 🧬codex.md 🏄.windsurfrules 🔧.clinerules 🧑‍✈️Copilot Instructions

All Development Operations Data Product Marketing Customer Design Sales

45 skills found · Page 1 of 2

doc-doc / NExT QA

189

NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR'21)

universal

causal-temporal-action-reasoningmulti-object-interactionvideo-question-answering+3

Updated 4d ago

jayleicn / TVQA

182

[EMNLP 2018] PyTorch code for TVQA: Localized, Compositional Video Question Answering

zed

datasetpytorchtvqa+1

Updated 27d ago

antoyang / FrozenBiLM

158

[NeurIPS 2022] Zero-Shot Video Question Answering via Frozen Bidirectional Language Models

universal

large-language-modelsmultimodal-learningpre-training+7

Updated 2mo ago

thaolmk54 / Hcrn Videoqa

135

Implementation for the paper "Hierarchical Conditional Relation Networks for Video Question Answering" (Le et al., CVPR 2020, Oral)

universal

question-answeringtgif-qavideoqa+1

Updated 18d ago

antoyang / Just Ask

127

[ICCV 2021 Oral + TPAMI] Just Ask: Learning to Answer Questions from Millions of Narrated Videos

universal

multimodal-learningpre-trainingquestion-generation+7

Updated 1mo ago

VRU-NExT / VideoQA

103

No description available

universal

Updated 9d ago

MILVLG / Activitynet Qa

An VideoQA dataset based on the videos from ActivityNet

universal

activitynetdatasetvideoqa+1

Updated 1mo ago

doc-doc / NExT GQA

Can I Trust Your Answer? Visually Grounded Video Question Answering (CVPR'24, Highlight)

universal

trustworthy-vqavideo-groundingvideo-language-understanding+3

Updated 17d ago

fanchenyou / HME VideoQA

Heterogeneous Memory Enhanced Multimodal Attention Model for VideoQA

universal

Updated 2y ago

sail-sg / VGT

Video Graph Transformer for Video Question Answering (ECCV'22)

universal

graph-transformertemporal-dynamicsvideo-language-understanding+2

Updated 4mo ago

zhousheng97 / EgoTextVQA

[CVPR'25] 🌟🌟 EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering

universal

egocentric-qa-assistancemllm-evaluationscene-text-videoqa+2

Updated 1mo ago

doc-doc / HQGA

Video as Conditional Graph Hierarchy for Multi-Granular Question Answering (AAAI'22, Oral)

universal

conditional-graph-hierarchyvideo-question-answeringvideoqa+1

Updated 8d ago

hyounghk / VideoQADenseCapFrameGate ACL2020

Code for ACL 2020 paper "Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA." Hyounghun Kim, Zineng Tang, Mohit Bansal.

universal

Updated 2y ago

qirui-chen / MultiHop EgoQA

[AAAI 2025] Grounded Multi-Hop VideoQA in Long-Form Egocentric Videos

universal

Updated 8d ago

TraffiX-VideoQA / TraffiX Qwen

[ICML 2025] Official implementation of TraffiX-Qwen model introduced in TUMTraf VideoQA benchmark for roadside traffic video understanding.

universal

Updated 8d ago

yandex-datasphere / VideoQABot

Мастер-класс по созданию диалогового чат-бота на основе Retrieval-Augmented Generation

universal

Updated 5mo ago

doc-doc / NExT OE

NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR'21)

universal

causal-temporal-action-reasoningmulti-object-interactionvideo-comprehension+2

Updated 4mo ago

yl3800 / IGV

This repo contains code for Invariant Grounding for Video Question Answering

universal

cvpr-2022cvpr-oral-2022generalization+5

Updated 9mo ago

ZJULearning / Videoqa

Unifying the Video and Question Attentions for Open-Ended Video Question Answering

universal

video-qa

Updated 4mo ago

noagarcia / ROLL VideoQA

PyTorch code for ROLL, a knowledge-based video story question answering model.

universal

knowledge-based-reasoningvideo-question-answeringvideo-understanding+1

Updated 11mo ago