8 skills found
ttengwang / PDVCEnd-to-End Dense Video Captioning with Parallel Decoding (ICCV 2021)
jayleicn / Recurrent Transformer[ACL 2020] PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
MikeWangWZHL / VidILPytorch code for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners
MichiganCOG / Video Grounding From TextSource code for "Weakly-Supervised Video Object Grounding from Text by Loss Weighting and Object Interaction"
LuoweiZhou / YouCook2 LeaderboardA one-stop shop for YouCook2 info such as leaderboard and recent advances on (cooking) video retrieval and captioning.
LuoweiZhou / ProcNets YouCook2Source code for paper "Towards Automatic Learning of Procedures from Web Instructional Videos"
awkrail / SvpcOfficial implementation of state-aware video procedural captioning (ACM MM 2021)
Maddy12 / VideoLanguageModelRobustnessA large scale robustness analysis for video and text, multimodal models on the YouCook2 and MSRVTT datasets.