11 skills found
v-iashin / BMTSource code for "Bi-modal Transformer for Dense Video Captioning" (BMVC 2020)
ttengwang / PDVCEnd-to-End Dense Video Captioning with Parallel Decoding (ICCV 2021)
jayleicn / Recurrent Transformer[ACL 2020] PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
v-iashin / MDVCPyTorch implementation of Multi-modal Dense Video Captioning (CVPR 2020 Workshops)
jssprz / Video Captioning DatasetsSummary about Video-to-Text datasets. This repository is part of the review paper *Bridging Vision and Language from the Video-to-Text Perspective: A Comprehensive Review*
WuJie1010 / Awesome Temporally Language GroundingA curated list of “Temporally Language Grounding” and related area
WuJie1010 / Temporally Language GroundingA Pytorch implemention for some state-of-the-art models for" Temporally Language Grounding in Untrimmed Videos"
ttengwang / Dense Video Captioning PytorchSecond-place solution to dense video captioning task in ActivityNet Challenge (CVPR 2020 workshop)
jssprz / Video Features ExtractorPython implementation of extraction of several visual features representations from videos
skasai5296 / ActnetchallengeRepository for the International Challenge on Activity Recognition (ActivityNet) Dense Captioning
ksharsha / ActivityNetCaptionsCaption generation for videos