Sst
SST: Single-Stream Temporal Action Proposals (Official Repo)
Install / Use
/learn @shyamal-b/SstREADME
SST: Single-Stream Temporal Action Proposals
Welcome to the official repo for SST: Single-Stream Temporal Action Proposals!
SST is an efficient model for generating temporal action proposals in untrimmed videos. Analogous to object proposals for images, temporal action proposals provide the temporal bounds in videos where potential actions of interest may lie.
<div class="centered"> <a href="http://vision.stanford.edu/pdf/buch2017cvpr.pdf" target="_blank"_> <img src="https://dl.dropboxusercontent.com/s/pv2mrc0ps09zqu3/sst_modelfig.png" width="590" alt="SST model overview" /> </div> <br/> </a>Resources
Quick links: [cvpr paper] [poster] [supplementary] [code]
<!-- [[video](https://drive.google.com/file/d/0B_-dKvCH2VL7dGV1ankxWnJVQmM/view?usp=sharing)] -->Update: if you find this work useful, you may also find our newer work of interest: link to SS-TAD
Please use the following bibtex to cite our work:
@inproceedings{sst_buch_cvpr17,
author = {Shyamal Buch and Victor Escorcia and Chuanqi Shen and Bernard Ghanem and Juan Carlos Niebles},
title = {{SST}: Single-Stream Temporal Action Proposals},
year = {2017},
booktitle = {CVPR}
}
As part of this repo, we also include evaluation notebooks, SST proposals for THUMOS'14, and pre-trained model parameters. Please see the code/ and data/ folders for more.
Dependencies
We include a requirements.txt file that lists all the dependencies you need. Once you have created a virtual environment, simply run pip install -r requirements.txt from within the environment to install all the dependencies. Note that the original code was executed using Python 2.7.
Related Skills
node-connect
342.5kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
85.3kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
342.5kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
342.5kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
