STFT
Spatial-Temporal Feature Transformation for Video Object Detection, MICCAI2021
Install / Use
/learn @lingyunwu14/STFTREADME
STFT: Spatial-Temporal Feature Transformation
By Lingyun Wu, Zhiqiang Hu, Yuanfeng Ji, Ping Luo, Shaoting Zhang.
This repo is an PyTorch implementation of "Multi-frame Collaboration for Effective Endoscopic Video Polyp Detection via Spatial-Temporal Feature Transformation", accepted by MICCAI 2021.

This repository contains the implementation of our approach STFT and several other video object detection algorithms like FGFA, RDN, and MEGA based on mega.pytorch, as well as training and testing scripts to reproduce the results on Endoscopic Video Datasets reported in our paper.
News
- [2021/11/12] For the implementation on the ImageNet VID dataset, please refer to here.
- [2021/09/21] Implementation for other video-based methods on Endoscopic Video Datasets released.
- [2021/09/21] Release training/testing scripts and the pretrained model for STFT.
- [2021/06/29] Create repository.
Model Zoo
Supported backbones:
- [x] ResNet
Supported image-based methods:
Supported video-based methods:
Installation
Please follow INSTALL.md for installation instructions.
Usage
Please follow GetStarted.md for usage instructions.
Citing STFT
Any new methods are welcomed. We also hope this repository would help further research in the field of video object detection and beyond. Please cite our paper in your publications if it helps your research:
@inproceedings{wu2021multi,
title={Multi-frame collaboration for effective endoscopic video polyp detection via spatial-temporal feature transformation},
author={Wu, Lingyun and Hu, Zhiqiang and Ji, Yuanfeng and Luo, Ping and Zhang, Shaoting},
booktitle={International Conference on Medical Image Computing and Computer-Assisted Intervention},
pages={302--312},
year={2021},
organization={Springer}
}
Contributing to the project
Any pull requests or issues are welcomed.
Related Skills
qqbot-channel
348.5kQQ 频道管理技能。查询频道列表、子频道、成员、发帖、公告、日程等操作。使用 qqbot_channel_api 工具代理 QQ 开放平台 HTTP 接口,自动处理 Token 鉴权。当用户需要查看频道、管理子频道、查询成员、发布帖子/公告/日程时使用。
docs-writer
100.3k`docs-writer` skill instructions As an expert technical writer and editor for the Gemini CLI project, you produce accurate, clear, and consistent documentation. When asked to write, edit, or revie
model-usage
348.5kUse CodexBar CLI local cost usage to summarize per-model usage for Codex or Claude, including the current (most recent) model or a full model breakdown. Trigger when asked for model-level usage/cost data from codexbar, or when you need a scriptable per-model summary from codexbar cost JSON.
Design
Campus Second-Hand Trading Platform \- General Design Document (v5.0 \- React Architecture \- Complete Final Version)1\. System Overall Design 1.1. Project Overview This project aims t
