TSAM

[CVPR2021] Official Implementation for Progressive Temporal Feature Alignment Network for Video Inpainting

Generate Convert Improve

Install / Use

/learn @MaureenZOU/TSAM

About this skill

Quality Score

0/100

README

Progressive Temporal Feature Alignment Network for Video Inpainting

This work is accepted in CVPR2021 as Poster. It proposed a new video inpainting approach that combines temporal convolution as well as optical flow approach.

Noted: This code is currently a beta version. Not gurantee to be fully correct.

Update

Optical Flow Davis | Optical Flow FVI | Mask Davis | Mask FVI | Checkpoint

Installation

torch==1.7.0
torchvision==0.8.1

Dataset

For FVI dataset, please refer to https://github.com/amjltc295/Free-Form-Video-Inpainting. For DAVIS dataset, please refer to https://davischallenge.org/.

File Structure

TSAM
└── data
    ├── checkpoints
    ├── model_weights
    ├── results
    ├── FVI
    ├── DAVIS    
    └── runs
└── code
    └── master
        └── TSAM
            └── ...

Prepare pretrained weights for training

Pretrained weights: download all the pretrained weights and put it under TSAM/data/model_weights | Model Name | | |----------------------------------|------------| | TSM_imagenet_resent50_gated.pth | weight | | TSM_imagenet_resent50.pth | weight |

Training

FVI TSM moving object/curve masks:

CUDA_VISIBLE_DEVICES=0,1,2,3 python3 train.py --config config/config_pretrain.json --dataset_config dataset_configs/FVI_all_masks.json
CUDA_VISIBLE_DEVICES=0,1,2,3 python3 train.py --config config/config_finetune.json --dataset_config dataset_configs/FVI_all_masks.json

Testing

Change the train.py in training scripts to test.py, and add -p /pth/to/ckpt to the end.

DAVIS TSAM object removal:

CUDA_VISIBLE_DEVICES=0 python3 test.py --config config/config_finetune_davis.json --dataset_config dataset_configs/DAVIS_removal.json -p /pth/to/ckpt

Citation

@inproceedings{zou2020progressive,
  title={Progressive Temporal Feature Alignment Network for Video Inpainting},
  author={Xueyan Zou and Linjie Yang and Ding Liu and Yong Jae Lee},
  booktitle={CVPR},
  year={2021}
}

Acknowledgement

Part of the code is borrow from https://github.com/amjltc295/Free-Form-Video-Inpainting and https://github.com/researchmm/STTN. Thanks for their great works!

Related Skills

qqbot-channel

347.0k

QQ 频道管理技能。查询频道列表、子频道、成员、发帖、公告、日程等操作。使用 qqbot_channel_api 工具代理 QQ 开放平台 HTTP 接口，自动处理 Token 鉴权。当用户需要查看频道、管理子频道、查询成员、发布帖子/公告/日程时使用。

docs-writer

100.1k

`docs-writer` skill instructions As an expert technical writer and editor for the Gemini CLI project, you produce accurate, clear, and consistent documentation. When asked to write, edit, or revie

model-usage

347.0k

Use CodexBar CLI local cost usage to summarize per-model usage for Codex or Claude, including the current (most recent) model or a full model breakdown. Trigger when asked for model-level usage/cost data from codexbar, or when you need a scriptable per-model summary from codexbar cost JSON.

Design

Campus Second-Hand Trading Platform \- General Design Document (v5.0 \- React Architecture \- Complete Final Version)1\. System Overall Design 1.1. Project Overview This project aims t

MaureenZOU

View profile

View on GitHub

GitHub Stars90

CategoryContent

Updated14d ago

Forks11

MaureenZOU/TSAM

Languages

Python

Security Score

80/100

Audited on Mar 20, 2026

No findings