READMem
[BMVC 2023] READMem: Robust Embedding Association for a Diverse Memory in Unconstrained Video Object Segmentation
Install / Use
/learn @Vujas-Eteph/READMemREADME
READMem: Robust Embedding Association for a Diverse Memory in Unconstrained Video Object Segmentation
</span>by Stéphane Vujasinović, Sebastian Bullinger, Stefan Becker, Norbert Scherer-Negenborn, Michael Arens and Rainer Stiefelhagen
TL;DR: We manage the memory of STM like sVOS methods to better deal with long video. To attain long-term performance we estimate the inter-frame diversity of the base memory and integrate the embeddings of an incoming frame into the memory if it enhances the diversity. In return, we are able to limit the number of memory slots and deal with unconstrained video sequences without hindering the performance on short sequences and alleviate the need for a sampling interval.
[arXiv] - [BMVC Proceeding Paper]/[SUPP.] - [Video] - [Poster] - [BMVC Page]
<p align="center"> <img src="./docs/img/Qualitative_Results.png" width="95%"> </p>News:
- Our poster was mentioned during the honorable mentions! 😄
📊 Some Quantitative Results
The following plots illustrate performance variations among sVOS baselines with and without our READMem extension on the LV1 dataset. The first plot showcases changes when varying the sampling interval $s_r$, while the second depicts variations when adjusting the memory size $N$.
<p align="center"> <img src="./docs/img/Quantitative_Results_LV1.png" width="95%"> </p> <p align="center"> <img src="./docs/img/Quantitative_Results_LV1_bis.png" width="95%"> </p>But check out our paper and supplementary material for more qualitative and quantitative results!
:books: Getting Started
The documentation is split in the following seperate markdown files:
:blue_book: Installation
:closed_book: Inference
:green_book: Evaluation
:black_nib: Citation
@inproceedings{Vujasinovic_2023_BMVC,
author = {Stephane Vujasinovic and Sebastian Bullinger and Stefan Becker and Norbert Scherer-Negenborn and Michael Arens and Rainer Stiefelhagen},
title = {READMem: Robust Embedding Association for a Diverse Memory in Unconstrained Video Object Segmentation},
booktitle = {34th British Machine Vision Conference 2023, {BMVC} 2023, Aberdeen, UK, November 20-24, 2023},
publisher = {BMVA},
year = {2023},
url = {https://papers.bmvc2023.org/0603.pdf}
}
:+1: Credits (Big thanks to those projects):
Related Skills
qqbot-channel
351.2kQQ 频道管理技能。查询频道列表、子频道、成员、发帖、公告、日程等操作。使用 qqbot_channel_api 工具代理 QQ 开放平台 HTTP 接口,自动处理 Token 鉴权。当用户需要查看频道、管理子频道、查询成员、发布帖子/公告/日程时使用。
docs-writer
100.5k`docs-writer` skill instructions As an expert technical writer and editor for the Gemini CLI project, you produce accurate, clear, and consistent documentation. When asked to write, edit, or revie
model-usage
351.2kUse CodexBar CLI local cost usage to summarize per-model usage for Codex or Claude, including the current (most recent) model or a full model breakdown. Trigger when asked for model-level usage/cost data from codexbar, or when you need a scriptable per-model summary from codexbar cost JSON.
Design
Campus Second-Hand Trading Platform \- General Design Document (v5.0 \- React Architecture \- Complete Final Version)1\. System Overall Design 1.1. Project Overview This project aims t
