VICT

[CVPR 2025] Test-Time Visual In-Context Tuning

Generate Convert Improve

Install / Use

/learn @Jiahao000/VICT

About this skill

Quality Score

0/100

README

<div align="center"> <h1>Test-Time Visual In-Context Tuning</h1> <div> <a href='https://jiahao000.github.io/' target='_blank'>Jiahao Xie</a><sup>1,2</sup>&emsp; <a href='https://alessiotonioni.github.io/' target='_blank'>Alessio Tonioni</a><sup>3</sup>&emsp; <a href='https://scholar.google.com/citations?user=OglqhoUAAAAJ&hl=en' target='_blank'>Nathalie Rauschmayr</a><sup>3</sup>&emsp; <a href='https://federicotombari.github.io/' target='_blank'>Federico Tombari</a><sup>3</sup>&emsp; <a href='https://scholar.google.com/citations?user=z76PBfYAAAAJ&hl=en' target='_blank'>Bernt Schiele</a><sup>1,2</sup> </div> <div> <sup>1</sup>Max Planck Institute for Informatics&emsp; <sup>2</sup>VIA Research Center&emsp; <sup>3</sup>Google </div> <div> <strong>CVPR 2025</strong> </div> <div> <h4 align="center"> <a href="https://arxiv.org/abs/2503.21777" target='_blank'> <img src="https://img.shields.io/badge/arXiv-2503.21777-b31b1b.svg"> </a> <a href="https://github.com/Jiahao000/VICT" target='_blank'> <img src="https://img.shields.io/badge/Project-Page-green"> </a> <a href="https://github.com/Jiahao000/VICT#-citation" target='_blank'> <img src="https://img.shields.io/badge/Cite-BibTeX-blue"> </a> </h4> </div>

<strong>We present VICT, a test-time visual in-context tuning method that can adapt visual in-context learning models on the fly with a single test sample. VICT can be applied to a wide range of unseen domains and tasks at test time.</strong>

:open_book: For more results, please refer to our <a href="https://arxiv.org/abs/2503.21777" target="_blank">paper</a>

</div>

📣 News

[03/2025] 🔥 VICT is released on arXiv.

🌟 Method

VICT is a simple yet effective test-time training approach to adapt visual in-context learning (VICL) models on the fly. The motivation is that each test input offers a hint about the test distribution. Thus, we modify a VICL model at test time to make full use of this hint by setting up a <i>one-sample learning problem</i>.

Specifically, we flip the role between the task prompts and the test sample and use a cycle consistency self-supervised loss to reconstruct the original task prompt output. Our key insight is that a model should be aware of a new test distribution if it can successfully recover the original task prompts.

🤗 Qualitative Examples

Unseen Domains

Middle-/High-Level Tasks with Corruptions

Low-Level Tasks with Corruptions

Unseen Tasks

🛠️ Usage

👨‍💻 Todo

[x] Release the arXiv version.
[x] Release the code.

📘 Citation

If you find this work useful for your research, please consider citing our paper:

@inproceedings{xie2025test,
  title = {Test-Time Visual In-Context Tuning},
  author = {Xie, Jiahao and Tonioni, Alessio and Rauschmayr, Nathalie and Tombari, Federico and Schiele, Bernt},
  booktitle={CVPR},
  year = {2025}
}

❤️ Acknowledgement

We acknowledge the use of the following public code in this project: Painter, MAE, BEiT, detectron2, Mask2Former, bts, mmcv, mmdetetection, mmpose, MIRNet, MPRNet, and Uformer.

Related Skills

YC-Killer

2.7k

A library of enterprise-grade AI agents designed to democratize artificial intelligence and provide free, open-source alternatives to overvalued Y Combinator startups. If you are excited about democratizing AI access & AI agents, please star ⭐️ this repository and use the link in the readme to join our open source AI research team.

best-practices-researcher

The most comprehensive Claude Code skills registry | Web Search: https://skills-registry-web.vercel.app

groundhog

400

Groundhog's primary purpose is to teach people how Cursor and all these other coding agents work under the hood. If you understand how these coding assistants work from first principles, then you can drive these tools harder (or perhaps make your own!).

last30days-skill

20.0k

AI agent skill that researches any topic across Reddit, X, YouTube, HN, Polymarket, and the web - then synthesizes a grounded summary

Jiahao000

View profile

View on GitHub

GitHub Stars30

CategoryEducation

Updated28d ago

Forks1

Jiahao000/VICT

Languages

Python

Security Score

80/100

Audited on Mar 12, 2026

No findings