UCAN

[CVPR 2026] Unified Convolutional Attention Network for Expansive Receptive Fields in Lightweight Super-Resolution

Generate Convert Improve

Install / Use

/learn @hokiyoshi/UCAN

About this skill

Quality Score

0/100

README

<h1 align="center"> <img width="200" src="logo.png"> UCAN: Unified Convolutional Attention Network for Expansive Receptive Fields in Lightweight Super-Resolution</h1> Cao Thien Tan1,2,3 · Phan Thi Thu Trang 3,5 · Do Nghiem Duc</a>6 · Ho Ngoc Anh</a>5 · Hanyang Zhuang</a>4 . Nguyen Duc Dung</a>2 1Ho Chi Minh City Open University     2AI Tech Lab, Ho Chi Minh City University of Technology 3Code Mely AI Research Team     4Global College, Shanghai Jiao Tong University 5Ha Noi University of Science and Technology     6University of Manitoba <h3 align="center"> CVPR 2026 - Poster </h3> <h3 align="center"><a href="https://arxiv.org/abs/2603.11680">[Paper]</a></h3>

Abstract: Hybrid CNN-Transformer architectures achieve strong results in image super-resolution, but scaling attention windows or convolution kernels significantly increases computational cost, limiting deployment on resource-constrained devices. We present UCAN, a lightweight network that unifies convolution and attention to expand the effective receptive field efficiently. UCAN combines window-based spatial attention with a Hedgehog Attention mechanism to model both local texture and long-range dependencies, and introduces a distillation-based large-kernel module to preserve high-frequency structure without heavy computation. In addition, we employ cross-layer parameter sharing to further reduce complexity. On Manga109 (4×), UCAN-L achieves 31.63 dB PSNR with only 48.4G MACs, surpassing recent lightweight models. On BSDS100, UCAN attains 27.79 dB, outperforming methods with significantly larger models. Extensive experiments show that UCAN achieves a superior trade-off between accuracy, efficiency, and scalability, making it well-suited for practical high-resolution image restoration.

Paper will be updated soon.

🔥 News

2026-02: 🎉UCAN is accepted by CVPR 2026! This repo is released.

🛠️ Setup

🗃️ Datasets

The datasets used in our training and testing are orgnized as follows:

Training and testing sets can be downloaded as follows:

| Training Set | Testing Set | Visual Results | | :-----------------------------------------------------------: | :----------------------------------------------------------: | :----------------------------------------------------------: | | DIV2K (800 training images, 100 validation images) [organized training dataset DIV2K: One Drive] | Set5 + Set14 + BSD100 + Urban100 + Manga109 [complete testing dataset: One Drive] | Updating |

💡 Models

🚀 Training

🧪 Testing

📚 Results

🙇 Citation

🥂 Acknowledgements

This work is based on BasicSR, HiT-SR, ESC and MambaIRv2. We thank them for their great work and for sharing the code.

Related Skills

node-connect

352.2k

Diagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps

frontend-design

111.1k

Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.

openai-whisper-api

352.2k

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

qqbot-media

352.2k

QQBot 富媒体收发能力。使用 <qqmedia> 标签，系统根据文件扩展名自动识别类型（图片/语音/视频/文件）。