UCAN
[CVPR 2026] Unified Convolutional Attention Network for Expansive Receptive Fields in Lightweight Super-Resolution
Install / Use
/learn @hokiyoshi/UCANREADME
Abstract: Hybrid CNN-Transformer architectures achieve strong results in image super-resolution, but scaling attention windows or convolution kernels significantly increases computational cost, limiting deployment on resource-constrained devices. We present UCAN, a lightweight network that unifies convolution and attention to expand the effective receptive field efficiently. UCAN combines window-based spatial attention with a Hedgehog Attention mechanism to model both local texture and long-range dependencies, and introduces a distillation-based large-kernel module to preserve high-frequency structure without heavy computation. In addition, we employ cross-layer parameter sharing to further reduce complexity. On Manga109 (4×), UCAN-L achieves 31.63 dB PSNR with only 48.4G MACs, surpassing recent lightweight models. On BSDS100, UCAN attains 27.79 dB, outperforming methods with significantly larger models. Extensive experiments show that UCAN achieves a superior trade-off between accuracy, efficiency, and scalability, making it well-suited for practical high-resolution image restoration.
Paper will be updated soon.
🔥 News
- 2026-02: 🎉UCAN is accepted by CVPR 2026! This repo is released.
🛠️ Setup
🗃️ Datasets
The datasets used in our training and testing are orgnized as follows:
Training and testing sets can be downloaded as follows:
| Training Set | Testing Set | Visual Results | | :-----------------------------------------------------------: | :----------------------------------------------------------: | :----------------------------------------------------------: | | DIV2K (800 training images, 100 validation images) [organized training dataset DIV2K: One Drive] | Set5 + Set14 + BSD100 + Urban100 + Manga109 [complete testing dataset: One Drive] | Updating |
💡 Models
🚀 Training
🧪 Testing
📚 Results
🙇 Citation
🥂 Acknowledgements
This work is based on BasicSR, HiT-SR, ESC and MambaIRv2. We thank them for their great work and for sharing the code.
Related Skills
node-connect
352.2kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
111.1kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
352.2kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
352.2kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
Security Score
Audited on Apr 7, 2026
