SPAN
Swift Parameter-free Attention Network for Efficient Super-Resolution
Install / Use
/learn @hongyuanyu/SPANREADME
SPAN - ESR (CVPR-W NTIRE Winner Award @2024)
<div align="center"> <h2><a href="https://arxiv.org/abs/2311.12770">Swift Parameter-free Attention Network for Efficient Super-Resolution</a></h2>Cheng Wan, Hongyuan Yu, Zhiqi Li, Yihang Chen, Yajun Zou, Yuqing Liu, Xuanwu Yin and Kunlong Zuo
</div> <div align="center"> <img src="assets/demo.gif" width="800px" /> <!-- <p>cell.</p> --> </div>🚀 Updates
- [2025.06.06] 🎉 SPAN-F is selected for an Oral Presentation at CVPR 2025, NTIRE Workshop!
- [2025.03.27] 😃 Our lighter version SPAN-F won the 2nd place in CVPR 2025 NTIRE's ESR Challenge, glad to see the top few teams were all inspired by SPAN. -> Report Paper
- [2024.06.11] 🎉 SPAN is selected for an Oral Presentation at CVPR 2024, NTIRE Workshop!
- [2024.04.09] 🎉 SPAN is accepted to CVPR 2024 Workshop!
- [2024.03.21] 🏆 SPAN won the 1st place in CVPR 2024 NTIRE's Efficient Super-Resolution Challenge(ESR) -> Report Paper
- [2023.03.10] ✅ Release checkpoints for our different pretrained models -> Google Drive.
- [2023.11.23] ✅ Upload SPAN codes here.
📖 Introduction:
The official pytorch implementation of the paper Swift Parameter-free Attention Network for Efficient Super-Resolution
<div align="center"> <img src="assets/model.png" width="800px" /> <!-- <p>cell.</p> --> </div>Single Image Super-Resolution (SISR) is a crucial task in low-level computer vision, aiming to reconstruct high-resolution images from low-resolution counterparts. Conventional attention mechanisms have significantly improved SISR performance but often result in complex network structures and large number of parameters, leading to slow inference speed and large model size. To address this issue, we propose the Swift Parameter-free Attention Network (SPAN), a highly efficient SISR model that balances parameter count, inference speed, and image quality. SPAN employs a novel parameter-free attention mechanism, which leverages symmetric activation functions and residual connections to enhance high-contribution information and suppress redundant information. Our theoretical analysis demonstrates the effectiveness of this design in achieving the attention mechanism's purpose. We evaluate SPAN on multiple benchmarks, showing that it outperforms existing efficient super-resolution models in terms of both image quality and inference speed, achieving a significant quality-speed trade-off. This makes SPAN highly suitable for real-world applications, particularly in resource-constrained scenarios. Notably, our model attains the best PSNR of 27.09 dB, and the test runtime of our team is reduced by 7.08ms in the NTIRE 2023 efficient super-resolution challenge.
🔧 Installation:
This implementation based on BasicSR. Please refer to BasicSR for training and testing. You can obtain all the checkpoints and results from [Google Drive].
python 3.9.5
pytorch 1.11.0
cuda 11.3
git clone this repo
cd SPAN
pip install -r requirements.txt
python setup.py develop --no_cuda_ext
📈 Results
<div align="center"> <img src="assets/results.png" width="400px" /> <!-- <p>cell.</p> --> </div> <div align="center"> <img src="assets/vis.png" width="800px" /> <!-- <p>cell.</p> --> </div>📍 Citation
If our work is useful to you, please use the following BibTeX for citation.
@inproceedings{wan2024swift,
title={Swift Parameter-free Attention Network for Efficient Super-Resolution},
author={Wan, Cheng and Yu, Hongyuan and Li, Zhiqi and Chen, Yihang and Zou, Yajun and Liu, Yuqing and Yin, Xuanwu and Zuo, Kunlong},
booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
pages={6246--6256},
year={2024}
}
📜 License and Acknowledgement
This project is released under the Apache 2.0 license.<br> More details about license and acknowledgement are in LICENSE.
Related Skills
node-connect
341.2kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
84.5kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
341.2kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
commit-push-pr
84.5kCommit, push, and open a PR
