SROOE

Perception-Oriented Single Image Super-Resolution using Optimal Objective Estimation

Generate Convert Improve

Install / Use

/learn @seungho-snu/SROOE

About this skill

Quality Score

0/100

README

SROOE

Perception-Oriented Single Image Super-Resolution using Optimal Objective Estimation (CVPR 2023) <a href="https://openaccess.thecvf.com/content/CVPR2023/html/Park_Perception-Oriented_Single_Image_Super-Resolution_Using_Optimal_Objective_Estimation_CVPR_2023_paper.html">Link</a>

Seung Ho Park, Young Su Moon, Nam Ik Cho

SROOE Architecture

SROT (SR model trained with an Objective Trajectory) Training <a href="https://github.com/seungho-snu/SROT">Link</a>

OOE (Optimal Objective Estimation) Training

Visual and quantitative comparison.

The proposed SROOE shows a higher PSNR, LRPSNR, and lower LPIPS than other state-of-the-art methods, i.e., lower distortion and higher perceptual quality.

Usage:

Environments

Pytorch 1.10.0
CUDA 11.3
Python 3.8

Test

To test the pre-trained ESRGAN-SROT model:

python test.py -opt options/test/test.yml

Before running the test code, download the pre-trained SR 4x model (SROT) <a href="https://www.dropbox.com/s/v7lx9qoji1ndonx/SR.pth?dl=0">Link</a> and the pre-trained OOE model <a href="https://www.dropbox.com/s/hoykbrpadzozlab/OOE.pth?dl=0">Link</a>

Training

Before running the training code, you need to prepare the training pairs of LR images and corresponding T_OOS_Maps using the SROT codes <a href="https://github.com/seungho-snu/SROT">Link</a>.

For this, you first need to train the SROT model <a href="https://github.com/seungho-snu/SROT">Link</a>, or you can use the pre-trained SROT model <a href="https://www.dropbox.com/s/v7lx9qoji1ndonx/SR.pth?dl=0">Link</a>.

After finishing the SROT model training,

(1) Modify the test.yml file of SROT codes as follows:

datasets:
  test_100:
    name: DIV2K_train_HR
    mode: LQ
    dataroot_LQ: path_to_LR\DIV2K_train_LRx4

Then, generate SROT results with different t values from 0 to 1 with 0.05 step using the SROT codes as follows:

python test.py -opt options/test/test.yml -t 0.00
python test.py -opt options/test/test.yml -t 0.05
python test.py -opt options/test/test.yml -t 0.10
...
python test.py -opt options/test/test.yml -t 0.95
python test.py -opt options/test/test.yml -t 1.00

After running command lines above, you will get the folder structure as follows:

SROT
├── LPIPS-Map-Gen
├── codes
├── figures
├── pretrained
└── retuls
    ├──> ESRGAN-SROT-M1234-v2-4x_t000
            └──> DIV2K_train_HR
    ├──> ESRGAN-SROT-M1234-v2-4x_t005
            └──> DIV2K_train_HR
    ├──> ...
    ├──> ESRGAN-SROT-M1234-v2-4x_t095
            └──> DIV2K_train_HR
    └──> ESRGAN-SROT-M1234-v2-4x_t100
            └──> DIV2K_train_HR

(2) Generate LPIPS maps for the SROT results with different t values

To generate LPIPS maps, use the following command line. lpips_measure.py is in the LPIPS-Map-Gen folder.

python lpips_measure.py HR_image_folder_path SR_image_folder_path

for example

python lpips_measure.py path_to_GT\DIV2K_train_HR path_to_SROT\SROT-main\results\ESRGAN-SROT-M1234-v2-4x_t000\DIV2K_train_HR
python lpips_measure.py path_to_GT\DIV2K_train_HR path_to_SROT\SROT-main\results\ESRGAN-SROT-M1234-v2-4x_t005\DIV2K_train_HR
...
python lpips_measure.py path_to_GT\DIV2K_train_HR path_to_SROT\SROT-main\results\ESRGAN-SROT-M1234-v2-4x_t095\DIV2K_train_HR
python lpips_measure.py path_to_GT\DIV2K_train_HR path_to_SROT\SROT-main\results\ESRGAN-SROT-M1234-v2-4x_t100\DIV2K_train_HR

After running command lines above, you will get the folder structure as follows:

SROT
├── LPIPS-Map-Gen
├── codes
├── figures
├── pretrained
└── retuls
    ├──> ESRGAN-SROT-M1234-v2-4x_t000
            ├──> DIV2K_train_HR
            └──> DIV2K_train_HR_LPIPS
    ├──> ESRGAN-SROT-M1234-v2-4x_t005
            ├──> DIV2K_train_HR
            └──> DIV2K_train_HR_LPIPS
    ├──> ...
    ├──> ESRGAN-SROT-M1234-v2-4x_t095
            ├──> DIV2K_train_HR
            └──> DIV2K_train_HR_LPIPS
    └──> ESRGAN-SROT-M1234-v2-4x_t100
            ├──> DIV2K_train_HR    
            └──> DIV2K_train_HR_LPIPS

(3) Generate T_OOS_Maps for each images. To generate T_OOS_Maps, use the following command lines. generate_T_OOS_Map.py is in the LPIPS-Map-Gen folder.

python generate_T_OOS_Map.py -gt path_to_GT\DIV2K_train_HR -sr path_to_SROT\SROT-main\results\ESRGAN-SROT-M1234-v2-4x

After running command lines above, you will get the T_OOS_Maps in the T_OOS_Map folder and the folder structure as follows:

SROT
├── LPIPS-Map-Gen
    └──> T-OOS-MAP
        └──> ESRGAN-SROT-M1234-v2-4x
├── codes
├── figures
├── pretrained
└── retuls
    ├──> ESRGAN-SROT-M1234-v2-4x_t000
            ├──> DIV2K_train_HR
            └──> DIV2K_train_HR_LPIPS
    ├──> ESRGAN-SROT-M1234-v2-4x_t005
            ├──> DIV2K_train_HR
            └──> DIV2K_train_HR_LPIPS
    ├──> ...
    ├──> ESRGAN-SROT-M1234-v2-4x_t095
            ├──> DIV2K_train_HR
            └──> DIV2K_train_HR_LPIPS
    └──> ESRGAN-SROT-M1234-v2-4x_t100
            ├──> DIV2K_train_HR    
            └──> DIV2K_train_HR_LPIPS

(4) To train a SROOE model using the SROOE codes in this webpage as follows:

python train.py -opt options/train/train.yml

Before running the command line above, set dataroot_T_OOS_map in the yml file as path_to_SROT/LPIPS-Map-Gen/T-OOS-MAP/ESRGAN-SROT-M1234-v2-4x

Experimental Results

Quantitative Evaluation

Visual Evaluation

Visual comparison with state-of-the-art perception-driven SR methods

Citation

@InProceedings{Park_2023_CVPR,
    author    = {Park, Seung Ho and Moon, Young Su and Cho, Nam Ik},
    title     = {Perception-Oriented Single Image Super-Resolution Using Optimal Objective Estimation},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
    month     = {June},
    year      = {2023},
    pages     = {1725-1735}
}

@misc{https://doi.org/10.48550/arxiv.2211.13676,
  doi = {10.48550/ARXIV.2211.13676},
  url = {https://arxiv.org/abs/2211.13676},
  author = {Park, Seung Ho and Moon, Young Su and Cho, Nam Ik},
  title = {Perception-Oriented Single Image Super-Resolution using Optimal Objective Estimation},
  publisher = {arXiv},
  year = {2022},  
  copyright = {arXiv.org perpetual, non-exclusive license}
}

Acknowledgement

Our work and implementations are inspired by and based on BasicSR <a href="https://github.com/xinntao/BasicSR">[site]</a>

Related Skills

node-connect

354.3k

Diagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps

frontend-design

112.3k

Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.

openai-whisper-api

354.3k

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

qqbot-media

354.3k

QQBot 富媒体收发能力。使用 <qqmedia> 标签，系统根据文件扩展名自动识别类型（图片/语音/视频/文件）。