ScanGAN360

Code and model for the paper "ScanGAN360: A Generative Model of Realistic Scanpaths for 360º Images"

Generate Convert Improve

Install / Use

/learn @DaniMS-ZGZ/ScanGAN360

About this skill

Quality Score

0/100

README

ScanGAN360

Code and model for the paper "ScanGAN360: A Generative Model of Realistic Scanpaths for 360º Images".

Teaser

Requirements

This work was developed using:

* python 3.7.4
* pytorch 1.2.0
* cudatoolkit 10.0.30
* opencv 4.1.2

You can install an environment with all required dependencies using scangan360.yml file in Anaconda.

Inference

The current version of the repository includes a basic, yet functional version to generate scanpaths from a 360º image using the ScanGAN360 model.

Usage

python main.py --mode inference

This will read an image image_path = "data/test.jpg" and generate a set of scanpaths that will be saved in path_to_save = "test/". You can modify both those paths, and the number of generated scanpaths n_generated. Each of the images will contain 25 different scanpaths.

Training the model

Training is now available. [Updated June 15th]

python main.py --mode train

Make sure you have correctly updated utils.py, including all the directories required. Also, check the data folder to download the required images and processed gaze data.

Checkpoints and models are saved periodically in the assigned folder.

Related Skills

node-connect

345.9k

Diagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps

frontend-design

106.4k

Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.

openai-whisper-api

345.9k

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

qqbot-media

345.9k

QQBot 富媒体收发能力。使用 <qqmedia> 标签，系统根据文件扩展名自动识别类型（图片/语音/视频/文件）。