ScanGAN360
Code and model for the paper "ScanGAN360: A Generative Model of Realistic Scanpaths for 360º Images"
Install / Use
/learn @DaniMS-ZGZ/ScanGAN360README
ScanGAN360
Code and model for the paper "ScanGAN360: A Generative Model of Realistic Scanpaths for 360º Images".

Requirements
This work was developed using:
* python 3.7.4
* pytorch 1.2.0
* cudatoolkit 10.0.30
* opencv 4.1.2
You can install an environment with all required dependencies using scangan360.yml file in Anaconda.
Inference
The current version of the repository includes a basic, yet functional version to generate scanpaths from a 360º image using the ScanGAN360 model.
Usage
python main.py --mode inference
This will read an image image_path = "data/test.jpg" and generate a set of scanpaths that will be saved in path_to_save = "test/". You can modify both those paths, and the number of generated scanpaths n_generated. Each of the images will contain 25 different scanpaths.
Training the model
Training is now available. [Updated June 15th]
python main.py --mode train
Make sure you have correctly updated utils.py, including all the directories required. Also, check the data folder to download the required images and processed gaze data.
Checkpoints and models are saved periodically in the assigned folder.
Related Skills
node-connect
345.9kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
106.4kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
345.9kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
345.9kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
