S2p

"S2P: State-conditioned Image Synthesis for Data Augmentation in Offline Reinforcement Learning" (NeurIPS 2022)

Generate Convert Improve

Install / Use

/learn @dsshim0125/S2p

About this skill

Quality Score

0/100

README

S2P: State-conditioned Image Synthesis for Data Augmentation in Offline Reinforcement Learning

This repo provides an official PyTorch implementation of "S2P: State-conditioned Image Synthesis for Data Augmentation in Offline Reinforcement Learning" (NeurIPS 2022). [paper]

Setup

conda create -n s2p python=3.8.5
conda activate s2p
conda install pytorch torchvision cudatoolkit=11.3 -c pytorch
pip install -r requirements.txt

Our experiments have been done with PyTorch 1.10.1, CUDA 11.4, Python 3.8.5 and Ubuntu 18.04. We use a single NVIDIA RTX A6000 for training, but you can still run our code with GPUs which have smaller memory by reducing the batchSize. A simpel visualziation of the generation results can be done by GPUs with 4GB of memory use.

Download pre-trained models

We provide pre-trained weights of S2P in some environments for simple test of the generation performance. Create a folder ./checkpoints and download the model weights into it. Here are model weights of S2P trained on cheetah and walker environment of DeepMind Controp Suite.

| Env_type | model | |----------|:--:| |cheetah|cheetah_30.pth| |walker|walker_30.pth|

Simple generation

We provide pre-trained models of S2P and some tiny dataset for simple visualization of S2P. Reviewers can easily visualize N-step generation results with --seq_len.

python simple_test.py --env_type=cheetah --dataroot=./datasets --netG=s2p --start_idx=0 --seq_len=5 --gpu_ids=0

Reference

https://github.com/NVlabs/SPADE
https://github.com/yenchenlin/nerf-pytorch
https://github.com/huangzh13/StyleGAN.pytorch

Related Skills

qqbot-channel

343.1k

QQ 频道管理技能。查询频道列表、子频道、成员、发帖、公告、日程等操作。使用 qqbot_channel_api 工具代理 QQ 开放平台 HTTP 接口，自动处理 Token 鉴权。当用户需要查看频道、管理子频道、查询成员、发布帖子/公告/日程时使用。

docs-writer

99.7k

`docs-writer` skill instructions As an expert technical writer and editor for the Gemini CLI project, you produce accurate, clear, and consistent documentation. When asked to write, edit, or revie

model-usage

343.1k

Use CodexBar CLI local cost usage to summarize per-model usage for Codex or Claude, including the current (most recent) model or a full model breakdown. Trigger when asked for model-level usage/cost data from codexbar, or when you need a scriptable per-model summary from codexbar cost JSON.

ddd

Guía de Principios DDD para el Proyecto > 📚 Documento Complementario : Este documento define los principios y reglas de DDD. Para ver templates de código, ejemplos detallados y guías paso

dsshim0125

View profile

View on GitHub

GitHub Stars4

CategoryContent

Updated9mo ago

Forks1

dsshim0125/s2p

Languages

Python

Security Score

82/100

Audited on Jun 9, 2025

No findings