PLAS

Code for Latent Action Space for Offline Reinforcement Learning [CoRL 2020]

Generate Convert Improve

Install / Use

/learn @Wenxuan-Zhou/PLAS

About this skill

Quality Score

0/100

README

PLAS: Latent Action Space for Offline Reinforcement Learning

This is the repository for our paper "PLAS: Latent Action Space for Offline Reinforcement Learning" in CoRL 2020. Please visit our website for more information.

This repository is built on top of BCQ. The logger is from BEAR.

Requirements

Python 3.7.4
PyTorch (v1.2.0)
mujoco_py (v2.0)
gym (v0.13)
d4rl (commit: 87d13f1)

You may install the above packages following the instructions in their repositories, or run the following command:

pip3 install -r requirements.txt

Note that the latest d4rl repository has some problem loading the mujoco dataset. We recommend the users to install this commit version.

Instructions

To train the Latent Policy for the d4rl datasets:

python main.py --env_name walker2d-medium-expert-v0 --algo_name Latent --max_latent_action 2

To train the Latent Policy with the perturbation layer:

python main.py --env_name walker2d-medium-expert-v0 --algo_name LatentPerturbation --max_latent_action 2 --phi 0.05

By default, the algorithm trains a VAE before the policy to model the behavior policy of the dataset. You may also load a pre-trained vae and then train policy.

python main.py --env_name walker2d-medium-expert-v0 --algo_name Latent --vae_mode v6

This command will load the vae models under the "models/vae_v6" folder according to the name of the dataset and the random seed automatically.

The results will be saved under the "results" folder. You may use viskit to visualize the curves.

Citation

@inproceedings{PLAS_corl2020,
 title={PLAS: Latent Action Space for Offline Reinforcement Learning},
 author={Zhou, Wenxuan and Bajracharya, Sujay and Held, David},
 booktitle={Conference on Robot Learning},
 year={2020}
}

Related Skills

YC-Killer

2.7k

A library of enterprise-grade AI agents designed to democratize artificial intelligence and provide free, open-source alternatives to overvalued Y Combinator startups. If you are excited about democratizing AI access & AI agents, please star ⭐️ this repository and use the link in the readme to join our open source AI research team.

flutter-tutor

Flutter Learning Tutor Guide You are a friendly computer science tutor specializing in Flutter development. Your role is to guide the student through learning Flutter step by step, not to provide d

groundhog

400

Groundhog's primary purpose is to teach people how Cursor and all these other coding agents work under the hood. If you understand how these coding assistants work from first principles, then you can drive these tools harder (or perhaps make your own!).

workshop-rules

Materials used to teach the summer camp <Data Science for Kids>