SkillAgentSearch skills...

Bracp

Improved Behavior Regularized Offline Reinforcement Learning

Install / Use

/learn @vermouth1992/Bracp
About this skill

Quality Score

0/100

Supported Platforms

Zed

README

BRAC+: Improved Behavior Regularized Offline Reinforcement Learning

This repository is the official implementation of BRAC+: Improved Behavior Regularized Actor Critic for Offline Reinforcement Learning.

Requirements

We high recommend that you create a new Python environment to test our code

Conda Environment

conda create -n bracp python=3.8

To install requirements:

Python package

pip install -r requirements.txt

D4RL library

pip install git+https://github.com/rail-berkeley/d4rl@master#egg=d4rl

rlutils library

pip install rlutils-python==0.0.3

Training

python d4rl_bracp.py train --env_name halfcheetah-medium-v0 --seed 110

The script will first pretrain the behavior policy and the initial policy that minimize the KL divergence.

Logging

The logs will be placed at data/d4rl_results/

View on GitHub
GitHub Stars5
CategoryEducation
Updated2y ago
Forks2

Languages

Jupyter Notebook

Security Score

70/100

Audited on Jul 18, 2023

No findings