PLDM
No description available
Install / Use
/learn @vladisai/PLDMREADME
Overview
This is a repository for the paper "Learning from Reward-Free Offline Data: A Case for Planning with Latent Dynamics Models".
<img src="assets/main_idea.png" width="100%" style="max-width: 640px"><br/>
In this paper, we focus on methods that can learn from offline trajectories without reward annotations. We test methods ranging from RL to control, and find that planning with a learned latent dynamics model (PLDM) is a promising approach for this setting when the data is imperfect.
Setting up
Repo Setup
git clone git@github.com:vladisai/PLDM.git
cd PLDM
pip install -r requirements.txt
pip install -e .
Run Experiments
- Go to
pldm_envs/, follow instructions to set up dataset for the environment of your hoice - Go to
pldm/, follow instruction to run training or evaluation
Datasets
To see the datasets we used to train our models, see folders inside pldm_envs/.
The readmes there will guide you on how to download and set up the datasets.
Related Skills
node-connect
343.3kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
92.1kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
343.3kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
343.3kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
