Minigrid
No description available
Install / Use
/learn @kebaek/MinigridREADME
Computational Benefits of Intermediate Rewards for Hierarchical Planning
We provide the code used to run the MiniGrid experiments provided in the paper.
Features
All our custom MiniGrid environments are available in gym-minigrid/gym_minigrid/envs/custom.py
For asynchronous Q-learning:
- Script to train:
scripts/qlearn.py - Script to evaluate:
scripts/qlearn_evaluate.py
For Deep RL algorithms (A2C, PPO, DQN):
- Script to train:
scripts/train.py - Script to evaluate,
scripts/evaluate.py
See experiments/ folder to run all experiments conducted in the paper.
We provide a sample parser file Log_Parser.ipynb to gather results presented in paper (average steps, rewards, win rate) for all seeds.
Installation
-
Clone this repository.
-
Install
gym-minigridenvironments andtorch-acRL algorithms:
pip3 install -r requirements.txt
cd torch-ac
pip3 install -e .
Related Skills
node-connect
342.5kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
85.3kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
342.5kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
342.5kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
