SkillAgentSearch skills...

PAAC.pytorch

Pytorch implementation of the PAAC algorithm presented in Efficient Parallel Methods for Deep Reinforcement Learning https://arxiv.org/abs/1705.04862

Install / Use

/learn @qbx2/PAAC.pytorch
About this skill

Quality Score

0/100

Supported Platforms

Universal

README

PAAC.pytorch

Pytorch implementation of the PAAC algorithm presented in "Efficient Parallel Methods for Deep Reinforcement Learning". PAAC is the abbreviation of Parallel Advantage Actor-Critic.

Currently, because the PAAC network is not using LSTM, the evaluation result is not very good. I'm working on the LSTM version of PAAC (waiting for a new graphic card due to lack of current gpu's memory.)

The original paper is here: https://arxiv.org/abs/1705.04862

Requirements

PAAC.pytorch requires torch, torchvision, PIL, gym.

Libraries used in this project:

  • torch==0.1.12+32e6665
  • torchvision==0.1.8
  • Pillow==4.1.1
  • gym@797a25d1b1a8823b305fdb575c4378a5c288b432

Result (BreakoutDeterministic-v4 training log)

log

https://www.youtube.com/watch?v=6FMzNaL88wQ

View on GitHub
GitHub Stars20
CategoryEducation
Updated2y ago
Forks0

Languages

Python

Security Score

65/100

Audited on Sep 5, 2023

No findings