PacmanDQN
Deep Reinforcement Learning in Pac-man
Install / Use
/learn @tychovdo/PacmanDQNREADME
PacmanDQN
Deep Reinforcement Learning in Pac-man
Demo
Example usage
Run a model on smallGrid layout for 6000 episodes, of which 5000 episodes
are used for training.
$ python3 pacman.py -p PacmanDQN -n 6000 -x 5000 -l smallGrid
Layouts
Different layouts can be found and created in the layouts directory
Parameters
Parameters can be found in the params dictionary in pacmanDQN_Agents.py. <br />
<br />
Models are saved as "checkpoint" files in the /saves directory. <br />
Load and save filenames can be set using the load_file and save_file parameters. <br />
<br />
Episodes before training starts: train_start <br />
Size of replay memory batch size: batch_size <br />
Amount of experience tuples in replay memory: mem_size <br />
Discount rate (gamma value): discount <br />
Learning rate: lr <br />
<br />
Exploration/Exploitation (ε-greedy): <br />
Epsilon start value: eps <br />
Epsilon final value: eps_final <br />
Number of steps between start and final epsilon value (linear): eps_step <br />
Citation
Please cite this repository if it was useful for your research:
@article{van2016deep,
title={Deep Reinforcement Learning in Pac-man},
subtitle={Bachelor Thesis},
author={van der Ouderaa, Tycho},
year={2016},
school={University of Amsterdam},
type={Bachelor Thesis},
}
Requirements
python==3.5.1tensorflow==0.8rc
Acknowledgements
DQN Framework by (made for ATARI / Arcade Learning Environment)
Pac-man implementation by UC Berkeley:
Related Skills
proje
Interactive vocabulary learning platform with smart flashcards and spaced repetition for effective language acquisition.
best-practices-researcher
The most comprehensive Claude Code skills registry | Web Search: https://skills-registry-web.vercel.app
fullstack-developer
Full-Stack Developer Role Role Definition CONCEPT: Full-stack developer expertise ARCHITECTURE: Covers both frontend and backend development BEST_PRACTICE: Comprehensive web applicat
last30days-skill
18.5kAI agent skill that researches any topic across Reddit, X, YouTube, HN, Polymarket, and the web - then synthesizes a grounded summary

