4 skills found
cfoh / Multi Armed Bandit ExampleLearning Multi-Armed Bandits by Examples. Currently covering MAB, UCB, Boltzmann Exploration, Thompson Sampling, Contextual MAB, LinUCB, Deep MAB.
negarhonarvar / DeepReinforcementLearningA Complete Collection of Deep RL Famous Algorithms implemented in Gymnasium most Popular environments
lkwbr / Grid QlearnSee a program learn the best actions in a grid-world to get to the target cell, and even run through the grid in real-time! This is a Q-Learning implementation for 2-D grid world using both epsilon-greedy and Boltzmann exploration policies.
lucadivit / Reinforcement Learning Maze SolverThis github contains a simple OpenAi Gym Maze Enviroment and (at now) a RL Algorithm to solve it.