Results for "boltzmann-exploration"

Claude Code Claude Desktop GitHub Copilot Cursor Windsurf Cline Zed JetBrains

📄SKILL.md 🤖CLAUDE.md ⚡Claude Commands 📐.cursorrules 📐Cursor Rules 🕹️AGENTS.md 🧬codex.md 🏄.windsurfrules 🔧.clinerules 🧑‍✈️Copilot Instructions

All Development Operations Data Product Marketing Customer Design Sales

4 skills found

cfoh / Multi Armed Bandit Example

Learning Multi-Armed Bandits by Examples. Currently covering MAB, UCB, Boltzmann Exploration, Thompson Sampling, Contextual MAB, LinUCB, Deep MAB.

universal

machine-learningmulti-armed-banditsrecommendation-system+1

Updated 26d ago

negarhonarvar / DeepReinforcementLearning

A Complete Collection of Deep RL Famous Algorithms implemented in Gymnasium most Popular environments

universal

boltzmann-explorationcartpole-v1d3qn+8

Updated 11mo ago

lkwbr / Grid Qlearn

See a program learn the best actions in a grid-world to get to the target cell, and even run through the grid in real-time! This is a Q-Learning implementation for 2-D grid world using both epsilon-greedy and Boltzmann exploration policies.

universal

boltzmann-explorationepsilon-greedygrid-world+3

Updated 2y ago

lucadivit / Reinforcement Learning Maze Solver

This github contains a simple OpenAi Gym Maze Enviroment and (at now) a RL Algorithm to solve it.

universal

boltzmann-explorationepsilon-decayepsilon-greedy+15

Updated 1y ago