206 skills found · Page 1 of 7
johnmyleswhite / BanditsBookCode for my book on Multi-Armed Bandit Algorithms
bgalbraith / BanditsPython library for Multi-Armed Bandits
lilianweng / Multi Armed BanditPlay with the solutions to the multi-armed-bandit problem.
fidelity / Mabwiser[IJAIT 2021] MABWiser: Contextual Multi-Armed Bandits Library
fidelity / Mab2rec[AAAI 2024] Mab2Rec: Multi-Armed Bandits Recommender
jkomiyama / BanditlibMulti-armed bandit simulation library
bayesianbandits / BayesianbanditsA Pythonic microframework for multi-armed bandit problems
alison-carrera / Mabalgs:bust_in_silhouette: Multi-Armed Bandit Algorithms Library (MAB) :cop:
bmuller / BanditA multi-armed bandit optimization framework for Rails
PierreGe / RL Movie RecommenderThe purpose of our research is to study reinforcement learning approaches to building a movie recommender system. We formulate the problem of interactive recommendation as a contextual multi-armed bandit.
Unity-Technologies / BanditDungeonDemo project using multi-armed bandit algorithm
antonismand / Personalized News RecommendationMulti Armed Bandits implementation using the Yahoo! Front Page Today Module User Click Log Dataset
akhadangi / Multi Armed BanditsIn this notebook several classes of multi-armed bandits are implemented. This includes epsilon greedy, UCB, Linear UCB (Contextual bandits) and Kernel UCB. Some of the well cited papers in this context are also implemented.
Nth-iteration-labs / StreamingbanditPython application to setup and run streaming (contextual) bandit experiments.
roycoding / SlotsA multi-armed bandit library for Python
Nth-iteration-labs / ContextualContextual Bandits in R - simulation and evaluation of Multi-Armed Bandit Policies
DeaconDesperado / Flask MabAn implementation of the multi-armed bandit optimization pattern as a Flask extension
raffg / Multi Armed BanditMonte Carlo simulations of several different multi-armed bandit algorithms and a comparison with classical statistical A/B testing
stitchfix / MabLibrary for multi-armed bandit selection strategies, including efficient deterministic implementations of Thompson sampling and epsilon-greedy.
pingles / BanditMulti-armed bandit algorithms in Clojure