150 skills found · Page 1 of 5
johnmyleswhite / BanditsBookCode for my book on Multi-Armed Bandit Algorithms
david-cortes / ContextualbanditsPython implementations of contextual bandits algorithms
st-tech / Zr ObpOpen Bandit Pipeline: a python library for bandit algorithms and off-policy evaluation
SMPyBandits / SMPyBandits🔬 Research Framework for Single and Multi-Players 🎰 Multi-Arms Bandits (MAB) Algorithms, implementing all the state-of-the-art algorithms for single-player (UCB, KL-UCB, Thompson...) and multi-player (MusicalChair, MEGA, rhoRand, MCTop/RandTopM etc).. Available on PyPI: https://pypi.org/project/SMPyBandits/ and documentation on
HCDM / BanditLibLibrary of contextual bandits algorithms
c-bata / GoptunaA hyperparameter optimization framework, inspired by Optuna.
alison-carrera / OnnOnline Deep Learning: Learning Deep Neural Networks on the Fly / Non-linear Contextual Bandit Algorithm (ONN_THS)
alison-carrera / Mabalgs:bust_in_silhouette: Multi-Armed Bandit Algorithms Library (MAB) :cop:
WilliamLwj / PyXABPyXAB - A Python Library for X-Armed Bandit and Online Blackbox Optimization Algorithms
KKeishiro / Yahoo RecommendationYahoo! news article recommendation system by linUCB
Unity-Technologies / BanditDungeonDemo project using multi-armed bandit algorithm
kfoofw / Bandit SimulationsBandit algorithms simulations for online learning
raffg / Multi Armed BanditMonte Carlo simulations of several different multi-armed bandit algorithms and a comparison with classical statistical A/B testing
pingles / BanditMulti-armed bandit algorithms in Clojure
tor / LibbanditLibrary for Multi-Armed Bandit Algorithms
jldbc / BanditsMulti-Armed Bandit algorithms applied to the MovieLens 20M dataset
rayshi14 / HybridLinUCB PythonHybrid Linear UCB bandit learning algorithm L Li(2010) python code
sshkhr / Practical RLMy solutions to Yandex Practical Reinforcement Learning course in PyTorch and Tensorflow
j2kun / Exp3Python code for the post "Adversarial Bandits and the Exp3 Algorithm"
timnugent / Bandit AlgorithmsEpsilon-greedy, softmax and LinUCB contextual bandit implementations [recommender systems]