99 skills found · Page 1 of 4
facebookresearch / ReAgentA platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.)
tensorflow / AgentsTF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.
cair / TsetlinMachineCode and datasets for the Tsetlin Machine
alison-carrera / OnnOnline Deep Learning: Learning Deep Neural Networks on the Fly / Non-linear Contextual Bandit Algorithm (ONN_THS)
amazon-science / Auction GymAuctionGym is a simulation environment that enables reproducible evaluation of bandit and reinforcement learning methods for online advertising auctions.
cair / PyTsetlinMachineImplements the Tsetlin Machine, Convolutional Tsetlin Machine, Regression Tsetlin Machine, Weighted Tsetlin Machine, and Embedding Tsetlin Machine, with support for continuous features, multigranularity, clause indexing, and literal budget
PierreGe / RL Movie RecommenderThe purpose of our research is to study reinforcement learning approaches to building a movie recommender system. We formulate the problem of interactive recommendation as a contextual multi-armed bandit.
yfletberliac / Rlss 2019Materials for the Practical Sessions of the Reinforcement Learning Summer School 2019: Bandits, RL & Deep RL (PyTorch).
kfoofw / Bandit SimulationsBandit algorithms simulations for online learning
MLWave / Hodor AutoMLHodor AutoML: Brute-Bandit fast good-enough solutions to a wide range of machine learning problems.
Nth-iteration-labs / ContextualContextual Bandits in R - simulation and evaluation of Multi-Armed Bandit Policies
banditml / BanditmlA lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.
SamRagusa / Checkers Reinforcement LearningA checkers reinforcement learning AI, and all the tools needed to train it.
rayshi14 / HybridLinUCB PythonHybrid Linear UCB bandit learning algorithm L Li(2010) python code
cair / Convolutional Tsetlin Machine TutorialTutorial on the Convolutional Tsetlin Machine
weisong-ucr / MAB MalwareMAB-Malware an open-source reinforcement learning framework to generate AEs for PE malware. We model this problem as a classic multi-armed bandit (MAB) problem, by treating each action-content pair as an independent slot machine.
cfoh / Multi Armed Bandit ExampleLearning Multi-Armed Bandits by Examples. Currently covering MAB, UCB, Boltzmann Exploration, Thompson Sampling, Contextual MAB, LinUCB, Deep MAB.
cair / PyTsetlinMachineParallelMulti-threaded implementation of the Tsetlin Machine, Convolutional Tsetlin Machine, Regression Tsetlin Machine, and Weighted Tsetlin Machine, with support for continuous features and multigranularity.
apexrl / RL Exploration Paper ListsPaper Collection of Reinforcement Learning Exploration covers Exploration of Muti-Arm-Bandit, Reinforcement Learning and Multi-agent Reinforcement Learning.
shirishjain / Music Recommender EngineThe purpose of this Personalized Music Recommendation Engine is to use reinforcement learning approach to build a music recommender system and to formulate the problem of interactive recommendation as a contextual multi-armed bandit, learning user preferences recommending new songs and receiving their ratings.