Results for "bandit-learning"

Claude Code Claude Desktop GitHub Copilot Cursor Windsurf Cline Zed JetBrains

📄SKILL.md 🤖CLAUDE.md ⚡Claude Commands 📐.cursorrules 📐Cursor Rules 🕹️AGENTS.md 🧬codex.md 🏄.windsurfrules 🔧.clinerules 🧑‍✈️Copilot Instructions

All Development Operations Data Product Marketing Customer Design Sales

99 skills found · Page 1 of 4

facebookresearch / ReAgent

3.7k

A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.)

universal

Updated 4h ago

tensorflow / Agents

3.0k

TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.

universal

banditscontextual-banditsdqn+5

Updated 1d ago

cair / TsetlinMachine

499

Code and datasets for the Tsetlin Machine

universal

bandit-learningfrequent-pattern-mininggame-theory+5

Updated 7h ago

alison-carrera / Onn

191

Online Deep Learning: Learning Deep Neural Networks on the Fly / Non-linear Contextual Bandit Algorithm (ONN_THS)

universal

contextual-banditsmabmachine-learning-library+10

Updated 1mo ago

amazon-science / Auction Gym

187

AuctionGym is a simulation environment that enables reproducible evaluation of bandit and reinforcement learning methods for online advertising auctions.

universal

advertisingmachine-learningreal-time-bidding+1

Updated 1mo ago

cair / PyTsetlinMachine

150

Implements the Tsetlin Machine, Convolutional Tsetlin Machine, Regression Tsetlin Machine, Weighted Tsetlin Machine, and Embedding Tsetlin Machine, with support for continuous features, multigranularity, clause indexing, and literal budget

universal

bandit-learningclassificationconvolution+8

Updated 12d ago

PierreGe / RL Movie Recommender

120

The purpose of our research is to study reinforcement learning approaches to building a movie recommender system. We formulate the problem of interactive recommendation as a contextual multi-armed bandit.

universal

movie-recommendationreinforcement-learning

Updated 2mo ago

yfletberliac / Rlss 2019

Materials for the Practical Sessions of the Reinforcement Learning Summer School 2019: Bandits, RL & Deep RL (PyTorch).

universal

banditseducationgoogle-colab+6

Updated 18d ago

kfoofw / Bandit Simulations

Bandit algorithms simulations for online learning

universal

Updated 1mo ago

MLWave / Hodor AutoML

Hodor AutoML: Brute-Bandit fast good-enough solutions to a wide range of machine learning problems.

universal

Updated 7mo ago

Nth-iteration-labs / Contextual

Contextual Bandits in R - simulation and evaluation of Multi-Armed Bandit Policies

universal

banditbandit-experimentsbandit-learning+17

Updated 5d ago

banditml / Banditml

A lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.

universal

banditscontextual-banditsneural-networks+3

Updated 2mo ago

SamRagusa / Checkers Reinforcement Learning

A checkers reinforcement learning AI, and all the tools needed to train it.

universal

adversarialadversarial-learningai+14

Updated 4mo ago

rayshi14 / HybridLinUCB Python

Hybrid Linear UCB bandit learning algorithm L Li(2010) python code

universal

Updated 1y ago

cair / Convolutional Tsetlin Machine Tutorial

Tutorial on the Convolutional Tsetlin Machine

universal

bandit-learningconvolutionfrequent-pattern-mining+5

Updated 1mo ago

weisong-ucr / MAB Malware

MAB-Malware an open-source reinforcement learning framework to generate AEs for PE malware. We model this problem as a classic multi-armed bandit (MAB) problem, by treating each action-content pair as an independent slot machine.

universal

Updated 15h ago

cfoh / Multi Armed Bandit Example

Learning Multi-Armed Bandits by Examples. Currently covering MAB, UCB, Boltzmann Exploration, Thompson Sampling, Contextual MAB, LinUCB, Deep MAB.

universal

machine-learningmulti-armed-banditsrecommendation-system+1

Updated 26d ago

cair / PyTsetlinMachineParallel

Multi-threaded implementation of the Tsetlin Machine, Convolutional Tsetlin Machine, Regression Tsetlin Machine, and Weighted Tsetlin Machine, with support for continuous features and multigranularity.

universal

bandit-learningclassificationconvolution+7

Updated 9mo ago

apexrl / RL Exploration Paper Lists

Paper Collection of Reinforcement Learning Exploration covers Exploration of Muti-Arm-Bandit, Reinforcement Learning and Multi-agent Reinforcement Learning.

universal

Updated 5d ago

shirishjain / Music Recommender Engine

The purpose of this Personalized Music Recommendation Engine is to use reinforcement learning approach to build a music recommender system and to formulate the problem of interactive recommendation as a contextual multi-armed bandit, learning user preferences recommending new songs and receiving their ratings.

zed

Updated 1mo ago