137 skills found · Page 4 of 5
Aurelien-Pelissier / AlphaFUSEBandit based Reinforcement Learning applied on Feature selection, a Monte Carlo search tree algorithm is trained to find the best feature subset.
singhsidhukuldeep / Contextual BanditsA comprehensive Python library implementing a variety of contextual and non-contextual multi-armed bandit algorithms—including LinUCB, Epsilon-Greedy, Upper Confidence Bound (UCB), Thompson Sampling, KernelUCB, NeuralLinearBandit, and DecisionTreeBandit—designed for reinforcement learning applications
everpeace / Banditsbook ScalaScala implementations of standard algorithms for Multi-Armed Bandits Problem.
kurttheviking / Softmax JsA softmax multi-armed bandit algorithm
GNGN1111 / Github Off TsThis item is code for the paper "Dynamic pricing algorithm for edge computing task offloading based on Contextual Multi-Armed Bandit".
flaviotruzzi / AdBanditsAdBandit: A New Algorithm For Multi-Armed Bandits
ardaegeunlu / X Armed BanditsImplementation of the X-armed Bandits algorithm, as detailed in the paper, "X-armed Bandits", Bubeck et al., 2011.
rssalessio / Reading ListThis is a collection of interesting papers that I have read so far or want to read. Note that the list is not up-to-date. Topics: reinforcement learning, deep learning, mathematics, statistics, bandit algorithms, optimization.
viswanath57 / Bandit AlgorithmsNo description available
sgiguere / RobinHood NeurIPS 2019Implementation of safe offline bandit algorithms.
annieyan / Bandits Using UCB AlgorithmThompson Sampling for Bandits using UCB policy
alextanhongpin / Node BanditA nodejs implementation of Multi-Arm-Bandit algorithm
clreda / NORDicNetwork-Oriented Repurposing of Drugs Python Package
etiennekintzler / Visualize Bandit AlgorithmsSome visualizations of bandit algorithm outputs.
Nath-B / Reinforcement LearningMulti-armed bandits algorithms
KaleabTessera / Multi Armed BanditImplementation of greedy, E-greedy and Upper Confidence Bound (UCB) algorithm on the Multi-Armed-Bandit problem.
alaiacano / Sifter LibThis is an implementation of a few multi-armed bandit algorithms in Scala.
sangitatukai / SAGIN Framework For Multi Layer CachingA simulation framework for intelligent, multi-tier content caching in Space-Air-Ground Integrated Networks (SAGIN), using contextual multi-armed bandit algorithms to optimize data availability and minimize retrieval delays.
dourouc05 / CombinatorialBandits.jlImplementation of several algorithms for combinatorial bandits, a kind of reinforcement learning.
viswanath57 / Bandit Algorithms DemoNo description available