Contextual

Contextual Bandits in R - simulation and evaluation of Multi-Armed Bandit Policies

Generate Convert Improve

Install / Use

/learn @Nth-iteration-labs/Contextual

About this skill

Quality Score

0/100

README

Contextual: Multi-Armed Bandits in R

Overview

R package facilitating the simulation and evaluation of context-free and contextual Multi-Armed Bandit policies.

The package has been developed to:

Ease the implementation, evaluation and dissemination of both existing and new contextual Multi-Armed Bandit policies.
Introduce a wider audience to contextual bandit policies' advanced sequential decision strategies.

Package links:

Installation

To install contextual from CRAN:

install.packages('contextual')

To install the development version (requires the devtools package):

install.packages("devtools")
devtools::install_github('Nth-iteration-labs/contextual')

When working on or extending the package, clone its GitHub repository, then do:

install.packages("devtools")
devtools::install_deps(dependencies = TRUE)
devtools::build()
devtools::reload()

clean and rebuild...

Overview of core classes

Contextual consists of six core classes. Of these, the Bandit and Policy classes are subclassed and extended when implementing custom (synthetic or offline) bandits and policies. The other four classes (Agent, Simulator, History, and Plot) are the workhorses of the package, and generally need not be adapted or subclassed.

Documentation

See the demo directory for practical examples and replications of both synthetic and offline (contextual) bandit policy evaluations.

When seeking to extend contextual, it may also be of use to review "Extending Contextual: Frequently Asked Questions", before diving into the source code.

How to replicate figures from two introductory context-free Multi-Armed Bandits texts:

Basic, context-free multi-armed bandit examples:

Examples of both synthetic and offline contextual multi-armed bandit evaluations:

An example how to make use of the optional theta log to create interactive context-free bandit animations:

Interactive, animated versions of Epsilon Greedy, UCB1 and Thompson Sampling policies

Some more extensive vignettes to get you started with the package:

Paper offering a general overview of the package's structure & API:

An introduction to multi-armed bandit problems and the use of the R package contextual paper.

Policies and Bandits

Overview of contextual's growing library of contextual and context-free bandit policies:

| General | Context-free | Contextual | Other | |---------------|-------------------------------------------------------------------------------------------|-------------------------------------------------------------------------------|------------------------------| | Random Oracle Fixed | Epsilon-Greedy Epsilon-First UCB1, UCB2 Thompson Sampling BootstrapTS Softmax Gradient Gittins | CMAB Naive Epsilon-Greedy Epoch-Greedy LinUCB (General, Disjoint, Hybrid) Linear Thompson Sampling ProbitTS LogitBTS GLMUCB | Lock-in Feedback (LiF) |

Overview of contextual's bandit library:

| Basic Synthetic | Contextual Synthetic | Offline | Continuous | |------------------------------------------|---------------------------------------------------------------------------------------|------------------------------------------------------------|------------| | Basic Bernoulli Bandit Basic Gaussian Bandit | Contextual Bernoulli Contextual Logit Contextual Hybrid Contextual Linear Contextual Wheel | Replay Evaluator Bootstrap Replay Propensity Weighting Direct Method Doubly Robust | Continuum |

Alternative parallel backends

By default, "contextual" uses R's built-in parallel package to facilitate parallel evaluation of multiple agents over repeated simulation. See the demo/alternative_parallel_backends directory for several alternative parallel backends:

Maintainers

Robin van Emden: author, maintainer* Maurits Kaptein: supervisor*

* Tilburg University / Jheronimus Academy of Data Science.

If you encounter a clear bug, please file a minimal reproducible example on GitHub.

Related Skills

YC-Killer

2.7k

A library of enterprise-grade AI agents designed to democratize artificial intelligence and provide free, open-source alternatives to overvalued Y Combinator startups. If you are excited about democratizing AI access & AI agents, please star ⭐️ this repository and use the link in the readme to join our open source AI research team.

best-practices-researcher

The most comprehensive Claude Code skills registry | Web Search: https://skills-registry-web.vercel.app

research_rules

Research & Verification Rules Quote Verification Protocol Primary Task "Make sure that the quote is relevant to the chapter and so you we want to make sure that we want to have it identifie

groundhog

398

Groundhog's primary purpose is to teach people how Cursor and all these other coding agents work under the hood. If you understand how these coding assistants work from first principles, then you can drive these tools harder (or perhaps make your own!).

Nth-iteration-labs

View profile

View on GitHub

GitHub Stars81

CategoryEducation

Updated6d ago

Forks25

Nth-iteration-labs/contextual

Languages

Security Score

85/100

Audited on Mar 24, 2026

No findings