Madupite

a High-Performance Distributed Solver for Large-Scale Markov Decision Processes (MDP) relying on Inexact Policy Iteration; for Python and C++

Generate Convert Improve

Install / Use

/learn @madupite/Madupite

About this skill

Quality Score

0/100

README

<h1 align="center">madupite</h1> <div align="center"> <i>a High-Performance Distributed Solver for Large-Scale Markov Decision Processes</i> <br/><br/> <a href="https://github.com/madupite/madupite/actions/workflows/ci-pipeline.yml"><img src="https://github.com/madupite/madupite/actions/workflows/ci-pipeline.yml/badge.svg" alt="CI Pipeline"></a> <a href="https://madupite.github.io/"><img src="https://img.shields.io/badge/docs-latest-blue.svg" alt="Documentation"></a> <a href="https://joss.theoj.org/papers/b7796be0782d9feb0bf5fb3bc37bc7e4"><img src="https://joss.theoj.org/papers/b7796be0782d9feb0bf5fb3bc37bc7e4/status.svg" alt="JOSS"></a> <br/><br/> <img src="docs/_static/madupite_logo.png" width="250" height="250"/> </div> <br/>

madupite is a high-performance C++ library with a Python interface designed for solving large-scale Markov Decision Processes (MDPs) using Distributed Inexact Policy Iteration (iPI)[^1]. Leveraging distributed sparse linear solvers from PETSc, madupite efficiently handles the computational complexity associated with large-scale MDPs.

Key Features

Scalable MDP Solver: Efficiently solve MDPs with large state and action spaces using distributed computation.
Python and C++ APIs: Access the power of madupite through both Python and C++, depending on your performance needs.
Distributed Computing: Integrates with PETSc and MPI for distributed computing on multi-core and cluster environments.

Installation

After cloning the repository, you can install the package using the following commands.

conda env create -f environment.yml
conda activate madupiteenv
pip install .

Note that the conda environment file is limited to Linux for now. After installation, run python examples/install/main.py or mpirun -n <number of ranks> python examples/install/main.py to check if the installation was successful.

We recommend using the provided conda environment. If you prefer to rely on your own environment, make sure to install the required dependencies listed in environment.yml. We refer to the documentation for more information on the installation process.

Examples

Solve any stochastic optimal control problem formulated as a MDP using madupite! For example, the fastest way out of a maze or the optimal control input to balance an inverted pendulum:

Examples on how to use madupite can be found in the documentation (tutorials and examples) as well as in the examples directory.

In short: The transition probabilities and stage costs can either be loaded from files or generated using functions in the code (or a combination of both):

See examples/tutorial/ex1.py for an example on how to generate the required data using function in the code.
See examples/tutorial/ex2.py for an example on how to load the required data from files as well as examples/tutorial/ex2_data_generation.py to see how the required files can be generated from NumPy or SciPy arrays.

The tutorials also provides more detail on the data layout and how to preallocate sparse matrices for improved performance.

Contributing

Contributions to madupite are welcome! Please see the guidelines file CONTRIBUTING.md for more information.

License

madupite is distributed under the MIT License. See the LICENSE file for more information.

Acknowledgements

madupite was developed at the Automatic Control Laboratory at ETH Zurich. This work was supported by the European Research Council under the Horizon 2020 Advanced under Grant 787845 (OCAL) and by the SNSF through NCCR Automation (Grant Number 180545).

If you use madupite in your research, please cite it.

@article{Gargiani2025,
    doi = {10.21105/joss.07411},
    url = {https://doi.org/10.21105/joss.07411},
    year = {2025},
    publisher = {The Open Journal},
    volume = {10},
    number = {108},
    pages = {7411},
    author = {Matilde Gargiani and Philip Pawlowsky and Robin Sieber and Václav Hapla and John Lygeros},
    title = {madupite: A High-Performance Distributed Solver for Large-Scale Markov Decision Processes},
    journal = {Journal of Open Source Software}
}

[^1]: Gargiani, M.; Sieber. R.; Balta, E.; Liao-McPherson, D.; Lygeros, J. Inexact Policy Iteration Methods for Large-Scale Markov Decision Processes. https://arxiv.org/abs/2404.06136.

Related Skills

node-connect

352.5k

Diagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps

claude-opus-4-5-migration

111.3k

Migrate prompts and code from Claude Sonnet 4.0, Sonnet 4.5, or Opus 4.1 to Opus 4.5

frontend-design

111.3k

Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.

model-usage

352.5k

Use CodexBar CLI local cost usage to summarize per-model usage for Codex or Claude, including the current (most recent) model or a full model breakdown. Trigger when asked for model-level usage/cost data from codexbar, or when you need a scriptable per-model summary from codexbar cost JSON.

madupite

View profile

View on GitHub

GitHub Stars25

CategoryDevelopment

Updated29d ago

Forks1

madupite/madupite

Languages

C++

Security Score

95/100

Audited on Mar 10, 2026

No findings