Hyperoptax: Parallel hyperparameter tuning with JAX

[!NOTE] Hyperoptax is still a WIP and the API is subject to change. There are many rough edges to smooth out. It is recommended to download specific versions or to download from source if you want to use it in a large scale project.

⛰️ Introduction

Hyperoptax is a lightweight toolbox for parallel hyperparameter optimization of pure JAX functions. It provides a concise API that lets you wrap any JAX-compatible loss or evaluation function and search across spaces in parallel – all while staying in pure JAX.

🏗️ Installation

pip install hyperoptax

If you want to use the notebooks:

pip install hyperoptax[notebooks]

If you do not yet have JAX installed, pick the right wheel for your accelerator:

# CPU-only
pip install --upgrade "jax[cpu]"
# or GPU/TPU – see the official JAX installation guide

🥜 In a nutshell

Hyperoptax offers a simple API to wrap pure JAX functions for hyperparameter search and making use of parallelization (vmap only currently). See the notebooks for more examples.

from hyperoptax.bayesian import BayesianOptimizer
from hyperoptax.spaces import LogSpace, LinearSpace

@jax.jit
def train_nn(learning_rate, final_lr_pct):
    ...
    return val_loss

search_space = {"learning_rate": LogSpace(1e-5,1e-1, 100),
                "final_lr_pct": LinearSpace(0.01, 0.5, 100)}

search = BayesianOptimizer(search_space, train_nn)
best_params = search.optimize(n_iterations=100, 
                              n_parallel=10, 
                              maximize=False,
                              )

💪 Hyperoptax in action

🔪 The Sharp Bits

Since we are working in pure JAX the same sharp bits apply. Some consequences of this for hyperoptax:

Parameters that change the length of an evaluation (e.g: epochs, generations...) can't be optimized in parallel.
Neural network structures can't be optimized in parallel either.
Strings can't be used as hyperparameters.

🫂 Contributing

We welcome pull requests! To get started:

Open an issue describing the bug or feature.
Fork the repository and create a feature branch (git checkout -b my-feature).
Install dependencies:

pip install -e ".[all]"

Run the test suite:

XLA_FLAGS=--xla_force_host_platform_device_count=4 pytest # Fake GPUs for pmap tests

Ensure the notebooks still work.
Format your code with ruff.
Submit a pull request.

Roadmap

I'm developing this both as a passion project and for my work in my PhD. I have a few ideas on where to go with this libary:

Sample hyperparameter configurations on the fly rather than generate a huge grid at initialisation.
Switch domain type from a list of arrays to a PyTree.
Callbacks!
Inspired by wandb's sweeps, use a linear grid for all parameters and apply transformations at sample time.
We are currently redoing the kernel calculation at each iteration when only the last row/column is actually needed. JAX requires sizes to be constant, so we need to do something clever...
Need to find a way to share the GP across workers on pmap for Bayesian.
Length scale tuning of kernel tuned during optimization (as done in other implementations).
Some clumpiness in the acquisisiton, there is literature that can help us.

📝 Citation

If you use Hyperoptax in academic work, please cite:

@misc{hyperoptax,
  author = {Theo Wolf},
  title = {{Hyperoptax}: Parallel hyperparameter tuning with JAX},
  year = {2025},
  url = {https://github.com/TheodoreWolf/hyperoptax}
}

Hyperoptax

Install / Use

README