Evogp

A GPU-accelerated library for Tree-based Genetic Programming, leveraging PyTorch and custom CUDA kernels for high-performance evolutionary computation. It supports symbolic regression, classification, and policy optimization with advanced features like multi-output trees and benchmark tools.

Generate Convert Improve

Install / Use

/learn @EMI-Group/Evogp

About this skill

Quality Score

0/100

README

<h1 align="center"> <a href="https://github.com/EMI-Group/evox"> <picture> <source media="(prefers-color-scheme: dark)" srcset="./imgs/evox_brand_light.svg"> <source media="(prefers-color-scheme: light)" srcset="./imgs/evox_brand_dark.svg"> <img alt="EvoX Logo" height="128" width="500px" src="./imgs/evox_brand_dark.svg"> </picture> </a> <br> </h1> <p align="center"> <picture> <img src="imgs/SymbolicRegression.svg" alt="" height="180"> </picture> <picture> <img src="imgs/Classification.svg" alt="" height="180"> </picture> <picture> <img src="imgs/Halfcheetah.svg" alt="HalfCheetah" height="180"> </picture> </p> <p align="center"> 🌟 EvoGP: A GPU-accelerated Framework for Tree-based Genetic Programming 🌟 </p> <p align="center"> <a href="https://arxiv.org/abs/2501.17168"> <img src="https://img.shields.io/badge/paper-arxiv-red?style=for-the-badge" alt="EvoGP Paper on arXiv"> </a> </p>

Installation
Basic API Usage
Advanced Genetic Operations
Supported Benchmarks

Introduction

EvoGP is a fully GPU-accelerated Tree-based Genetic Programming (TGP) framework built on PyTorch, leveraging custom CUDA kernels for core evolutionary operations like tree generation, mutation, crossover, and fitness evaluation. It supports multi-output trees and includes built-in tools for symbolic regression, policy optimization, and classification, along with standardized benchmarks for evaluation and tuning. EvoGP combines the flexibility of Python with the computational power of GPUs, making it an ideal platform for TGP research and applications. EvoGP is a sister project of <a href="https://github.com/EMI-Group/evox">EvoX</a>.

Key Features

CUDA-based parallel approach for TGP:
- Leverage specialized CUDA kernels to optimize critical TGP operations.
- Enhance computational efficiency, especially for large populations, enabling faster execution compared to traditional TGP methods.
GPU-accelerated framework in Python:
- Integrates CUDA kernels into Python via custom operators of PyTorch, ensuring compatibility with modern computational ecosystems.
- Achieve up to a 100x speedup compared to existing TGP implementations while maintaining or improving solution quality.
Rich in extended content:
- Offers a range of genetic operation variants, allowing users to tailor configurations for specific tasks.
- Supports multi-output trees, making it suitable for complex problems like classification and policy optimization.
- Supports Symbolic Regression, Classification, and Policy Optimization (Brax) benchmarks.

Installation

To install EvoGP, please follow the steps below:

1. Install NVIDIA CUDA Toolkit

Ensure you have the NVIDIA CUDA Toolkit installed, including nvcc. You can download it from NVIDIA's official website.

Check your CUDA version:
```
nvcc --version
```

2. Install a C++ Compiler

Ensure you have a compatible C++ compiler installed:

Linux/macOS: Install GCC (9.x or later is recommended).

sudo apt install build-essential  # On Ubuntu
gcc --version

Windows: Install the Visual C++ Build Tools. You can download it from this. During installation, ensure that the C++ workload is selected.

3. Install PyTorch

Install the version of PyTorch that matches your installed CUDA Toolkit version.
For example, if you are using CUDA 11.8:

pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118

Important: Make sure to select the PyTorch version compatible with the CUDA Toolkit version (nvcc -V), not the NVIDIA driver version.

You can find more details on the PyTorch installation page.

4. Install EvoGP

Finally, install EvoGP:

pip install git+https://github.com/EMI-Group/evogp.git --no-build-isolation

Note: This process might take a significant amount of time, as it includes the compilation of CUDA kernels.

5. Validate Installation

python -m evogp.sr_test

Basic API Usage

Start your journey with EvoGP in a few simple steps:

Import necessary modules:

import torch
from evogp.tree import Forest, GenerateDescriptor
from evogp.algorithm import (
    GeneticProgramming,
    DefaultSelection,
    DefaultMutation,
    DefaultCrossover,
)
from evogp.problem import SymbolicRegression
from evogp.pipeline import StandardPipeline

Define a problem (Here is Symbolic Regression with XOR-3d):

XOR_INPUTS = torch.tensor(
    [
        [0, 0, 0],
        [0, 0, 1],
        [0, 1, 0],
        [0, 1, 1],
        [1, 0, 0],
        [1, 0, 1],
        [1, 1, 0],
        [1, 1, 1],
    ],
    dtype=torch.float,
    device="cuda",
)

XOR_OUTPUTS = torch.tensor(
    [[0], [1], [1], [0], [1], [0], [0], [1]],
    dtype=torch.float,
    device="cuda",
)

problem = SymbolicRegression(datapoints=XOR_INPUTS, labels=XOR_OUTPUTS)

Configure the algorithm:

# create decriptor for generating new trees
descriptor = GenerateDescriptor(
    max_tree_len=32,
    input_len=problem.problem_dim,
    output_len=problem.solution_dim,
    using_funcs=["+", "-", "*", "/"],
    max_layer_cnt=4,
    const_samples=[-1, 0, 1],
)

# create the algorithm
algorithm = GeneticProgramming(
    initial_forest=Forest.random_generate(pop_size=5000, descriptor=descriptor),
    crossover=DefaultCrossover(),
    mutation=DefaultMutation(
        mutation_rate=0.2, descriptor=descriptor.update(max_layer_cnt=3)
    ),
    selection=DefaultSelection(survival_rate=0.3, elite_rate=0.01),
)

Run!:

pipeline = StandardPipeline(
    algorithm,
    problem,
    generation_limit=100,
)

best = pipeline.run()

Check the details for the best tree:

Predict results check:

pred_res = best.forward(XOR_INPUTS)
print(pred_res)

Obtain output like this:

tensor([[ 1.0000e-09],
        [ 1.0000e+00],
        [ 1.0000e+00],
        [-1.0000e-09],
        [ 1.0000e+00],
        [ 1.0000e-09],
        [ 1.0000e-09],
        [ 1.0000e+00]], device='cuda:0')

Mathmatics Formula (Sympy expression):

sympy_expression = best.to_sympy_expr()
print(sympy_expression)

Obtain output like this:

(-x2*(x0 + x1) + 1.0)*(1.0*x2*(-x2*(x0 + x1) + 1.0) + (x0 - x1)**2)

Visualize:

best.to_png("./imgs/xor_tree.png")

Obtain:

The complete code is available in code.

Advanced Genetic Operations

EvoGP includes multiple genetic operators, allowing users to freely assemble them to build customized TGP algorithms.

| Type | Name | |------------|---------------------------------------| | Selection | DefaultSelection | | Selection | RouletteSelection | | Selection | TruncationSelection | | Selection | RankSelection | | Selection | TournamentSelection | | Crossover | DefaultCrossover | | Crossover | DiversityCrossover | | Crossover | LeafBiasedCrossover | | Mutation | DefaultMutation | | Mutation | HoistMutation | | Mutation | SinglePointMutation | | Mutation | MultiPointMutation | | Mutation | InsertMutation | | Mutation | DeleteMutation | | Mutation | SingleConstMutation | | Mutation | MultiConstMutation | | Mutation | CombinedMutation |

Supported Benchmarks

Symbolic Regression

EvoGP supports symbolic regression tasks. You can construct a Problem with your custom dataset:

from evogp.problem import SymbolicRegression

problem = SymbolicRegression(datapoints=YOUR_DATA, labels=YOUR_LABELS)

Or use a predefined function to generate data:

def func(x):
    val = x[0] ** 4 / (x[0] ** 4 + 1) + x[1] ** 4 / (x[1] ** 4 + 1)
    return val.reshape(-1)

problem = SymbolicRegression(
    func=func, 
    num_inputs=2, 
    num_data=20000, 
    lower_bounds=-5, 
    upper_bounds=5
)

Classification

EvoGP supports classification tasks. You can construct a Problem with your custom dataset:

from evogp.problem import Classification
problem = Classification(datapoints=YOUR_DATA, labels=YOUR_LABELS)

Or use the provided datasets:

dataset_name = ["iris", "wine", "breast_cancer", "digits"]
problem = Classification(dataset="iris")

Transformation

EvoGP supports feature transformation tasks, allowing the generation of new features from raw data to improve model performance. You can create a Problem with your custom dataset:

from evogp.problem import Transformation

problem = Transformation(datapoints=YOUR_DATA, labels=YOUR

Related Skills

openhue

337.1k

Control Philips Hue lights and scenes via the OpenHue CLI.

sag

337.1k

ElevenLabs text-to-speech with mac-style say UX.

weather

337.1k

Get current weather and forecasts via wttr.in or Open-Meteo

tweakcc

1.4k

Customize Claude Code's system prompts, create custom toolsets, input pattern highlighters, themes/thinking verbs/spinners, customize input box & user message styling, support AGENTS.md, unlock private/unreleased features, and much more. Supports both native/npm installs on all platforms.

EMI-Group

View profile

View on GitHub

GitHub Stars259

CategoryCustomer

Updated18h ago

Forks43

EMI-Group/evogp

Languages

Python

Security Score

100/100

Audited on Mar 25, 2026

No findings

Evogp

Install / Use

README

Table of Contents

Introduction

Key Features

Installation

1. Install NVIDIA CUDA Toolkit

2. Install a C++ Compiler

3. Install PyTorch

4. Install EvoGP

5. Validate Installation

Basic API Usage

Advanced Genetic Operations

Supported Benchmarks

Symbolic Regression

Classification

Transformation

Related Skills