TPSR

[NeurIPS 2023] This is the official code for the paper "TPSR: Transformer-based Planning for Symbolic Regression"

Generate Convert Improve

Install / Use

/learn @deep-symbolic-mathematics/TPSR

About this skill

Quality Score

0/100

README

Deep Symbolic Regression with Transformers & Lookahead Planning

Official Implementation of Transformer-based Planning for Symbolic Regression (NeurIPS 2023).

Paper | SRBench Results | Code

Overview

In this paper, we introduce TPSR, a novel transformer-based planning framework for symbolic regression by leveraging priors of large-scale pretrained models and incorporating lookahead planning. TPSR incorporates Monte Carlo Tree Search (MCTS) into the transformer decoding process of symbolic regression models. Unlike conventional decoding strategies, TPSR enables the integration of non-differentiable feedback, such as fitting accuracy and complexity, as external sources of knowledge into the transformer-based equation generation process.

<img src="./images/Media13_Final.gif" width="100%" /> TPSR uncovering the governing symbolic mathematics of data, providing enhanced extrapolation capabilities.

Preperation: Data and Pre-trained Backbone Models

Download Pre-trained Models:
- End-to-End (E2E) SR Transformer model is available here.
- NeSymReS model is available here.

After downloading, extract both models to this directory. They should be located under the symbolicregression/weights/ and nesymres/weights/ sub-folders, respectively.

Download Benchmark Datasets:
- Feynman equations are here
- PMLB datasets are also here. Data points of PMLB datasets are used in the SRBench (A Living Benchmark for Symbolic Regression), containing three data groups: Feynman, Strogatz, and Black-box.

Extract the datasets to this directory, Feynman datasets should be in datasets/feynman/, and PMLB datasets should be in datasets/pmlb/.

Installation

To run the code with deafult E2E backbone model, create a conda environment and install the dependencies by running the following command.

conda create --name tpsr
conda activate tpsr
pip install -r requirements.txt

If you're interested to run experiments with NeSymReS backbone, install its additional dependencies from here. You can follow these steps:

conda create --name tpsr
conda activate tpsr
cd nesymres
pip install -e src/
pip install -r requirements.txt
pip install lightning==1.9

Run

We have created run.sh script to execute Transformer-based Planning for Automated Equation Discovery based on the reward defined in reward.py with the combination of equation's fitting accuracy and complexity. To run the script for different datasets, configure the following parameters:

| Parameters | |:-----------------:|:------- | backbone_model | eval_in_domain | eval_mcts_in_domain | eval_on_pmlb | eval_mcts_on_pmlb | horizon | rollout | num_beams | width | no_seq_cache | no_prefix_cache | ucb_constant | uct_alg | max_input_points | max_number_bags | pmlb_data_type | target_noise | beam_type | beam_size | n_trees_to_refine | prediction_sigmas | eval_input_length_modulo Description | Example Values | -------------------------------------------------------------------------------------------------:|:------------------------------:| | Backbone Pre-trained Model Type (e2e/nesymres) | e2e | | Evaluate backbone pre-trained model on In-Domain dataset (Yes/No) | True/False | | Evaluate TPSR on In-Domain dataset (Yes/No) | True/False | | Evaluate backbone pre-trained model on PMLB (Yes/No) | True/False | | Evaluate TPSR on PMLB (Yes/No) | True/False | | Horizon of lookahead planning (maxlen of equation tokens) | 200 | | Number of rollouts ($r$) in TPSR | 3 | | Beam size ($b$) in TPSR's evaluation step to simulate completed equations | 1 | | Top-k ($k$) in TPSR's expansion step to expand tree width | 3 | | Use sequence caching (Yes/No) | False | | Use top-k caching (Yes/No) | False | | Exploration weight in UCB | 1.0 | | UCT algorithm $\in$ {uct, p_uct, var_p_uct} | uct | | Maximum input points observed by pre-trained model ($N$) | 200 | | Maximum number of bags for input points ($B$) | 10 | | PMLB data group $\in$ {feynman, strogatz, black-box} | feynman | | Target noise added to y_to_fit in PMLB | 0.0 | | Decoding type for pre-trained models $\in$ {search, sampling} | sampling | | Decoding size ($s$) for pre-trained models (beam size, or sampling size) | 10 | | Number of refinements in decodings $\in$ {1,..., $s$ } | 10 | | Sigmas of extrapolation eval data sampling (In-domain) | 1,2,4,8,16 | | Number of eval points (In-domain). Set to 50 yields $N_{test}=[50,100,150,200]$ per extrapolation range. | 50 |

Run - PMLB Datasets (Feynman/ Strogatz/ Blackbox)

Pre-trained E2E Model (Sampling / Beam Search):

python run.py --eval_on_pmlb True \
                   --pmlb_data_type feynman \
                   --target_noise 0.0 \
                   --beam_type sampling \ # or search
                   --beam_size 10 \
                   --n_trees_to_refine 10 \
                   --max_input_points 200 \
                   --max_number_bags 10 \
                   --save_results True

Transformer-based Planning with E2E Backbone:

python run.py --eval_mcts_on_pmlb True \
                   --pmlb_data_type feynman \
                   --target_noise 0.0 \
                   --lam 0.1 \
                   --horizon 200 \
                   --width 3 \
                   --num_beams 1 \
                   --rollout 3 \
                   --no_seq_cache False \
                   --no_prefix_cache True \
                   --max_input_points 200 \
                   --max_number_bags 10 \
                   --save_results True

For running the code on Strogatz or Black-box datasets, simply adjust the pmlb_data_type parameter to either strogatz or blackbox. The commands provided above are set for the Feynman datasets. You can also modify the target_noise and other parameters to suit your experiments. Running each command saves the results for all datasets and metrics in a .csv file.

Run - In-Domain Datasets

In-Domain datasets are generated, following the validation data gneration protocol suggested in E2E. For details, refer to the generate_datapoints function here. You can also modify data generation parameters here. For example, you can adjust parameters like prediction_sigmas to control extrapolation. A sigma 1 aligns with the training data range, while >1 is for extrapolation ranges. The In-domain validation datasets are generated on-the-fly. For consistent evaluations across models, consider setting a fixed seed.

Pre-trained E2E Model (Sampling / Beam Search):

python run.py --eval_in_domain True \
                   --beam_type sampling \ # or search
                   --beam_size 10 \
                   --n_trees_to_refine 10 \
                   --max_input_points 200 \
                   --eval_input_length_modulo 50 \
                   --prediction_sigmas 1,2,4,8,16 \
                   --save_results True

Transformer-based Planning with E2E Backbone:

python run.py --eval_mcts_in_domain True \
                   --lam 0.1 \
                   --horizon 200 \
                   --width 3 \
                   --num_beams 1 \
                   --rollout 3 \
                   --no_seq_cache False \
                   --no_prefix_cache True \
                   --max_input_points 200 \
                   --eval_input_length_modulo 50 \
                   --prediction_sigmas 1,2,4,8,16 \
                   --save_results True \
                   --debug

Demo

We have also included a small demo that runs TPSR with both E2E and NesymReS backbones on your dataset. You can play with it here

E2E+TPSR:

python tpsr_demo.py --backbone_model e2e --no_seq_cache True --no_prefix_cache True

NeSymReS+TPSR:

python tpsr_demo.py --backbone_model nesymres --no_seq_cache True --no_prefix_cache True

Final Results on SRBench

Our experimental results of E2E+TPSR on SRBench datasets are provided in the srbench_results/ directory.

Related Skills

YC-Killer

2.7k

A library of enterprise-grade AI agents designed to democratize artificial intelligence and provide free, open-source alternatives to overvalued Y Combinator startups. If you are excited about democratizing AI access & AI agents, please star ⭐️ this repository and use the link in the readme to join our open source AI research team.

best-practices-researcher

The most comprehensive Claude Code skills registry | Web Search: https://skills-registry-web.vercel.app

groundhog

398

Groundhog's primary purpose is to teach people how Cursor and all these other coding agents work under the hood. If you understand how these coding assistants work from first principles, then you can drive these tools harder (or perhaps make your own!).

isf-agent

a repo for an agent that helps researchers apply for isf funding

deep-symbolic-mathematics

View profile

View on GitHub

GitHub Stars80

CategoryEducation

Updated9d ago

Forks17

deep-symbolic-mathematics/TPSR

Languages

Python

Security Score

100/100

Audited on Mar 21, 2026

No findings