SciMLBenchmarks.jl
Scientific machine learning (SciML) benchmarks, AI for science, and (differential) equation solvers. Covers Julia, Python (PyTorch, Jax), MATLAB, R
Install / Use
/learn @SciML/SciMLBenchmarks.jlREADME
SciMLBenchmarks.jl: Benchmarks for Scientific Machine Learning (SciML) and Equation Solvers
SciMLBenchmarks.jl holds webpages, pdfs, and notebooks showing the benchmarks for the SciML Scientific Machine Learning Software ecosystem, including:
- Benchmarks of equation solver implementations
- Speed and robustness comparisons of methods for parameter estimation / inverse problems
- Training universal differential equations (and subsets like neural ODEs)
- Training of physics-informed neural networks (PINNs)
- Surrogate comparisons, including radial basis functions, neural operators (DeepONets, Fourier Neural Operators), and more
The SciML Bench suite is made to be a comprehensive open source benchmark from the ground up, covering the methods of computational science and scientific computing all the way to AI for science.
Rules: Optimal, Fair, and Reproducible
These benchmarks are meant to represent good optimized coding style. Benchmarks are preferred to be run on the provided open benchmarking hardware for full reproducibility (though in some cases, such as with language barriers, this can be difficult). Each benchmark is documented with the compute devices used along with package versions for necessary reproduction. These benchmarks attempt to measure in terms of work-precision efficiency, either timing with approximately matching error or building work-precision diagrams for direct comparison of speed at given error tolerances.
If any of the code from any of the languages can be improved, please open a pull request.
For critiques of benchmarks, please open a pull request that changes the code in the desired manner. Issues with recommended changes are generally vague and not actionable, while pull requests with code changes are exact. Thus if there is something you think should be changed in the code, please make the recommended change in the code!
Results
To view the results of the SciML Benchmarks, go to docs.sciml.ai/SciMLBenchmarksOutput. By default, this will lead to the latest tagged version of the benchmarks. To see the in-development version of the benchmarks, go to https://docs.sciml.ai/SciMLBenchmarksOutput/dev/.
Static outputs in pdf, markdown, and html reside in SciMLBenchmarksOutput.
Citing
To cite the SciML Benchmarks, please cite the following:
@article{rackauckas2019confederated,
title={Confederated modular differential equation APIs for accelerated algorithm development and benchmarking},
author={Rackauckas, Christopher and Nie, Qing},
journal={Advances in Engineering Software},
volume={132},
pages={1--6},
year={2019},
publisher={Elsevier}
}
@article{DifferentialEquations.jl-2017,
author = {Rackauckas, Christopher and Nie, Qing},
doi = {10.5334/jors.151},
journal = {The Journal of Open Research Software},
keywords = {Applied Mathematics},
note = {Exported from https://app.dimensions.ai on 2019/05/05},
number = {1},
pages = {},
title = {DifferentialEquations.jl – A Performant and Feature-Rich Ecosystem for Solving Differential Equations in Julia},
url = {https://app.dimensions.ai/details/publication/pub.1085583166 and http://openresearchsoftware.metajnl.com/articles/10.5334/jors.151/galley/245/download/},
volume = {5},
year = {2017}
}
Current Summary
The following is a quick summary of the benchmarks. These paint broad strokes over the set of tested equations and some specific examples may differ.
Non-Stiff ODEs
- OrdinaryDiffEq.jl's methods are the most efficient by a good amount
- The
Vernmethods tend to do the best in every benchmark of this category - At lower tolerances,
Tsit5does well consistently. - ARKODE and Hairer's
dopri5/dop853perform very similarly, but are both far less efficient than theVernmethods. - The multistep methods,
CVODE_Adamsandlsoda, tend to not do very well. - The ODEInterface multistep method
ddeabmdoes not do as well as the other multistep methods. - ODE.jl's methods are not able to consistently solve the problems.
- Fixed time step methods are less efficient than the adaptive methods.
Stiff ODEs
- In this category, the best methods are much more problem dependent.
- For smaller problems:
Rosenbrock23,lsoda, andTRBDF2tend to be the most efficient at high tolerances.Rodas4PandRodas5Ptend to be the most efficient at low tolerances.
- For larger problems (Filament PDE):
FBDFandQNDFdo the best at all normal tolerances.- The ESDIRK methods like
TRBDF2andKenCarp4can come close.
radauis always the most efficient when tolerances go to the low extreme (1e-13)- Fixed time step methods tend to diverge on every tested problem because the high stiffness results in divergence of the Newton solvers.
- ARKODE is very inconsistent and requires a lot of tweaking in order to not
diverge on many of the tested problems. When it doesn't diverge, the similar
algorithms in OrdinaryDiffEq.jl (
KenCarp4) are much more efficient in most cases. - GeometricIntegrators.jl fails to converge on any of the tested problems.
Dynamical ODEs
- Higher order (generally order >=6) symplectic integrators are much more efficient than the lower order counterparts.
- For high accuracy, using a symplectic integrator is not preferred. Their extra cost is not necessary since the other integrators are able to not drift simply due to having low enough error.
- In this class, the
DPRKNmethods are by far the most efficient. TheVernmethods do well for not being specific to the domain.
Non-Stiff SDEs
- For simple 1-dimensional SDEs at low accuracy, the
EMandRKMilmethods can do well. Beyond that, they are simply outclassed. - The
SRAandSRImethods both are very similar within-class on the simple SDEs. SRA3is the most efficient when applicable and the tolerances are low.- Generally, only low accuracy is necessary to get to sampling error of the mean.
- The adaptive method is very conservative with error estimates.
Stiff SDEs
- The high order adaptive methods (
SRIW1) generally do well on stiff problems. - The "standard" low-order implicit methods,
ImplicitEMandImplicitRK, do not do well on all stiff problems. Some exceptions apply to well-behaved problems like the Stochastic Heat Equation.
Non-Stiff DDEs
- The efficiency ranking tends to match the ODE Tests, but the cutoff from low to high tolerance is lower.
Tsit5does well in a large class of problems here.- The
Vernmethods do well in low tolerance cases.
Stiff DDEs
- The Rosenbrock methods, specifically
Rodas5P, perform well.
Parameter Estimation
- Broadly two different approaches have been used, Bayesian Inference and Optimisation algorithms.
- In general it seems that the optimisation algorithms perform more accurately but that can be attributed to the larger number of data points being used in the optimisation cases, Bayesian approach tends to be slower of the two and hence lesser data points are used, accuracy can increase if proper data is used.
- Within the different available optimisation algorithms, BBO from the BlackBoxOptim package and GN_CRS2_LM for the global case while LD_SLSQP,LN_BOBYQA and LN_NELDERMEAD for the local case from the NLopt package perform the best.
- Another algorithm being used is the QuadDIRECT algorithm, it gives very good results in the shorter problem case but doesn't do very well in the case of the longer problems.
- The choice of global versus local optimization make a huge difference in the timings. BBO tends to find the correct solution for a global optimization setup. For local optimization, most methods in NLopt, like :LN_BOBYQA, solve the problem very fast but require a good initial condition.
- The different backends options available for Bayesian method offer some tradeoffs between time, accuracy and control. It is observed that sufficiently high accuracy can be observed with any of the backends with the fine tuning of stepsize, constraints on the parameters, tightness of the priors and number of iterations being passed.
Interactive Notebooks
To generate the interactive notebooks, first install the SciMLBenchmarks, instantiate the
environment, and then run SciMLBenchmarks.open_notebooks(). This looks as follows:
]add SciMLBenchmarks#master
]add IJulia
]activate SciMLBenchmarks
]instantiate
using SciMLBenchmarks, IJulia
SciMLBenchmarks.open_notebooks()
The benchmarks will be generated at your pwd() in a folder called generated_notebooks.
Note that when running the benchmarks, the packages are not automatically added. Thus you will need to add the packages manually or use the internal Project/Manifest tomls to instantiate the correct packages. This can be done by activating the folder of the benchmarks. For example,
using Pkg
Pkg.activate(join
Related Skills
claude-opus-4-5-migration
81.4kMigrate prompts and code from Claude Sonnet 4.0, Sonnet 4.5, or Opus 4.1 to Opus 4.5
model-usage
330.7kUse CodexBar CLI local cost usage to summarize per-model usage for Codex or Claude, including the current (most recent) model or a full model breakdown. Trigger when asked for model-level usage/cost data from codexbar, or when you need a scriptable per-model summary from codexbar cost JSON.
TrendRadar
49.6k⭐AI-driven public opinion & trend monitor with multi-platform aggregation, RSS, and smart alerts.🎯 告别信息过载,你的 AI 舆情监控助手与热点筛选工具!聚合多平台热点 + RSS 订阅,支持关键词精准筛选。AI 智能筛选新闻 + AI 翻译 + AI 分析简报直推手机,也支持接入 MCP 架构,赋能 AI 自然语言对话分析、情感洞察与趋势预测等。支持 Docker ,数据本地/云端自持。集成微信/飞书/钉钉/Telegram/邮件/ntfy/bark/slack 等渠道智能推送。
mcp-for-beginners
15.5kThis open-source curriculum introduces the fundamentals of Model Context Protocol (MCP) through real-world, cross-language examples in .NET, Java, TypeScript, JavaScript, Rust and Python. Designed for developers, it focuses on practical techniques for building modular, scalable, and secure AI workflows from session setup to service orchestration.
