NeuralSAT: A DPLL(T) Framework for Verifying Deep Neural Networks

NeuralSAT is a high-performance verification tool for deep neural networks (DNNs). It integrates the DPLL(T) approach commonly used in SMT solving with a theory solver specialized for DNN reasoning. NeuralSAT exploits multicores and GPU for efficiency and can scale to networks with millions of parameters. It supports a wide range of neural networks and activation functions.

NEWS

NeuralSAT is ranked 2nd overall at VNN-COMP'25
NeuralSAT is ranked 2nd overall at VNN-COMP'24
- Initially VNN-COMP’s script incorrectly parsed NeuralSAT’s (and another tool’s) results, and ranked it last. After fixing the issue, NeuralSAT’s results were correctly parsed and NeuralSAT was officially placed 2nd, behind ABCrown and above PyRAT. The updated VNN-COMP’24 report mentions the issue, presents the corrected rankings (see Table B.1), and includes detailed results and graphs in Appendix B.</li>
NeuralSAT is given the New Participation Award at VNN-COMP'23
NeuralSAT is ranked 4th in VNN-COMP'23. This was our first participation and we look forward to next time.
- Note: The current version of NeuralSAT adds significant improvements and fixed the implementation bugs we had during VNN-COMP'23 that produce unsound results (hence 4th place ranking).

PUBLICATIONS

CVPR'26 research paper on verifying AI-based computer vision systems.
FSE’26 paper on formalizing and verifying structural robustness properties of AI systems.
NeurIPS’25 paper on generating and checking DNN verification proofs.
NeurIPS’25 paper on compositional DNN verification (Spotlight).
SAIV'25 competition paper on NeuralSAT.
CAV’25 paper on NeuralSAT verification tool.
FSE'24 paper on new optimizations developed for the NeuralSAT.
Arxiv'24 technical report on NeuralSAT design and methodology.

INSTALLATION & USAGE

see INSTALL.md

FEATURES

fully automatic, ease of use and requires no tuning (i.e., no expert knowledge required)
- NeuralSAT requires no parameter tuning (a huge engineering effort that researchers often don't pay attention to)! In fact, you can just apply NeuralSAT as is to check your networks and desired properties. The user does not have to do any configuration or tweaking. It just works!
  - But if you're an expert (or want to break the tool), you are welcome to tweak its internal settings.
- This is what makes NeuralSAT different from other DNN verifiers (e.g., AB-Crown), which require lots of tuning to work well.
standard input and output formats
- input: onnx for neural networks and vnnlib for specifications
- output: unsat for proved property, sat for disproved property (accompanied with a counterexample), and unknown or timeout for property that cannot be proved.
versatile: support multiple types of neural types of networks and activation functions
- layers (can be mixture of different types): fully connected (fc), convolutional (cnn), residual networks (resnet), batch normalization (bn)
- activation functions: ReLU, sigmoid, tanh, power
well-tested
- NeuralSAT has been tested on a wide-range of benchmarks (e.g., ACAS XU, MNIST, CIFAR).
fast and among the most scalable verification tools currently
- NeuralSAT exploits and uses multhreads (i.e., multicore processing/CPUS) and GPUs available on your system to improve its performance.
active development and frequent updates
- If NeuralSAT does not support your problem, feel free to contact us (e.g., by openning a new Github issue). We will do our best to help.
- We will release new, stable versions about 3-4 times a year

<details> <summary><kbd>details</kbd></summary>

sound and complete algorithm: will give both correct unsat and sat results
combine ideas from conflict-clause learning (CDCL), abstractions (e.g., polytopes), LP solving
employ multiple adversarial attack techniques for fast counterexamples (i.e., sat) discovery

</details>

OVERVIEW

NeuralSAT takes as input the formula $\alpha$ representing the DNN N (with non-linear ReLU activation) and the formulae $\phi_{in}\Rightarrow \phi_{out}$ representing the property $\phi$ to be proved. Internally, it checks the satisfiability of the formula: $\alpha \land \phi_{in} \land \overline{\phi_{out}}$. NeuralSAT returns UNSAT if the formula is unsatisfiable, indicating N satisfies $\phi$, and SAT if the formula is satisfiable, indicating the N does not satisfy $\phi$.

NeuralSAT uses a DPLL(T)-based algorithm to check unsatisfiability. It applies DPLL/CDCL to assign values to boolean variables and checks for conflicts the assignment has with the real-valued constraints of the DNN and the property of interest. If conflicts arise, NeuralSAT determines the assignment decisions causing the conflicts and learns clauses to avoid those decisions in the future. NeuralSAT repeats these decisions and checking steps until it finds a full assignment for all boolean variables, in which it returns SAT, or until it no longer can decide, in which it returns UNSAT.

<p align="center"> <img src="./doc/figure/overview.png" width='50%'/> </p> <!-- ## ALGORITHM *NeuralSAT* constructs a propositional formula representing neuron activation status (`Boolean Abstraction`) and searches for satisfying truth assignments while employing a DNN-specific theory solver to check feasibility with respect to DNN constraints and properties. The process integrates standard DPLL components, which include deciding (`Decide`) variable assignments, and performing Boolean constraint propagation (`BCP`), with DNN-specific theory solving (`Deduce`), which uses LP solving and the polytope abstraction to check the satisfiability of assignments with the property of interest. If satisfiability is confirmed, it continues with new assignments; otherwise, it analyzes and learns conflict clauses (`Analyze Conflict`) to backtrack. *NeuralSAT* continues it search until it either proves the property (`UNSAT`) or finds a total assignment (`SAT`). <p align="center"> <img src="./doc/figure/algorithm.png" /> </p> ### Boolean Representation `Boolean Abstraction` encodes the DNN verification problem into a Boolean constraint to be solved. This step creates Boolean variables to represent the activation status of hidden neurons in the DNN. *NeuralSAT* also forms a set of initial clauses ensuring that each status variable is either `T` (active) or `F` (inactive). ### DPLL search *NeuralSAT* iteratively searches for an assignment satisfying the clauses. Throughout it maintains several state variables including: clauses, a set of clauses consisting of the initial activation clauses and learned conflict clauses; $\alpha$, a truth assignment mapping status variables to truth values which encodes a partial activation pattern; and $igraph$, an implication graph used for analyzing conflicts. ### Decide `Decide` chooses an unassigned variable and assigns it a random truth value. Assignments from `Decide` are essentially guesses that can be wrong which degrades performance. The purpose of `BCP`, `Deduce`, and `Stabilize` – which are discussed below – is to eliminate unassigned variables so that Decide has fewer choices. ### BooleanConstraintPropagation (BCP) `BCP` detects unit clauses from constraints representing the current assignment and clauses and infers values for variables in these clauses. For example, after the decision $a\mapsto F$, `BCP` determines that the clause $a \vee b$ becomes unit, and infers that $b \mapsto T$. Internally, *NeuralSAT* uses an implication graph to represent the current assignment and the reason for each BCP implication. ### AnalyzeConflict `AnalyzeConflict` processes an implication graph with a conflict to learn a new clause that explains the conflict. The algorithm traverses the implication graph backward, starting from the conflicting node, while constructing a new clause through a series of resolution steps. `AnalyzeConflict` aims to obtain an asserting clause, which is a clause that will result a BCP implication. These are added to clauses so that they can block further searches from encountering an instance of the conflict. ### Theory Solver (T-solver) `T-solver` consists of two parts: `Stabilize` and `Deduce`: - `Deduce` checks the feasibility of the DNN constraints represented by the current propositional variable assignment. This component is shared with *NeuralSAT* and it leverages specific information from the DNN problem, including input and output properties, for efficient feasibility checking. Specifically, it obtains neuron bounds using the polytope abstraction and performs infeasibility checking to detect conflicts. - `Stabilize` has a similar effect as `BCP` – reducing mistaken assignments by `Decide` – but it operates at the theory level not the propositional Boolean level. The key idea in using neuron stability is that if we can determine that a neuron is stable, we can assign the exact truth value for the corresponding Boolean variable instead of having to guess. Stabilization involves the solution of a mixed integer linear program (MILP) system. First, a MILP problem is created from the current assignment, the DNN,

Neuralsat

Install / Use

README