0% found this document useful (0 votes)

28 views9 pages

Provably-Safe Neural Network Training Using Hybrid Zonotope Reachability Analysis

This paper presents a method for training neural networks that ensures safety by enforcing constraints on their outputs, particularly in non-convex input and unsafe regions. The proposed approach utilizes hybrid zonotope reachability analysis to derive a differentiable loss function that encourages the network to avoid unsafe areas, while maintaining computational efficiency. The method significantly improves upon previous techniques by allowing for the exact representation of non-convex sets and demonstrating scalability with respect to various parameters.

Uploaded by

maryht1706

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views9 pages

Provably-Safe Neural Network Training Using Hybrid Zonotope Reachability Analysis

Uploaded by

maryht1706

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Provably-Safe Neural Network Training Using

Hybrid Zonotope Reachability Analysis

Long Kiu Chung and Shreyas Kousik

Abstract— Even though neural networks are being increas- robust to adversarial attacks, and more. An overview of our
ingly deployed in safety-critical applications, it remains difficult method is shown in Fig. 1.
to enforce constraints on their output, meaning that it is hard to
guarantee safety in such settings. Towards addressing this, many
existing methods seek to verify a neural network’s satisfaction A. Related Work
of safety constraints, but do not address how to correct an
arXiv:2501.13023v1 [cs.LG] 22 Jan 2025

“unsafe” network. On the other hand, the few works that We now review three key approaches to enforce constraints
extract a training signal from verification cannot handle non- on neural network: sampling-based approaches that do not
convex sets, and are either conservative or slow. To address have formal guarantees, verification approaches that only
these challenges, this work proposes a neural network training check constraint satisfaction, and approaches that combine
method that can encourage the exact reachable set of a non- verification with training, which our method belongs to.
convex input set through a neural network with rectified linear
unit (ReLU) nonlinearities to avoid a non-convex unsafe region, Finally, we review relevant literature on hybrid zonotopes,
using recent results in non-convex set representation with which is the set representation used in our method.
hybrid zonotopes and extracting gradient information from 1) Training with Soft Constraints: Many existing work
mixed-integer linear programs (MILPs). The proposed method capture safety in neural networks by penalizing constraint
is fast, with the computational complexity of each training
violations on sampled points during training [9]–[13]. How-
iteration comparable to that of solving a linear program (LP)
with number of dimensions and constraints linear to the number ever, these soft approaches, while often fast and easy to
of neurons and complexity of input and unsafe sets. For a neural implement, do not provide any safety guarantees beyond the
network with three hidden layers of width 30, the method was training samples. While there are works that are capable of
able to drive the reachable set of a non-convex input set with enforcing hard constraints in neural networks by modifying
55 generators and 26 constraints out of a non-convex unsafe
the training process [14], [15], they can only handle simple
region with 21 generators and 11 constraints in 490 seconds.
affine constraints.
I. I NTRODUCTION 2) Neural Network Verification: A different approach is to
certify safety with respect to a set of inputs. Methods in this
Neural networks are universal approximators [1] that have category tend to analyze the reachable set (i.e. image) of the
seen success in many domains. However, they are also input set through the neural network, either exactly [16]–[20]
well-known as “black-box” models, where the relationship or as an over-approximation [16]–[18], [21]–[23] depending
between their inputs and outputs is not easily interpretable or on the choice of set representation. That said, most of these
directly analyzable due to non-linearity and high-dimensional works only focus on neural network verification. That is,
parameterizations. As such, it is very difficult to certify their these methods only answer the yes-no question of “safe” or
safety (e.g. satisfaction of constraints). This limitation im- “unsafe”, with the aftermath of fixing an “unsafe” network
poses many significant drawbacks. For example, robots crash left largely unexplored. As a result, engineers can only train
frequently when training their neural network controllers via trial-and-error until the desired safety properties have
with deep reinforcement learning (RL) algorithms, limiting been achieved, which can be slow and ineffective.
deep RL’s success in robots where hardware failures are 3) Training with Verification: To the best of our knowl-
costly, simulations are not readily available, or the sim-to- edge, there are only two works that attempted to extract
real gap is too large for reliable performance [2]. In addition, learning signals from the safety verification results using
neural networks can also be susceptible to adversarial attacks, reachability analysis.
where minor perturbations in the input can lead to drastically First, in [24], given an input set as an H-polytope (i.e.
different results in the output [3], [4]. This makes deploying polytope represented by intersection of halfplanes) and a
neural networks in safety-critical tasks a questionable choice, neural network controller embedded in a nonlinear dynamical
even though it has already been widely done [5]–[7], leading system, the polytope is expressed as the projection of a high-
to many injuries and accidents [8]. In this paper, we present a dimensional hyperrectangle, enabling the use of the CROWN
method to enforce safety in neural networks by encouraging verifier [21] for interval reachability. Then, using a loss
their satisfaction of a collision-free constraint, which has po- function that encourages the vector field of the reachable set
tential application in making deep RL safe, neural networks to point inwards, the authors were able to train the neural net-
work until the system is forward invariant. With this method,
All authors are with the Department of Mechanical Engineering,
Georgia Institute of Technology, Atlanta, GA. Corresponding author: the input set is limited to being a convex polytope. Moreover,
[email protected]. since [21] and the interval reachability techniques used are

1
Hybrid Zonotope Input Set Hybrid Zonotope Reachable Set Safe Hybrid Zonotope Reachable Set
2
1
Neural 2

Network 1.5 1.5

0.5
1 1

0 0.5 MILP 0.5

Safe
Verification
0 0
-0.5
-0.5 -0.5

-1 -1 Unsafe -1
-1 -0.5 0 0.5 1 -1 0 1 2 -1 0 1 2

Loss from
Update
Relaxed LP

Fig. 1. A flowchart of our method, using the example from Sec. VI. Our method takes in a non-convex input set (green), then computes its exact
reachable set (blue) through the neural network. Then, we formulate the reachable set’s collision with the unsafe set (red) as a loss function using a linear
program (LP), which enables us to update the neural network’s parameters via backpropagation. Every several iterations, we check if the reachable set
collides with the unsafe set using a mixed-integer linear program (MILP). If it does not, then the training is complete and our method is successful.

over-approximations, its space of discoverable solutions may ReLU neural networks training based on exact reacha-
be limited. bility analysis with hybrid zonotope. This loss function
Second, given an input set and an unsafe region expressed encourages the reachable set of the input set to avoid the
as constrained zonotopes (a convex polytopic representation unsafe region, the satisfaction of which can be checked
[25]), our prior work [26] computed the exact reachable set using a mixed-integer linear program (MILP).
of a neural network as a union of constrained zonotopes. 2) We show that this method is fast and scales fairly well
Then, by using a loss function to quantify the “emptiness” with respect to input dimensions, output dimensions,
of the intersection between the reachable set and the unsafe network size, complexity of the input set, and com-
region, we were able to train the neural network such that plexity of the unsafe region. The results significantly
the reachable set no longer collides with the unsafe region. outperform our prior method for exact reachability
Similarly, the input set and the obstacle in this method is analysis in training [26].
limited to being a convex polytope. Moreover, the number The remainder of the paper is organized as follows:
of sets needed to represent the reachable set grows exponen- we provide preliminary information in Sec. II, formalize
tially with the size of the neural network, making the method our problem statement in Sec. III, detail our proposed
numerically intractable even for very small neural networks. method in Sec. IV, provide experimental analysis in Sec. V,
4) Hybrid Zonotopes: Recently, a non-convex polytopic demonstrate the utility of our method in Sec. VI, then give
set representation called the hybrid zonotope [27] was pro- concluding remarks and limitations in Sec. VII.
posed. Hybrid zonotopes are closed under affine mapping,
Minkowski sum, generalized intersection, intersection [27], II. P RELIMINARIES
union, and complement [28], with extensive toolbox support We now introduce our notation conventions, define hy-
in MATLAB [29] and Python [30]. They can also exactly brid zonotopes and ReLU neural networks, and summarize
represent the forward reachable set (image) [19] and back- existing work [19], [31] on representing image of a hybrid
ward reachable set (preimage) [31] of a neural network with zonotope through a ReLU neural network exactly as a hybrid
rectified linear units (ReLU) using basic matrix operations, zonotope.
with complexity scaling only linearly with the size of the
network. However, existing methods for hybrid zonotopes A. Notation
enforce safety on robots either by formulating a model In this paper, we denote the set of real numbers as R, non-
predictive control (MPC) [28] or a nonlinear optimization negative real numbers as R+ , natural numbers as N, scalars in
problem [32] without neural networks in the loop, whereas lowercase italic, sets in uppercase italic, vectors in lowercase
those with neural networks only use hybrid zonotope for bold, and matrices in uppercase bold. We also denote a
verification but not training [19], [31], [33], [34]. In this matrix of zeros as 0, a matrix of ones as 1, and an identity
paper, our contribution is extracting and using learning matrix as I, with their dimensions either implicitly defined
signals from neural network reachability analysis with hybrid from context or explicitly using subscripts, e.g. 0n1 ×n2 ⊂
zonotopes. Rn1 ×n2 , In ⊂ Rn×n . An empty array is [ ]. Finally, inequalities
≤, ≥ between vectors are compared element-wise.
B. Contributions
Our contributions are twofold: B. Hybrid Zonotope
1) Given a non-convex input set and a non-convex unsafe A hybrid zonotope HZ(Gc , Gb , c, Ac , Ab , b) ⊂ Rn is a set
region, we propose a differentiable loss function for parameterized by a continuous generator matrix Gc ∈ Rn×ng ,

2
a binary generator matrix Gb ∈ Rn×nb , a center c ∈ Rn , a hybrid zonotope [19], [31]:
continuous linear constraint matrix Ac ∈ Rnc ×ng , a binary
{max (Wi xi−1 + wi , 0) | xi−1 ∈ Pi−1 }
linear constraint matrix Ab ∈ Rnc ×nb , and a constraint vec-
(5)
tor b ∈ Rnb on continous coefficients zc ∈ Rng and binary

= 0 Ini Hni ∩h i (Wi Pi−1 + wi ) ,
coefficients zb ∈ {−1, 1}nb as follows [27, Definition 3]: I 0
where Hni ⊂ R2ni is the graph of an ni -dimensional ReLU
HZ(Gc , Gb , c, Ac , Ab , b)
activation function over a hypercube domain {x | −a1 ≤ x ≤
={Gc zc + Gb zb + c | Ac zc + Ab zb = b, ∥zc ∥∞ ≤ 1, (1) a1} for some a > 0, which can be represented exactly by a
nb
zb ∈ {−1, 1} }. hybrid zonotope as in [31]:

We denote ng as the number of continuous generators, nb as x
Hni = | −a1 ≤ x ≤ a1 ,
the number of binary generators, and nc as the number of max(x, 0)
a a
a
constraints in a hybrid zonotope. I⊗ − 2 −a 2 0 0 , − 2 I , a 1, (6)
= HZ
Consider a pair of hybrid zonotopes P1 = I⊗ 0 −2 0 0 0 2
HZ(Gc1 , Gb1 , c1 , Ac1 , Ab1 , b1 ) ⊂ Rn1 and P2 =

1
HZ(Gc2 , Gb2 , c2 , Ac2 , Ab2 , b2 ) ⊂ Rn2 . In this paper, we I ⊗ I2 I , I ⊗ ,1 ,
−1
make use of their closed form expressions in generalized
intersection under some R ⊂ Rn2 ×n1 , denoted as ∩R [27, where ⊗ is the Kronecker product. Note that (5) holds as long
Proposition 7]: as a is large enough [19]. As such, the reachable set Pd ⊂ Rnd
of a hybrid zonotope Z = P0 ⊂ Rn0 through a ReLU neural
P1 ∩R P2 ={x ∈ P1 | Rx ∈ P2 }, network can be obtained by applying (5) d − 1 times, before

Ac1 0
 applying an affine transformation parameterized by Wd and

=HZ Gc1

0 , Gb1

0 , c1 ,  0 Ac2  , wd . This way, if Z has ng,Z continuous generators, nb,Z binary
RGc1 −Gc2 generators, and nc,Z constraints, then Pd will have ng,Z + n0 +
   ! 4nn continuous generators, nb,Z + nn binary generators, and
Ab1 0 b1
 0 nc,Z + n0 + 3nn constraints [19], where nn := n1 + · · · + nd−1
Ab2  ,  b2  .
denotes the number of neurons.
RGb1 −Gb2 c2 − Rc1
(2) III. P ROBLEM S TATEMENT
Our goal in this paper is to design a ReLU neural network
Note that their “regular” intersection {x ∈ P1 | x ∈ P2 }, which
training method such that the reachable set of a given input
we denote as P1 ∩ P2 , is a particular case of the generalized
set through the network avoids some unsafe regions. As per
intersection with R = I.
most other training methods, we assume that the structure
Finally, a hybrid zonotope P = HZ(Gc , Gb , c, Ac , Ab , b) ⊂
(i.e. depth and widths) of the ReLU neural network is fixed
Rn1 is also closed under affine transformation with any
as a user choice, and we focus only on updating its weights
matrix W ⊂ Rn2 ×n1 and vector w ⊂ Rn2 as [27, Proposition
and biases (a.k.a. trainable parameters). Mathematically, we
7]:
want to tackle the following problem:
WP + w = {Wx + w | x ∈ P}, Problem 1 (Training the Reachable Set of a Neural Net-
(3)
= HZ (WGc , WGb , Wc + w, Ac , Ab , b) . work to Avoid Unsafe Regions). Given an input set Z =
HZ(Gc,Z , Gb,Z , cZ , Ac,Z , Ab,Z , bZ ) ⊂ Rn0 with ng,Z continuous
C. ReLU Neural Network generators, nb,Z binary generators, and nc,Z constraints,
In this work, we consider a fully-connected, ReLU acti- an unsafe region U = HZ(Gc,U , Gb,U , cU , Ac,U , Ab,U , bU ) ⊂
vated feedforward neural network ξ : Rn0 → Rnd , with output Rnd with ng,U continuous generators, nb,U binary genera-
xd = ξ (x0 ) ∈ Rnd given an input x0 ∈ Rn0 . We denote by tors, and nc,U constraints, and a ReLU neural network ξ
d ∈ N the depth of the network and by ni the width of the with fixed depth d and widths n0 , · · · , nd , we want to find
ith layer. Mathematically, W1 , · · · , Wd , w1 , · · · , wd such that
Q := {ξ (x) | x ∈ Z} ∩U = 0.
/ (7)
xi = max (Wi xi−1 + wi , 0) , (4a)
xd = Wd xd−1 + wd , (4b) Of course, a trivial solution would be to set Wd = 0 and
wd ∈/ U, but this kind of solution is not useful. Instead, we
where Wi ∈ Rni ×ni−1 , wi ∈ Rni , i = 1, · · · , d − 1, Wd ∈ aim to design a differentiable loss function such that (7) can
Rnd ×nd−1 , wd ∈ Rnd , and max is taken elementwise. We be achieved by following a gradient and updating the train-
denote W1 , · · · , Wd as weights and w1 , · · · , wd as biases able parameters via backpropagation [35]. Doing so allows
of the network. The function max(·, 0) is known as an ni - our method to integrate with other loss functions to achieve
dimensional ReLU activation function for 0 ⊂ Rni . additional objectives, as well as makes the training applicable
Consider a hybrid zonotope Pi−1 ⊂ Rni−1 . By applying the to ReLU networks with other structural constraints, such as
operations in (2) and (3), its image through (4a) is exactly a when they are embedded in a dynamical system [5]–[7].

3
IV. M ETHODS B. Loss Function to Encourage Emptiness
We now construct a loss function which, when minimized,
In this section, we first formulate a MILP to check whether
makes Q empty. Naı̈vely, since Q = 0/ iff r∗ > 1, where r∗ is
a hybrid zonotope is empty. Then, we explain how to obtain
the optimal value of (9) with P = Q, we can construct the
useful gradient information from this MILP to train the ReLU
loss function ℓ ∈ R as:
network such that the reachable set is out of the unsafe
region. ℓ = 1 − r∗ , (10)
such that when ℓ is decreased to a negative value, we must
A. Hybrid Zonotope Emptiness Check have Q = 0. / To minimize ℓ using backpropagation, from
∗ ∗ ∗ ∂A
Before constructing a loss function for training, we first chain rule, we must compute ∂∂rℓ∗ , ∂ ∂Ar , ∂ ∂Ar , ∂∂br , ∂ Wc,Q ,
c,Q b,Q Q 1
need a way to check whether (7) is true. From (2) and (5), the ∂b ∂A ∂b
· · · , ∂ WQ , and ∂ wc,Q , · · · , ∂ wQ . Since expressing Ac,Q , Ab,Q ,
d 1 d
left-hand side of (7), Q, can be straightforwardly computed and bQ in terms of W1 , · · · , Wd , and w1 , · · · , wd involves
as a hybrid zonotope HZ(Gc,Q , Gb,Q , cQ , Ac,Q , Ab,Q , bQ ) ⊂ only basic matrix operations à la (2) and (5), ∂∂rℓ∗ , ∂ Wc,Q ,
∂A
Rnd with ng,Q = ng,Z + n0 + 4nn + ng,U continuous generators, ∂b ∂A ∂b
1

nb,Q = nb,Z + nn + nb,U binary generators, and nc,Q = nc,Z + · · · , ∂ WQ , and ∂ wc,Q , · · · , ∂ wQ can be straightforwardly ob-
d 1 d
n0 + 3nn + nd constraints. Then, the image of the input set tained from automatic differentiation [37]. However, obtain-
∗ ∗ ∗
is not in collision with the unsafe region iff Q is empty. To ing ∂ ∂Ar , ∂ ∂Ar , and ∂∂br involves differentiation through an
c,Q b,Q Q
check whether a hybrid zonotope is empty, existing methods MILP. Since the optima of an MILP can remain unchanged
formulate a feasibility MILP with ng,Q continuous variables under small differences in its parameters, its gradient can
and nb,Q binary variables [27]: be 0 or non-existent, which are uninformative [38]. Instead,
consider the following convex relaxation of (9):
find zc , zb , min r̃ − µ(1ln(z̃c1 ) + 1ln(z̃c2 ) + 1ln(z̃b ) + ln(r̃) + 1ln(s)),
s.t. Ac,Q zc + Ab,Q zb = bQ , s.t. Ac (z̃c1 − z̃c2 ) + Ab (2z̃b − 1) = b,
(8)
∥zc ∥∞ ≤ 1, 
z̃c1 − z̃c2 − r̃1
 
0nc ×1

zb ∈ {−1, 1}nb , z̃c2 − z̃c1 − r̃1 + s = 0nc ×1  ,
z̃b 1
which is infeasible iff Q = 0.
/ Note that (8) is NP-complete (11)
[36]. However, not only is it not always feasible, it is also
unclear how to derive a loss function from the optimizers to where r̃ ∈ R, z̃c1 ∈ Rng , z̃c2 ∈ Rng , z̃b ∈ Rnb , s ∈ Rng +ng +nb ,
drive Q to be empty. Instead, consider the following MILP µ ∈ R+ is the cut-off multiplier from the solver [39], and
with one more continuous variable than (8): ln(·) is applied elementwise. (11) is the standard linear
program (LP) form of (9) with log-barrier regularization and
Proposition 2 (Hybrid Zonotope Emptiness Check). Given without the integrality constraints, and can be obtained by
a hybrid zonotope P = HZ(Gc , Gb , c, Ac , Ab , b) ⊂ Rn , where replacing r with r̃, zc with z̃c1 − z̃c2 , and zb with 2z̃b −1 (such
Ac ∈ Rnc ×ng and Ab ∈ Rnc ×nb . Consider the following MILP: that all constraints are non-negative), and introducing slack
variable s (such that inequality constraints become equality
min r, constraints) [40].
s.t. Ac zc + Ab zb = b, The optimization problem (11) can be solved quickly
(9)
∥zc ∥∞ ≤ r, using solvers such as IntOpt [39]. Moreover, if r̃∗ is the
∗ ∗ ∗
optimal value of (11), ∂∂Ar̃ c , ∂∂Ar̃ , and ∂∂r̃b can be obtained by
zb ∈ {−1, 1}nb , b
differentiating the Karush-Kuhn-Tucker (KKT) conditions of
where r ∈ R. Then, if r∗ is the optimal value of (9), then (11), which we refer the readers to [38, Appendix B] for
P = 0/ iff r∗ > 1. the mathematical details. Not only are these gradients well-
defined, easily computable, and informative, but also, they
Proof. This follows from the definition of hybrid zonotope have been shown to outperform other forms of convex re-
in (1). laxation in computation speed and minimizing loss functions
derived from MILPs [38, Appendix E].
By construction, (9) is feasible as long as ∃ zc ∈ Rng , zb ∈ Therefore, instead of the loss function ℓ, we propose to
{−1, 1}nb such that Ac zc + Ab zb = b. If this condition is not backpropagate with respect to a surrogate loss function ℓ̃ ∈ R:
met for Q, then we have Q = 0/ anyway and no training
ℓ̃ = 1 − r̃∗ , (12)
is needed. Importantly, it has been shown in [26] that
the minimum upper bound of the norm of the continuous where r̃∗ is the optimal value of (11) with Ac = Ac,Q and
coefficients is useful for gauging the extent of collision Ab = Ab,Q .
between two constrained zonotopes, which are subsets of Unfortunately, since ℓ̃ does not necessarily equal ℓ, we
a hybrid zonotope. As such, (9) gives a good foundation for cannot use (12) to simultaneously verify and train the neural
constructing a loss function for encouraging Q to be empty. network. In practice, we solve (8) in between some iterations

4
of training with (12) to check whether (7) has been achieved. f : Rn0 → Rnd defined as:
If it has, then the training is complete and Problem 1 has been 2
xodd + sin(xeven )
solved. f (x) = 10.5nd ×1 ⊗ 2 , (15a)
xeven + sin(xodd )
n0
1
V. E XPERIMENTS xodd = ∑ xi 1odd (i),
⌈0.5n0 ⌉ i=1
(15b)
n0
We now assess the scalability of our method by observing 1
the results under different problem parameters. We also wish xeven = ∑ xi (1 − 1odd (i)), (15c)
⌊0.5n0 ⌋ i=1
to compare our results with [26] to assess our contribution
to the state of the art. All experiments were performed on a where ⌈·⌉ is the ceiling function, ⌊·⌋ is the floor function,
desktop computer with a 24-core i9 CPU, 32 GB RAM, and x = [x1 , · · · , xn0 ]⊺ , and 1odd : R+ → {0, 1} is the indicator
an NVIDIA RTX 4090 GPU on Python1 . function for odd numbers, such that 1odd (i) = 1 if i is odd
and 1odd (i) = 0 if i is even.
Given the pretrained network, we begin training to obey
A. Experiment Setup and Method the safety constraint. In each training iteration, we use IntOpt
We test our method’s performance under different condi- [39] to compute the loss function (12) and PyTorch [37]
tions by varying the width of the first layer n1 ∈ {10, 20, 30}, with optim.SGD as the optimizer to update the trainable
the depth of the network d ∈ {2, 3, 4}, the input dimen- parameters in the network. Every 10 iterations, we use
sion n0 ∈ {2, 4, 6}, the output dimension nd ∈ {2, 4, 6}, Gurobi [41] to solve the MILP in (8) to check the emptiness
the complexity of the input set nb,Z ∈ {0, 10, 20}, and the of Q. We are successful in solving Problem 1 if Q = 0, / at
complexity of the unsafe region nb,U ∈ {0, 10, 20}. We opted which point we terminate the training instead of updating
not to show results from higher dimensions, set complexities, the parameters. Note that each training iteration is done on
and larger networks here as we do not wish to introduce CPU instead of GPU. Furthermore, we chose not to solve
large confounding variables from the increased difficulties the MILP in every iteration because solving (8) can be many
in training with standard supervised learning. times slower than solving (11).
We define input and unsafe sets as follows. The input set We also compare against a constrained zonotope safe
is given by: training method [26]. We tested the method with n1 = 10,
d = 2, n0 = 2, nd = 2, nb,Z = 0, and nb,U = 0, which are
the parameters used in the example in [26]. To compare the

1 1
Z = HZ I, 1 I, 0, [ ], [ ], [ ] , (13a)
mZ mZ 1×(mZ −1) scalability of both methods, we also tested [26] on n1 = 20
nb,Z and n1 = 30. To ensure fairness, we do not include the
mZ = + 1, (13b) objective loss and only add the constraint loss when it is
n0
positive (see [26] for details). We terminate the training once
which is a hypercube with length 2 centered at the origin the constraint loss has reached zero (i.e. the reachable set is
n
formed from a union of mZ0 smaller hypercubes (repre- out of collision with the unsafe set).
sented as 2mZ −1 overlapping hypercubes). We want its image
through the neural network to avoid the unsafe region: B. Hypotheses
Since the most complex operations in our method are
0.5 0.5 solving the relaxed LP (11) and the MILP (8), we expect
U = HZ I, 1 I, 1.51, [ ], [ ], [ ] , (14a)
mU mU 1×(mU −1) our performance to be dependent on the solvers’ (i.e. IntOpt
nb,U and Gurobi) ability to scale with the number of variables and
mU = + 1, (14b)
nd constraints, which in turn scale linearly with the dimensions,
network size, and set complexity (see Sec. IV-A). As such,
which is a hypercube with length 1 centered at 1.51nd ×1 we expect the computation time for each iteration of our
n
formed from a union of mUd smaller hypercubes (represented method to be significantly faster than that of [26], which
as 2mU −1 overlapping hypercubes). We choose these particu- scales exponentially with the number of neurons. That said,
lar parameters such that the reachable set of the input set and since [26] verifies (7) in every iteration (whereas our method
the unsafe region all have shapes similar to those shown in only checks it every 10 iterations), it is also possible for [26]
Fig. 2a before we apply our method in IV. Also, when n0 = 2, to terminate the training earlier than our method does.
nd = 2, nb,Z = 0, and nb,U = 0, we recover the problem setup
in [26], which we will compare our method against. C. Results and Discussion
We then ensure our ReLU neural network represents a We report the results of our experiments in Table I. All
nonlinear function that intersects the unsafe set. In partic- reachable sets have been successfully driven out of the unsafe
ular, we use standard supervised learning (implemented in regions, except for [26] with n1 of 30, which failed to even
PyTorch [37]) to train the network to approximate a function compute the reachable set. We show the training progression
of one of the experiments in Fig. 2, which clearly shows the
1 We are preparing our code for open-source release loss function driving the reachable set out of collision.

5
0 Iterations 10 Iterations 20 Iterations 30 Iterations 40 Iterations
2 2 2 2 2

1 1 1 1 1

0 0 0 0 0

-1 -1 -1 -1 -1

-2 -2 -2 -2 -2

-1 0 1 2 -1 0 1 2 -1 0 1 2 -1 0 1 2 -1 0 1 2

0.000 s 0.180 s 0.388 s 0.581 s 0.814 s

(a) (b) (c) (d) (e)
Fig. 2. Training an input set’s reachable set (blue) through a neural network to avoid the unsafe region (red) after (a) 0, (b) 10, (c) 20, (d) 30, and (e)
40 iterations with our method, with network size (2, 10, 2), nb,Z = 0, and nb,Q = 10. The elapsed times are denoted below the figures. Our algorithm treats
the unsafe region as a union of 210 overlapping convex sets.

As expected, the computation time of our method is largely A. Demonstration Setup

dictated by the complexity of the LP and MILP problem. In In this experiment, we first train a ReLU neural network
theory, since IntOpt [39] is a primal-dual interior point solver, with d = 4 and n1 = n2 = n3 = 30 to approximate (15),
√
it has a complexity of O(k ng,Q + nb,Q ) [42], where k is the where n0 = nd = 2. We choose the input set as the union
bit length of the input data. On the other hand, since (8) of 7SV-polytopes (i.e. polytopes represented by vertices)
is NP-complete, it has a worst-case complexity of O(2nb,Q ) Z = 7i=1 Zi , where
[36]. In the experiments in this section, solving (8) was on
average 2 to 4 times longer than solving (11). To circumvent Z1 = conv([0, 1]⊺ , [0.2, 0.2]⊺ , [−0.2, 0.2]⊺ ),
this for more complex problems, we can either lower the Z2 = conv([−0.2, 0.2]⊺ , [−0.2, −0.2]⊺ , [−1, 0]⊺ ),
frequency of calling (8) (i.e. increase the number of iterations
Z3 = conv([0.2, −0.2]⊺ , [0, −0.1]⊺ , [−0.2, −0.2]⊺ ),
between calling the MILP), or replace (8) with faster but
over-approximative neural network verification methods such Z4 = conv([0.2, 0.2]⊺ , [1, 0]⊺ , [0.2, −0.2]⊺ ),
as [21]. In other words, our method can modular to other Z5 = conv([−1, 1]⊺ , [−0.1, 1]⊺ , [−0.28, 0.28]⊺ , [−1, 0.1]⊺ ),
neural network verification techniques. Z6 = conv([0.1, 1]⊺ , [1, 1]⊺ , [1, 0.1]⊺ ), and
We believe the power of our method comes from the low Z7 = conv([1, −0.1]⊺ , [1, −1]⊺ , [0.1, −1]⊺ , [0.28, −0.28]⊺ ),
complexity of hybrid zonotope representation (specifically,
Q), which scales only linearly with the number of neurons, where conv(·) is the convex hull operation. Similarly, we
input and output dimensions, and complexity of the input choose the unsafe set as the union of 3 V-polytopes U =
S3
set and the unsafe region. For example, when increasing i=1 Ui , where
nb,Z from 10 to 20, the maximum number of convex sets
U1 = conv([1, 2]⊺ , [2, 2]⊺ , [2, 1]⊺ ),
representable as a union by Z increases from 210 to 220 , even
though only 10 more binary (continuous in the LP’s case) U2 = conv([1.5, 0]⊺ , [2, −0.5]⊺ , [1.5, −1]⊺ ), and
variables are needed, adding only 0.056 s of computation U3 = conv([−0.5, 2]⊺ , [0, 1.5]⊺ , [−1, 1.5]⊺ ).
time to every 10 iterations. In contrast, since the complexity
of the constrained zonotope representation in [26] increases See Fig. 1 for a visualization of the sets. Note that a
exponentially with the number of neurons in the method, our union of nN V-polytopes with a total of nv vertices can be
method easily outperforms it for larger network sizes. For a exactly represented as a hybrid zonotope with 2nv continuous
more detailed discussion on the scalability and representation generators, nN binary generators, and nv + 2 constraints [43],
power of hybrid zonotopes, we refer the reader to [19] and and can be further simplified using reduction algorithms [34].
[27]. We use the same solvers and computer with Sec. V. As
before, we verify the satisfaction of (7) every 10 iterations
using (8).
VI. D EMONSTRATION
B. Results and Discussion
We now demonstrate our method’s ability to handle deep The results of the experiment are shown in Table I
neural networks and disjoint, non-convex input and unsafe and visualized in Fig. 1. Since the complexity of the set
sets. To the best of our knowledge, no existing method can representations and the network size are significantly larger
solve this problem with formal guarantees. than those in Sec. V, the number of constraints and variables

6
TABLE I
Summary of duration required to drive a neural network’s reachable set (i.e. image of a given input set) out of an unsafe region under different network
sizes, input and output dimensions, and complexity of the input set and the unsafe region. The dimensions of the hybrid zonotope intersection of the
reachable set and the unsafe region ng,Q , nc,Q , and nb,Q represents the complexity of the LPs and MILPs that must be solved during the training
iterations, which took up a majority of the computation time.

Network Size Time Time per 10 Iterations

Method nb,Z nb,U Iterations nc,Q ng,Q nb,Q
(n0 , · · · , nd ) (s) (s)
Sec. V
Increasing Network Width Increases Training Time in Each Iteration
(2, 10, 2) 0 0 1.318 70 0.188 32 44 10
Ours (2, 20, 2) 0 0 12.960 130 0.997 62 84 20
(2, 30, 2) 0 0 17.721 60 2.954 92 124 30
[26] Scales Very Poorly with Increasing Network Width
(2, 10, 2) 0 0 0.755 2 3.775 N/A N/A N/A
[26] (2, 20, 2) 0 0 946.677 3 3,155.590 N/A N/A N/A
(2, 30, 2) 0 0 Timeout N/A Timeout N/A N/A N/A
Increasing Network Depth Increases Training Time in Each Iteration
(2, 10, 10, 2) 0 0 0.879 10 0.879 62 84 20
Ours
(2, 10, 10, 10, 2) 0 0 5.709 20 2.855 92 124 30
Increasing Input Dimension Increases Training Time in Each Iteration
(4, 10, 2) 0 0 0.167 10 0.167 32 46 10
Ours
(6, 10, 2) 0 0 0.450 20 0.225 32 48 10
Increasing Output Dimension Increases Training Time in Each Iteration
(2, 10, 4) 0 0 5.756 270 0.195 34 46 10
Ours
(2, 10, 6) 0 0 0.191 10 0.191 36 48 10
Increasing Input Set Complexity Increases Training Time in Each Iteration
(2, 10, 2) 10 0 0.859 40 0.215 32 44 20
Ours
(2, 10, 2) 20 0 2.165 80 0.271 32 44 30
Increasing Unsafe Set Complexity Increases Training Time in Each Iteration
(2, 10, 2) 0 10 0.814 40 0.204 32 44 20
Ours
(2, 10, 2) 0 20 2.451 90 0.272 32 44 30
Sec. VI
Ours (2, 30, 30, 30, 2) 7 3 490.290 20 245.145 309 426 100

in the LPs and MILPs solved are also a magnitude larger. VII. C ONCLUSION
Despite this, our method is still able to drive the reachable
This work proposes a new training method for enforcing
set out of collision with the unsafe set in 20 iterations after
constraint satisfaction by extracting learning signals from
490.290 s. A majority of the computation time was spent
neural network reachability analysis using hybrid zonotopes.
solving the MILPs, which took 54.740 s and 194.987 s at
This method is exact and can handle non-convex input sets
the 10th and 20th iteration. In contrast, solving (11) took less
and unsafe regions, and has been shown to be fast and scale
than 0.1 s in each iteration.
fairly well with respect to network sizes, dimensions, and set
This demo presents preliminary results on how to train complexities, significantly outperforming our pervious work
a neural network to obey non-convex constraints with for- in [26].
mal guarantees for the first time. However, it also reveals Limitations: Our current implementation has several draw-
the method’s computational bottleneck of solving the NP- backs to be addressed in future work. Firstly, while the
complete problem in (8), which limits its utility in appli- training step remains fast and efficient with an increase in
cations that require larger networks and more complex sets. network sizes and set complexities, the verification step does
We plan to address this in future work by experimenting with not, since the MILP in (8) is NP-complete. Secondly, the
other neural network verification techniques, or by develop- method is limited to fully-connected networks with ReLU
ing over-approximation methods using hybrid zonotopes with activation functions, which prevents it from being applied to
simpler representations. more interesting problems such as those with convolutional

7
neural networks (CNNs) or those with neural networks [13] K.-C. Hsu, D. P. Nguyen, and J. F. Fisac, “Isaacs: Iterative soft
embedded in dynamical systems. Finally, as with other neural adversarial actor-critic for safety,” in Learning for Dynamics and
Control Conference, PMLR, 2023, pp. 90–103.
network training methods, backpropagation through the loss [14] R. Balestriero and Y. LeCun, “POLICE: Provably optimal linear
function does not guarantee convergence towards the global constraint enforcement for deep neural networks,” in ICASSP 2023-
minimum. Thus, our method cannot guarantee the discovery 2023 IEEE International Conference on Acoustics, Speech and
Signal Processing (ICASSP), IEEE, 2023, pp. 1–5.
of a solution, even if it exists. [15] J.-B. Bouvier, K. Nagpal, and N. Mehr, “POLICEd RL: Learning
Future Work: Going forward, we hope to explore our Closed-Loop Robot Control Policies with Provable Satisfaction of
method’s compatibility with other verification methods to Hard Constraints,” arXiv preprint arXiv:2403.13297, 2024.
[16] H.-D. Tran, X. Yang, D. Manzanas Lopez, et al., “NNV: the neural
overcome the NP-complete problem in solving the MILP, network verification tool for deep neural networks and learning-
at the cost of potentially losing exactness. We also plan enabled cyber-physical systems,” in International Conference on
to apply techniques from [31], [33], [34] to train ReLU Computer Aided Verification, Springer, 2020, pp. 3–17.
[17] H.-D. Tran, D. Manzanas Lopez, P. Musau, et al., “Star-based reach-
networks embedded in dynamical systems, and leverage ability analysis of deep neural networks,” in Formal Methods–The
tricks from [18] to apply hybrid zonotope techniques on Next 30 Years: Third World Congress, FM 2019, Porto, Portugal,
CNNs. If successful, they could advance safety in camera- October 7–11, 2019, Proceedings 3, Springer, 2019, pp. 670–686.
[18] H.-D. Tran, S. Bak, W. Xiang, and T. T. Johnson, “Verification of
based control for autonomous driving [16] or aircraft landing deep convolutional neural networks using imagestars,” in Interna-
[44]–[46]. tional conference on computer aided verification, Springer, 2020,
Another particularly exciting possibility this method can pp. 18–42.
[19] J. Ortiz, A. Vellucci, J. Koeln, and J. Ruths, “Hybrid zonotopes
enable is a form of set-based training, where instead of exactly represent ReLU neural networks,” in 2023 62nd IEEE
training a neural network with features and labels as points, Conference on Decision and Control (CDC), IEEE, 2023, pp. 5351–
we can represent them as sets around the points, which 5357.
[20] Y. Zhang and X. Xu, “Safety verification of neural feedback systems
can make the network provably robust against attacks and based on constrained zonotopes,” in 2022 IEEE 61st Conference on
disturbances for seen examples. This could be enabled by Decision and Control (CDC), IEEE, 2022, pp. 2737–2744.
solving the optimization problems (8) and (11) in parallel [21] H. Zhang, T.-W. Weng, P.-Y. Chen, C.-J. Hsieh, and L. Daniel, “Effi-
cient neural network robustness certification with general activation
on GPU using methods similar to [47]. functions,” Advances in neural information processing systems,
vol. 31, 2018.
R EFERENCES [22] N. Kochdumper, C. Schilling, M. Althoff, and S. Bak, “Open-
[1] M. Leshno, V. Y. Lin, A. Pinkus, and S. Schocken, “Multilayer and closed-loop neural network verification using polynomial zono-
feedforward networks with a nonpolynomial activation function can topes,” in NASA Formal Methods Symposium, Springer, 2023,
approximate any function,” Neural networks, vol. 6, no. 6, pp. 861– pp. 16–36.
867, 1993. [23] T. Ladner and M. Althoff, “Automatic abstraction refinement in
[2] G. Dulac-Arnold, N. Levine, D. J. Mankowitz, et al., “Challenges neural network verification using sensitivity analysis,” in Proceed-
of real-world reinforcement learning: definitions, benchmarks and ings of the 26th ACM International Conference on Hybrid Systems:
analysis,” Machine Learning, vol. 110, no. 9, pp. 2419–2468, 2021. Computation and Control, 2023, pp. 1–13.
[3] K. Eykholt, I. Evtimov, E. Fernandes, et al., “Robust physical-world [24] A. Harapanahalli and S. Coogan, “Certified Robust Invariant
attacks on deep learning visual classification,” in Proceedings of the Polytope Training in Neural Controlled ODEs,” arXiv preprint
IEEE conference on computer vision and pattern recognition, 2018, arXiv:2408.01273, 2024.
pp. 1625–1634. [25] J. K. Scott, D. M. Raimondo, G. R. Marseglia, and R. D. Braatz,
[4] C. Szegedy, “Intriguing properties of neural networks,” arXiv “Constrained zonotopes: A new tool for set-based estimation and
preprint arXiv:1312.6199, 2013. fault detection,” Automatica, vol. 69, pp. 126–136, 2016.
[5] B. Ko, H.-J. Choi, C. Hong, J.-H. Kim, O. C. Kwon, and C. D. [26] L. K. Chung, A. Dai, D. Knowles, S. Kousik, and G. X. Gao,
Yoo, “Neural network-based autonomous navigation for a homecare “Constrained feedforward neural network training via reachability
mobile robot,” in 2017 IEEE International Conference on Big Data analysis,” arXiv preprint arXiv:2107.07696, 2021.
and Smart Computing (BigComp), IEEE, 2017, pp. 403–406. [27] T. J. Bird, H. C. Pangborn, N. Jain, and J. P. Koeln, “Hybrid
[6] E. N. Johnson, A. J. Calise, and J. E. Corban, “Adaptive guid- zonotopes: A new set representation for reachability analysis of
ance and control for autonomous launch vehicles,” in 2001 IEEE mixed logical dynamical systems,” Automatica, vol. 154, p. 111 107,
Aerospace Conference Proceedings (Cat. No. 01TH8542), IEEE, 2023.
vol. 6, 2001, pp. 2669–2682. [28] T. J. Bird and N. Jain, “Unions and complements of hybrid
[7] J. Ni, Y. Chen, Y. Chen, J. Zhu, D. Ali, and W. Cao, “A survey zonotopes,” IEEE Control Systems Letters, vol. 6, pp. 1778–1783,
on theories and applications for self-driving cars based on deep 2021.
learning methods,” Applied Sciences, vol. 10, no. 8, p. 2749, 2020. [29] J. Koeln, T. J. Bird, J. Siefert, J. Ruths, H. C. Pangborn, and N. Jain,
[8] N. H. T. S. Administration et al., “Summary report: standing general “zonoLAB: A MATLAB toolbox for set-based control systems anal-
order on crash reporting for level 2 advanced driver assistance ysis using hybrid zonotopes,” in 2024 American Control Conference
systems,” US Department of Transport, 2022. (ACC), IEEE, 2024, pp. 2513–2520.
[9] L. Brunke, M. Greeff, A. W. Hall, et al., “Safe learning in robotics: [30] L. Hadjiloizou, F. J. Jiang, A. Alanwar, and K. H. Johansson,
From learning-based control to safe reinforcement learning,” Annual “Formal Verification of Linear Temporal Logic Specifications Us-
Review of Control, Robotics, and Autonomous Systems, vol. 5, no. 1, ing Hybrid Zonotope-Based Reachability Analysis,” arXiv preprint
pp. 411–444, 2022. arXiv:2404.03308, 2024.
[10] S. Gu, L. Yang, Y. Du, et al., “A review of safe reinforce- [31] Y. Zhang, H. Zhang, and X. Xu, “Backward reachability analysis
ment learning: Methods, theory and applications,” arXiv preprint of neural feedback systems using hybrid zonotopes,” IEEE Control
arXiv:2205.10330, 2022. Systems Letters, vol. 7, pp. 2779–2784, 2023.
[11] Z. Liu, Z. Guo, Y. Yao, et al., “Constrained decision transformer [32] T. J. Bird, J. A. Siefert, H. C. Pangborn, and N. Jain, “A set-based
for offline safe reinforcement learning,” in International Conference approach for robust control co-design,” in 2024 American Control
on Machine Learning, PMLR, 2023, pp. 21 611–21 630. Conference (ACC), IEEE, 2024, pp. 2564–2571.
[12] K. Chakraborty, A. Gupta, and S. Bansal, “Enhancing Safety and [33] H. Zhang, Y. Zhang, and X. Xu, “Hybrid Zonotope-Based Backward
Robustness of Vision-Based Controllers via Reachability Analysis,” Reachability Analysis for Neural Feedback Systems With Nonlinear
arXiv preprint arXiv:2410.21736, 2024. Plant Models,” in 2024 American Control Conference (ACC), IEEE,
2024, pp. 4155–4161.

8
[34] Y. Zhang and X. Xu, “Reachability analysis and safety verification
of neural feedback systems via hybrid zonotopes,” in 2023 American
Control Conference (ACC), IEEE, 2023, pp. 1915–1921.
[35] D. E. Rumelhart, G. E. Hinton, and R. J. Williams, “Learning repre-
sentations by back-propagating errors,” nature, vol. 323, no. 6088,
pp. 533–536, 1986.
[36] T. Achterberg, R. E. Bixby, Z. Gu, E. Rothberg, and D. Weninger,
“Presolve reductions in mixed integer programming,” INFORMS
Journal on Computing, vol. 32, no. 2, pp. 473–506, 2020.
[37] A. Paszke, S. Gross, F. Massa, et al., “Pytorch: An imperative
style, high-performance deep learning library,” Advances in neural
information processing systems, vol. 32, 2019.
[38] X. Hu, J. Lee, and J. Lee, “Two-Stage Predict+ Optimize for MILPs
with Unknown Parameters in Constraints,” Advances in Neural
Information Processing Systems, vol. 36, 2024.
[39] J. Mandi and T. Guns, “Interior point solving for lp-based pre-
diction+ optimisation,” Advances in Neural Information Processing
Systems, vol. 33, pp. 7272–7282, 2020.
[40] S. Boyd and L. Vandenberghe, Convex optimization. Cambridge
university press, 2004.
[41] L. Gurobi Optimization, Gurobi optimizer reference manual, 2021.
[42] S. J. Wright, Primal-dual interior-point methods. SIAM, 1997.
[43] J. A. Siefert, T. J. Bird, A. F. Thompson, et al., “Reachability
analysis using hybrid zonotopes and functional decomposition,”
IEEE Transactions on Automatic Control, 2025.
[44] M. J. Kochenderfer and J. Chryssanthacopoulos, “Robust airborne
collision avoidance through dynamic programming,” Massachusetts
Institute of Technology, Lincoln Laboratory, Project Report ATC-
371, vol. 130, 2011.
[45] M. J. Kochenderfer, J. E. Holland, and J. P. Chryssanthacopoulos,
“Next generation airborne collision avoidance system,” Lincoln
Laboratory Journal, vol. 19, no. 1, pp. 17–33, 2012.
[46] M. J. Kochenderfer, C. Amato, G. Chowdhary, et al., “Optimized
airborne collision avoidance,” 2015.
[47] B. Amos and J. Z. Kolter, “Optnet: Differentiable optimization as a
layer in neural networks,” in International conference on machine
learning, PMLR, 2017, pp. 136–145.

Neural Network Synthesis
No ratings yet
Neural Network Synthesis
8 pages
Scalable Synthesis of Formally Verified Neural Value Function For Hamilton-Jacobi Reachability Analysis
No ratings yet
Scalable Synthesis of Formally Verified Neural Value Function For Hamilton-Jacobi Reachability Analysis
36 pages
Run-Time Safety Monitoring of Neural-Network-Enabled Dynamical Systems
No ratings yet
Run-Time Safety Monitoring of Neural-Network-Enabled Dynamical Systems
10 pages
Towards Optimal Branching of Linear and Semidefinite Relaxations For Neural Network Robustness Certification
No ratings yet
Towards Optimal Branching of Linear and Semidefinite Relaxations For Neural Network Robustness Certification
59 pages
A N N V H S A I: Dvancing Eural Etwork Erification Through Ierarchical Afety Bstract Nterpretation
No ratings yet
A N N V H S A I: Dvancing Eural Etwork Erification Through Ierarchical Afety Bstract Nterpretation
17 pages
Fault Tolerant
No ratings yet
Fault Tolerant
10 pages
Linearly Constrained Neural Networks
No ratings yet
Linearly Constrained Neural Networks
31 pages
Survey of FNN
No ratings yet
Survey of FNN
25 pages
Deepsplit: Scalable Verification of Deep Neural Networks Via Operator Splitting
No ratings yet
Deepsplit: Scalable Verification of Deep Neural Networks Via Operator Splitting
26 pages
Soft Computing Practical Teacher Manual
No ratings yet
Soft Computing Practical Teacher Manual
87 pages
A Review of Safe Reinforcement Learning Methods For Modern Power Systems
No ratings yet
A Review of Safe Reinforcement Learning Methods For Modern Power Systems
43 pages
Bag of Tricks For Image Classification With Convolutional Neural Networks
No ratings yet
Bag of Tricks For Image Classification With Convolutional Neural Networks
10 pages
Static Security Assessment of Power System Using Kohonen Neural Network A. El-Sharkawi
No ratings yet
Static Security Assessment of Power System Using Kohonen Neural Network A. El-Sharkawi
5 pages
Imposing Hard Constraints On Deep Networks
No ratings yet
Imposing Hard Constraints On Deep Networks
9 pages
He Bag of Tricks For Image Classification With Convolutional Neural Networks CVPR 2019 Paper
No ratings yet
He Bag of Tricks For Image Classification With Convolutional Neural Networks CVPR 2019 Paper
10 pages
Chap 2 Training Feed Forward Neural Networks
No ratings yet
Chap 2 Training Feed Forward Neural Networks
22 pages
Ai - W7L13
No ratings yet
Ai - W7L13
46 pages
Near Zero Knowledge Existence of Undesired Behaviors
No ratings yet
Near Zero Knowledge Existence of Undesired Behaviors
4 pages
Min WCNN
No ratings yet
Min WCNN
11 pages
Neural Networks & Deep Learning Guide
No ratings yet
Neural Networks & Deep Learning Guide
16 pages
Artificial Neural NetworkIV
No ratings yet
Artificial Neural NetworkIV
6 pages
DL UNIT 3 - Part2
No ratings yet
DL UNIT 3 - Part2
34 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
100 pages
A Novel Neural Network For Nonlinear Convex Programming: Xing-Bao Gao
No ratings yet
A Novel Neural Network For Nonlinear Convex Programming: Xing-Bao Gao
9 pages
NIPS 2015 Backpropagation For Energy Efficient Neuromorphic Computing Paper
No ratings yet
NIPS 2015 Backpropagation For Energy Efficient Neuromorphic Computing Paper
9 pages
Chapters 1-4
No ratings yet
Chapters 1-4
6 pages
Dense Neural Nets
No ratings yet
Dense Neural Nets
68 pages
Neural Network Activation Functions
No ratings yet
Neural Network Activation Functions
15 pages
Wind Farm Presentation
No ratings yet
Wind Farm Presentation
55 pages
Lecture 06
No ratings yet
Lecture 06
22 pages
Multi-PDE Solutions via Neural Networks
No ratings yet
Multi-PDE Solutions via Neural Networks
4 pages
AI Lec24-25
No ratings yet
AI Lec24-25
63 pages
14 - Học sâu (3) - Improve DNN - v3
No ratings yet
14 - Học sâu (3) - Improve DNN - v3
129 pages
Neural Network for Ionosphere Analysis
No ratings yet
Neural Network for Ionosphere Analysis
7 pages
Incorporating System-Level Safety Requirements in Perception Models Via Reinforcement Learning
No ratings yet
Incorporating System-Level Safety Requirements in Perception Models Via Reinforcement Learning
9 pages
Lec 8
No ratings yet
Lec 8
43 pages
Dat 300
No ratings yet
Dat 300
12 pages
Xingbaogao 2010
No ratings yet
Xingbaogao 2010
12 pages
General Observation
No ratings yet
General Observation
93 pages
Deep Learning With Tensorflow
No ratings yet
Deep Learning With Tensorflow
50 pages
PDC Lecture 12
No ratings yet
PDC Lecture 12
42 pages
Uncertainty in Neural Networks
No ratings yet
Uncertainty in Neural Networks
41 pages
Assignment 13 Modern AI
No ratings yet
Assignment 13 Modern AI
3 pages
4 - DNN Tip
No ratings yet
4 - DNN Tip
52 pages
Lect 5
No ratings yet
Lect 5
26 pages
Vericompress: A Tool To Streamline The Synthesis of Verified Robust Compressed Neural Networks From Scratch
No ratings yet
Vericompress: A Tool To Streamline The Synthesis of Verified Robust Compressed Neural Networks From Scratch
10 pages
A Gentle Introduction To Neural Networks With Python
No ratings yet
A Gentle Introduction To Neural Networks With Python
85 pages
A Gentle Introduction To Neural Networks With Python
100% (1)
A Gentle Introduction To Neural Networks With Python
85 pages
Control System Term Paper
No ratings yet
Control System Term Paper
12 pages
Lec 35
No ratings yet
Lec 35
12 pages
Jurnal
No ratings yet
Jurnal
18 pages
2022 - Neural Optimization Machine-A Neural Network Approach For Optimization
No ratings yet
2022 - Neural Optimization Machine-A Neural Network Approach For Optimization
22 pages
IBest DeepLearning
No ratings yet
IBest DeepLearning
123 pages
Unit 5
No ratings yet
Unit 5
36 pages
Lec 105
No ratings yet
Lec 105
19 pages
Neural Network Presentation
No ratings yet
Neural Network Presentation
33 pages
Haykin, Xue-Neural Networks and Learning Machines 3ed Soln
53% (19)
Haykin, Xue-Neural Networks and Learning Machines 3ed Soln
103 pages
Haykin Xue Neural Networks and Learning Machines 3ed Soln PDF
50% (2)
Haykin Xue Neural Networks and Learning Machines 3ed Soln PDF
103 pages
Matrix Factorization For Inferring Associations and Missing Links
No ratings yet
Matrix Factorization For Inferring Associations and Missing Links
35 pages
2501 12599v1
No ratings yet
2501 12599v1
25 pages
Measuring AI Ability To Complete Long Tasks: Model Evaluation & Threat Research (METR)
No ratings yet
Measuring AI Ability To Complete Long Tasks: Model Evaluation & Threat Research (METR)
45 pages
Activation Space Interventions Can Be Transferred Between Large Language Models
No ratings yet
Activation Space Interventions Can Be Transferred Between Large Language Models
68 pages
Dynamic Pricing For On-Demand DNN Inference in The Edge-AI Market
No ratings yet
Dynamic Pricing For On-Demand DNN Inference in The Edge-AI Market
18 pages
GBFRS: Robust Fuzzy Rough Sets Via Granular-Ball Computing: Shuyin Xia, Xiaoyu Lian, Binbin Sang, Guoyin Wang, Xinbo Gao
No ratings yet
GBFRS: Robust Fuzzy Rough Sets Via Granular-Ball Computing: Shuyin Xia, Xiaoyu Lian, Binbin Sang, Guoyin Wang, Xinbo Gao
12 pages
Gravity-Bench-v1: A Benchmark On Gravitational Physics Discovery For Agents
No ratings yet
Gravity-Bench-v1: A Benchmark On Gravitational Physics Discovery For Agents
20 pages
Precisecam: Precise Camera Control For Text-To-Image Generation
No ratings yet
Precisecam: Precise Camera Control For Text-To-Image Generation
14 pages
Eliza: A Web3 Friendly AI Agent Operating System
No ratings yet
Eliza: A Web3 Friendly AI Agent Operating System
20 pages
Lifelong Learning of Large Language Model Based Agents: A Roadmap
No ratings yet
Lifelong Learning of Large Language Model Based Agents: A Roadmap
46 pages
Evolution and The Knightian Blindspot of Machine Learning
No ratings yet
Evolution and The Knightian Blindspot of Machine Learning
35 pages
Remembering, Reflecting and Dynamic Decision Making For Web Agents
No ratings yet
Remembering, Reflecting and Dynamic Decision Making For Web Agents
12 pages
Mutation-Guided LLM-based Test Generation at Meta: Christopher Foster Abhishek Gulati
No ratings yet
Mutation-Guided LLM-based Test Generation at Meta: Christopher Foster Abhishek Gulati
12 pages
The Goofus & Gallant Story Corpus For Practical Value Alignment
No ratings yet
The Goofus & Gallant Story Corpus For Practical Value Alignment
8 pages
Learning Graph Node Embeddings by Smooth Pair Sampling: Konstantin Kutzkov
No ratings yet
Learning Graph Node Embeddings by Smooth Pair Sampling: Konstantin Kutzkov
37 pages
Let'S Verify and Reinforce Image Generation Step by Step: Can We Generate Images With Cot?
No ratings yet
Let'S Verify and Reinforce Image Generation Step by Step: Can We Generate Images With Cot?
26 pages
LLMPC: Large Language Model Predictive Control: Gabriel Maher January 7, 2025
No ratings yet
LLMPC: Large Language Model Predictive Control: Gabriel Maher January 7, 2025
27 pages
ELIZA Reanimated: The World's First Chatbot Restored On The World's First Time Sharing System
No ratings yet
ELIZA Reanimated: The World's First Chatbot Restored On The World's First Time Sharing System
21 pages
AIO L: A H F E AIA E A C: PS AB Olistic Ramework To Valuate Gents For Nabling Utonomous Louds
No ratings yet
AIO L: A H F E AIA E A C: PS AB Olistic Ramework To Valuate Gents For Nabling Utonomous Louds
14 pages
Generative AI in Education: From Foundational Insights To The Socratic Playground For Learning
No ratings yet
Generative AI in Education: From Foundational Insights To The Socratic Playground For Learning
49 pages
Unifying Two Types of Scaling Laws From The Perspective of Conditional Kolmogorov Complexity
No ratings yet
Unifying Two Types of Scaling Laws From The Perspective of Conditional Kolmogorov Complexity
10 pages
M Rag: T E S R - A G: INI Owards Xtremely Imple Etrieval Ugmented Eneration
No ratings yet
M Rag: T E S R - A G: INI Owards Xtremely Imple Etrieval Ugmented Eneration
16 pages
Why Are We Living The Age of AI Applications Right Now? The Long Innovation Path From AI's Birth To A Child's Bedtime Magic
No ratings yet
Why Are We Living The Age of AI Applications Right Now? The Long Innovation Path From AI's Birth To A Child's Bedtime Magic
14 pages
The Essentials of AI For Life and Society: An AI Literacy Course For The University Community
No ratings yet
The Essentials of AI For Life and Society: An AI Literacy Course For The University Community
6 pages
H GPT L L L: OW Earns Ayer by Ayer
No ratings yet
H GPT L L L: OW Earns Ayer by Ayer
14 pages
On The Complexity of Global Necessary Reasons To Explain Classification
No ratings yet
On The Complexity of Global Necessary Reasons To Explain Classification
26 pages
Test-Time Computing: From System-1 Thinking To System-2 Thinking
No ratings yet
Test-Time Computing: From System-1 Thinking To System-2 Thinking
22 pages
Artificial Intelligence in Creative Industries: Advances Prior To 2025
No ratings yet
Artificial Intelligence in Creative Industries: Advances Prior To 2025
68 pages
Anonymization of Documents For Law Enforcement With Machine Learning
No ratings yet
Anonymization of Documents For Law Enforcement With Machine Learning
7 pages
A Study On Educational Data Analysis and Personalized Feedback Report Generation Based On Tags and Chatgpt
No ratings yet
A Study On Educational Data Analysis and Personalized Feedback Report Generation Based On Tags and Chatgpt
8 pages
Wavelet Theory and Application in Communication An
No ratings yet
Wavelet Theory and Application in Communication An
18 pages
18.085 Computational Science and Engineering I: Mit Opencourseware
No ratings yet
18.085 Computational Science and Engineering I: Mit Opencourseware
13 pages
Short-Run Cost Output Relationship
No ratings yet
Short-Run Cost Output Relationship
5 pages
Intro to Statistics for Students
No ratings yet
Intro to Statistics for Students
28 pages
The Joys of Compounding
100% (1)
The Joys of Compounding
20 pages
Mtap G4S1 Student
No ratings yet
Mtap G4S1 Student
2 pages
Linear Differential Equation
No ratings yet
Linear Differential Equation
35 pages
cs5300 Day06 Adversarial Search
No ratings yet
cs5300 Day06 Adversarial Search
5 pages
LiDAR Full Notes
No ratings yet
LiDAR Full Notes
32 pages
Lec 1
No ratings yet
Lec 1
54 pages
Linear Equations and Inequalities Lesson Plan
100% (1)
Linear Equations and Inequalities Lesson Plan
7 pages
G.T.N. Arts College (Autonomous)
No ratings yet
G.T.N. Arts College (Autonomous)
20 pages
Introduction To Jflap - Jar and Finite State Automata: Theory of Computation (Cs 333) Spring Term, 2011 (Prof. Mckelvey)
No ratings yet
Introduction To Jflap - Jar and Finite State Automata: Theory of Computation (Cs 333) Spring Term, 2011 (Prof. Mckelvey)
7 pages
Math Problem Set with Solutions
No ratings yet
Math Problem Set with Solutions
5 pages
All The Math You Missed - But Need To Know For Graduate School
100% (36)
All The Math You Missed - But Need To Know For Graduate School
417 pages
Residual Offset in Silicon Hall-Effect Sensor Analytical Formula Stress Effects and Implications For Octagonal Hall Plate Geometry
No ratings yet
Residual Offset in Silicon Hall-Effect Sensor Analytical Formula Stress Effects and Implications For Octagonal Hall Plate Geometry
9 pages
Unit 5
No ratings yet
Unit 5
25 pages
Statistical Quality Control
100% (1)
Statistical Quality Control
3 pages
General Format Sip
No ratings yet
General Format Sip
3 pages
Sorting Search New
No ratings yet
Sorting Search New
15 pages
Grade 2 Class Prog
No ratings yet
Grade 2 Class Prog
1 page
Grade 4 DLL Quarter 3 Week 1 (Sir Bien Cruz)
No ratings yet
Grade 4 DLL Quarter 3 Week 1 (Sir Bien Cruz)
47 pages
PMS KPK
No ratings yet
PMS KPK
2 pages
Geography IA Guide
100% (1)
Geography IA Guide
13 pages
Math Patterns for Grade 9 Students
No ratings yet
Math Patterns for Grade 9 Students
4 pages
Optimal Coordination of Directional Over Current Relays For Distribution Systems Using Hybrid GWO-CSA
No ratings yet
Optimal Coordination of Directional Over Current Relays For Distribution Systems Using Hybrid GWO-CSA
14 pages
Applied Mathematics Msbte Board Paper PDF
No ratings yet
Applied Mathematics Msbte Board Paper PDF
3 pages
Pamantasan NG Lungsod NG Muntinlupa University Road Poblacion, Muntinlupa College of Teacher Education
No ratings yet
Pamantasan NG Lungsod NG Muntinlupa University Road Poblacion, Muntinlupa College of Teacher Education
8 pages
Ss 2 Economics 1st Term E-Note
No ratings yet
Ss 2 Economics 1st Term E-Note
77 pages
Csa FSP 22001
No ratings yet
Csa FSP 22001
12 pages

Provably-Safe Neural Network Training Using Hybrid Zonotope Reachability Analysis

Uploaded by

Provably-Safe Neural Network Training Using Hybrid Zonotope Reachability Analysis

Uploaded by

Provably-Safe Neural Network Training Using

Hybrid Zonotope Reachability Analysis

Network 1.5 1.5

0 0.5 MILP 0.5

0.000 s 0.180 s 0.388 s 0.581 s 0.814 s

As expected, the computation time of our method is largely A. Demonstration Setup

Network Size Time Time per 10 Iterations

You might also like