0% found this document useful (0 votes)

39 views9 pages

Practice BFGS Algorithm

This document details the optimization of a quadratic function using the BFGS algorithm. It outlines the steps taken to compute the initial gradient, search direction, optimal step size, and updates to the iterate and inverse Hessian approximation. The final result includes the updated inverse Hessian and a summary of the next steps in the algorithm.

Uploaded by

Daniel Solomon

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

39 views9 pages

Practice BFGS Algorithm

Uploaded by

Daniel Solomon

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 9

Case Study: Quadratic Function Optimization

Let’s consider optimizing the quadratic function:

1 T T
f ( x )= x Qx−b x
2
where

Q=
[ 41 13] , b=[12]
This is a convex function with a unique minimum at x ¿=Q−1 b .
Step 1: Initialization
We choose an initial guess:

x 0=
[ 21]
The initial inverse Hessian approximation is the identity matrix:

H 0=I =
[ 10 01]
Now, your turn! Compute the initial gradient g0=∇ f ( x 0 ) . What do you get?

(RESPONSE): We compute the initial gradient:

g0=∇ f ( x 0 ) =Q x 0 −b

Substituting the given values:

g0 =
[ 41 13] [ 21]−[ 12]
Perform the matrix-vector multiplication:

[ ][ ][]
4 ( 2 )+1 ( 1 )
1 ( 2 ) +3 ( 1 )
=
8+1
2+3
=
9
5

Subtracting b :

g0 =
[ 95]−[ 12]=[ 83]
Thus, the initial gradient is:
g0 =
[]
8
3

Now, compute the search direction using the initial inverse Hessian
approximation:
p0=−H 0 g 0

What do you get?

We compute the search direction:

p0=−H 0 g 0

Since H 0 is the identity matrix:

p0=−I g0=−
[ 10 01][ 83]
Multiplying by I (which keeps the vector unchanged):

p0=−
[ 83]=[−8
−3 ]

Thus, the search direction is:

p0 =
[−8
−3 ]

Now, perform a line search to determine the optimal step size α 0. We seek
α 0 >0 such that it minimizes:

ϕ ( α )=f ( x 0 + α p 0 )

1 T T
Substituting f ( x )= x Qx−b x , derive the optimal α 0.
2

(RESPONSE):
(I ASKED IF THAT WAS ALL AND IT SAID,)
No, there’s much more! Let’s go step by step through the line search process
in detail.
We aim to minimize:
ϕ ( α )=f ( x 0 + α p 0 )
Step 1: Express x k+1 in Terms of α

Since x 0= [ 21] and p =[−8

0
−3 ]
, we write:

x ( α ) =x0 + α p0=
[ 21]+α [−8
−3 ] [ 1−3 α ]
=
2−8 α

Step 2: Compute ϕ ( α )=f ( x ( α ) )

1 T T
Since f ( x )= x Qx−b x , we expand:
2
1 T T
ϕ ( α )= ( x ( α ) ) Qx ( α )−b x ( α )
2
First Term: Quadratic Form Expansion

Expanding x ( α )T Qx ( α ):

T
x ( α ) Qx ( α )=( 2−8 α , 1−3 α )
[ 41 13 ][ 2−8
1−3 α ]
α

Performing the matrix-vector multiplication:

Qx ( α )=
[ 4 ( 2−8 α ) +1 ( 1−3 α )
1 ( 2−8 α )+ 3 (1−3 α ) ]
¿
[8−32
2−8 α +3−9 α ] [5−17 α ]
α +1−3 α
=
9−35 α

Now computing x ( α )T ( Qx ( α ) ):

[
( 2−8 α , 1−3 α ) ⋅ 9−35 α
5−17 α ]
¿ ( 2−8 α ) ( 9−35 α ) + ( 1−3 α ) ( 5−17 α )
Expanding:
2 2
¿ 18−70 α−72 α +280 α +5−17 α −15 α + 51 α
2
¿ 23−174 α + 331 α
Thus:
1 1 23 331 2
x ( α ) Qx ( α )= ( 23−174 α +331 α ) = −87 α +
T 2
α
2 2 2 2
Second Term: Linear Term Expansion

b x ( α )=[ 1 2 ]
T
[ 2−8
1−3 α ]
α

¿ 1 ( 2−8 α ) +2 (1−3 α )=2−8 α +2−6 α =4−14 α

Step 3: Derivative to Find α 0

23 331 2
ϕ ( α )= −87 α + α − ( 4−14 α )
2 2
23 331 2
¿ −87 α + α −4+ 14 α
2 2
15 331 2
¿ −73 α + α
2 2
Taking the derivative:
331
ϕ ' ( α )=−73+ ( 2 α )=−73+331 α
2
Setting ϕ ' ( α )=0:
331 α =73
73
α 0= ≈ 0.2205
331

Final Answer for Step Size:

α 0 ≈ 0.2205

Your Turn
Now update the iterate:
x 1=x 0 +α 0 p0

Substituting the values, compute x 1.

(RESPONSE):
We update the iterate:
x 1=x 0 +α 0 p0

Substituting the known values:

x 0=
[] [ ]
2
1
, p0 =
−8
−3
,α 0=
73
331
≈ 0.2205

Performing the element-wise update:

x 1=
[21]+ 0.2205[−8
−3 ]

¿
[2−0.2205
1−0.2205 ×3 ]
×8

¿
[1−0.6615
2−1.764
]
¿
[0.3385
0.236
]
Thus, the updated iterate is:

x 1=
[0.3385
0.236
]
Next Step: Compute Differences for the BFGS Update
Now, compute:
s0 =x1−x 0 , y 0=∇ f ( x 1 )−∇ f ( x 0 )

Since we already know x 0 and x 1, compute s0 first. Then, find ∇ f ( x 1 ) using the
gradient formula:
∇ f ( x )=Qx−b

Use this to compute y 0. Let’s see your result!

We now compute the differences required for the BFGS update.

Step 1: Compute s0
s0 =x1−x 0

Substituting the values:

s0 =
[ 0.3385
0.236
]−[ 21]
¿
[ ]
0.236−2
0.3385−1

¿
[−0.6615
−1.764
]
Step 2: Compute y 0=∇ f ( x 1 ) −∇ f ( x 0 )
We use the gradient formula:
∇ f ( x )=Qx−b

We already computed ∇ f ( x 0 )= [ 83]. Now compute ∇ f ( x ): 1

∇ f ( x 1 )=Q x 1−b

¿
[ 41 13][ 0.3385
0.236
]−[12]
First, perform the matrix-vector multiplication:

Q x1 =
[ 4 ( 0.236 ) +1 ( 0.3385 )
1 ( 0.236 )+3 ( 0.3385 ) ]
¿
[0.944
0.236+1.0155 ]
+0.3385

¿
[1.2825
1.2515 ]

Now subtract b :

∇ f ( x 1 )=
[1.2825
1.2515 ]−
[ 2]
1

¿
[1.2825−1
1.2515−2 ]

¿
[−0.7485
0.2825
]
Now, compute y 0:
y 0=∇ f ( x 1 ) −∇ f ( x 0 )

¿
[−0.7485
0.2825
]−[ 83]
¿
[0.2825−8
−0.7485−3 ]
¿
[−7.7175
−3.7485 ]

Final Results

[−0.6615
s0 =
−1.764
]
y=
[−7.7175
−3.7485 ]
0

(RESPONSE):

Next Step: Compute ρ0 and Update H 1

Now, compute:
1
ρ0 = T
y s0
0

H 1=( I −ρ0 s 0 y T0 ) H 0 ( I −ρ0 y 0 s T0 ) + ρ0 s 0 sT0

Let’s see your results!

You’re right! Let’s wrap it up properly.

Final Step: Compute ρ0 and Update H 1

Step 1: Compute ρ0
1
ρ0 = T
y s0
0

First, compute y T0 s0 :
T
y 0 s0 =(−7.7175 ,−3.7485 ) ⋅ (−1.764 ,−0.6615 )

¿ (−7.7175 ×−1.764 )+ (−3.7485×−0.6615 )

¿ 13.616+2.479
¿ 16.095
Thus:
1
ρ0 = ≈ 0.0621
16.095

Step 2: Compute H 1 using the BFGS update

H 1=( I −ρ0 s 0 y T0 ) H 0 ( I −ρ0 y 0 s T0 ) + ρ0 s 0 sT0

Since H 0=I , the formula simplifies to:

T T T
H 1=I −ρ 0 s0 y 0 −ρ0 y 0 s 0 + ρ0 s 0 s 0

Computing each term:

1. ρ0 s 0 y T0 and ρ0 y 0 s T0 :

T
ρ0 s 0 y 0 =0.0621
[−0.6615
−1.764
] [−7.7175 −3.7485 ]

¿ 0.0621
[ 13.616 6.609
5.103 2.480 ] =
[ 0.846
0.317
0.411
0.154 ]
Since ρ0 s 0 y T0 = ρ0 y 0 sT0 , we subtract twice this matrix from I :

I −2 ×
[0.846
0.317
0.411
0.154
=
][
1−1.692 −0.822
=
−0.692 −0.822
−0.634 1−0.308 −0.634 0.692 ][ ]
2. ρ0 s 0 s T0 :

T
[−0.6615
ρ0 s 0 s 0 =0.0621
−1.764
] [−1.764 −0.6615 ]

¿ 0.0621
[ 1.167
3.112 1.167
0.438 ] =
[ 0.193
0.073
0.073
0.027 ]
Adding to the previous result:

H 1=
[−0.692
−0.634 0.692 ] [ 0.073
−0.822 0.193
+
0.073
0.027 ]
¿
[−0.499
−0.561 0.719 ]
−0.749

Final Result:
The updated inverse Hessian approximation:
H 1=
[−0.499
−0.561
−0.749
0.719 ]
Final Thoughts
This completes the first iteration of the BFGS algorithm. In subsequent
iterations, we would:
 Check for convergence.

 Compute a new search direction.

 Perform a line search.

 Update x k , H k , and iterate again.

Dual Simplex Method & LP Duality
No ratings yet
Dual Simplex Method & LP Duality
32 pages
Dca2101 Computer Oriented Numerical Methods
No ratings yet
Dca2101 Computer Oriented Numerical Methods
7 pages
Relativizations of The P NP Question (Original)
No ratings yet
Relativizations of The P NP Question (Original)
12 pages
Transportation Problem Using Stepping Stone Method (Optimal Solution) Calculator
No ratings yet
Transportation Problem Using Stepping Stone Method (Optimal Solution) Calculator
3 pages
Differential Evolution Algorithm With A Modified Archiving-Based Adaptive Tradeoff Model For Optimal Power Flow
No ratings yet
Differential Evolution Algorithm With A Modified Archiving-Based Adaptive Tradeoff Model For Optimal Power Flow
33 pages
Tutorial: Using MATLAB For Mathematical Programming: APS502 - Financial Engineering I
No ratings yet
Tutorial: Using MATLAB For Mathematical Programming: APS502 - Financial Engineering I
22 pages
Lucas Numbers
No ratings yet
Lucas Numbers
3 pages
Chapter 3. FEA Preliminaries: Direct Stiffness Method (DSM) Steps
No ratings yet
Chapter 3. FEA Preliminaries: Direct Stiffness Method (DSM) Steps
16 pages
The 8-Point Algorithm: 16-385 Computer Vision (Kris Kitani)
No ratings yet
The 8-Point Algorithm: 16-385 Computer Vision (Kris Kitani)
33 pages
MAT6007 Session4 MP Neuron Perceptrons
No ratings yet
MAT6007 Session4 MP Neuron Perceptrons
15 pages
SESO2018 Wednesday Sagastizabal
No ratings yet
SESO2018 Wednesday Sagastizabal
181 pages
Chapter Two Part V Duality and Sensitivity Analysis
No ratings yet
Chapter Two Part V Duality and Sensitivity Analysis
75 pages
Desrosiers, Lübbecke - A Primer in Column Generation
No ratings yet
Desrosiers, Lübbecke - A Primer in Column Generation
32 pages
NLP Lecture: Language Models & RNNs
No ratings yet
NLP Lecture: Language Models & RNNs
14 pages
Min Lin PDF
No ratings yet
Min Lin PDF
10 pages
The Real-Symmetric Spectral Theorem
No ratings yet
The Real-Symmetric Spectral Theorem
5 pages
2 Divide and Conquer
No ratings yet
2 Divide and Conquer
52 pages
Regression Analysis Essentials
No ratings yet
Regression Analysis Essentials
47 pages
Informed Search: Artificial Intelligence 1
No ratings yet
Informed Search: Artificial Intelligence 1
18 pages
Applied Numerical Methods: Dr. Khaled Ahmida Ashouri
No ratings yet
Applied Numerical Methods: Dr. Khaled Ahmida Ashouri
12 pages
Factoring
100% (1)
Factoring
51 pages
Chapter 2
No ratings yet
Chapter 2
62 pages
Uas Matops Acjm
No ratings yet
Uas Matops Acjm
29 pages
Age and Gender Classification Using Convolutional Neural Networks
No ratings yet
Age and Gender Classification Using Convolutional Neural Networks
9 pages
Series Maths
No ratings yet
Series Maths
57 pages
CSC 120 Projec1
No ratings yet
CSC 120 Projec1
10 pages
Homework 9 QMB 3200
No ratings yet
Homework 9 QMB 3200
22 pages
Mobile UI Design A Detailed Exploration of Properties, Principles, and Practical Applications
No ratings yet
Mobile UI Design A Detailed Exploration of Properties, Principles, and Practical Applications
67 pages
FFSQP Manual PDF
No ratings yet
FFSQP Manual PDF
46 pages
Practicals
No ratings yet
Practicals
10 pages
Solutions of Second Order Ordinary Differential Equations
No ratings yet
Solutions of Second Order Ordinary Differential Equations
9 pages
Numerical Methods Question Bank
No ratings yet
Numerical Methods Question Bank
22 pages
Constructive Hoare Logic
No ratings yet
Constructive Hoare Logic
5 pages
Least Squares Method
No ratings yet
Least Squares Method
36 pages
Numerical Solutions To Civil Engineers Problem (Lecture) : Manuel S. Enverga University Foundation College of Engineering
No ratings yet
Numerical Solutions To Civil Engineers Problem (Lecture) : Manuel S. Enverga University Foundation College of Engineering
3 pages
Polygonal Numbers - A Comprehensive Study
No ratings yet
Polygonal Numbers - A Comprehensive Study
9 pages
Comprehensive Technical Note On Symmetric Matrices, Eigenvectors, Eigenvalues, and Principal Component Analysis (PCA)
No ratings yet
Comprehensive Technical Note On Symmetric Matrices, Eigenvectors, Eigenvalues, and Principal Component Analysis (PCA)
6 pages
Solution Manual For Microeconomic Theory
100% (1)
Solution Manual For Microeconomic Theory
8 pages
06 23ECE216 GradientDescent v2
No ratings yet
06 23ECE216 GradientDescent v2
73 pages
Comprehensive Breakdown of A Logical Data Flow Diagram (LDFD)
No ratings yet
Comprehensive Breakdown of A Logical Data Flow Diagram (LDFD)
4 pages
CPSC 540 Assignment 1 (Due January 19)
100% (1)
CPSC 540 Assignment 1 (Due January 19)
9 pages
Adaptive Filters: Overview & Applications
No ratings yet
Adaptive Filters: Overview & Applications
39 pages
4 Proximal Methods and ADMM Modified Ver1
No ratings yet
4 Proximal Methods and ADMM Modified Ver1
48 pages
16.323 Optimal Control Problems Set 1
No ratings yet
16.323 Optimal Control Problems Set 1
3 pages
Information Content and Surprise in Probability
No ratings yet
Information Content and Surprise in Probability
3 pages
Opt Sem10
No ratings yet
Opt Sem10
26 pages
Daa Cse 5TH Sem
No ratings yet
Daa Cse 5TH Sem
1 page
10.3 One-Dimensional Search With First Derivatives
No ratings yet
10.3 One-Dimensional Search With First Derivatives
4 pages
Assign REG Bba
No ratings yet
Assign REG Bba
1 page
MLF Combined
No ratings yet
MLF Combined
84 pages
Ce 565
No ratings yet
Ce 565
42 pages
Simulation and Modelling PQ 2025
No ratings yet
Simulation and Modelling PQ 2025
3 pages
Changing of Integrals
No ratings yet
Changing of Integrals
1 page
Linear Algebra Assignment Solution
100% (1)
Linear Algebra Assignment Solution
28 pages
Optimization Lecture 1
No ratings yet
Optimization Lecture 1
11 pages
Ch9 Presn PDF
No ratings yet
Ch9 Presn PDF
22 pages
Linear Quadratic Control
No ratings yet
Linear Quadratic Control
7 pages
A Linearization: A.1 Functions of One Variable
No ratings yet
A Linearization: A.1 Functions of One Variable
8 pages
Exercise Problems
No ratings yet
Exercise Problems
11 pages
Exam 2018
No ratings yet
Exam 2018
18 pages
Numerical Programming I (For CSE) : Final Exam
No ratings yet
Numerical Programming I (For CSE) : Final Exam
8 pages
Clnote Sept28
No ratings yet
Clnote Sept28
30 pages
Stationary Point As Local Minimum, Local Maximum
No ratings yet
Stationary Point As Local Minimum, Local Maximum
2 pages
ASSIGNMENT 1 Math PDF
No ratings yet
ASSIGNMENT 1 Math PDF
40 pages
CS-6777 Liu Abs
100% (1)
CS-6777 Liu Abs
103 pages
Computer Oriented Numerical Methods
No ratings yet
Computer Oriented Numerical Methods
11 pages
Calculus - Class Notes
No ratings yet
Calculus - Class Notes
4 pages
Linear Least-Squares
No ratings yet
Linear Least-Squares
7 pages
Project For Automated Train by Roshan
No ratings yet
Project For Automated Train by Roshan
6 pages
Deep Learning Assignment 1 Solutions
No ratings yet
Deep Learning Assignment 1 Solutions
10 pages
Successive Quadratic Programming Methods
No ratings yet
Successive Quadratic Programming Methods
33 pages
2.NCC-SFC-LMT-KKT 2
No ratings yet
2.NCC-SFC-LMT-KKT 2
56 pages
Multi Variable Optimization: Min F (X, X, X, - X)
No ratings yet
Multi Variable Optimization: Min F (X, X, X, - X)
38 pages
Mit18 S096iap23 Lec1
No ratings yet
Mit18 S096iap23 Lec1
16 pages
Diagnostic Activity
No ratings yet
Diagnostic Activity
11 pages
Calculus
No ratings yet
Calculus
5 pages
t1 Sol
No ratings yet
t1 Sol
4 pages
BFGS
No ratings yet
BFGS
9 pages
COL726 A1-Solutions
No ratings yet
COL726 A1-Solutions
8 pages
Intoduction To Atlab
No ratings yet
Intoduction To Atlab
59 pages
Optimal Dispatch of Generation Part I: Unconstrained Parameter Optimization
No ratings yet
Optimal Dispatch of Generation Part I: Unconstrained Parameter Optimization
9 pages
Sample Midterm With Solutions (Updated)
No ratings yet
Sample Midterm With Solutions (Updated)
26 pages
Introduction To Programming in MATLAB: Lecture 3: Solving Equations and Curve Fitting
No ratings yet
Introduction To Programming in MATLAB: Lecture 3: Solving Equations and Curve Fitting
39 pages
Unit4Lab Report Mayol
No ratings yet
Unit4Lab Report Mayol
13 pages
Maths Formula Pocket Book Maths Formula-Page73
No ratings yet
Maths Formula Pocket Book Maths Formula-Page73
1 page
Matlab Session 5
No ratings yet
Matlab Session 5
23 pages
Optimization Based On Gradient Descent
No ratings yet
Optimization Based On Gradient Descent
24 pages
Math4ml PDF
No ratings yet
Math4ml PDF
21 pages
OptimumEngineeringDesign Day2b
No ratings yet
OptimumEngineeringDesign Day2b
24 pages
Gradient Based Optimization
No ratings yet
Gradient Based Optimization
24 pages
KEY - Final Practice Questions
No ratings yet
KEY - Final Practice Questions
9 pages
MIT6 094IAP10 Lec03
No ratings yet
MIT6 094IAP10 Lec03
40 pages
Exam With Solutions PDF
0% (1)
Exam With Solutions PDF
17 pages
Introduc) On To MATLAB For Control Engineers: EE 447 Autumn 2008 Eric Klavins
No ratings yet
Introduc) On To MATLAB For Control Engineers: EE 447 Autumn 2008 Eric Klavins
30 pages
Chương 9
No ratings yet
Chương 9
12 pages
Optimization Techniques Lecture
No ratings yet
Optimization Techniques Lecture
37 pages
Lecture 5
No ratings yet
Lecture 5
6 pages
Unreal vs Unity: Game Engine Comparison
No ratings yet
Unreal vs Unity: Game Engine Comparison
1 page

Practice BFGS Algorithm

Uploaded by

Practice BFGS Algorithm

Uploaded by

Case Study: Quadratic Function Optimization

Let’s consider optimizing the quadratic function:

(RESPONSE): We compute the initial gradient:

Substituting the given values:

What do you get?

We compute the search direction:

Since H 0 is the identity matrix:

Thus, the search direction is:

Since x 0= [ 21] and p =[−8

Step 2: Compute ϕ ( α )=f ( x ( α ) )

Performing the matrix-vector multiplication:

¿ 1 ( 2−8 α ) +2 (1−3 α )=2−8 α +2−6 α =4−14 α

Step 3: Derivative to Find α 0

Final Answer for Step Size:

Substituting the values, compute x 1.

Substituting the known values:

Performing the element-wise update:

Use this to compute y 0. Let’s see your result!

We now compute the differences required for the BFGS update.

Substituting the values:

We already computed ∇ f ( x 0 )= [ 83]. Now compute ∇ f ( x ): 1

Next Step: Compute ρ0 and Update H 1

H 1=( I −ρ0 s 0 y T0 ) H 0 ( I −ρ0 y 0 s T0 ) + ρ0 s 0 sT0

Let’s see your results!

You’re right! Let’s wrap it up properly.

Final Step: Compute ρ0 and Update H 1

¿ (−7.7175 ×−1.764 )+ (−3.7485×−0.6615 )

Step 2: Compute H 1 using the BFGS update

H 1=( I −ρ0 s 0 y T0 ) H 0 ( I −ρ0 y 0 s T0 ) + ρ0 s 0 sT0

Since H 0=I , the formula simplifies to:

Computing each term:

 Compute a new search direction.

 Perform a line search.

 Update x k , H k , and iterate again.

You might also like