0% found this document useful (0 votes)

26 views40 pages

Optimization 2

The document outlines various methods and concepts in unconstrained continuous optimization, including gradient descent, Newton's method, and the Levenberg-Marquardt algorithm. It discusses the importance of convexity in optimization problems and introduces key ideas such as axial iteration and conjugate gradients. Additionally, it emphasizes performance issues related to optimization algorithms, such as iteration count and computational cost.

Uploaded by

Santiago Garrido Bullón

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views40 pages

Optimization 2

Uploaded by

Santiago Garrido Bullón

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 40

Optimization 2

Lecture outline

Unconstrained continuous optimization:

• Convexity
• Iterative optimization algorithms
• Gradient descent
• Newton’s method
• Gauss-Newton method

New topics:
• Axial iteration
• Levenberg-Marquardt algorithm
• Application
Introduction: Problem specification

Suppose we have a cost function (or objective function)

f ! x " # $%n & $%

Our aim is find the value of the parameters x that minimize this function

x ! ' ()* +,-

x
f ! x "
subject to the following constraints:

• equality c i ! x "#$ %, i $ &, . . . , me

• inequality c i ! x " # $ %, i & me ' (,. . . , m

We will start by focussing on unconstrained problems

Unconstrained optimization
function of one
variable f !x"

+,- f ! x "
x

local global x
minimum minimum

• down-hill search (gradient descent) algorithms can find local minima

• which of the minima is found depends on the starting point
• such minima often occur in real applications
Reminder: convexity
Class of functions

convex Not convex

• Convexity provides a test for a single extremum

• A non-negative sum of convex functions is convex
Class of functions continued

single extremum – convex single extremum – non-convex

Not convex

multiple extrema – non-convex noisy horrible

Optimization algorithm – key ideas

! "#$% δx &'() * ) + * f , x - δx . < f , x .

! /)#& 0 12+%&0* 3 0 +$0#24+#520'6%+*20 x n ! " # 7 0 x n - δx

! 82%'(20)20643912:0 3 0 +0&24#2&03;0< = 0 1#$20&2+4()2&0δx 7 α p

-5
-5 0 5 10 15
Optimization algorithm – Random direction
Choosing the direction 1: axial iteration

Alternate minimization over x and y

-5-5 0 5 10 15
Optimization algorithm
axial directions
Gradient and Partial Derivatives

A function of several variables can be written as f (x1 , x2 ), Gradient and Tangent Plane /
etc. Often times, we abbreviate multiple arguments in a 1st Degree Taylor Expansion
single vector as f (x).

Let a function f : Rn → R. The gradient of f is the

column vector of partial derivatives ∇f (x)
 ∂f (x) 
∂x1

∇f (x) :=  .. 
.

 
∂f (x)
∂xn

Suppose now a function g(x, y) with signature g : Rn × τx1 (y) = f (x) + (y − x)> ∇f (x)
Rm → R. Its derivative with respect to just x is written
as ∇x g(x, y).
Choosing the direction 2: steepest descent

Move in the direction of the gradient ! f "xn#

-5-5 0 5 10 15
Optimization algorithm – Steepest descent
Steepest descent
15

-5
-5 0 5 10 15

$ % & ' # ()+,'-.#,/#'0')12&')'#3')3'-+,456)#. 7 # .&'#47-.75) 6,-'/8

$ 9:.')#'*4&#6,-'#;,-,;,<*.,7-#.&'#-'2#()*+,'-.#,/#*62*1/ orthogonal
. 7 .&' 3)'0,75/ /.'3 +,)'4.,7- =.)5' 7: *-1 6,-' ;,-,;,<*.,7-8>

$ ?7-/'@5'-.61A#.&'#,.')*.'/#.'-+#. 7 # <,(B<*(#+72-#.&'#0*66'1#,-#*#0')1##
,-'C4,'-. ;*--')
Gradient Descent

• Iterative method starting at an initial point x(0)

• Step to the next point x(k+1) in the direction of the
negative gradient

x(k+1) = x(k) − ∇f (x(k) )

120
• Repeat until k∇f (x(k) )k < for a chosen 100
80
f 60
• But: No convergence is guaranteed. 40 10
20
For convergence, an additional line search is required. 0
8
6
1.00
0.75 4
0.50

x1
0.25 2
x2 0.00 0.25
Line Search 0.50
0.75 0

• Take the descent step direction d = −∇f (x) Gradient Descent for

• Select the step length α as minα≥0 f (x + αd)

f (x) = 12 (x1 )2 + 5(x2 )2
• In practice, α is selected with heuristics
A harder case: Rosenbrock’s function

! ! !
f " x , y # $ %&&"y ' x # ( " % ' x #
Rosenbrock function
3

2.5

1.5

0.5

-0.5

-1
-2 -1 0 1 2

" # $ # % & % ' #(') * ' +,, ,-

Steepest descent on Rosenbrock function

Steepest Descent Steepest Descent

2.5
0.85
2

1.5 0.8

1
0.75
0.5

0 0.7

-0.5
0.65
-1 -0.95 -0.9 -0.85 -0.8 -0.75
-2 -1 0 1 2

• The zig-zag behaviour is clear in the zoomed view (100 iterations)

• The algorithm crawls down the valley

Optimization algorithm – Steepest descent 2
Optimization algorithm – Steepest descent for matrices
Conjugate Gradients – sketch only
! " # $ # % " & ' &( c o n j u g a t e g r a d i e n t s )"&&*#* *+))#**,-# '#*)#.% ',/#)0
%,&.* p n *+)" % " 1 % , % ,* 2+1/1.%##' % & /#1)" %"# $ , . , $ + $ ,. 1 3. ,% #
.+$4#/ &( *%#5*6

7 81)"9 p n ,9)"&#.9% & 9 4#9)&.:+21%#9% & 9 1;;95/#-,&+9#1/)"9',/#)%,&.*99

< , % " 9 /#*5#)%9% & 9 %"#9=#**,1.9 H>

p!nHp j ? @, @?< j < n

7 ! " # 9 /#+;%,.29#1/)"9',/#)%,&.*91/#9$+%+1;;C9;,.#1/;C ,.'#5#.'#.%6

7 RemarkablyD p n )1. 4# )"&#. +,.2 &.;C E.&<;#'2# &( p n " # , A f F x n " # G 9 9

1.'9A f F x n G 9 F*##9H+$#/,)1; I#),5#*G

Afn!Afn p
pn ? A f n B n" #
A f n!" # A f n " #
Choosing the direction 3: conjugate gradients

Again, uses first derivatives only, but avoids “undoing” previous

work

$ 9 - # DB+,; '-/,7-6#@ 5 + ) * .,4 #: 7 ) ; # 4*-#E'#; ,- ,; ,<' + #,-#a t m o s t N

47 - F5 ( * .' #+'/4'-. /.'3/8

$ G #+ , H ' ) ' - . # / .* ) .,- ( 3 7 ,- ./ 8

$ I , - , ; 5 ; # ,/#)'*4&'+#,-#'J*4.61#K /.'3/8
The Hessian Matrix

Let f : Rn → R twice differentiable. Its second (partial)

derivatives make up the Hessian Matrix ∇2f (x):
2nd Degree Taylor Expansion
 
∂ 2f (x) ∂ 2f (x)
 ∂x ∂x ···
 1 1 ∂x1 ∂xn 
∇2f (x) := 
 .. .. .. 
 2. . . 

 ∂ f (x) 2
∂ f (x) 
···
∂xn ∂x1 ∂xn ∂xn

• The order of differentiation does not matter if the

function has continuous second (higher-order) τx2 (y) = f (x) +
partial derivatives (Schwarz’s Theorem) (y − x)> ∇f (x) +
• Then the Hessian is symmetric >
1
2
2 (y − x) ∇ f (x) (y − x)
∇2f (x) = [∇2f (x)]>
Choosing the direction 4: Newton’s method
Start from Taylor expansion in 2D
9 # :5-4.,7-#;*1#E'#*33)7J,;*.'+#674*661#E1#,./#%*167)#/'),'/#'J3*-/,7-##
*E75.#*#37,-. x $
∂ !f ∂ !f
∂f ∂f δx " ∂x ! ∂x∂y δx
f = x ! δx > L f = x> ! , ! =δx, δy> ∂ !f ∂ !f
∂ x ∂y δy K δy
∂x ∂y ∂y!

%&'# 'J3-/,7-#. 7 # /'47-+#7)+')#,/##@5+).,4 :5-4.,7-

" !
f = x ! δ x > M a ! g ! δx ! δx H δx
K

D72#;,-,;,<'#.&,/#'J3*-/,7-#70') δxN
" !
;,- f = x ! δ x > M a ! g ! δx ! δx H δx
δx K
<
:#$ f , x - δx . 7 a - g>δx - δx>H δx
δx >
"34 + : #$#: ': ?2 42@'#42 * ) + * A f , x - δx . 7 0B +$% &3

A f , x - δx . 7 g - Hδx 7 0
? # * ) &31'*#3$ δx 7 C H O" g , D + * 1 + 9 δx 7 C H E g .F

15
/)#& 0 G#52&0*)20#*24+*#52 '6%+*2

x n ! " & x n ) H#n"gn

-5
-5 0 5 10 15
x n ! " ! x n " H#n"gn

$ P:#f = x > # ,/#@5+).,4A#.&'-#.&'#/765.,7-#,/#:75-+#,-#7-' /.'38

$ % & ' # ;'.&7+#&/#@5+).,4#47-0')('-4'#=/#,-#.&'#" Q 4*/'>8

$ % & ' # /765.,7-#δx M # OH " n#g n ,/#(5)-.''+#. 7 # E'#*#+72-&,66#+,)'4.,7-##

3)70,+'+#. & * . # H,/#37/,.,0' +'R-,.'

$ S.&')#.&-#F5;3#/.),(&.#. 7 # .&'#3)'+,4.'+#/765.,7-#. # x n O # H"n#gnA##

, . # ,/#E'..')#. 7 # 3'):7);#*#line search

x n % # M x n O αnH "n#gn

$ P:#HM # I .&'-#.&,/#)'+54'/#. 7 # /.''3'/. +'/4'-.8

Newton’s method - example
Newton method with line search
Newton method with line search
3 3

2.5 2.5

2 2

1.5 1.5

1 1

0.5 0.5

0 0

-0.5 -0.5

-1 -1
-2 -1 0 1 2 -2 -1.5 -1 -0.5 0 0.5 1 1.5 2
gradient < 1e-3 after 15 iterations gradient < 1e-3 after 15 iterations

ellipses show successive

quadratic approximations

•The algorithm converges in only 15 iterations – far superior to steepest

descent
•However, the method requires computing the Hessian matrix at each
iteration – this is not always feasible
Optimization algorithm – Newton method
Optimization algorithm – Newton2 method
Performance issues for optimization algorithms

1. Number of iterations required

2. Cost per iteration

3. Memory footprint

4. Region of convergence
Non-linear least squares

M
'
f F xG ? ri
i& #
Gradient
M
A f FxG ? J r iF x G A r iF x G Ari
i
Hessian
M
H? A A ! f F x G ? J A 9 r iF x G A 9!r Fi x G
i
M
? J A r i F x G A !9 r iF x G B 9 ri F x G A A !9 r iF x G
i
<",)"9,*9155/&K,$1%#' 1*
!Uri
M Ari
H() ? J A r i F x G A !9 r i F x G
i

! " , * 9 ,*9%"#9G a u s s - N e w t o n 155/&K,$1%,&.

x n ! " " x n # αnH#n"gn $ % & ' Hn ( x ) " H$% ( x n )

Gauss-Newton method with line search

Gauss-Newton method with line search
3 3

2.5 2.5

2 2

1.5 1.5

1 1

0.5 0.5

0 0

-0.5 -0.5

-1 -1
-2 -1 0 1 2 -2 -1.5 -1 -0.5 0 0.5 1 1.5 2

gradient < 1e-3 after 14 iterations gradient < 1e-3 after 14 iterations

•minimization with the Gauss-Newton approximation with line search

takes only 14 iterations
Comparison
Newton Gauss-Newton
Newton method with line search Gauss-Newton method with line search
3 3
2.5 2.5

2 2

1.5 1.5

1 1

0.5 0.5

0 0

-0.5 -0.5

-1 -1
-2 -1 0 1 2 -2 -1 0 1 2
gradient < 1e-3 after 15 iterations gradient < 1e-3 after 14 iterations

• requires computing Hessian •approximates Hessian by

product of gradient of residuals
• exact solution if quadratic
• requires only derivatives
Summary of minimizations methods

&'()*+ x n ! " , x n ! δx

"- %+.*/0-
H δx , # g

1- $)2334%+.*/0-
HVD#δx , # g

5-6$7)(8+0* (+39+0*-
λ δx , # g
Levenberg-Marquardt algorithm
$ 92*1 :)7; .&' ;,-,;5;A ,- )'(,7-/ 7: -'(*.,0' 45)0*.5)'A .&'
V*5//BD'2.7- *33)7J,;*.,7- ,/ -7. 0')1 (77+8

$ P- /54& )'(,7-/A * /,;36' /.''3'/.B+'/4'-. /.'3 ,/ 3)7E*E61 .&' E'/.

36*-8

$ % & ' W'0'-E')(BI)@5)+. ;'.&7+ ,/ * ;'4&-,/; :7) 0)1,-( E'B

.2''- /.''3'/.B+'/4'-. *-+ V*5//BD'2.7- /.'3/ +'3'-+,-( 7- &72
(77+ .&' H() *33)7J,;*.,7- ,/ 674*6618
1.4

1.2

0.8

0.6

0.4

0.2

0
-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6 0.8 1

Newton gradient
descent
$ % & ' # ; ' . & 7 + # 5/'/#.& ' #; 7 + ,R ' + X'//,*-

H= x , λ > M H$% ! λ I

$ T & ' - #λ ,/#/;66A# H33)7J,;.'/#.& ' #V5//BD'2.7- X'//,*-8

$ T & ' - #λ ,/#6)('A# H,/#467/'#. 7 # .& ' #,+'-.,.1A#45/,-(#/.''3'/.B+'/4'-.##

/.'3/#. 7 # E' .*Y'-8
LM Algorithm
H= x , λ > M H $ % = x > ! λ I

"8#Z'.#λ M # [.[[" =/*1>

K8 Z760' δx M O H = x , λ > & # g

G8 P: f = x n ! δx > > f = x n > A ,-4)'/' λ = \ " [ /1> *-+ (7 . 7 K8

]8 ^.&')2,/'A#+'4)'/'#λ = \ [ . " # /1>A#6'.# x n ' # ( M # x n ! δx A#*-+#(7#. 7 # K8

N o t e : T h i s a l g o r i t h m d o e s n o t r e q u i r e e x p l i c i t lin e searches.
Example

Levenberg-Marquardt method
3 Levenberg-Marquardt method
3

2.5 2.5

2 2

1.5 1.5

1 1

0.5 0.5

0 0

-0.5 -0.5

-1 -1
-2 -1 0 1 2 -2 -1.5 -1 -0.5 0 0.5 1 1.5 2
gradient < 1e-3 after 31 iterations gradient < 1e-3 after 31 iterations

! "#$#%#&'(#)$*+,#$-*./0/$1/2-3"'24+'25(*6 $ ) *7#$/*,/'289:*(';/,*<=**
#(/2'(#)$,>

Matlab: lsqnonlin
Comparison

Gauss-Newton Levenberg-Marquardt
Levenberg-Marquardt method
Gauss-Newton method with line search
3 3

2.5 2.5

2 2

1.5 1.5

1 1

0.5 0.5

0 0

-0.5 -0.5
-1
-2 -1 0 1 2 -1
gradient < 1e-3 after 14 iterations -2 -1 0 1 2
gradient < 1e-3 after 31 iterations

•more iterations than Gauss-Newton,

but
• no line search required,
• and more frequently converges

Midterm 1 Notes
No ratings yet
Midterm 1 Notes
46 pages
Gradient Based Optimization
No ratings yet
Gradient Based Optimization
24 pages
Gradient and Newton Optimization
No ratings yet
Gradient and Newton Optimization
42 pages
Department of Education: Republic of The Philippines
No ratings yet
Department of Education: Republic of The Philippines
2 pages
Op Tim Ization
No ratings yet
Op Tim Ization
25 pages
5 1 SD 17122020
No ratings yet
5 1 SD 17122020
47 pages
CS-6777 Liu Abs
100% (1)
CS-6777 Liu Abs
103 pages
Mathematical Methods of Optimization
No ratings yet
Mathematical Methods of Optimization
62 pages
Optimumengineeringdesign Day3a
No ratings yet
Optimumengineeringdesign Day3a
34 pages
Clnote Oct8
No ratings yet
Clnote Oct8
39 pages
Twin-Turbine Centrifugal Compressor MODEL TT-300: Service Monitor User Manual
No ratings yet
Twin-Turbine Centrifugal Compressor MODEL TT-300: Service Monitor User Manual
68 pages
Lecture 7 (With Notes)
No ratings yet
Lecture 7 (With Notes)
39 pages
06 23ECE216 GradientDescent v2
No ratings yet
06 23ECE216 GradientDescent v2
73 pages
Clnote Oct12
No ratings yet
Clnote Oct12
25 pages
Chapter 2 - Final
No ratings yet
Chapter 2 - Final
11 pages
CH 4
No ratings yet
CH 4
28 pages
Doan BFGS
No ratings yet
Doan BFGS
72 pages
Lecture8 UnconstrainedII 2023
No ratings yet
Lecture8 UnconstrainedII 2023
57 pages
Process Optimization
100% (1)
Process Optimization
70 pages
Unconstrained Multivariable Optimization
No ratings yet
Unconstrained Multivariable Optimization
42 pages
2.NCC-SFC-LMT-KKT 2
No ratings yet
2.NCC-SFC-LMT-KKT 2
56 pages
Calculus - Class Notes
No ratings yet
Calculus - Class Notes
4 pages
TIA EIA 568 B.2 1final
No ratings yet
TIA EIA 568 B.2 1final
86 pages
Unconstrained Numerical Optimization An Introduction For Econometricians
100% (1)
Unconstrained Numerical Optimization An Introduction For Econometricians
32 pages
Opt Lec 10
No ratings yet
Opt Lec 10
16 pages
The Levenberg-Marquardt Algorithm: Ananth Ranganathan 8th June 2004
No ratings yet
The Levenberg-Marquardt Algorithm: Ananth Ranganathan 8th June 2004
5 pages
EDO - Lecture 5 - 2024
No ratings yet
EDO - Lecture 5 - 2024
47 pages
14 Newton
No ratings yet
14 Newton
24 pages
Lecture 5 Si416 2025
No ratings yet
Lecture 5 Si416 2025
21 pages
Lecture 5
No ratings yet
Lecture 5
6 pages
Optimization
No ratings yet
Optimization
21 pages
Optimization for Engineers
No ratings yet
Optimization for Engineers
166 pages
Nonlinear Programming (Concepts, Algorithms, and Applications To Chemical Processes) - 3. Newton-Type Methods For Unconstrained Optimization (2010)
No ratings yet
Nonlinear Programming (Concepts, Algorithms, and Applications To Chemical Processes) - 3. Newton-Type Methods For Unconstrained Optimization (2010)
23 pages
OPTFIT Aflevering
No ratings yet
OPTFIT Aflevering
9 pages
Lecture 2
No ratings yet
Lecture 2
19 pages
Newton-Raphson Optimization: Steve Kroon
No ratings yet
Newton-Raphson Optimization: Steve Kroon
4 pages
6 Gradient Method
No ratings yet
6 Gradient Method
19 pages
Lecture 12
No ratings yet
Lecture 12
16 pages
Chương 9
No ratings yet
Chương 9
12 pages
Numerical Results For Gauss-Seidel Iterative Algor
No ratings yet
Numerical Results For Gauss-Seidel Iterative Algor
11 pages
Gradient and Newton's Methods Lecture
No ratings yet
Gradient and Newton's Methods Lecture
14 pages
Numerical Optimization For Inverse Problems - 10 Lectures On Inverse Problems and Imaging
No ratings yet
Numerical Optimization For Inverse Problems - 10 Lectures On Inverse Problems and Imaging
15 pages
Mit18 S096iap23 Lec06
No ratings yet
Mit18 S096iap23 Lec06
9 pages
Lecture 05 - Unconstrained
No ratings yet
Lecture 05 - Unconstrained
21 pages
19 Newton Method
No ratings yet
19 Newton Method
10 pages
Steepest Descent for Optimization
No ratings yet
Steepest Descent for Optimization
29 pages
Lecture 7 Newton
No ratings yet
Lecture 7 Newton
44 pages
(k+1) K (K) (K) (K) : Recall That A Direction Is A Vector of Unit Length
No ratings yet
(k+1) K (K) (K) (K) : Recall That A Direction Is A Vector of Unit Length
5 pages
Numerical Optimal Control: July 2011
No ratings yet
Numerical Optimal Control: July 2011
123 pages
Unconstrained Optimization
No ratings yet
Unconstrained Optimization
27 pages
Lec3 Gradient Based Method Part I
No ratings yet
Lec3 Gradient Based Method Part I
30 pages
Unconstrained Minimization
No ratings yet
Unconstrained Minimization
7 pages
Algorithms Process Optimization
No ratings yet
Algorithms Process Optimization
5 pages
Optim
No ratings yet
Optim
70 pages
OpTimIzation Overview
No ratings yet
OpTimIzation Overview
47 pages
Project For Automated Train by Roshan
No ratings yet
Project For Automated Train by Roshan
6 pages
Multi-Variable Optimization Methods
No ratings yet
Multi-Variable Optimization Methods
21 pages
Optimization Techniques Lecture
No ratings yet
Optimization Techniques Lecture
37 pages
Numerical Experiments With Variations of The Gauss-Newton Algorithm For Nonlinear Least Squares
No ratings yet
Numerical Experiments With Variations of The Gauss-Newton Algorithm For Nonlinear Least Squares
17 pages
ISO/IEC JTC1 SC35 Dissemination Event February 2023 - WG6
No ratings yet
ISO/IEC JTC1 SC35 Dissemination Event February 2023 - WG6
17 pages
Optimization Based On Gradient Descent
No ratings yet
Optimization Based On Gradient Descent
24 pages
Unconstrained Optimization Methods: Amirkabir University of Technology Dr. Madadi
No ratings yet
Unconstrained Optimization Methods: Amirkabir University of Technology Dr. Madadi
13 pages
BSNL Cellone Phase Iv FMCC
No ratings yet
BSNL Cellone Phase Iv FMCC
13 pages
UPSC EPFO APFC Exam Syllabus
0% (1)
UPSC EPFO APFC Exam Syllabus
5 pages
0 Intro
No ratings yet
0 Intro
26 pages
Optimization 1
No ratings yet
Optimization 1
32 pages
Optimization Class Notes MTH-9842
No ratings yet
Optimization Class Notes MTH-9842
25 pages
Standard 1
No ratings yet
Standard 1
3 pages
HW3 PDF
No ratings yet
HW3 PDF
1 page
ISO 17987-4 - 2013 - Draft
No ratings yet
ISO 17987-4 - 2013 - Draft
37 pages
Gradient Based Optimization
No ratings yet
Gradient Based Optimization
8 pages
Revised Syllabus TY Information Technology W.e.f.ay 2020 21
No ratings yet
Revised Syllabus TY Information Technology W.e.f.ay 2020 21
28 pages
OptimumEngineeringDesign Day2b
No ratings yet
OptimumEngineeringDesign Day2b
24 pages
AX7203 User Manual
No ratings yet
AX7203 User Manual
57 pages
Vrontis 2021
No ratings yet
Vrontis 2021
31 pages
Introduction to Mobile Robotics
No ratings yet
Introduction to Mobile Robotics
745 pages
Unique Student Identity and Profile System
No ratings yet
Unique Student Identity and Profile System
51 pages
CÁC CÁCH ĐƯA RA LẬP LUẬN TRONG BÀI VIẾT new
No ratings yet
CÁC CÁCH ĐƯA RA LẬP LUẬN TRONG BÀI VIẾT new
9 pages
Q1 Module+2 Internet+and+Computing+Fundamentals+III Dostilla,+Mark+William+M. AJ+Villegas+Voc+HS
No ratings yet
Q1 Module+2 Internet+and+Computing+Fundamentals+III Dostilla,+Mark+William+M. AJ+Villegas+Voc+HS
8 pages
Remote Radiotherapy Planning The EIMRT Project
No ratings yet
Remote Radiotherapy Planning The EIMRT Project
7 pages
Engineering Design Optimization Course
No ratings yet
Engineering Design Optimization Course
33 pages
System Identification
100% (3)
System Identification
23 pages
OptimumEngineeringDesign Day 1
No ratings yet
OptimumEngineeringDesign Day 1
40 pages
Manual Bluesolar PWM Charge Controller Lcd&Usb 12v 24v 30a & 48v 10a 20a 30a en NL FR de Es Se It
No ratings yet
Manual Bluesolar PWM Charge Controller Lcd&Usb 12v 24v 30a & 48v 10a 20a 30a en NL FR de Es Se It
73 pages
Indradrive MPX - 1x
No ratings yet
Indradrive MPX - 1x
90 pages
Cybersecurity Tool for All Users
No ratings yet
Cybersecurity Tool for All Users
39 pages
Nonlinear Systems
No ratings yet
Nonlinear Systems
30 pages
Activity 19 Using Report Wizard Creating A Report Based On More Than One Table
No ratings yet
Activity 19 Using Report Wizard Creating A Report Based On More Than One Table
1 page
Dynamic Programming for Two-State OCP
No ratings yet
Dynamic Programming for Two-State OCP
1 page
Fuzzy 2
No ratings yet
Fuzzy 2
22 pages
Lyapunov-Based Methods in Control: Dr. Alexander Schaum
No ratings yet
Lyapunov-Based Methods in Control: Dr. Alexander Schaum
22 pages
Unit Iii - Knowledge Representation: Part - A
No ratings yet
Unit Iii - Knowledge Representation: Part - A
5 pages
Stability-Routh Hurwitz Root Locus
No ratings yet
Stability-Routh Hurwitz Root Locus
19 pages
Tabela de Memórias Com Foto-1
No ratings yet
Tabela de Memórias Com Foto-1
3 pages
6nlc Relay
No ratings yet
6nlc Relay
4 pages
Techciti: Managed Services
No ratings yet
Techciti: Managed Services
6 pages
OS - Chapter - 4 - Memory Management
No ratings yet
OS - Chapter - 4 - Memory Management
48 pages
Engineering Optimization Techniques
No ratings yet
Engineering Optimization Techniques
66 pages
Training Material
No ratings yet
Training Material
15 pages
Data Democratization: Toward A Deeper Understanding: September 2021
No ratings yet
Data Democratization: Toward A Deeper Understanding: September 2021
18 pages
Phase Plane Analysis: Glad & Ljung
No ratings yet
Phase Plane Analysis: Glad & Ljung
8 pages
Vigenere Cipher: By: Mohsin Tahir Waqas Akram Numan-Ul-Haq Ali Asghar Rao Arslan
No ratings yet
Vigenere Cipher: By: Mohsin Tahir Waqas Akram Numan-Ul-Haq Ali Asghar Rao Arslan
15 pages
Lyapunov Stability Theory:: Problem of Motion Stability, Includes Two Methods For Stability Analysis (The So
No ratings yet
Lyapunov Stability Theory:: Problem of Motion Stability, Includes Two Methods For Stability Analysis (The So
25 pages
Fuzzy Logic for Beginners
No ratings yet
Fuzzy Logic for Beginners
11 pages
U X F DT DX: Nonlinear Control
No ratings yet
U X F DT DX: Nonlinear Control
11 pages
Queuing Model for KFC Jember
No ratings yet
Queuing Model for KFC Jember
19 pages
Why Lagrange Multipliers Work
No ratings yet
Why Lagrange Multipliers Work
13 pages
1 X X X X : Examples: Example 1: Consider The System
No ratings yet
1 X X X X : Examples: Example 1: Consider The System
16 pages
Fuzzy 3 Fuzzy Inference Process
No ratings yet
Fuzzy 3 Fuzzy Inference Process
17 pages
Feedback Linearization
No ratings yet
Feedback Linearization
42 pages
Control Optimo
No ratings yet
Control Optimo
132 pages
Nonlinear Control
No ratings yet
Nonlinear Control
28 pages
7phase Portraits Chaos FD
No ratings yet
7phase Portraits Chaos FD
30 pages
Optimal Quadratic Control L2
No ratings yet
Optimal Quadratic Control L2
35 pages
Nonlinear Control of Flex Joint
No ratings yet
Nonlinear Control of Flex Joint
37 pages
Chapter 3
No ratings yet
Chapter 3
39 pages
The Maximum Principle and Hamilton Jacobi Theory
No ratings yet
The Maximum Principle and Hamilton Jacobi Theory
40 pages
Chapter 2 - Burl Optimal Quadratic Control: Islamic University of Gaza
No ratings yet
Chapter 2 - Burl Optimal Quadratic Control: Islamic University of Gaza
44 pages

Optimization 2

Uploaded by

Optimization 2

Uploaded by

Optimization 2

Unconstrained continuous optimization:

Suppose we have a cost function (or objective function)

f ! x " # $%n & $%

x ! ' ()* +,-

• equality c i ! x "#$ %, i $ &, . . . , me

We will start by focussing on unconstrained problems

• down-hill search (gradient descent) algorithms can find local minima

convex Not convex

• Convexity provides a test for a single extremum

single extremum – convex single extremum – non-convex

multiple extrema – non-convex noisy horrible

! "#$% δx &'() * ) + * f , x - δx . < f , x .

! /)#& 0 12+%&0* 3 0 +$0#*24+*#520'6%+*20 x n ! " # 7 0 x n - δx

! 82%'(20*)20643912:0* 3 0 +0&24#2&03;0< = 0 1#$20&2+4()2&0δx 7 α p

Alternate minimization over x and y

Let a function f : Rn → R. The gradient of f is the

Move in the direction of the gradient ! f "xn#

$ % & ' # ()*+,'-.#,/#'0')12&')'#3')3'-+,456*)#. 7 # .&'#47-.75) 6,-'/8

• Iterative method starting at an initial point x(0)

x(k+1) = x(k) − ∇f (x(k) )

• Select the step length α as minα≥0 f (x + αd)

" # $ # % & % ' #(') * ' +,, ,-

Steepest Descent Steepest Descent

• The zig-zag behaviour is clear in the zoomed view (100 iterations)

• The algorithm crawls down the valley

7 81)"9 p n ,*9)"&*#.9% & 9 4#9)&.:+21%#9% & 9 1;;95/#-,&+*9*#1/)"9',/#)%,&.*99

p!nHp j ? @, @?< j < n

7 ! " # 9 /#*+;%,.29*#1/)"9',/#)%,&.*91/#9$+%+1;;C9;,.#1/;C ,.'#5#.'#.%6

7 RemarkablyD p n )1. 4# )"&*#. +*,.2 &.;C E.&<;#'2# &( p n " # , A f F x n " # G 9 9

Again, uses first derivatives only, but avoids “undoing” previous

$ 9 - # DB+,; '-/,7-*6#@ 5 * + ) * .,4 #: 7 ) ; # 4*-#E'#; ,- ,; ,<' + #,-#a t m o s t N

$ G #+ , H ' ) ' - . # / .* ) .,- ( 3 7 ,- ./ 8

Let f : Rn → R twice differentiable. Its second (partial)

• The order of differentiation does not matter if the

%&'# 'J3*-/,7-#. 7 # /'47-+#7)+')#,/#*#@5*+)*.,4 :5-4.,7-

x n ! " & x n ) H#n"gn

$ P:#f = x > # ,/#@5*+)*.,4A#.&'-#.&'#/765.,7-#,/#:75-+#,-#7-' /.'38

$ % & ' # ;'.&7+#&*/#@5*+)*.,4#47-0')('-4'#=*/#,-#.&'#" Q 4*/'>8

$ % & ' # /765.,7-#δx M # OH " n#g n ,/#(5*)*-.''+#. 7 # E'#*#+72-&,66#+,)'4.,7-##

$ S*.&')#.&*-#F5;3#/.)*,(&.#. 7 # .&'#3)'+,4.'+#/765.,7-#*. # x n O # H"n#gnA##

$ P:#HM # I .&'-#.&,/#)'+54'/#. 7 # /.''3'/. +'/4'-.8

ellipses show successive

•The algorithm converges in only 15 iterations – far superior to steepest

1. Number of iterations required

2. Cost per iteration

! " , * 9 ,*9%"#9G a u s s - N e w t o n 155/&K,$1%,&.

Gauss-Newton method with line search

•minimization with the Gauss-Newton approximation with line search

• requires computing Hessian •approximates Hessian by

$ P- /54& )'(,7-/A * /,;36' /.''3'/.B+'/4'-. /.'3 ,/ 3)7E*E61 .&' E'/.

$ % & ' W'0'-E')(BI*)@5*)+. ;'.&7+ ,/ * ;'4&*-,/; :7) 0*)1,-( E'B

$ T & ' - #λ ,/#/;*66A# H*33)7J,;*.'/#.& ' #V*5//BD'2.7- X'//,*-8

$ T & ' - #λ ,/#6*)('A# H,/#467/'#. 7 # .& ' #,+'-.,.1A#4*5/,-(#/.''3'/.B+'/4'-.##

"8#Z'.#λ M # [.[[" =/*1>

K8 Z760' δx M O H = x , λ > & # g

G8 P: f = x n ! δx > > f = x n > A ,-4)'*/' λ = \ " [ /*1> *-+ (7 . 7 K8

]8 ^.&')2,/'A#+'4)'*/'#λ = \ [ . " # /*1>A#6'.# x n ' # ( M # x n ! δx A#*-+#(7#. 7 # K8

•more iterations than Gauss-Newton,

You might also like

! /)#& 0 12+%&0* 3 0 +$0#24+#520'6%+*20 x n ! " # 7 0 x n - δx

! 82%'(20)20643912:0 3 0 +0&24#2&03;0< = 0 1#$20&2+4()2&0δx 7 α p

$ % & ' # ()+,'-.#,/#'0')12&')'#3')3'-+,456)#. 7 # .&'#47-.75) 6,-'/8

7 81)"9 p n ,9)"&#.9% & 9 4#9)&.:+21%#9% & 9 1;;95/#-,&+9#1/)"9',/#)%,&.*99

7 ! " # 9 /#+;%,.29#1/)"9',/#)%,&.*91/#9$+%+1;;C9;,.#1/;C ,.'#5#.'#.%6

7 RemarkablyD p n )1. 4# )"&#. +,.2 &.;C E.&<;#'2# &( p n " # , A f F x n " # G 9 9

$ 9 - # DB+,; '-/,7-6#@ 5 + ) * .,4 #: 7 ) ; # 4*-#E'#; ,- ,; ,<' + #,-#a t m o s t N

%&'# 'J3-/,7-#. 7 # /'47-+#7)+')#,/##@5+).,4 :5-4.,7-

$ P:#f = x > # ,/#@5+).,4A#.&'-#.&'#/765.,7-#,/#:75-+#,-#7-' /.'38

$ % & ' # ;'.&7+#&/#@5+).,4#47-0')('-4'#=/#,-#.&'#" Q 4*/'>8

$ % & ' # /765.,7-#δx M # OH " n#g n ,/#(5)-.''+#. 7 # E'#*#+72-&,66#+,)'4.,7-##

$ S.&')#.&-#F5;3#/.),(&.#. 7 # .&'#3)'+,4.'+#/765.,7-#. # x n O # H"n#gnA##

$ % & ' W'0'-E')(BI)@5)+. ;'.&7+ ,/ * ;'4&-,/; :7) 0)1,-( E'B

$ T & ' - #λ ,/#/;66A# H33)7J,;.'/#.& ' #V5//BD'2.7- X'//,*-8

$ T & ' - #λ ,/#6)('A# H,/#467/'#. 7 # .& ' #,+'-.,.1A#45/,-(#/.''3'/.B+'/4'-.##

G8 P: f = x n ! δx > > f = x n > A ,-4)'/' λ = \ " [ /1> *-+ (7 . 7 K8

]8 ^.&')2,/'A#+'4)'/'#λ = \ [ . " # /1>A#6'.# x n ' # ( M # x n ! δx A#*-+#(7#. 7 # K8