0% found this document useful (0 votes)

191 views12 pages

DPOCexam2008midterm Solution

Uploaded by

Berfu Türkmen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

191 views12 pages

DPOCexam2008midterm Solution

Uploaded by

Berfu Türkmen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Prof. L.

Guzzella
Prof. R. D’Andrea

Midterm Examination November 12th, 2008

Dynamic Programming & Optimal Control (151-0563-00) Prof. R. D’Andrea

Solutions

Exam Duration: 150 minutes

Number of Problems: 4 (25% each)

Permitted aids: Textbook Dynamic Programming and Optimal Control by

Dimitri P. Bertsekas, Vol. I, 3rd edition, 2005, 558 pages.
Your written notes.
No calculators.

Important: Use only these prepared sheets for your solutions.

Page 2 Midterm Examination – Dynamic Programming & Optimal Control

Problem 1 25%

1 9
3 2 1 1
2 1 1
1 4 1 4
S 2 4 6 T
3 3 1 1

5 1 1
3 5 7 10
1 1
1 1

Figure 1

Find the shortest path from node S to node T for the graph given in Figure 1. Apply the label
correcting method. Use best-first search to determine at each iteration which node to remove
from OPEN; that is, remove node i with

di = min dj ,
j in OPEN

where the variable di denotes the length of the shortest path from node S to node i that has
been found so far.
Solve the problem by populating a table of the following form:

Iter- Node exiting OPEN dS d1 d2 d3 d4 d5 d6 d7 d8 d9 d10 dT =

ation OPEN UPPER
...
Midterm Examination – Dynamic Programming & Optimal Control Page 3

Solution 1

Iter- Node exiting OPEN dS d1 d2 d3 d4 d5 d6 d7 d8 d9 d10 dT =

ation OPEN UPPER
0 - S 0 ∞ ∞ ∞ ∞ ∞ ∞ ∞ ∞ ∞ ∞ ∞
1 S 1,2,3 0 3 1 3 ∞ ∞ ∞ ∞ ∞ ∞ ∞ ∞
2 2 1,3,4 0 2 1 3 5 ∞ ∞ ∞ ∞ ∞ ∞ ∞
3 1 3,4 0 2 1 3 4 ∞ ∞ ∞ ∞ ∞ ∞ ∞
4 3 4,5 0 2 1 3 4 8 ∞ ∞ ∞ ∞ ∞ ∞
5 4 5,6 0 2 1 3 4 7 5 ∞ ∞ ∞ ∞ ∞
6 6 5,9 0 2 1 3 4 7 5 ∞ ∞ 6 ∞ 9
7 9 5 0 2 1 3 4 7 5 ∞ ∞ 6 ∞ 7
8 5 - 0 2 1 3 4 7 5 ∞ ∞ 6 ∞ 7

The shortest path is S → 2 → 1 → 4 → 6 → 9 → T with a total length of 7.

Page 4 Midterm Examination – Dynamic Programming & Optimal Control

Problem 2 25%

Consider the dynamic system

xk+1 = (1 − a) wk + auk , 0 ≤ a ≤ 1, k = 0, 1 ,

with initial state x0 = −1. The cost function, to be minimized, is given by

( 1
)
X ¡ ¢
E x22 + x2k + u2k + wk2 .
w0 ,w1
k=0

The disturbance wk takes the values 0 and 1. If xk ≥ 0, both values have equal probability. If
xk < 0, the disturbance wk is 0 with probability 1. The control uk is constrained by

0 ≤ uk ≤ 1, k = 0, 1.

Apply the Dynamic Programming algorithm to find the optimal control policy and the optimal
final cost J0 (−1).
Midterm Examination – Dynamic Programming & Optimal Control Page 5

Solution 2

The optimal control problem is considered over a time horizon N = 2 and the cost, to be
minimized, is defined by

g2 (x2 ) = x22 and gk (xk , uk , wk ) = x2k + u2k + wk2 , k = 0, 1.

The DP algorithm proceeds as follows:

2nd stage:

J2 (x2 ) = x22

1st stage:
© ª
J1 (x1 ) = min E x21 + u21 + w12 + J2 (x2 )
0≤u1 ≤1
© ¡ ¢ª
= min E x21 + u21 + w12 + J2 (1 − a) w1 + au1
0≤u1 ≤1 w1
n ¡ ¢2 o
= min E x21 + u21 + w12 + (1 − a) w1 + au1
0≤u1 ≤1 w1

Distinguish two cases: x1 ≥ 0 and x1 < 0.

I ) x1 ≥ 0:
½ ´¾
2 2 1³ 2
´ 1³
2
J1 (x1 ) = min x1 + u1 + 1 + ((1 − a) + au1 ) + 0 + ((1 − a) · 0 + au1 )
0≤u1 ≤1 2 2
| {z }
L(x1 ,u1 )

Find the minimizing ū1 by

¯
∂L ¯¯ ¡ ¢ ! −a (1 − a)
¯ = (1 − a) a + 2 1 + a2 ū1 = 0 ⇔ ū1 = ≤ 0 (!).
∂u1 ū1 2 (1 + a2 )

Recall that the feasible set of inputs u1 is given by 0 ≤ u1 ≤ 1.

However, using the information that L (x1 , u1 ) is convex in u1 ; that is,

∂2L ¡ ¢
2 = 2 1 + a2 > 0,
∂u1

it follows that ū1 is a local minimum and the feasible optimal control u∗1 is given by

⇒ u∗1 = µ∗1 (x1 ) = 0 ∀ x1 ≥ 0.

Page 6 Midterm Examination – Dynamic Programming & Optimal Control

II ) x1 < 0:
© 2 ¡ ¢ ª
J1 (x1 ) = min x1 + 1 + a2 u21
0≤u1 ≤1 | {z }
L(x1 ,u1 )

Find the minimizing ū1 by

¯
∂L ¯¯ ¡ ¢ !
¯ = 2 1 + a2 ū1 = 0 ⇔ ū1 = 0.
∂u1 ū1
¯
∂2L ¯
Since the sufficient condition for a local minimum, ∂u21 ¯ū
> 0, holds, the optimal control is
1

⇒ u∗1 = µ∗1 (x1 ) = 0 ∀ x1 < 0.

0th stage:
n ¡ ¢o
J0 (−1) = min E (−1)2 + u20 + w02 + J1 (1 − a) w0 + au0
0≤u0 ≤1 w0

Since x0 < 0, we get

where au0 ≥ 0. From above’s results, the optimal cost-to-go function for x1 ≥ 0 is

1 1
J1 (x1 ) = + (1 − a)2 + x21 .
2 2
Finally, the minimizing ū0 results from
¯
∂L ¯¯ !
= 2ū0 + 2a2 ū0 = 0 ⇔ ū0 = 0.
∂u0 ¯ū0
¯
∂2L ¯
Since ∂u20 ¯ū
> 0, the optimal control u∗0 is
0

⇒ u∗0 = µ∗0 (−1) = 0.

With this, the optimal final cost reads as

3 1
J0 (−1) = + (1 − a)2 .
2 2

In brief, the optimal control policy is to always set the input to zero, which can also be verified
by carefully looking at the equations given in the problem statement.
Midterm Examination – Dynamic Programming & Optimal Control Page 7

Problem 3 25%

t=0 t=1
u

z=0 z=1
Figure 2

At time t = 0, a unit mass is at rest at location z = 0. The mass is on a frictionless surface and
it is desired to apply a force u(t), 0 ≤ t ≤ 1, such that at time t = 1, the mass is at location
z = 1 and again at rest. In particular,

z̈(t) = u(t), 0 ≤ t ≤ 1, (1)

with initial and terminal conditions:

z(0) = 0, ż(0) = 0,
z(1) = 1, ż(1) = 0.

Of all the functions u(t) that achieve the above objective, find the one that minimizes
Z 1
1
u2 (t)dt.
2 0

Hint: The state for this system is x(t) = [ x1 (t), x2 (t) ]T , where x1 (t) = z(t) and x2 (t) = ż(t).
Page 8 Midterm Examination – Dynamic Programming & Optimal Control

Solution 3

Introduce the state vector · ¸ · ¸

x z
x= 1 = .
x2 ż
Using this notation, the dynamics read as
· ¸ · ¸
ẋ1 x
= 2
ẋ2 u

with initial and terminal conditions,

x1 (0) = 0, x2 (0) = 0,
x1 (1) = 1, x2 (1) = 0.

Apply the Minimum Principle.

• The Hamiltonian is given by

H(x, u, p) = g(x, u) + pT f (x, u)

1
= u2 + p1 x2 + p2 u.
2

• The optimal input u∗ (t) is obtained by minimizing the Hamiltonian along the optimal
trajectory. Differentiating the Hamiltonian with respect to u yields,

u∗ (t) + p2 (t) = 0 ⇔ u∗ (t) = −p2 (t).

Since the second derivative of H with respect to u is 1, u∗ (t) is indeed a minimum.

• The adjoint equations,

ṗ1 (t) = 0
ṗ2 (t) = −p1 (t),

are integrated and result in the following equations:

p1 (t) = c1 , c1 constant
p2 (t) = −c1 t − c2 , c2 constant.

Using this result, the optimal input is given by

u∗ (t) = c1 t + c2 .

• Recalling the initial and terminal conditions on x, we can solve for c1 and c2 .
With above’s results, the optimal state trajectory x∗2 (t) is

1
ẋ∗2 (t) = c1 t + c2 ⇒ x∗2 (t) = c1 t2 + c2 t + c3 , c3 constant,
2
Midterm Examination – Dynamic Programming & Optimal Control Page 9

and, therefore,

x∗2 (0) = 0 ⇒ c3 = 0
1
x∗2 (1) = 0 ⇒ c1 + c2 = 0 ⇒ c1 = −2c2 ,
2

yielding to

x∗2 (t) = −c2 t2 + c2 t.

The optimal state x∗1 (t) is given by

1 1
ẋ∗1 (t) = x∗2 (t) = −c2 t2 + c2 t ⇒ x∗1 (t) = − c2 t3 + c2 t2 + c4 , c4 constant.
3 2

With the conditions on x1 , we get

x∗1 (0) = 0 ⇒ c4 = 0
1 1
x∗1 (1) = 1 ⇒ − c2 + c2 = 1 ⇒ c2 = 6 and c1 = −12.
3 2

• Finally, we obtain the optimal control

u∗ (t) = −12t + 6,

and the optimal state trajectory

x∗1 (t) = z ∗ (t) = −2t3 + 3t2

x∗2 (t) = ż ∗ (t) = −6t2 + 6t.
Page 10 Midterm Examination – Dynamic Programming & Optimal Control

Problem 4 25%

Recall the Minimum Principle.

Under suitable technical assumptions, the following Proposition holds:

Given the dynamic system

ẋ = f (x(t), u(t)) , x(0) = x0 , 0≤t≤T

and the cost function, Z T

h (x(T )) + g (x(t), u(t)) dt,
0
to be minimized, define the Hamiltonian function

H(x, u, p) = g (x, u) + pT f (x, u) .

Let u∗ (t), t ∈ [0, T ] be an optimal control trajectory and x∗ (t) the resulting state trajectory.
Then,

1. ṗ(t) = − ∂H ∗ ∗
∂x (x (t), u (t), p(t)) , p(T ) = ∂h
∂x (x∗ (T )) ,

2. u∗ (t) = arg minu∈U H (x∗ (t), u, p(t)) ,

3. H (x∗ (t), u∗ (t), p(t)) is constant.

Show that if the dynamics and the cost are time varying – that is, f (x, u) is replaced by f (x, u, t)
and g (x, u) is replaced by g (x, u, t) – the Minimum Principle becomes:

1. ṗ(t) = − ∂H ∗ ∗
∂x (x (t), u (t), p(t), t) , p(T ) = ∂h
∂x (x∗ (T )) ,

2. u∗ (t) = arg minu∈U H (x∗ (t), u, p(t), t)

3. H (x∗ (t), u∗ (t), p(t), t) not necessarily constant,

where the Hamiltonian function is now given by

H(x, u, p, t) = g (x, u, t) + pT f (x, u, t) .

Midterm Examination – Dynamic Programming & Optimal Control Page 11

Solution 4

General idea:
Convert the problem to a time-independent one, apply the standard Minimum Principle pre-
sented in class, and simplify the obtained equations.

Follow the subsequent steps:

• Introduce an extra state variable y(t) representing the time:

y(t) = t, with ẏ(t) = 1 and y(0) = 0.

• Convert the problem into standard form by introducing the extended state ξ = [ x, y ]T :
The dynamics read now as
˙ = f˜(ξ, u) = [ f (x, u, y), 1 ]T
ξ(t)

and the cost is defined by Z T

h̃(ξ(T )) + g̃(ξ, u) dt,
0

where g̃(ξ, u) = g(x, u, y) and h̃(ξ) = h(x).

The Hamiltonian follows from above’s definitions:

H̃(ξ, u, p̃) = g̃(ξ, u) + p̃T f˜(ξ, u) with p̃ = [ p, py ]T .

• Apply the Minimum Principle:

Denoting the optimal control by u∗ (t) and the corresponding optimal state by ξ ∗ (t), we
get the following:
1. The adjoint equation is given by

˙ = − ∂ H̃ (ξ ∗ (t), u∗ (t), p̃(t)) ,

p̃(t) p̃(T ) =
∂ h̃ ∗
(ξ (T )) . (2)
∂ξ ∂ξ
However,

H̃(ξ, u, p̃) = g(x, u, y) + pT f (x, u, y) + py = H(x, u, p, y) + py ;

that is,
∂ H̃ ∂H ∂ H̃ ∂H
= , = .
∂x ∂x ∂y ∂y
Moreover,
∂ h̃ ∂h ∂ h̃
= and = 0.
∂x ∂x ∂y
From (2), we recover the first equation
∂H ∗ ∂h ∗
ṗ(t) = − (x (t), u∗ (t), p(t), t) , p(T ) = (x (T )) .
∂x ∂x
In addition, replacing y(t) by t again, we get
∂H ∗
ṗy (t) = − (x (t), u∗ (t), p(t), t) , py (T ) = 0.
∂t
Page 12 Midterm Examination – Dynamic Programming & Optimal Control

2. The optimal input u∗ (t) is obtained by

n o
u∗ (t) = arg minu∈U H(x∗ (t), u∗ (t), p(t), t) + py (t)

= arg minu∈U H(x∗ (t), u∗ (t), p(t), t).

3. Finally, the standard formulation gives us

H(x∗ (t), u∗ (t), p(t), t) + py (t) is constant.

∂H
However, py (t) is constant only if ∂t = 0, which, in general, is only true if f and g
do not depend on time.

Kalman Filtering Study Guide
No ratings yet
Kalman Filtering Study Guide
185 pages
Predictive Control: J.M.Maciejowski Cambridge University Engineering Department
100% (1)
Predictive Control: J.M.Maciejowski Cambridge University Engineering Department
85 pages
All The Math You Missed - But Need To Know For Graduate School
100% (36)
All The Math You Missed - But Need To Know For Graduate School
417 pages
Stanford Linear System Theory
100% (1)
Stanford Linear System Theory
431 pages
Stochastic Control Notes
No ratings yet
Stochastic Control Notes
173 pages
Stochastic Model Predictive Control
100% (1)
Stochastic Model Predictive Control
208 pages
Robotics Homework Solutions
No ratings yet
Robotics Homework Solutions
5 pages
Woolseylecture 1
No ratings yet
Woolseylecture 1
4 pages
5.1 Dynamic Programming and The HJB Equation: k+1 K K K K
No ratings yet
5.1 Dynamic Programming and The HJB Equation: k+1 K K K K
30 pages
Dynamic Optimization - Book
No ratings yet
Dynamic Optimization - Book
84 pages
Namma Kalvi 12th Maths Minimum Learning Study Material em 217100
No ratings yet
Namma Kalvi 12th Maths Minimum Learning Study Material em 217100
81 pages
Optimal Control Exercises Guide
100% (2)
Optimal Control Exercises Guide
79 pages
LQG/LQR Controller Design: Undergraduate Lecture Notes On
No ratings yet
LQG/LQR Controller Design: Undergraduate Lecture Notes On
37 pages
Model Predictive Control
No ratings yet
Model Predictive Control
17 pages
Linear System Theory
No ratings yet
Linear System Theory
62 pages
Nonlinear Adaptive Control Techniques
No ratings yet
Nonlinear Adaptive Control Techniques
116 pages
Optimal Control of An Oscillator System
No ratings yet
Optimal Control of An Oscillator System
6 pages
Linear Systems Raymond A DeCarlo Cap 1 Indice
No ratings yet
Linear Systems Raymond A DeCarlo Cap 1 Indice
17 pages
Non Linear Control
No ratings yet
Non Linear Control
15 pages
Introduction To Control: Dr. Muhammad Aamir Assistant Professor Bahria University Islamabad
No ratings yet
Introduction To Control: Dr. Muhammad Aamir Assistant Professor Bahria University Islamabad
117 pages
Designing A Neuro PD With Gravity Compensation For Six Legged Robot
No ratings yet
Designing A Neuro PD With Gravity Compensation For Six Legged Robot
8 pages
Robo3 2
No ratings yet
Robo3 2
336 pages
Gandhinagar Institute of Technology: Question Bank
No ratings yet
Gandhinagar Institute of Technology: Question Bank
5 pages
ANN and ANFIS Based Inverse Kinematics of Six Arm Robot Manipulator
0% (1)
ANN and ANFIS Based Inverse Kinematics of Six Arm Robot Manipulator
81 pages
Robotics: Dynamic Model of Manipulators
No ratings yet
Robotics: Dynamic Model of Manipulators
20 pages
Stabilization Methods For Simulations of Constrained Multibody Dynamics
100% (1)
Stabilization Methods For Simulations of Constrained Multibody Dynamics
179 pages
SS Notes PDF
No ratings yet
SS Notes PDF
113 pages
Computer and Networking
No ratings yet
Computer and Networking
61 pages
Liquid Dynamics in Moving Tanks
No ratings yet
Liquid Dynamics in Moving Tanks
5 pages
A Mathematical Approach From Classical Control To Advanced Control
No ratings yet
A Mathematical Approach From Classical Control To Advanced Control
346 pages
Advances in Missile Guidance
No ratings yet
Advances in Missile Guidance
208 pages
6 Math
No ratings yet
6 Math
184 pages
ECE1647 NonlinearControl
100% (1)
ECE1647 NonlinearControl
138 pages
Lecture 4 Chapter 4 Lyapunov Stability
No ratings yet
Lecture 4 Chapter 4 Lyapunov Stability
86 pages
Aircraft Flight Dynamics Guide
No ratings yet
Aircraft Flight Dynamics Guide
51 pages
wph16 01 Pef 20230302
No ratings yet
wph16 01 Pef 20230302
17 pages
The Variational Approach To Optimal Control
100% (1)
The Variational Approach To Optimal Control
48 pages
Data Analysis: in Microsoft Excel
100% (1)
Data Analysis: in Microsoft Excel
48 pages
Center Manifold Reduction
100% (2)
Center Manifold Reduction
8 pages
Chapter 6 1
No ratings yet
Chapter 6 1
26 pages
Data-Driven Aerospace Engineering With ML
No ratings yet
Data-Driven Aerospace Engineering With ML
28 pages
Ratios and Rates
No ratings yet
Ratios and Rates
21 pages
Dynamic Programming Value Iteration
100% (1)
Dynamic Programming Value Iteration
36 pages
Computational Fluid Dynamics: Indo-European Winter Academy 2013
No ratings yet
Computational Fluid Dynamics: Indo-European Winter Academy 2013
30 pages
Differential Equations Course Guide
No ratings yet
Differential Equations Course Guide
20 pages
Linear System Theory: 2.1 Discrete-Time Signals
No ratings yet
Linear System Theory: 2.1 Discrete-Time Signals
31 pages
Model Predictive Control For UAVs
100% (1)
Model Predictive Control For UAVs
24 pages
Inverse Kinematics of 6-DOF Manipulator With Three Intersecting Axes
No ratings yet
Inverse Kinematics of 6-DOF Manipulator With Three Intersecting Axes
6 pages
Next-Generation Sequencing Data Analysis 2nd Edition
No ratings yet
Next-Generation Sequencing Data Analysis 2nd Edition
86 pages
Control Principles For Engineered Systems 5SMC0: State Reconstruction & Observer Design
No ratings yet
Control Principles For Engineered Systems 5SMC0: State Reconstruction & Observer Design
19 pages
Chapter 2 Gain Scheduling Adaptive Control
100% (1)
Chapter 2 Gain Scheduling Adaptive Control
39 pages
CV Example School & No Experience
No ratings yet
CV Example School & No Experience
5 pages
The Discovery of The Vector Representation of Moments and Angular Velocity (Caparrini)
100% (1)
The Discovery of The Vector Representation of Moments and Angular Velocity (Caparrini)
31 pages
State Space A To NPGMV Control
No ratings yet
State Space A To NPGMV Control
43 pages
Note You Must Follow A Sequential Method and Show All Your Working For Arriving at A Particular Solution
No ratings yet
Note You Must Follow A Sequential Method and Show All Your Working For Arriving at A Particular Solution
9 pages
LQR Feedforward
100% (1)
LQR Feedforward
20 pages
Lec12 Control
No ratings yet
Lec12 Control
19 pages
Notas - Dynamic Optimation and Optimal Control
No ratings yet
Notas - Dynamic Optimation and Optimal Control
26 pages
Optimal Control for Anti-Tank Systems
No ratings yet
Optimal Control for Anti-Tank Systems
174 pages
Semi-Supervised Learning A Brief Review
No ratings yet
Semi-Supervised Learning A Brief Review
6 pages
Stochastic Control Princeton
No ratings yet
Stochastic Control Princeton
14 pages
Hw2sol PDF
100% (1)
Hw2sol PDF
5 pages
City University of Hong Kong Course Syllabus Offered by Department of Mathematics With Effect From Semester - A - 20 - 15 - / 16
No ratings yet
City University of Hong Kong Course Syllabus Offered by Department of Mathematics With Effect From Semester - A - 20 - 15 - / 16
6 pages
Controls Engineering in FRC
No ratings yet
Controls Engineering in FRC
238 pages
Stability: EE-601 Linear System Theory
No ratings yet
Stability: EE-601 Linear System Theory
26 pages
Optimal Control Dynamic Programming
No ratings yet
Optimal Control Dynamic Programming
18 pages
LMI-Linear Matrix Inequality
100% (1)
LMI-Linear Matrix Inequality
34 pages
Neural Network Sliding-Mode Position Controller For Induction Servo Drive
No ratings yet
Neural Network Sliding-Mode Position Controller For Induction Servo Drive
12 pages
Managerial Math Assignment 2013
No ratings yet
Managerial Math Assignment 2013
4 pages
Engineering Deflection Analysis
No ratings yet
Engineering Deflection Analysis
7 pages
Complete Quadratic Lyapunov Functionals Using Bessel LegendreInequality
No ratings yet
Complete Quadratic Lyapunov Functionals Using Bessel LegendreInequality
6 pages
Chapter 17 PDF
No ratings yet
Chapter 17 PDF
10 pages
Axioms of Newtonian Mechanics
No ratings yet
Axioms of Newtonian Mechanics
6 pages
Post Lab #2
No ratings yet
Post Lab #2
7 pages
Chapter 15 Problems
100% (2)
Chapter 15 Problems
7 pages
Attitude Control of A Quadrotor
No ratings yet
Attitude Control of A Quadrotor
6 pages
Rabie Bin Asim Design Problem 1
No ratings yet
Rabie Bin Asim Design Problem 1
25 pages
Notes On Linearisation (H.K.Khalil)
No ratings yet
Notes On Linearisation (H.K.Khalil)
11 pages
Bayesian Leak Prediction for Utilities
No ratings yet
Bayesian Leak Prediction for Utilities
15 pages
Advanced Statistical Physics Problems
No ratings yet
Advanced Statistical Physics Problems
7 pages
Stair, Staircase and Ramps
No ratings yet
Stair, Staircase and Ramps
18 pages
12 Hookes Law and Youngs Modulus
No ratings yet
12 Hookes Law and Youngs Modulus
6 pages
Wavelet Theory and Application in Communication An
No ratings yet
Wavelet Theory and Application in Communication An
18 pages
MAMBA
No ratings yet
MAMBA
5 pages
DPS FINAL MATHS PAPER 2023 (1) (Practice)
No ratings yet
DPS FINAL MATHS PAPER 2023 (1) (Practice)
4 pages
Aids To Selection and Selection Methods.
No ratings yet
Aids To Selection and Selection Methods.
172 pages
Lecture 17
No ratings yet
Lecture 17
2 pages
Aai 50 Days
No ratings yet
Aai 50 Days
1 page
Aharonov-Anandan - PRL.1990.65.1697 - Geometry of Quantum Phase
No ratings yet
Aharonov-Anandan - PRL.1990.65.1697 - Geometry of Quantum Phase
4 pages

DPOCexam2008midterm Solution

Uploaded by

DPOCexam2008midterm Solution

Uploaded by

Prof. L.

Midterm Examination November 12th, 2008

Dynamic Programming & Optimal Control (151-0563-00) Prof. R. D’Andrea

Exam Duration: 150 minutes

Number of Problems: 4 (25% each)

Permitted aids: Textbook Dynamic Programming and Optimal Control by

Important: Use only these prepared sheets for your solutions.

Iter- Node exiting OPEN dS d1 d2 d3 d4 d5 d6 d7 d8 d9 d10 dT =

Iter- Node exiting OPEN dS d1 d2 d3 d4 d5 d6 d7 d8 d9 d10 dT =

The shortest path is S → 2 → 1 → 4 → 6 → 9 → T with a total length of 7.

Consider the dynamic system

with initial state x0 = −1. The cost function, to be minimized, is given by

g2 (x2 ) = x22 and gk (xk , uk , wk ) = x2k + u2k + wk2 , k = 0, 1.

The DP algorithm proceeds as follows:

Distinguish two cases: x1 ≥ 0 and x1 < 0.

Find the minimizing ū1 by

Recall that the feasible set of inputs u1 is given by 0 ≤ u1 ≤ 1.

However, using the information that L (x1 , u1 ) is convex in u1 ; that is,

⇒ u∗1 = µ∗1 (x1 ) = 0 ∀ x1 ≥ 0.

Find the minimizing ū1 by

⇒ u∗1 = µ∗1 (x1 ) = 0 ∀ x1 < 0.

Since x0 < 0, we get

⇒ u∗0 = µ∗0 (−1) = 0.

With this, the optimal final cost reads as

z̈(t) = u(t), 0 ≤ t ≤ 1, (1)

with initial and terminal conditions:

Introduce the state vector · ¸ · ¸

with initial and terminal conditions,

Apply the Minimum Principle.

• The Hamiltonian is given by

H(x, u, p) = g(x, u) + pT f (x, u)

u∗ (t) + p2 (t) = 0 ⇔ u∗ (t) = −p2 (t).

Since the second derivative of H with respect to u is 1, u∗ (t) is indeed a minimum.

• The adjoint equations,

are integrated and result in the following equations:

Using this result, the optimal input is given by

x∗2 (t) = −c2 t2 + c2 t.

The optimal state x∗1 (t) is given by

With the conditions on x1 , we get

• Finally, we obtain the optimal control

and the optimal state trajectory

x∗1 (t) = z ∗ (t) = −2t3 + 3t2

Recall the Minimum Principle.

Under suitable technical assumptions, the following Proposition holds:

ẋ = f (x(t), u(t)) , x(0) = x0 , 0≤t≤T

and the cost function, Z T

H(x, u, p) = g (x, u) + pT f (x, u) .

2. u∗ (t) = arg minu∈U H (x∗ (t), u, p(t)) ,

3. H (x∗ (t), u∗ (t), p(t)) is constant.

2. u∗ (t) = arg minu∈U H (x∗ (t), u, p(t), t)

3. H (x∗ (t), u∗ (t), p(t), t) not necessarily constant,

where the Hamiltonian function is now given by

H(x, u, p, t) = g (x, u, t) + pT f (x, u, t) .

Follow the subsequent steps:

y(t) = t, with ẏ(t) = 1 and y(0) = 0.

and the cost is defined by Z T

where g̃(ξ, u) = g(x, u, y) and h̃(ξ) = h(x).

H̃(ξ, u, p̃) = g̃(ξ, u) + p̃T f˜(ξ, u) with p̃ = [ p, py ]T .

• Apply the Minimum Principle:

˙ = − ∂ H̃ (ξ ∗ (t), u∗ (t), p̃(t)) ,

H̃(ξ, u, p̃) = g(x, u, y) + pT f (x, u, y) + py = H(x, u, p, y) + py ;

2. The optimal input u∗ (t) is obtained by

= arg minu∈U H(x∗ (t), u∗ (t), p(t), t).

3. Finally, the standard formulation gives us

H(x∗ (t), u∗ (t), p(t), t) + py (t) is constant.

You might also like