0% found this document useful (0 votes)

22 views7 pages

Lin Al Rev

Lin Al revision

Uploaded by

saud

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views7 pages

Lin Al Rev

Lin Al revision

Uploaded by

saud

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

CSC 576: Mathematical Foundations I

Ji Liu
Department of Computer Sciences, University of Rochester

September 20, 2016

1 Notations and Assumptions

In most cases (if without local definitions), we use
• Greek alphabets such as α, β, and γ to denote real numbers;

• Small letters such as x, y, and z to denote vectors;

• Capital letters to denote matrices, e.g., A, B, and C.

Other notations:
• R is the one dimensional Euclidean space;

• Rn is the n dimensional vector Euclidean space;

• Rm×n is the m × n dimensional matrix Euclidean space;

• R+ denotes the range [0, +∞);

• 1n ∈ Rn denotes a vector with 1 in all entries;

• For any vector x ∈ Rn , we use |x| to denote the absolute vector, that is, |x|i = |xi | ∀i =
1, · · · , n;

• denotes the component-wise product, that is, for any vectors x and y, (x y)i = xi yi .
Some assumptions:
• Unless explicit (local) definition, we always assume that all vectors are column vectors.

2 Vector norms, Inner product

A function f : x ∈ Rn → y ∈ R+ is called a “norm”, if the following three conditions are satisfied
• (Zero element) f (x) ≥ 0 and f (x) = 0 if and only if x = 0;

• (Homogeneous) For any α ∈ R and x ∈ Rn , f (αx) = |α|f (x);

• (Triangle inequality) Any x, y ∈ Rn satisfy f (x) + f (y) ≥ f (x + y).

1
The `2 norm “k · k2 ” (a special “f (·)”) in Rn is defined as
1
kxk2 = (|x1 |2 + |x2 |2 + · · · + |xn |2 ) 2 .

Because of `2 is the most commonly used norm (also known as Euclidean norm), we denote it as
k · k sometimes for short. (Think about it how about f ([x1 , x2 ]) = 2x21 + x22 ?)
A general `p norm (p ≥ 1) is defined as
1
kxkp = (|x1 |p + |x2 |p + · · · + |xn |p ) p .

Note that for p < 1, it is not a “norm” since the triangle inequality is violated. `∞ norm is defined
as
kxk∞ = max{|x1 |, |x2 |, · · · , |xn |}.
One may notice that the `∞ norm is the limit of the `p norm, that is, for any x ∈ Rn , kxk∞ =
limp→+∞ kxkp . In addition, people use kxk0 to denote the `0 “norm”.
The inner product h·, ·i in Rn is defined as
X
hx, yi = xi yi .
i

One can show that hx, xi = kxk2 . Two vectors x and y are orthogonal if hx, yi = 0. That is one
reason why `2 norm is so special.
If p ≥ q, then for any x ∈ Rn we have kxkp ≤ kxkq . In particular, we have

kxk1 ≥ kxk2 ≥ kxk∞ .

To bound from the order sides, we have

√ √
kxk1 ≤ nkxk2 kxk2 ≤ nkxk∞ .

Proof. To see the first one, we have

√
kxk1 = h1n , |x|i ≤ k1n k2 k|x|k2 = nkxk2

where the last inequality uses the Cauchy inequality. I leave the proof of the second inequality in
your homework.

Given a norm “k · kA ”, its dual norm is defined as

hx, zi
kxkA∗ = max hx, yi = max hx, yi = max .
kykA ≤1 kykA =1 z kzkA
Several important properties about the dual norm are
• The dual norm’s dual norm is itself, that is, kxk(A∗ )∗ = kxkA ;
• The `2 norm is self-dual, that is, the dual norm of the `2 norm is still the `2 norm;
• The dual norm of the `p norm (p ≥ 1) is `q norm where p and q satisfy 1/p + 1/q = 1.
Particularly, `1 norm and `∞ norm are dual to each other.
• (Holder inequality): hx, yi ≤ kxkA kykA∗

2
3 Linear space, subspace, linear transformation
A set S is a linear space if

• 0 ∈ S;

• given any two points x ∈ S, y ∈ S in S and any two scalars α ∈ R and β ∈ R, we have

αx + βy ∈ S.

Note that ∅ is not a linear space. Examples: vector space Rn , matrix space Rm×n . How about the
following things:

• 0; (no)

• {0}; (yes)

• {x | Ax = b} where A is a matrix and b is a vector. (b = 0 yes; otherwise, no)

Let S be a linear space. A set S 0 is a subspace if S 0 is a linear space and also a subset of S.
Actually, “subspace” is equivalent to “linear space”, because any subspace is a linear space and
any linear space is a subspace. They are indeed talking about the same thing.
Let S be a linear space. A function L(·) is a linear transformation if given any two points
x, y ∈ S and two scalars α ∈ R and β ∈ R, one has

L(αx + βy) = αL(x) + βL(y).

For vector space, there exists a 1-1 correspondence between a linear transformation and a matrix.
Therefore, we can simply say “a matrix is a linear transformation”.

• Prove that {L(x) | x ∈ S} is a linear space if S is a linear space and L is a linear transformation.

• Prove that {x | L(x) ∈ S} a linear space assuming S is a linear space, and L is a linear
transformation.

How to express a subspace? The most intuitive way is to use a bunch of vectors. A subspace
can be expressed by
( n )
X
span{x1 , x2 , · · · , xn } = αi xi | αi ∈ R = {Xα | α},
i=1

which is called the range space of matrix X. A subspace can be also represented by the null space
of X by
{α | Xα = 0}.

3
4 Eigenvalues / eigenvectors, rank, SVD, inverse
The transpose of a matrix A ∈ Rm×n is defined as AT ∈ Rn×m :

(AT )ij = Aji .

One can verify that

(AB)T = B T AT .
A matrix B ∈ Rn×n is the inverse of an invertible matrix A ∈ Rn×n if

AB = I and BA = I.

B can be denoted as A−1 . A has the inverse is equivalent to that A has a full rank (the definition
for “rank” will be clear very soon.) Note that the inverse of a matrix is unique. One can also verify
that if both A and B are invertible, then

(AB)−1 = B −1 A−1 .

The “transpose” and the “inverse” are exchangeable:

(AT )−1 = (A−1 )T .

When we write A−1 , we have to make sure that A is invertible.

Given a square matrix A ∈ Rn×n , x ∈ Rn (x 6= 0) is called its eigenvector and λ ∈ Rn is called
its eigenvalue, if the following relationship is satisfied

Ax = λx. (The effect of applying the linear transformation A on x is nothing but scaling it.)

Note that
• If {λ, x} is a pair of eigenvalue-eigenvector, then so is {λ, αx} for any α 6= 0.

• One eigenvalue may correspond to multiple different eigenvectors. “Different” means eigen-
vectors are different after normalization.
If the matrix A is symmetric, then any two eigenvectors (corresponding to different eigenvalues)
are orthogonal, that is, if AT = A, Ax1 = λ1 x1 , Ax2 = λ2 x2 , and λ1 6= λ2 , then

xT1 x2 = 0.

Proof. Consider xT1 Ax2 . We have

xT1 Ax2 = xT1 (Ax2 ) = xT1 (Ax2 ) = xT1 (λ2 x2 ) = λ2 xT1 x2 ,

and
xT1 Ax2 = (xT1 A)x2 = (AT x1 )T x2 |{z}
= (Ax1 )T x2 = λ1 xT1 x2 .
A=AT
Therefore, we have
λ2 xT1 x2 = λ1 xT1 x2 .
Since λ1 6= λ2 , we obtain xT1 x2 = 0.

4
A matrix A ∈ Rm×n is a “rank-1” matrix, if A can be expressed as

A = xy T

where x ∈ Rm and y ∈ Rn , and x 6= 0, y 6= 0. The rank of a matrix A ∈ Rm×n is defined as

r
( )
X
rank(A) = min r | A = xi yiT , xi ∈ Rm , yi ∈ Rn
i=1
r
( )
X
= min r | A = Bi , Bi is a “rank-1” matrix .
i=1

Examples: [1, 1; 1, 1], [1, 1; 2, 2], and many natural images have the low rank property. “Low rank”
implies that the contained information is few.
We say “U ∈ Rm×n has orthogonal columns” if U T U = I, that is, any two columns Ui· and Uj·
of U satisfies
Ui·T Uj· = 0 if i 6= j; otherwise Ui·T Uj· = 1.
Swapping any two columns in U to get U 0 , U 0 still satisfies U 0T U 0 = I.

• kU xk = kxk ∀x.

• kU T yk ≤ kyk ∀y.

If U is a square matrix and has orthogonal columns, then we call it “orthogonal matrix”. It has
some nice properties

• U −1 = U T (which means that U U T = U T U = I.)

• U T is also an orthogonal matrix.

• The effect of applying the transformation U on a vector x is to rotate x, that is, kU xk =

kxk = kU T xk.

“SVD” is short for “singular value decomposition”, which is the most important concept in
linear algebra and matrix analysis. SVD almost explores all structures of a matrix. Given any
matrix A ∈ Rm×n , it can be decomposed into
r
X
A = U ΣV T = σi Ui· Vi·T
i=1

where U ∈ Rm×r and V ∈ Rn×r have orthogonal columns, and Σ = diag{σ1 , σ2 , · · · , σr } is a

diagonal matrix with positive diagonal elements. σi ’s are called singular values, which are positive
and are arranged in the decreasing order.

• rank(A) = r;

• kAxk ≤ σ1 kxk. Why?

A matrix B ∈ Rn×n is positive semi-definite (PSD), if the following things are satisfied

5
• B is symmetric;

• ∀x ∈ Rn , we have xT Bx ≥ 0.

The positive definite matrix is defined by adding one more condition

• xT Bx = 0 ⇔ x = 0.

We can also use an equivalent definition for PSD matrices in the following: A matrix B ∈ Rn×n is
positive semi-definite (PSD), if the SVD of B can be written as

B = U ΣU T

where U T U = I and Σ is a diagonal matrix with nonnegative diagonal elements. Examples of PSD
matrices: I, AT A.
Assume matrices A and B are invertible. We have the following identity:

B −1 = A−1 − B −1 (B − A)A−1 .

The Sherman-Morrison-Woodbury Formula is very useful to calculate the matrix inverse:

(A + U V > )−1 = A−1 − A−1 U (I + V > A−1 U )−1 V > A−1 .

This result is especially important from the perspective of computation. A special case would be
that U and V are two vectors u and v. Then it is in form of

(A + uv > )−1 = A−1 − (1 + v > A−1 u)−1 A−1 uv > A−1 ,

which can be calculated with complexity O(n2 ) if A−1 is known.

The Sylvester’s determinant theorem is

det(Im + AB) = det(In + BA).

5 Matrix norms (spectral norm, nuclear norm, Frobenius norm)

The Frobenius norm (F-norm) of a matrix A ∈ Rm×n is defined as
 1 !1
2
X X 2

kAkF =  |Ai,j |2  = σi2

1≤i≤m,1≤j≤n i=1

If A is a vector, one can verify that kAkF = kAk2 .

The inner product h·, ·i in Rm×n is defined as
X
hX, Y i = Xij Yij = trace(X T Y ) = trace(Y X T ) = trace(XY T ) = trace(Y T X).
i,j

An important property for trace(AB):

trace(AB) = trace(BA) = trace(AT B T ) = trace(B T AT ).

6
One may notice that hX, Xi = kXk2F .
The spectral (trace) norm of a matrix A ∈ Rm×n is defined as

kAkspec = max kAxk = max y T Ax = σ1 (A)

kxk=1 kxk=1,kyk=1

The nuclear norm of a matrix A ∈ Rm×n is defined as

X
kAktr = σi (A) = trace(Σ)
i

where Σ is the diagonal matrix of SVD of A = U ΣV T .

An important relationship
p
kAkspec ≤ kAkF ≤ kAktr and rank(A)kAkspec ≥ rank(A)kAkF ≥ kAktr .

The dual norm for a matrix norm k · kA is defined as

hX, Y i
kY kA∗ := max = max hX, Y i. (1)
kXk≤1 kXkA X

We have the following properties (think about why it is true):

kXkspec∗ = kXktr , kXkF ∗ = kXkF .

6 Matrix and Vector Differential

Let f (X) : Rm×n → R be a function with respect to matrix X ∈ Rm×n . It is differential (or
gradient) is defined as
 ∂f (X)
· · · ∂f (X)
· · · ∂f (X)

∂X11 ∂X1j ∂X1n
 ··· ··· ··· ··· ···
 

∂f (X)  ∂f (X) ∂f (X) ∂f (X) 
=  ∂Xi1 · · · ∂Xij · · · ∂Xin
 .
∂X  ···

 ··· ··· ··· ··· 

∂f (X) ∂f (X) ∂f (X)
∂Xm1 · · · ∂Xmj · · · ∂Xmn

We provide a few examples in the following

∂f (X)
f (X) = trace(AT X) = hA, Xi =A
∂X
∂f (X)
f (X) = trace(X T AX) = (A + AT )X
∂X
1 ∂f (X)
f (X) = kAX − Bk2F = AT (AX − B)
2 ∂X
1 ∂f (X)
f (X) = trace(B T X T XB) = XBB T
2 ∂X
1 ∂f (X) 1
f (X) = trace(B T X T AXB) = (A + AT )XBB T
2 ∂X 2

01 - Lab Notes
No ratings yet
01 - Lab Notes
8 pages
Linear Algebra Cheat Sheet
100% (1)
Linear Algebra Cheat Sheet
9 pages
Advanced Math Concepts for ORF 523
No ratings yet
Advanced Math Concepts for ORF 523
13 pages
Inner Products and Norms Lecture
No ratings yet
Inner Products and Norms Lecture
13 pages
Chapter1 - Numerical Analysis II 2023-2024
No ratings yet
Chapter1 - Numerical Analysis II 2023-2024
30 pages
斯坦福大学机器学习数学基础 9-16
No ratings yet
斯坦福大学机器学习数学基础 9-16
8 pages
Math Prelims
No ratings yet
Math Prelims
40 pages
Chapter 0 - Miscellaneous Preliminaries: EE 520: Topics - Compressed Sensing Linear Algebra Review
No ratings yet
Chapter 0 - Miscellaneous Preliminaries: EE 520: Topics - Compressed Sensing Linear Algebra Review
18 pages
L02 Notes
No ratings yet
L02 Notes
6 pages
Nonlinear Optimization (18799 B, PP) : Ist-Cmu PHD Course, Spring 2011
No ratings yet
Nonlinear Optimization (18799 B, PP) : Ist-Cmu PHD Course, Spring 2011
11 pages
Linear Algebra Reivew: All Linear Algebra, So This Is A Fairly Serious Weakness. This Review Is
No ratings yet
Linear Algebra Reivew: All Linear Algebra, So This Is A Fairly Serious Weakness. This Review Is
10 pages
Selected Linear Algebra For Machine Learning
No ratings yet
Selected Linear Algebra For Machine Learning
30 pages
Mult 2023 Final 1
No ratings yet
Mult 2023 Final 1
96 pages
Chapter1 - II 2024-2025
No ratings yet
Chapter1 - II 2024-2025
35 pages
Matrix Decomposition Guide
100% (3)
Matrix Decomposition Guide
17 pages
ML - Lec 3 - Review of Linear Algebra
No ratings yet
ML - Lec 3 - Review of Linear Algebra
16 pages
Module 3 - Supplementary Slides
No ratings yet
Module 3 - Supplementary Slides
36 pages
Linear Algebra Cheat-Sheet: Laurent Lessard
100% (1)
Linear Algebra Cheat-Sheet: Laurent Lessard
13 pages
HKU MATH1853 - Brief Linear Algebra Notes: 1 Eigenvalues and Eigenvectors
No ratings yet
HKU MATH1853 - Brief Linear Algebra Notes: 1 Eigenvalues and Eigenvectors
6 pages
Matrix Algebra
No ratings yet
Matrix Algebra
18 pages
EN530.678 Nonlinear Control and Planning in Robotics Lecture 1: Matrix Algebra Basics January 27, 2020
No ratings yet
EN530.678 Nonlinear Control and Planning in Robotics Lecture 1: Matrix Algebra Basics January 27, 2020
4 pages
Lec 3
No ratings yet
Lec 3
54 pages
Linear Algebra for Vision Experts
No ratings yet
Linear Algebra for Vision Experts
23 pages
Refresher Algebra Calculus
100% (2)
Refresher Algebra Calculus
2 pages
Matrixanalysis PDF
No ratings yet
Matrixanalysis PDF
46 pages
Linear Algebra
No ratings yet
Linear Algebra
44 pages
Math 308 List of Definitions: W.R. Casper December 7, 2013
No ratings yet
Math 308 List of Definitions: W.R. Casper December 7, 2013
5 pages
SFU MACM 409 Chapter 1 Notes
No ratings yet
SFU MACM 409 Chapter 1 Notes
11 pages
00 Lectureslides LinAlg
No ratings yet
00 Lectureslides LinAlg
20 pages
Lecture 6
No ratings yet
Lecture 6
53 pages
Chapter 2 Lecture Notes
No ratings yet
Chapter 2 Lecture Notes
4 pages
Lecture 2: Background: - Linear Algebra
No ratings yet
Lecture 2: Background: - Linear Algebra
36 pages
Matrix
No ratings yet
Matrix
10 pages
Math 5610 Fall 2018 Notes of 9/24/18 Review: The Significance of Orthogonal Matrices
No ratings yet
Math 5610 Fall 2018 Notes of 9/24/18 Review: The Significance of Orthogonal Matrices
16 pages
Matrix Theory: Transformations & Eigenvalues
No ratings yet
Matrix Theory: Transformations & Eigenvalues
10 pages
SVD Slides
No ratings yet
SVD Slides
17 pages
DA241M Review of Linear Algebra Part 1
No ratings yet
DA241M Review of Linear Algebra Part 1
5 pages
Module 3 - Supplementary Slides
No ratings yet
Module 3 - Supplementary Slides
36 pages
Linear Algebra Chap - 3
No ratings yet
Linear Algebra Chap - 3
42 pages
Mathematical Treatise On Linear Algebra
No ratings yet
Mathematical Treatise On Linear Algebra
7 pages
Summary
No ratings yet
Summary
115 pages
72073931-8e00-4107-bdde-c19d4ec282cb
No ratings yet
72073931-8e00-4107-bdde-c19d4ec282cb
5 pages
Gi Part1 PDF
No ratings yet
Gi Part1 PDF
57 pages
Executive Summary of AI and ET
No ratings yet
Executive Summary of AI and ET
154 pages
Math - ML Trang 3
No ratings yet
Math - ML Trang 3
28 pages
Tensors (Susanka)
No ratings yet
Tensors (Susanka)
81 pages
A Journey From Linear Algebra To Machine Learning
No ratings yet
A Journey From Linear Algebra To Machine Learning
50 pages
Ecd 01
No ratings yet
Ecd 01
16 pages
Mathematical Background Overview
No ratings yet
Mathematical Background Overview
31 pages
Lecture 3
No ratings yet
Lecture 3
31 pages
Lec 8
No ratings yet
Lec 8
11 pages
Lecture 6
No ratings yet
Lecture 6
53 pages
Linear Algebra Foundations
No ratings yet
Linear Algebra Foundations
17 pages
Svdnotes
No ratings yet
Svdnotes
10 pages
Linear Algebra II Lecture Notes
No ratings yet
Linear Algebra II Lecture Notes
61 pages
PDE1 ScriptWS22 v1.1
No ratings yet
PDE1 ScriptWS22 v1.1
47 pages
XII Maths Important Concept and Question
No ratings yet
XII Maths Important Concept and Question
73 pages
A First Course in Functional Analysis Theory and Applications
100% (4)
A First Course in Functional Analysis Theory and Applications
487 pages
Linear Algebra: Basis Transformations
No ratings yet
Linear Algebra: Basis Transformations
18 pages
Linear Algebra Assignment 2 Vector Space, Subspace, Basis & Dimension
No ratings yet
Linear Algebra Assignment 2 Vector Space, Subspace, Basis & Dimension
6 pages
Ebooks File An Introduction To Integral Transforms Patra All Chapters
100% (7)
Ebooks File An Introduction To Integral Transforms Patra All Chapters
55 pages
Homework 2
No ratings yet
Homework 2
4 pages
BCS 012 Maths
No ratings yet
BCS 012 Maths
431 pages
Multiple Choice Question 1 Q.No. Answer
No ratings yet
Multiple Choice Question 1 Q.No. Answer
2 pages
ND Mathematical Methods Lecture Notes
No ratings yet
ND Mathematical Methods Lecture Notes
502 pages
Hunkle Transform 1
No ratings yet
Hunkle Transform 1
10 pages
Abstract Book ICMME 2024
No ratings yet
Abstract Book ICMME 2024
255 pages
50 Questions Linear Algebra Net Gate Aspirants
No ratings yet
50 Questions Linear Algebra Net Gate Aspirants
10 pages
L3 Evaluation A Function Using A Matrix-1
No ratings yet
L3 Evaluation A Function Using A Matrix-1
16 pages
A New Type of Difference Sequence Spaces
No ratings yet
A New Type of Difference Sequence Spaces
5 pages
Functional Analysis Primer
No ratings yet
Functional Analysis Primer
33 pages
Module 2 ML Mumbai University
No ratings yet
Module 2 ML Mumbai University
39 pages
Jee Mains Weekly Test - Solutions
No ratings yet
Jee Mains Weekly Test - Solutions
14 pages
Matrix Operations and Theorems
No ratings yet
Matrix Operations and Theorems
6 pages
Unit 3 Maths
No ratings yet
Unit 3 Maths
68 pages
Multivariable Calculus Limits
No ratings yet
Multivariable Calculus Limits
3 pages
Chapter 3 Discrete Transform - M-Files: Function
No ratings yet
Chapter 3 Discrete Transform - M-Files: Function
4 pages
(Hariharasudhan.s) Quadratic Forms - Reduction of Quadratic Form To Canonical Form by Orthogonal Transformation
No ratings yet
(Hariharasudhan.s) Quadratic Forms - Reduction of Quadratic Form To Canonical Form by Orthogonal Transformation
3 pages
Afrika Mathematika Paper
No ratings yet
Afrika Mathematika Paper
16 pages
Problem Set 5
No ratings yet
Problem Set 5
8 pages
Real Analysis Sts
No ratings yet
Real Analysis Sts
445 pages
MCQ'S
No ratings yet
MCQ'S
11 pages
The Discrete Fourier Transform: Quote of The Day
No ratings yet
The Discrete Fourier Transform: Quote of The Day
15 pages
Dual Spaces in Functional Analysis
No ratings yet
Dual Spaces in Functional Analysis
4 pages
Laplace
100% (1)
Laplace
4 pages

Lin Al Rev

Uploaded by

Lin Al Rev

Uploaded by

CSC 576: Mathematical Foundations I

September 20, 2016

1 Notations and Assumptions

• Small letters such as x, y, and z to denote vectors;

• Capital letters to denote matrices, e.g., A, B, and C.

• Rn is the n dimensional vector Euclidean space;

• Rm×n is the m × n dimensional matrix Euclidean space;

• R+ denotes the range [0, +∞);

• 1n ∈ Rn denotes a vector with 1 in all entries;

2 Vector norms, Inner product

• (Homogeneous) For any α ∈ R and x ∈ Rn , f (αx) = |α|f (x);

• (Triangle inequality) Any x, y ∈ Rn satisfy f (x) + f (y) ≥ f (x + y).

kxk1 ≥ kxk2 ≥ kxk∞ .

To bound from the order sides, we have

Proof. To see the first one, we have

Given a norm “k · kA ”, its dual norm is defined as

• {x | Ax = b} where A is a matrix and b is a vector. (b = 0 yes; otherwise, no)

L(αx + βy) = αL(x) + βL(y).

(AT )ij = Aji .

One can verify that

The “transpose” and the “inverse” are exchangeable:

(AT )−1 = (A−1 )T .

When we write A−1 , we have to make sure that A is invertible.

Proof. Consider xT1 Ax2 . We have

xT1 Ax2 = xT1 (Ax2 ) = xT1 (Ax2 ) = xT1 (λ2 x2 ) = λ2 xT1 x2 ,

where x ∈ Rm and y ∈ Rn , and x 6= 0, y 6= 0. The rank of a matrix A ∈ Rm×n is defined as

• U −1 = U T (which means that U U T = U T U = I.)

• U T is also an orthogonal matrix.

• The effect of applying the transformation U on a vector x is to rotate x, that is, kU xk =

where U ∈ Rm×r and V ∈ Rn×r have orthogonal columns, and Σ = diag{σ1 , σ2 , · · · , σr } is a

• kAxk ≤ σ1 kxk. Why?

The positive definite matrix is defined by adding one more condition

The Sherman-Morrison-Woodbury Formula is very useful to calculate the matrix inverse:

(A + U V > )−1 = A−1 − A−1 U (I + V > A−1 U )−1 V > A−1 .

(A + uv > )−1 = A−1 − (1 + v > A−1 u)−1 A−1 uv > A−1 ,

which can be calculated with complexity O(n2 ) if A−1 is known.

det(Im + AB) = det(In + BA).

5 Matrix norms (spectral norm, nuclear norm, Frobenius norm)

kAkF =  |Ai,j |2  = σi2

If A is a vector, one can verify that kAkF = kAk2 .

An important property for trace(AB):

trace(AB) = trace(BA) = trace(AT B T ) = trace(B T AT ).

kAkspec = max kAxk = max y T Ax = σ1 (A)

The nuclear norm of a matrix A ∈ Rm×n is defined as

where Σ is the diagonal matrix of SVD of A = U ΣV T .

The dual norm for a matrix norm k · kA is defined as

We have the following properties (think about why it is true):

kXkspec∗ = kXktr , kXkF ∗ = kXkF .

6 Matrix and Vector Differential

We provide a few examples in the following

You might also like