25 Eigenvalues, Eigenvectors, Complete Set of Eigenvectors, Diagonalization, Eigendecomposition

Lecture from 11.12.2024 | Video: Videos ETHZ

Eigenvalues and Eigenvectors: A Summary

Let’s recap some key properties of eigenvalues and eigenvectors for a matrix $A \in R^{n \times n}$ :

Complex Conjugate Eigenvalue: If $(λ, v)$ is an eigenvalue-eigenvector pair of $A$ , then $(\overset{ˉ}{λ}, \overset{ˉ}{v})$ is also an eigenvalue-eigenvector pair, where $\overset{ˉ}{λ}$ and $\overset{ˉ}{v}$ denote the complex conjugates.
Eigenvalues of $A$ and $A^{T}$ : $A$ and $A^{T}$ share the same eigenvalues, but their eigenvectors are generally different.
Eigenvalues of $A^{- 1}$ : If $A$ is invertible and $(λ, v)$ is an eigenvalue-eigenvector pair, then $(1/ λ, v)$ is an eigenvalue-eigenvector pair for $A^{- 1}$ . Note that $λ \neq = 0$ since $A$ is invertible.
Linearity of Eigenvalues: Eigenvalues do not behave linearly under matrix addition or multiplication. In general, the eigenvalues of $A + B$ are not the sum of the eigenvalues of $A$ and $B$ , and the eigenvalues of $A B$ are not the product of the eigenvalues of $A$ and $B$ .
Gaussian Elimination and Eigenvalues: Gaussian elimination does not preserve eigenvalues.
Eigenvalues of Orthogonal Matrices: If $Q$ is an orthogonal matrix ( $Q^{T} Q = I$ ) and $λ$ is an eigenvalue of $Q$ , then $∣ λ ∣ = 1$ .

Theorem: Basis of Eigenvectors for Distinct Eigenvalues

If $A \in R^{n \times n}$ has $n$ distinct real eigenvalues, then there exists a basis for $R^{n}$ consisting of the eigenvectors of $A$ . This basis enables diagonalization, simplifying various computations involving $A$ . However, what happens when eigenvalues are repeated?

Repeated Eigenvalues: Challenges in Building a Basis

When eigenvalues are repeated, constructing a basis of eigenvectors becomes more challenging.

Definition: The algebraic multiplicity of an eigenvalue is the number of times it appears as a root of the characteristic polynomial.

Example

$A = [0010]$ The characteristic polynomial is $det (A - λ I) = λ^{2}$ . The eigenvalue 0 has algebraic multiplicity 2. The eigenspace corresponding to $λ = 0$ is $N (A)$ , which is spanned by the vector $[10]$ . Its dimension is 1. Thus, there is only one linearly independent eigenvector.
$A = [0000]$ The characteristic polynomial is $λ^{2}$ . $λ = 0$ is the only eigenvalue, again with algebraic multiplicity 2. However, the eigenspace is all of $R^{2}$ (any vector is an eigenvector), which has dimension 2. We have two linearly independent eigenvectors $[10]$ and $[01]$ .

These examples illustrate that a repeated eigenvalue might not yield enough linearly independent eigenvectors to form a basis. This leads to the following definition:

Complete Set of Real Eigenvectors

Definition: A matrix $A \in R^{n \times n}$ has a complete set of real eigenvectors if there exists a basis for $R^{n}$ consisting of eigenvectors of $A$ . So in our example the first matrix doesn’t have such a complete set of eigenvectors, the second matrix however does have such a set.

When Do We Have a Complete Set of Real Eigenvectors?

A crucial question in linear algebra is whether a given matrix possesses a complete set of eigenvectors that form a basis for the vector space. This property, known as diagonalizability, significantly simplifies matrix computations and analysis.

Definition: Complete Set of Eigenvectors: A matrix $A \in R^{n \times n}$ has a complete set of real eigenvectors if there exists a basis for $R^{n}$ consisting entirely of eigenvectors of $A$ .

Let’s examine specific cases:

1. Distinct Eigenvalues

Proposition: If a matrix $A \in R^{n \times n}$ has $n$ distinct real eigenvalues, then it has a complete set of linearly independent eigenvectors, hence forming a basis for $R^{n}$ .

Proof: We have already proved (in the last lecture note [[24 Explicit Fibonacci Formula, Eigenvalue and Eigenvector properties, Trace#Key Properties and Results#Linear Independence of Eigenvectors]]) that eigenvectors corresponding to distinct eigenvalues are linearly independent. If a matrix has $n$ distinct eigenvalues, it consequently has $n$ linearly independent eigenvectors. Since these eigenvectors reside in $R^{n}$ , $n$ linearly independent vectors form a basis. Therefore, the matrix has a complete set of eigenvectors.

2. Diagonal Matrices

Proposition: A diagonal matrix $D \in R^{n \times n}$ always has a complete set of linearly independent eigenvectors.

Proof: Let $D = diag (d_{1}, d_{2}, \dots, d_{n})$ be a diagonal matrix. The characteristic polynomial is given by:

det (D - λ I) = (d_{1} - λ) (d_{2} - λ) \dots (d_{n} - λ)

Setting the characteristic polynomial to zero, we find that the eigenvalues are the diagonal entries: $λ_{i} = d_{i}$ for $i = 1, \dots, n .$ Now let’s consider the standard basis vectors $e_{i}$ (a vector with 1 in the $i$ -th position and 0 elsewhere). Observe:

D e_{i} = d_{i} e_{i} = λ_{i} e_{i}

Thus, each standard basis vector $e_{i}$ is an eigenvector corresponding to the eigenvalue $λ_{i} = d_{i}$ . Since the standard basis vectors are linearly independent and span $R^{n}$ , the diagonal matrix $D$ has a complete set of linearly independent eigenvectors. Note that if there are repeated entries on the diagonal, we still have a complete set of linearly independent eigenvectors; it’s just that multiple eigenvectors will correspond to the same eigenvalue.

3. Repeated Eigenvalues: Algebraic and Geometric Multiplicity

When an eigenvalue is repeated (i.e., its algebraic multiplicity is greater than 1), the situation is more nuanced.

Definition: Algebraic Multiplicity

The algebraic multiplicity of an eigenvalue $λ$ is the number of times it appears as a root of the characteristic polynomial.

Definition: Geometric Multiplicity

The geometric multiplicity of an eigenvalue $λ$ is the dimension of its eigenspace, $N (A - λ I)$ - the number of linearly independent eigenvectors associated with that eigenvalue.

Explanation:

Algebraic multiplicity counts how many times an eigenvalue appears as a root of the characteristic polynomial. This tells us how many eigenvectors we expect for that eigenvalue.
Geometric multiplicity counts the maximum number of linearly independent eigenvectors we can actually find for an eigenvalue. This is the dimension of the eigenspace.

To form a basis, we need $n$ linearly independent eigenvectors for an $n \times n$ matrix.

If the geometric multiplicity of an eigenvalue is less than its algebraic multiplicity, we have “missing” eigenvectors-we don’t have enough linearly independent vectors for that eigenvalue to contribute its “full share” to a basis.

Only when the geometric multiplicity equals the algebraic multiplicity for all eigenvalues do we have enough linearly independent eigenvectors to span the entire space and form a complete set, thus making the matrix diagonalizable.

Crucial Condition for a Complete Set

A matrix $A$ has a complete set of eigenvectors if and only if for every eigenvalue, its geometric multiplicity is equal to its algebraic multiplicity.

Example (Repeated Eigenvalue, Insufficient Eigenvectors)

A = [2012]

The characteristic polynomial is $(2 - λ)^{2}$ , so $λ = 2$ has algebraic multiplicity 2. However, the eigenspace $N (A - 2 I)$ has dimension 1 (spanned by $[10]^{T}$ ), so the geometric multiplicity is 1. Since the geometric multiplicity is less than the algebraic multiplicity, this matrix does not have a complete set of eigenvectors and is not diagonalizable.

Projection Matrices and Complete Sets of Eigenvectors

Projection matrices, representing projections onto subspaces, possess a special eigenstructure that always guarantees a complete set of eigenvectors.

Proposition: Let $P$ be the orthogonal projection matrix onto a subspace $S \subset R^{n}$ . Then $P$ has two eigenvalues: 0 and 1, and it always has a complete set of real eigenvectors that form a basis for $R^{n} .$

Proof

Eigenvalues 0 and 1

Let $x \in R^{n}$ . We can uniquely decompose $x$ as $x = x_{S} + x_{S^{⊥}}$ where $x_{S} \in S$ and $x_{S^{⊥}} \in S^{⊥}$ (the orthogonal complement of $S$ ). By definition of orthogonal projection:

P x = x_{S} .

Eigenvalue 1: If $x \in S$ , then $P x = x = 1 x$ . Thus, any non-zero vector in $S$ is an eigenvector of $P$ with eigenvalue 1.
Eigenvalue 0: If $x \in S^{⊥}$ , then $P x = 0 = 0 x$ . Thus, any non-zero vector in $S^{⊥}$ is an eigenvector of $P$ with eigenvalue 0.

Complete Set of Eigenvectors

Let $m = dim (S)$ . Let ${v_{1}, v_{2}, \dots, v_{m}}$ be an orthonormal basis for $S$ , and let ${v_{m + 1}, v_{m + 2}, \dots, v_{n}}$ be an orthonormal basis for $S^{⊥}$ .

Eigenvectors for eigenvalue 1: The vectors ${v_{1}, v_{2}, \dots, v_{m}}$ are eigenvectors of $P$ with eigenvalue 1 (as shown above).
Eigenvectors for eigenvalue 0: The vectors ${v_{m + 1}, v_{m + 2}, \dots, v_{n}}$ are eigenvectors of $P$ with eigenvalue 0 (as shown above).
Combined Set: The set ${v_{1}, \dots, v_{m}, v_{m + 1}, \dots, v_{n}}$ forms an orthonormal basis for $R^{n}$ consisting entirely of eigenvectors of $P .$
- Linear Independence: Since it is a basis for $R^{n}$ the set of vectors are linearly independent.
- Spanning Property: The set spans all of $R^{n}$ as it is a basis.

Explicit Form of Eigenvectors: The eigenvectors of the projection matrix $P$ are any set of orthonormal vectors that form bases for the subspace $S$ and its orthogonal complement $S^{⊥}$ .

Diagonalization: Leveraging a Complete Set of Eigenvectors

Theorem: Let $A \in R^{n \times n}$ have a complete set of eigenvectors $v_{1}, \dots, v_{n}$ associated with eigenvalues $λ_{1}, \dots, λ_{n}$ , respectively. Let $V$ be the matrix whose columns are these eigenvectors. Then:

A = V Λ V^{- 1}

where $Λ$ is a diagonal matrix with the eigenvalues $λ_{1}, \dots, λ_{n}$ on the diagonal. This process is called diagonalization.

Diagonalization and Powers of A... (MIT)

Proof

Invertibility of $V$ : Since $A$ has a complete set of eigenvectors, these eigenvectors form a basis for $R^{n},$ which mean they are linearly independent. Consequently, the matrix $V$ , having these linearly independent vectors as its columns, is invertible.
Eigenvalue Equation in Matrix Form: The individual eigenvalue equations $A v_{i} = λ_{i} v_{i}$ for $i = 1, 2, \dots, n$ , can be expressed concisely in matrix form:
$A V = A [v_{1} v_{2} \dots v_{n}] = [A v_{1} A v_{2} \dots A v_{n}] = [λ_{1} v_{1} λ_{2} v_{2} \dots λ_{n} v_{n}]$
Introducing $Λ$ : Notice that the rightmost expression in the previous step can be written as the product of $V$ and the diagonal matrix $Λ$ : Let’s see why:
$[λ_{1} v_{1} λ_{2} v_{2} \dots λ_{n} v_{n}] = [v_{1} v_{2} \dots v_{n}] λ_{1} 0 ⋮ 0 0 λ_{2} ⋮ 0 \dots \dots ⋱ \dots 00 ⋮ λ_{n} = V Λ$
Diagonalization: Combining the steps, we have $A V = V Λ$ . Since $V$ is invertible, we can multiply both sides on the left by $V^{- 1}$ :
$V^{- 1} A V = V^{- 1} V Λ = I Λ = Λ$
This equation demonstrates that $A$ is similar to the diagonal matrix $Λ.$
Rewriting the Equation: Finally, to obtain the diagonalization of $A$ , we can multiply both sides of $V^{- 1} A V = Λ$ on the left by $V$ and on the right by $V^{- 1}$ :
$V (V^{- 1} A V) V^{- 1} = V Λ V^{- 1} ⟹ (V V^{- 1}) A (V V^{- 1}) = V Λ V^{- 1} ⟹ A = V Λ V^{- 1}$

Similar Matrices

Definition: Two matrices $A$ and $B$ are similar if there exists an invertible matrix $S$ such that $B = S^{- 1} A S .$

Proposition: Similar matrices have the same eigenvalues.

Proof

If $A v = λ v$ , and $B = S^{- 1} A S$ , then:

B (S^{- 1} v) = S^{- 1} A S (S^{- 1} v) = S^{- 1} A v = S^{- 1} (λ v) = λ (S^{- 1} v)

Thus, if $λ$ is an eigenvalue of $A$ with eigenvector $v$ , then $λ$ is an eigenvalue of $B$ with eigenvector $S^{- 1} v .$

Diagonalization, when possible, finds a similar matrix that is diagonal, greatly simplifying computations and revealing the core structure of the linear transformation.

Why Diagonalization Matters

Diagonalization simplifies many matrix computations. For instance, calculating powers of $A$ becomes much easier:

A^{k} = (V Λ V^{- 1})^{k} = V Λ^{k} V^{- 1}

Since $Λ$ is diagonal, $Λ^{k}$ is simply computed by raising the diagonal entries (eigenvalues) to the power $k$ . This simplification extends to other matrix functions, making diagonalization a powerful technique in various applications.

Change of Basis and Diagonalization

Diagonalization, when possible, provides a way to simplify the representation of a linear transformation by choosing a basis of eigenvectors. In this basis, the transformation acts as a simple scaling along each eigenvector direction. Let’s explore this concept in detail, starting with the general case of a linear transformation and then specializing to diagonalizable matrices.

General Case: Change of Basis

Consider a linear transformation $L : R^{n} \to R^{m}$ and two sets of bases:

${u_{1}, u_{2}, \dots, u_{n}}$ for $R^{n}$ (the “old” basis).
${v_{1}, v_{2}, \dots, v_{m}}$ for $R^{m}$ (the “new” basis).

Let $U = [u_{1} u_{2} \dots u_{n}] \in R^{n \times n}$ and $V = [v_{1} v_{2} \dots v_{m}] \in R^{m \times m}$ be the matrices whose columns are the respective basis vectors. These matrices are invertible since their columns are linearly independent. Any vector $x \in R^{n}$ can be expressed in the old basis as $x = \sum_{j = 1}^{n} α_{j} u_{j}$ . Let $α = α_{1} ⋮ α_{n}$ . Then $x = U α$ .

The transformation $L (x)$ results in a vector in $R^{m}$ , which can be expressed in the new basis as $L (x) = \sum_{i = 1}^{m} β_{i} v_{i}$ . Similarly, let $β = β_{1} ⋮ β_{m} .$ Then $L (x) = V β .$

We want to find a matrix $B$ that represents the transformation $L$ with respect to these new bases, such that $β = B α$ . To do this, let $A$ be the matrix representing the linear transformation $L$ in the standard bases. Then $L (x) = A x$ .

Using the change of basis matrices:

L (x) = V β = A x = A U α

To find $B$ , we need to express $β$ in terms of $α$ :

β = V^{- 1} A U α

Thus, the matrix representing the transformation $L$ in the new bases is:

B = V^{- 1} A U

Diagonalization: A Special Case of Change of Basis

When $A \in R^{n \times n}$ is diagonalizable (i.e., it has a complete set of $n$ linearly independent eigenvectors), we can choose the eigenvectors as the new basis for both the domain and codomain ( $R^{n}$ ).

Let $V$ be the matrix whose columns are the eigenvectors of $A$ . Then, in this eigenvector basis, the transformation is represented by the diagonal matrix $Λ = V^{- 1} A V,$ where $Λ$ has the eigenvalues of $A$ on its diagonal.

In this case, the change of basis simplifies the transformation to a scaling along each eigenvector direction, as the matrix representing the transformation becomes diagonal.

Continue here: 26 Symmetric Matrices, Spectral Theorem, Rayleigh Quotient

CS Notes

Explorer

25 Eigenvalues, Eigenvectors, Complete Set of Eigenvectors, Diagonalization, Eigendecomposition

Eigenvalues and Eigenvectors: A Summary

Theorem: Basis of Eigenvectors for Distinct Eigenvalues

Repeated Eigenvalues: Challenges in Building a Basis

Example

Complete Set of Real Eigenvectors

When Do We Have a Complete Set of Real Eigenvectors?

1. Distinct Eigenvalues

2. Diagonal Matrices

3. Repeated Eigenvalues: Algebraic and Geometric Multiplicity

Definition: Algebraic Multiplicity

Definition: Geometric Multiplicity

Crucial Condition for a Complete Set

Example (Repeated Eigenvalue, Insufficient Eigenvectors)

Projection Matrices and Complete Sets of Eigenvectors

Proof

Eigenvalues 0 and 1

Complete Set of Eigenvectors

Diagonalization: Leveraging a Complete Set of Eigenvectors

Proof

Similar Matrices

Proof

Why Diagonalization Matters

Change of Basis and Diagonalization

General Case: Change of Basis

Diagonalization: A Special Case of Change of Basis

Table of Contents

Graph View