14 Fundamental Subspaces, Column Space, Row Space, Nullspace

Lecture from 01.11.2024 | Video: Videos ETHZ

Fundamental Subspaces of a Matrix

In linear algebra, we often work with matrices and the vector spaces associated with them. A key concept is that of a subspace, which is a subset of a vector space closed under vector addition and scalar multiplication. We can fully characterize a vector space by identifying its basis.

The four fundamental subspaces associated with a matrix $A \in R^{m \times n}$ :

Column Space ( $C (A)$ ): The column space of $A$ , denoted $C (A)$ , is the subspace of $R^{m}$ spanned by the columns of $A$ . In other words, $C (A)$ consists of all linear combinations of the columns of $A$ .
Row Space ( $R (A)$ ): The row space of $A$ , denoted $R (A)$ , is the subspace of $R^{n}$ spanned by the rows of $A$ . Equivalently, it is the column space of $A^{T}$ (the transpose of $A$ ).
Null Space ( $N (A)$ ): The null space of $A$ , denoted $N (A)$ , is the subspace of $R^{n}$ consisting of all vectors $x$ that satisfy the homogeneous equation $A x = 0$ . The null space represents the set of all inputs to the linear transformation represented by $A$ that produce the zero vector as output.
Left Null Space ( $N (A^{T})$ ): The left null space of $A$ , denoted $N (A^{T})$ , is the null space of $A^{T}$ . It is a subspace of $R^{m}$ . The left null space can be interpreted as the set of all vectors orthogonal to the column space of $A$ . This means that for any $y \in N (A^{T})$ and any $b \in C (A)$ , their dot product is zero: $y \cdot b = 0$ . (Note we will not look at this)

We already have a tool with which we can calculate the basis for all of these subspaces: Gauss-Jordan-Elimination. Recall Gauss-Jordan-Elimination (see Gauss-Jordan Elimination for more):

Column Space

The column space of $A$ , denoted $C (A)$ , is formally defined as:

$C (A) = {b \in R^{m} ∣ b = A x for some x \in R^{n}} .$

It represents the set of all possible linear combinations of the columns of $A$ , and thus all possible outputs of the linear transformation represented by $A$ .

Finding a Basis for $C (A)$ using Gauss-Jordan Elimination:

Reduce $A$ to Reduced Row Echelon Form (RREF): Perform Gauss-Jordan elimination on matrix $A$ to obtain its RREF, denoted $R$ .
Identify Pivot Columns: The pivot columns in $R$ correspond to the linearly independent columns of $A$ . The indices of these pivot columns are crucial.
Extract Original Columns: The corresponding columns in the original matrix $A$ (not the RREF matrix $R$ ) form a basis for $C (A)$ .

Theorem: Let $A$ be an $m \times n$ matrix, and let $R$ be the RREF of $A$ with pivot column indices $j_{1}, j_{2}, \dots, j_{r}$ . Then the columns $A_{j_{1}}, A_{j_{2}}, \dots, A_{j_{r}}$ form a basis for $C (A)$ , and $dim (C (A)) = r = rank (A)$ .

Example: Suppose $A = 123246012345$ After Gauss-Jordan elimination, we obtain the RREF: $R = 100200010 3 - 2 0$

The pivot columns are the first and third columns ( $j_{1} = 1$ , $j_{2} = 3$ ). Therefore, the first and third columns of the original matrix $A$ form a basis for $C (A)$ : $Basis for C (A) = ⎩ ⎨ ⎧ 123, 012 ⎭ ⎬ ⎫$ The dimension of the column space is 2, which is also the rank of the matrix $A$ .

Proof: The independent columns of $A$ are a basis for $C (A)$ and we get these independent columns by doing Gauss-Jordan.

Row Space

The row space of $A$ , denoted $R (A)$ , is the subspace of $R^{n}$ spanned by the rows of $A$ . It is equivalent to the column space of the transpose of $A$ : $R (A) = C (A^{T})$ .

Finding a Basis for $R (A)$ using Gauss-Jordan Elimination:

Reduce $A$ to RREF: Perform Gauss-Jordan elimination on matrix $A$ to obtain its RREF, denoted $R$ .
Non-zero Rows form a Basis: The non-zero rows of $R$ form a basis for $R (A)$ .

Theorem: Let $A$ be an $m \times n$ matrix, and let $R$ be the RREF of $A$ . The non-zero rows of $R$ form a basis for $R (A)$ , and $dim (R (A)) = r = rank (A)$ .

Important Note: While the column space basis comes from the original matrix $A$ , the row space basis comes from the RREF matrix $R$ .

Example: Using the same matrix $A$ as in the column space example: $A = 123246012345$ and its RREF: $R = 100200010 3 - 2 0$ The non-zero rows of $R$ form a basis for $R (A)$ :

$Basis for R (A) = {[1203], [001 - 2]}$ The dimension of the row space is 2, which is also the rank of $A$ . Note that although $dim (C (A)) = dim (R (A))$ , the basis vectors for $C (A)$ and $R (A)$ are different and belong to different vector spaces ( $R^{m}$ and $R^{n}$ respectively).

Key Takeaway: Row operations preserve the row space but not the column space. The RREF simplifies the process of finding a basis for the row space.

Invariance of Row Space under Multiplication by an Invertible Matrix

Lemma 4.27: Let $A$ be an $m \times n$ matrix, and let $M$ be an invertible $m \times m$ matrix. Then $R (A) = R (M A)$ .

This lemma states that multiplying a matrix $A$ by an invertible matrix $M$ on the left does not change its row space.

Proof

The core idea is to understand the connection between row spaces and column spaces through transposes:

Row space is the column space of the transpose: $R (A) = C (A^{T})$ .

We’ll use the following notation:

$B = A^{T}$ (transpose of $A$ )
$N = M^{T}$ (transpose of $M$ )

The proof revolves around showing that $C (B) = C (BN)$ :

Start with the row space of A: Consider any vector $v$ in the row space of $A$ ( $R (A)$ ): $v \in R (A)$
Express it as a column space vector: Since $R (A) = C (A^{T}) = C (B)$ , we can express $v$ as a linear combination of the columns of $B$ : $v = B x for some x \in R^{m}$
Introduce M and relate to the row space of MA: To connect to the row space of $M A$ , we introduce the invertible matrix $M$ . Since $M$ is invertible, its inverse $N^{- 1}$ exists. Let’s introduce a new vector $y$ related to $x$ by the equation $y = N^{- 1} x$ .

This means $x = N y$ (multiplying both sides by $N$ ). Now, we can rewrite $v$ in terms of $y$ : $v = B x = B (N y) = (BN) y$ 4. Show v is in the column space of BN: Since $v$ is a linear combination of the columns of $BN$ , it belongs to the column space of $BN$ :

$v \in C (BN)$

Connecting to the row space of MA: Recall that $C (BN) = C ((M A)^{T})$ because the transpose of a product is the product of the transposes in reverse order. Therefore, $C (BN)$ is the row space of $M A$ : $C (BN) = C ((M A)^{T}) = R (M A)$
Equivalence: By following these steps, we’ve shown that:
- If $v$ is in $R (A)$ , then it’s also in $R (M A)$ . This implies $R (A) \subseteq R (M A)$ .
- Since $M$ is invertible, we can repeat the same logic using $M^{- 1}$ to show that $R (M A) \subseteq R (A)$ .
- Combining both directions, we conclude that $R (A) = R (M A)$ .

Finding a Basis for $R (A)$ using REF

Theorem 4.28: Let $A$ be an $m \times n$ matrix, and let $R$ be the REF of $A$ with pivot column indices $j_{1}, j_{2}, \dots, j_{r}$ . Then the first $r$ rows of $R$ form a basis for the row space $R (A)$ , and $dim (R (A)) = r = rank (A)$ .

Why This Works:

The theorem builds on the understanding of row operations, REF structure, and the connection between row spaces and column spaces through transposes. Let’s break down the key reasons why this theorem holds true:

Row Operations Preserve Row Space: Gauss-Jordan elimination, which produces the REF, involves row operations. We already know from Lemma 4.27 (see Invariance of Row Space under Multiplication by an Invertible Matrix) that row operations (equivalent to multiplication by invertible matrices) don’t change the row space. Therefore, $R (A) = R (R)$ .
Structure of RREF: The REF $R$ has the following key properties:

Pivot Rows: The first $r$ rows of $R$ are the non-zero rows, where $r$ is the rank of $A$ .
Zero Rows: The remaining $m - r$ rows are all zero.

Linear Independence: The first $r$ rows of $R$ are linearly independent. This is because each row has a pivot position (leading 1) in a column that is unique to that row. No row can be expressed as a linear combination of the other rows.
Spanning: The first $r$ rows of $R$ span the entire row space $R (R)$ . This is because any linear combination of the original rows of $A$ can be rewritten as a linear combination of the rows of $R$ , thanks to the row operations.

By using row operations to put $A$ in REF, we achieve two things:

We obtain a set of linearly independent vectors (the non-zero rows of $R$ ).
These vectors span the entire row space $R (A)$ .

Therefore, these vectors form a basis for the row space of $A$ .

Proof

Let $R_{0}$ be the result of Gauss-Jordan elimination on $A$ . We know that $R_{0} = M A$ , where $M$ is a product of elementary matrices (invertible). By Lemma 4.27, $R (A) = R (R_{0})$ .

The first $r$ rows of $R_{0}$ are the non-zero rows of $R_{0}$ . These rows are linearly independent because each row has a pivot position (leading 1) in a column that is unique to that row. This implies that no row can be expressed as a linear combination of the other rows.

We now show that the first $r$ rows of $R_{0}$ span $R (A)$ . Let $v \in R (A)$ . Since $R (A) = R (R_{0})$ , we can write $v$ as a linear combination of the rows of $R_{0}$ . Let $u$ be a vector containing the coefficients of this linear combination. Then, $v = u^{T} R_{0}$ .

Since the first $r$ rows of $R_{0}$ are linearly independent, the remaining $m - r$ rows must be linear combinations of the first $r$ rows. Thus, we can rewrite $v$ as a linear combination of only the first $r$ rows of $R_{0}$ .

This shows that the first $r$ rows of $R_{0}$ span $R (A)$ . Since they are also linearly independent, they form a basis for $R (A)$ .

Since $R_{0} = M A$ , we can conclude that the first $r$ rows of $R$ (which are the same as the first $r$ rows of $R_{0}$ ) form a basis for $R (A)$ .

Example:

Consider the matrix $A$ and its REF from previous examples:

$A = 123246012345 \Rightarrow R = 100200010 3 - 2 0$

The first two rows of $R$ form a basis for $R (A)$ :

$Basis for R (A) = {[1203], [001 - 2]}$

We can conclude that $dim (R (A)) = 2$ and the rank of $A$ is 2.

Observation: $Because rank (A) dim (C (A)) = Theorem 4.28 dim (R (A)) = Def. of row space dim (C (A^{⊤})) rank (A^{⊤)}, row rank equals column rank$

Row Rank Equals Column Rank

Theorem 4.29: Let $A$ be an $m \times n$ matrix. Then $rank (A) = rank (A^{T})$ .

Why This Works:

This theorem directly follows from the fact that the dimension of the column space of $A$ equals the dimension of the row space of $A$ , which in turn equals the dimension of the column space of $A^{T}$ .

Dimensionality: We’ve established that $dim (C (A)) = rank (A)$ and $dim (R (A)) = rank (A)$ .
Transpose Relationship: We also know that $R (A) = C (A^{T})$ .

Putting these together, we have:

$rank (A) = dim (C (A))$
$rank (A) = dim (R (A)) = dim (C (A^{T})) = rank (A^{T})$

Therefore, the row rank of a matrix is always equal to its column rank.

This is a fundamental result in linear algebra, emphasizing the close relationship between row space and column space.

Nullspace $N (A)$

The nullspace (also called kernel) of a matrix $A \in R^{m \times n}$ , denoted $N (A)$ , is the set of all vectors $x \in R^{n}$ that satisfy the homogeneous equation $A x = 0$ . Formally:

$N (A) = {x \in R^{n} ∣ A x = 0}$

Key Properties and Interpretations

Subspace: $N (A)$ is a subspace of $R^{n}$ . This can be easily verified by checking closure under addition and scalar multiplication.
Solutions to Homogeneous System: The nullspace represents all solutions to the homogeneous system of linear equations represented by $A x = 0$ .
Kernel of Linear Transformation: If $A$ represents a linear transformation, $N (A)$ is the kernel of that transformation – the set of all vectors that map to the zero vector.

Invertible Matrices and Nullspace

Lemma 4.33: Let $A$ be an $m \times n$ matrix, and $M$ be an invertible $m \times m$ matrix. Then $N (A) = N (M A)$ .

This lemma states that multiplying a matrix $A$ by an invertible matrix $M$ on the left does not change its nullspace.

Proof:

We need to show that any vector in $N (A)$ is also in $N (M A)$ , and vice-versa.

Suppose $x \in N (A)$ : This means $A x = 0$ . Now consider $(M A) x = M (A x) = M 0 = 0$ . Therefore, $x$ is also in $N (M A)$ . This shows $N (A) \subseteq N (M A)$ .
Suppose $x \in N (M A)$ : This means $(M A) x = 0$ . Since $M$ is invertible, we can multiply both sides by $M^{- 1}$ : $M^{- 1} (M A) x = M^{- 1} 0$ , which simplifies to $A x = 0$ . Thus, $x$ is also in $N (A)$ . This shows $N (M A) \subseteq N (A)$ .

Since $N (A) \subseteq N (M A)$ and $N (M A) \subseteq N (A)$ , we conclude that $N (A) = N (M A)$ .

This lemma is crucial for understanding why we can use the RREF of $A$ to find the nullspace of $A$ . The RREF is obtained by multiplying $A$ by a series of invertible elementary matrices. Thus, the nullspace of $A$ is the same as the nullspace of its RREF, which is much easier to compute.

Finding a Basis for $N (A)$ using Gauss-Jordan Elimination

Theorem 4.35:

Reduce A to REF: Perform Gauss-Jordan elimination on $A$ to obtain its REF, denoted $R$ .
Identify Free and Basic Variables: In the REF, identify the pivot columns and the corresponding basic variables. The remaining variables are the free variables.
Express Basic Variables in terms of Free Variables: Express each basic variable as a linear combination of the free variables, based on the equations represented by the rows of $R$ .
Form Special Solutions: For each free variable, create a special solution by setting that free variable to 1 and all other free variables to 0. Then, solve for the basic variables using the expressions derived in the previous step.
Special Solutions Form a Basis: The set of special solutions forms a basis for $N (A)$ .

Example

Let’s revisit our example matrix $A$ and its REF $R$ :

$A = 123246012345 \Rightarrow R = 100200010 3 - 2 0$

Basic and Free Variables: The pivot columns are 1 and 3, so $x_{1}$ and $x_{3}$ are basic variables. $x_{2}$ and $x_{4}$ are free variables.
Express Basic in terms of Free: From the REF, we have the equations:

Notice how we can choose our free variables $x (Q)$ and then read of $x (I)$ .
Special Solutions:
Setting $x_{2} = 1$ and $x_{4} = 0$ , we get $x_{1} = - 2$ and $x_{3} = 0$ . This gives the special solution $v_{1} = - 2 100$ .
Setting $x_{2} = 0$ and $x_{4} = 1$ , we get $x_{1} = - 3$ and $x_{3} = 2$ . This gives the special solution $v_{2} = - 3 021$ .

Basis for N(A): The basis for the nullspace is ${v_{1}, v_{2}}$ .

Therefore, $dim (N (A)) = 2$ , which is $n - r$ (number of columns minus the rank).

The Solution Space of $A x = b$

Definition 4.39: For a matrix $A \in R^{m \times n}$ and a vector $b \in R^{m}$ , the solution space (or solution set) of the linear system $A x = b$ is defined as: $Sol (A, b) = {x \in R^{n} ∣ A x = b}$ This set contains all vectors $x$ that satisfy the given linear system.

Key Properties and Considerations

Subspace (only if b = 0): $Sol (A, b)$ is a subspace of $R^{n}$ if and only if $b = 0$ . If $b \neq = 0$ , then $Sol (A, b)$ is not a subspace because it doesn’t contain the zero vector (since $A 0 = 0 \neq = b$ ).
Affine Subspace (if b ≠ 0): When $b \neq = 0$ , and a solution exists, $Sol (A, b)$ is called an affine subspace. It can be thought of as a subspace that has been “shifted” or “translated” away from the origin.
Existence of Solutions: The solution space might be empty (i.e., no solutions exist) if $b$ is not in the column space of $A$ .
Relationship to Nullspace: The solution space is closely related to the nullspace of $A$ . Theorem 4.40 formalizes this relationship.

Structure of the Solution Space

Theorem 4.40: Let $A$ be an $m \times n$ matrix, and $b \in R^{m}$ . Let $x_{p}$ be a particular solution of $A x = b$ (if one exists). Then: $Sol (A, b) = {x_{p} + v ∣ v \in N (A)}$ This theorem states that the solution space consists of all vectors obtained by adding a particular solution $x_{p}$ to every vector in the nullspace of $A$ .

Proof:

We need to show that any vector in $Sol (A, b)$ can be written in the form $x_{p} + v$ , where $v \in N (A)$ , and vice-versa.

Let $y \in Sol (A, b)$ : This means $A y = b$ . Since $x_{p}$ is also a solution, $A x_{p} = b$ . Subtracting these equations gives: $A y - A x_{p} = b - b$ $A (y - x_{p}) = 0$ This means $y - x_{p}$ is in the nullspace of $A$ . Let $v = y - x_{p}$ . Then $v \in N (A)$ , and $y = x_{p} + v$ . So, any solution $y$ can be written in the desired form.
Let $x = x_{p} + v$ , where $v \in N (A)$ : This means $A v = 0$ . Now, let’s check if $x$ is a solution to $A x = b$ : $A x = A (x_{p} + v) = A x_{p} + A v = b + 0 = b$ Thus, $x$ is indeed a solution, so $x \in Sol (A, b)$ .

This proves that the solution set has the structure described in the theorem. If a particular solution exists, every other solution can be obtained by adding a vector from the nullspace to that particular solution. This highlights the fundamental relationship between the solution space of a non-homogeneous system and the nullspace of the corresponding matrix.

Intuition:

$x_{p}$ : This represents one specific way to reach $b$ through the transformation defined by $A$ .
$N (A)$ : The nullspace captures all the directions along which you can move without changing the result (i.e., $A v = 0$ for $v \in N (A)$ ).
$x_{p} + v$ : Adding vectors from the nullspace to the particular solution finds all other solutions. These represent variations of the particular solution that don’t affect the target $b$ .

Dimensions and Structure of Solution Spaces

Dimensionality of $S o l (A, b)$ : When $Sol (A, b)$ has a solution (i.e., the system $A x = b$ is consistent), it has a “dimension” of $n - r$ , where $r$ is the rank of $A$ and $n$ is the number of columns in $A$ . This dimension intuitively represents the number of “degrees of freedom” in the solution space.
Parallelism to $N (A)$ : The solution space $Sol (A, b)$ can be viewed as a parallel translation of the nullspace $N (A)$ . Imagine the nullspace as a subspace (or a “plane” in higher dimensions) passing through the origin. The solution space is obtained by shifting this “plane” by the particular solution $x_{p}$ .
Cases Based on $R ank (r)$ :

$r = m$ (Full Rank): If the rank of $A$ equals the number of rows ( $r = m$ ), then the RREF of $A$ will have no rows full of zeros. This means that the system $R_{0} x = c$ (where $R_{0}$ is the RREF of $A$ and $c$ is obtained from $b$ through the same row operations) will always have a solution. Consequently, the original system $A x = b$ will also have a solution. In this case, the solution space is non-empty, and its dimension is $n - r = n - m$ , which could be 0 or greater. If it is 0, there is only one solution.
$r < m$ (Rank Deficiency): When the rank of $A$ is less than the number of rows ( $r < m$ ), this means the column space $C (A)$ has a lower dimension than the space it’s embedded in ( $R^{m}$ ). Imagine $C (A)$ as a “plane” (or a “line” in a 3D space) within a higher-dimensional space. Since $C (A)$ is “smaller” than the space it occupies, there will be many vectors $b$ that lie outside of this “plane”. For these vectors $b$ , there’s no way to express $b$ as a linear combination of the columns of $A$ , meaning there’s no solution to the system $A x = b$ .

Example: Rank Deficiency

Continue here: 15 Orthogonal Vectors and Orthogonal Complements of Subspaces

CS Notes

Explorer

14 Fundamental Subspaces, Column Space, Row Space, Nullspace