Skip to main content
Logo image

Section 1.5 Existence of Solutions

Recall that any linear system may be expressed as a matrix equation of the form \(A\vb{x} = \vb{b}\text{.}\) Furthermore, this system is consistent if and only if the last column of the augmented matrix \(\mqty[A \amp \bb]\) is not a pivot column. Based on this and our observations in Section 1.4, we can make the following observations.
As before, we use Gaussian elimination (i.e., row reduction) to solve systems.

Example 1.5.2.

Find any solutions of
\begin{align*} 5x - 7y + 3z &= 17\\ -15x + 20y - 9z &= -50 \end{align*}
Solution.
Reducing to an echelon form is enough to determine if the system is consistent and find the number of solutions it has. Using Octave to find the reduced echelon form (see the code cell immediately after this example), we get
\begin{equation*} \begin{bmatrix} 1 & 0 & \frac{3}{5} & 2 \\ 0 & 1 & 0 & -1\end{bmatrix}. \end{equation*}
From the reduced echelon form above, we see that the system must be consistent since the rank of the coefficient matrix is equal to the rank of the augmented matrix. Equivalently, the last column is not a pivot column. Furthermore, there are infinitely many solutions since the rank of the coefficient matrix is less than the total number of columns.
The solution set itself can be written in vector notation as
\begin{equation*} \begin{bmatrix}x \\ y \\ z\end{bmatrix} = \begin{bmatrix}2 \\ -1 \\ 0\end{bmatrix} + z\begin{bmatrix}-\frac{3}{5} \\ 0 \\ 1\end{bmatrix}. \end{equation*}
This, again, is verified below.
In the last example the variable \(z\) led to infinitely many solutions and we were able to write out solution depending on the value of this variable. We call \(z\) a free variable and \(x\) and \(y\) basic variables.

Definition 1.5.3. Basic and Free Variables.

Given a consistent linear system \(A\vb{x}=\vb{b}\text{,}\) the variables corresponding to pivot columns of \(A\) are basic variables and the variables corresponding to non-pivot columns of \(A\) are free variables.
Any solution of the linear system \(A\vb{x} = \vb{b}\) can always be written to depend solely on any free variables as we did in Example 1.5.2. In fact we can go a bit further, still using our answer in Example 1.5.2 as a guide. Using free variables, any solution to \(A\vb{x}=\vb{b}\) can be written as a sum of two components:
\begin{equation*} \vb{x} = \vb{x}_p + \vb{x}_{\text{free}}. \end{equation*}
This notation will change shortly, but the main idea is that one component of the solution will not depend on the free variable and will represent a single solution of the system \(A\vb{x} = \vb{b}\text{.}\) In Example 1.5.2 this would be
\begin{equation*} \vb{x}_{p} = \mqty[2\\-1\\0], \end{equation*}
and it's easy to verify that
\begin{equation*} A\vb{x}_{p} = \mqty[5 & -7 & 3 \\ -15 & 20 & -9]\mqty[2\\-1\\0] = \mqty[17\\-50]. \end{equation*}
The other component of the solution, \(\vb{x}_{\text{free}}\text{,}\) will depend on the free variable. In Example 1.5.2, this component was
\begin{equation*} \vb{x}_{\text{free}} = z\mqty[-\frac{3}{5} \\ 0 \\ 1]. \end{equation*}
As it turns out, this component is not a solution of the original system \(A\vb{x} = \vb{b}\text{.}\) Instead, \(A\vb{x}_{\text{free}} = \vb{0}\text{.}\) This behavior is shared by all consistent systems with free variables, and leads us to introduce the following terminology.

Definition 1.5.4. Associated Homogeneous System.

Given a linear system \(A\vb{x} = \vb{b}\text{,}\) we define the associated homogeneous system to be the system \(A\vb{x} = \vb{0}\text{.}\)
The observations made after Example 1.5.2 can be summarized in the following theorem.
Here, we only prove that \(\vb{x}_{h}\) satisfies the associated homogeneous system. Since \(\vb{x}_{p}\) is a solution of the original system along with \(\vb{x} = \vb{x}_{p} + \vb{x}_{h}\text{,}\) it follows that
\begin{equation*} A\vb{x}_{h} = A(\vb{x}-x_{p}) = \vb{b} - \vb{b} = \vb{0}. \end{equation*}
Therefore \(\vb{x}_{h}\) is a solution of the associated homogeneous system.
Since solutions of homogeneous systems play such an important role in the solution of non-homogeneous systems, we give their solution sets a special name.

Definition 1.5.6. Null Space.

Let \(A\) be a matrix. The null space of \(A\) is the set of all solutions of \(A\vb{x} = \vb{0}\text{.}\) This is denoted by \(\nul{A}\text{.}\)
As with column spaces and row spaces, null spaces are always subspaces.
To show that \(\nul{A}\) is a subspace we need to show that it's closed under linear combinations. So let \(\vb{u},\vb{v}\in\nul{A}\) be arbitrary vectors in the null space and let \(\alpha,\beta\in\RR\) be arbitrary scalars. Our goal is to show that \(\alpha\vb{u} + \beta\vb{v}\in\nul{A}\text{.}\) Thankfully, we can do this very quickly:
\begin{align*} A(\alpha\vb{u} + \beta\vb{v}) &= \alpha A\vb{u} + \beta A\vb{v}\\ &= \vb{0} + \vb{0} \\ &= \vb{0}, \end{align*}
which shows that the linear combination \(\alpha\vb{u}+\beta\vb{v}\) lies in \(\nul{A}\text{.}\)
The concept of the null space is related to that of the column space in Definition 1.4.8, but they are distinct. To be precise, if \(A\) is an \(m\times n\) matrix then
\begin{align*} \col{A} &= \qty{A\vb{x} : \vb{x}\in\RR^n} \subseteq \RR^{m} \\ \nul{A} &= \qty{\vb{x} : A\vb{x} = \vb{0}} \subseteq \RR^{n} \end{align*}

Example 1.5.8. Finding a Null Space.

Let
\begin{equation*} A = \mqty[0 & 5 & 5 & -10 & 0 \\ 2 & -3 & -3 & 6 & 2 \\ 4 & 1 & 1 & -2 & 4]. \end{equation*}
Find \(\nul{A}\text{.}\)
Solution.
We need to find the solution set of \(A\vb{x} = \vb{0}\) which we've done before. The Octave code cell below can be used to solve this system, giving the reduced echelon form for the augmented matrix \(\mqty[A & \vb{0}]\) to be
\begin{equation*} \mqty[1 & 0 & 0 & 0 & 1 & 0 \\ 0 & 1 & 1 & -2 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0]. \end{equation*}
Therefore any solution \(\vb{x}\in\RR^5\) of \(A\vb{x}=\vb{0}\) must look like
\begin{equation*} \vb{x} = \mqty[x_1\\x_2\\x_3\\x_4\\x_5] = \mqty[-x_5\\-x_3+2x_4\\x_3\\x_4\\x_5] = x_3\mqty[0\\-1\\1\\0\\0] + x_4\mqty[0\\2\\0\\1\\0] + x_5\mqty[-1\\0\\0\\0\\1]. \end{equation*}
Note that the above shows that
\begin{equation*} \nul{A} = \spn{\qty{\mqty[0\\-1\\1\\0\\0], \mqty[0\\2\\0\\1\\0], \mqty[-1\\0\\0\\0\\1]}} \end{equation*}
Since these vectors are also linearly independent, it follows that the set
\begin{equation*} \qty{\mqty[0\\-1\\1\\0\\0], \mqty[0\\2\\0\\1\\0], \mqty[-1\\0\\0\\0\\1]} \end{equation*}
is in fact a basis for \(\nul{A}\) and \(\dim{\nul{A}} = 3\text{.}\)
At this point we can make a simple but useful observation. In Example 1.5.8, the dimension of the null space was directly tied to the number of free variables in the system \(A\vb{x} = \vb{0}\text{.}\) The number of basic variables is likewise equal to the number of pivot columns of \(A\text{.}\) Noting that the number of basic variables plus the number of free variables must be the total number of columns of \(A\text{,}\) together with Theorem 1.4.6, we get the Rank-Nullity Theorem.