Observe that multiplying \(A\) by \(B\) on the left results in multiplying the second row of \(A\) by \(5\text{,}\) while multiplying \(A\) by \(C\) on the left results in multiplying the third row of \(A\) by \(-2\text{.}\)
Now we need to return to the question of what \(B\) and \(C\) have in common. Both matrices were obtained from the identity matrix by multiplying one row of the identity by a non-zero constant. Matrices \(B\) and \(C\) were obtained from \(I\) by multiplying one row of \(I\) by \(5\) and \(-2\) respectively. Multiplying \(A\) by \(B\) (or \(C\)) on the left affects \(A\) in the same way.
In general, if a square matrix \(E\) is obtained from the identity matrix \(I\) by multiplying row \(j\) of \(I\) by a non-zero constant \(k\text{,}\) then multiplying an appropriately sized matrix \(A\) on the left by \(E\) results in row \(j\) of \(A\) being multiplied by \(k\text{.}\)
Recall that multiplication of a row of a matrix by a non-zero constant is one of three elementary row operations. Applying such an elementary row operation to \(I\) in order to produce \(E\text{,}\) results in applying the same elementary row operation to \(A\) when \(A\) is multiplied by \(E\) on the left.
As in the previous Exploration, the two matrices have something in common. Both \(D\) and \(F\) were obtained from the identity matrix by adding a multiple of one row to another row.
As you had probably guessed, multiplication by \(D\) resulted in the third row of \(A\) being added to the first, and multiplication by \(F\) produced a matrix by adding \(-2\) times the first row to the second row of \(A\text{.}\) The elementary row operations performed on \(A\) mimic the elementary row operations performed on \(I\) in order to obtain \(D\) and \(F\text{.}\)
In general, if a square matrix \(E\) is obtained from the identity matrix \(I\) by adding \(k\) times row \(j\) of \(I\) to row \(i\text{,}\) then multiplying an appropriately sized matrix \(A\) on the left by \(E\) results in \(k\) times row \(j\) of \(A\) being added to row \(i\) of \(A\text{.}\)
Recall that adding a scalar multiple of one row to another row of a matrix is one of three elementary row operations. Applying such an elementary row operation to \(I\) in order to produce \(E\text{,}\) results in applying the same elementary row operation to \(A\) when \(A\) is multiplied by \(E\) on the left.
The matrices \(B,C,D,F\) above are special because when we multiply them by any appropriately sized matrix \(A\text{,}\) we are performing row operations on \(A\text{.}\)
Suppose \(A\) is an \(m \times n\) matrix. If \(E\) is an \(m \times m\) elementary matrix formed by performing a certain row operation on the \(m \times m\) identity matrix, then multiplying any matrix \(A\) on the left by \(E\) is equivalent to performing that same row operation on \(A\text{.}\) As there are three types of elementary row operations, there are three types of elementary matrices.
Elementary matrices give us a new way of looking at Gauss-Jordan elimination. Suppose it takes \(j\) elementary row operations to transform \(A\) into \(R\text{,}\) its reduced row-echelon form. Then we can represent this reduced row-echelon form as
\begin{equation*}
R = E_j \cdots E_2 E_1 A
\end{equation*}
where each \(E_i\) is the elementary matrix corresponding to the \(i\)th row operation performed on \(A\text{.}\)
It is easy to see that any elementary matrix \(E\) is invertible, because if \(E\) is formed by applying a certain row operation to the identity matrix \(I\text{,}\) then there is a single row operation that may be applied to \(E\) to get \(I\) back. For example, in Explorationย 3.3.2, \(F\) is formed by adding \(-2\) times the first row of the identity to the second row of the identity. It follows that \(F^{-1}\) should be the matrix formed by adding \(2\) times the first row of the identity to the second row of the identity, i.e.
As part of the Practice Problem set you are asked to find the inverse of each of the other elementary matrices in Explorationย 3.3.1, Explorationย 3.3.2 and Explorationย 3.3.3. Once we have accounted for each of the three types of elementary matrices, we will have proven the following theorem.
Suppose \(E\) is obtained from \(I\) by switching rows \(i\) and \(j\text{.}\) To find the inverse of \(E\text{,}\) we need to find a matrix \(F\) such that \(FE=I\text{.}\) To get from \(E\) back to \(I\text{,}\) rows \(i\) and \(j\) of \(E\) must be switched. This can be accomplished by multiplying \(E\) by itself on the left. So, \(E\) is its own inverse. We can use the same line of reasoning to show that the other two types of elementary matrices are also invertible, and their inverses are also elementary matrices. The details are left to the reader.
[Proof of Itemย 1\(\Rightarrow\)Itemย 2]: Suppose \(\mbox{rref}(A)=I\text{.}\) Then \(A\) can be carried to the identity by elementary row operations. So, there exist elementary matrices \(E_1, E_2, \ldots ,E_k\) such that
By Theoremย 3.3.7, elementary matrices are invertible and their inverses are also elementary matrices. Thus, we can write \(A\) as a product of elementary matrices as follows:
[Proof of Itemย 2\(\Rightarrow\)Itemย 3]: Suppose \(A=E_1E_2\ldots E_k\text{,}\) where \(E_1, E_2, \ldots , E_k\) are elementary matrices. In Itemย 3 we proved that \((BC)^{-1} = C^{-1} B^{-1}\text{.}\) By repeated applications of this theorem we have
For each elementary matrix \(E\) below, determine the elementary row operation that results from multiplying a \(3\times n\) matrix \(A\) by \(E\) on the left. Write down \(E^{-1}\) without going through the row-reduction procedure.
Row-reduce \(A\) to find \(\mbox{rref}(A)\text{.}\) Record the elementary row operations as you perform row reduction. You will be able to conclude that \(E_j\cdots E_2E_1A=I\text{.}\) Find the inverse of each \(E_i\) and multiply by the inverses on the left.
In Explorationย 3.3.1, Explorationย 3.3.2 and Explorationย 3.3.3 we performed elementary row operations on \(A\) by multiplying \(A\) by elementary matrices \(B, C, D, F, G\) on the left. Compute \(AB, AC, AD, AF\) and \(AG\text{.}\) Summarize your findings.