Web Homework Gaussian Elimination

Definition 1: A m by n matrix is a rectangular array of numbers with m rows and n columns. One can write images/gauss1.png for an m by n matrix where images/gauss2.png is the entry in row i and column j. Row i of the matrix is the 1 by n matrix images/gauss3.png whose entries are the entries of A with first index i. Similarly, column j is the m by 1 matrix consisting of the entries of A with second index j. Rows and columns are also referred to as row vectors and column vectors respectively. The leading column number of this row is the smallest index j with images/gauss4.png or infinity if there is no such index. If the leading column number k of some row is finite, then column k is called a leading column.

Definition 2:The matrix images/gauss5.png is in echelon form if

  1. The leading column number of rows images/gauss6.png which have at least one non-zero entry is a strictly increasing function of the row index images/gauss7.png .
  2. If some row images/gauss8.png has all its entries zero, then so does every row images/gauss9.png with images/gauss10.png .

Definition 3:A matrix is in reduced echelon form if every leading column images/gauss11.png has a unique non-zero entry and if this entry has value 1.

Of course, the non-zero entry must be in the row which has this column as its leading column.

Definition 4: An elementary row operation is any of the following functions from the set of m by n matrices to the set of m by n matrices:

  1. Multiply every entry in a row by a non-zero number c.
  2. Swap two different rows.
  3. Replace a row with the sum of itself and a constant multiple of another row.

Notice that every elementary row operation has an inverse operation which is also an elementary row operation. For example, if c is the multiplier in the operation of type iii above, the inverse of that operation is one of the same type with multiplier -c.

Definition 5: Two matrices A and B are said to be row equivalent if there is a finite sequence of elementary row operations which maps A into B.

Clearly, two matrices can only be row equivalent if they have the same number of rows and the same numbeer of columns.

Exercise 1: Row equivalence is an equivalence relation, i.e.

  1. (Reflexive) Every matrix A is row equivalent to itself.
  2. (Symmetric) If a matrix A is row equivalent to a matrix B, then B is row equivalent to A.
  3. (Transitive) If A is row equivalent to B and B is row equivalent to C, then A is row equivalent to C.

Because of Exercise 1, we know that the m by n matrices decomposes into a union of row equivalence classes, i.e. It is a disjoint union of sets images/gauss12.png where any two matrices are row equivalent if and only if they belong to the same one of these sets.

Definition 6: A row vector images/gauss13.png is said to is said to be a linear combination of vectors images/gauss14.png if there are constants images/gauss15.png such that images/gauss16.png , i.e. images/gauss17.png for every images/gauss18.png .

Exercise 2: Let images/gauss19.png be a finite set of vectors and S be the set of vectors which are linear combinations of the images/gauss20.png . Show that S is closed under addition and multiplication by scalars.

In general, one would not expect the coefficients images/gauss21.png to be uniquely determined, but it is true in the case where the images/gauss22.png are non-zero rows of a matrix in echelon form. In more detail:

Proposition 1: Let images/gauss23.png be an m by n matrix in echelon form and images/gauss24.png for images/gauss25.png be the rows of images/gauss26.png which have at least one entry non-zero. If images/gauss27.png is a linear combination of these rows, say, images/gauss28.png , then there is a unique set of images/gauss29.png for which this equation is true.

Proof: Suppose that j is the leading column number of row i. Then the equation says images/gauss30.png . Now, images/gauss31.png for all images/gauss32.png because images/gauss33.png is in echelon form and images/gauss34.png because j is the leading column number of row i. This allows one to solve for a unique value of images/gauss35.png in terms of the previous images/gauss36.png (and the constants images/gauss37.png and images/gauss38.png ). Since this is true for each i, the images/gauss39.png are uniquely determined.

Things are particular simple in case of a matrix in reduced echelon form:

Corollary 1: If, in addition, the matrix images/gauss40.png is in reduced echelon form, then images/gauss41.png where j is the leading column number of row i of images/gauss42.png .

Proof: The equation for column j is images/gauss43.png . One has images/gauss44.png for images/gauss45.png and images/gauss46.png . So the equation is just images/gauss47.png .

The leading column number of a linear combination of the rows of a matrix in echelon form is the leading column number of some row of that matrix (or infinity). More precisely:

Proposition 2: Let images/gauss48.png be an m by n matrix in echelon form and images/gauss49.png be a linear combination of the rows images/gauss50.png of images/gauss51.png . Then the leading column number of images/gauss52.png is either infinity or equal to a leading column number of some row of images/gauss53.png .

Proof: Let images/gauss54.png express images/gauss55.png as a linear combination of the non-zero rows of images/gauss56.png . Let images/gauss57.png be the smallest index for which images/gauss58.png is non-zero and images/gauss59.png be the leading column number of images/gauss60.png . If images/gauss61.png does not exist or if the images/gauss62.png is infinity, then the leading column number of images/gauss63.png is infinite. Otherwise, we will show that images/gauss64.png is the leading column number of images/gauss65.png . One has images/gauss66.png . Suppose that images/gauss67.png .

  1. If s is less than i, we have images/gauss68.png and the corresponding term of the sum is zero.
  2. If t is less than the leading column number of row s, then images/gauss69.png and again the corresponding term of the sum is zero. Since the leading column number is an increasing function in a matrix in echelon form, we know that this is true whenever s is bigger than i (since images/gauss70.png ) or when images/gauss71.png and images/gauss72.png . In particular, if images/gauss73.png , then every term of the sum is zero and so images/gauss74.png .
  3. If images/gauss75.png , then the only term in the sum which could be non-zero is the term when s is i: images/gauss76.png . Since both factors are non-zero, the term is non-zero and so j is the leading column number of images/gauss77.png .

Gaussian elimination is essentially an algorithm for finding a reduced echelon matrix in the row equivalence class of a given matrix. More formally:

Proposition 3: Every row equivalence class contains at least one matrix in reduced echelon form.

Proof: See the textbook for the algorithm.

The main result of Chapter 1 of the text is the assertion that regardless of the sequence of elementary row operations, one always arrives at the same reduced echelon form matrix:

Theorem 1: Every row equivalence class contains exactly one matrix in reduced echelon form.

Proof: Suppose a row equivalence class contains two reduced echelon matrices A and B. There is a sequence of elementary row transformations which converts A into B. Each row of B is a linear combination of the non-zero rows of A and, similarly, each row of A is a linear combination of the non-zero rows of B. Since each non-zero row of B is a linear combination of the the rows of A, Proposition 2 says that the finite leading column numbers of B are amongst the finite leading column numbers of A. Similarly, the finite leading column numbers of A are amongst those of B.

The number of non-zero rows of A (or of B) is just the number of finite leading column numbers (because each non-zero row has a leading column number different from that of every other non-zero row). So, we know that A and B have the same zero rows. Further, because the leading column numbers are increasing functions of the row index, we know that a non-zero row of A has the same leading column number as the row of B with the same row index (and vice versa).

Suppose images/gauss78.png expresses row i of A as a linear combination of the non-zero rows of B. By Corollary 1, we know that images/gauss79.png where j is the leading column number of images/gauss80.png . When s = i, this means that images/gauss81.png because the leading column number of images/gauss82.png is the same as that of images/gauss83.png and both A and B have entry 1 in that column. Otherwise, we know that images/gauss84.png because every entry of images/gauss85.png is zero in all columns whose column index is the leading column number of another row. So, images/gauss86.png for every i. But then A = B.


Revised: September 8, 2002
All contents © Copyright 2002 K. K. Kubota. All rights reserved.