Fast Gaussian Elimination with Partial Pivoting for
Matrices with Displacement Structure
y yz
I.Gohb erg T.Kailath and V.Olshevsky
Abstract
2
implementation of Gaussian elimination with partial pivoting is designed Fast O n
for matrices p ossessing Cauchy-like displacement structure. We show how Toeplitz{
like, Toeplitz-plus-Hankel{like and Vandermonde-like matrices can b e transformed into
Cauchy{like matrices by using Discrete Fourier, Cosine or Sine Transform matrices.
2
In particular this allows us to prop ose a new fast O n To eplitz solver GKO,
which incorp orates partial pivoting. A large set of numerical examples showed that
GKO demonstrated stable numerical b ehavior and can be recommended for solving
linear systems, esp ecially with nonsymmetric, inde nite and ill-conditioned p ositive
de nite To eplitz matrices. It is also useful for blo ck To eplitz and mosaic To eplitz
To eplitz{blo ck matrices.
The algorithms prop osed in this pap er suggest an attractive alternative to lo ok-
ahead approaches, where one has to jump over ill-conditioned leading submatrices,
3
which in the worse case requires O n op erations.
Key words. Gaussian elimination, partial pivoting, displacement structure, To eplitz-
like, Cauchy-like, Vandermonde-like, To eplitz-plus-Hankel matrix, Schur algorithm, Levin-
son algorithm.
AMS sub ject classi cation. 15A06, 15A09, 15A23, 15A57, 65F05, 65H10.
0 Intro duction
nn nn
0.1. Displacement structure. Let matrices F , A 2 C be given. Let R 2 C be a
matrix satisfying a Sylvester typ e equation
r R=F R RA=GB; 0.1
fF;Ag
Scho ol of Mathematical Sciences, Tel Aviv University, Ramat Aviv 69978, Israel, e-mail : go-
hb ergmath.tau.ac.il
y
Information Systems Lab oratory, Stanford University, Stanford CA 94305 { 4055, U.S.A., e-mail : tkras-
cals.stanford.edu, olshevskrascals.stanfo d.edu
z
This researchwas supp orted in part by the Army Research Oce under GrantDAAH04-93-G-0029. This
manuscript is submitted for publication with the understanding that the US Government is authorized to
repro duce and distribute reprints for Government purp oses notwithstanding any copyright notation hereon. 1
n n
with some rectangular matrices G 2 C , B 2 C , where number is small in compari-
son with n. The pair of matrices G; B in 0.1 is referred to as a fF; Ag-generator of R and
the smallest p ossible inner size among all fF; Ag-generators is called a fF; Ag-displacement
rank of R . The concept of displacement structure was rst intro duced in [KKM] using Stein
nn nn
typ e displacement op erator r : C !C , given by
fF;Ag
r R=R F RA: 0.2
fF;Ag
The variant 0.1 of displacement equation app eared later in [HR]. The left hand sides of
0.2 and 0.1 are often referred to as To eplitz{like and Hankel{like displacement op erators,
resp ectively. The most general form of displacement structure, which clearly includes 0.1
and 0.2, was intro duced in [KS1] using the op erator
r R = R F RA : 0.3
f ;;F ;Ag
A standard Gaussian elimination scheme applied for triangular factorization of R would
3
require p erforming O n op erations. At the same time displacement structure allows us
to sp eed up the triangular factorization of a matrix, or equivalently, Gaussian elimination.
2
This sp eed up is not unexp ected, since all n entries of a structured matrix are completely
determined by a smaller number 2 n of the entries of its generator fG; B g in the right hand
side of 0.1. Moreover, translating the Gaussian elimination pro cedure into appropriate
2
op erations on generator gives fast O n algorithms. This was rst implicitly done by
Schur in [S] for Hermitian To eplitz matrices and for certain generalizations thereof, which
were called quasi-To eplitz matrices in [LAK]. Then it was progressively extended to more
general displacement structures by Kailath and his colleagues. The most general results
were presented in [KS1] see also [KS2] . The name generalized Schur algorithm was coined
there for any fast implementation of Gaussian elimination exploiting displacement structure
as in 0.3.
0.2. Basic classes of structured matrices. The nowwell known classes of To eplitz{
like matrices, Vandermonde{like, or Cauchy{like matrices, etc, can be intro duced by using
any form 0.1 or 0.2 of displacement equation see e.g. [KKM], [HR], [CK1], [AG],
[GO2] . For our purp oses in this pap er it will be more convenient to exploit the Sylvester
typ e displacement op erator as in 0.1. Particular choices of matrices F and A in 0.1 lead
to the de nitions of basic classes of structured matrices, e.g.,
Toeplitz{like [KKM] : F = Z , A = Z ;
1 1
Toeplitz{plus{Hankel{like : F = Y , A = Y ;
00 11
[HJR], [GK], [SLAK]
Cauchy{like [HR] : F = diagc ; :::; c , A = diagd ; :::; d ;
1 n 1 n
1 1
Vandermonde{like [HR], [GO2] : F = diag ; :::; , A = Z ;
1
x x
n
1
Chebyshev{Vandermonde{like [KO1] : F = diagx ; :::; x , A = Y .
1 n ;
Here
1 0 0
0 0 0
.
.
. .
1 0 0
.
1 0 1 .
.
.
. .
. .
. .
.
0 1 .
; 0.4 ; Y = Z =
. .
;
0 1 0
. .
. .
.
. .
. . . .
. . .
. .
. .
. .
. 0 1
0 0 1 0
0 0 1 2
T T T
i.e. Z is the lower shift {circulant matrix and Y = Z +Z + e e +e e . The reason for
; 0 1 n
0 1 n
such designations may b e seen from the easily veri ed fact that the shift-invariance prop erty
t
of a To eplitz matrix T = implies that that the fZ ;Z g-displacement rank of
i j
1 1
1i;j n
anyTo eplitz matrix do es not exceed 2. Therefore a matrix with low fZ ;Z g{displacement
1 1
rank is referred to as a To eplitz{like matrix. Similar justi cation for the other typ es of
displacement structure will b e given in the main text b elow.
nn
0.3. Transformation of structured matrices and pivoting. Let R 2 C satisfy
nn
the displacement equation 0.1, and let T ;T 2 C be two invertible matrices. It is
1 2