Faster Spectral Algorithms Via Approximation Theory (Slides)
Total Page:16
File Type:pdf, Size:1020Kb
Faster (Spectral) Algorithms via Approximation Theory Nisheeth K. Vishnoi EPFL 1.0 d=0 d=1 0.5 d=5 -1.0 -0.5 d=4 0.5 1.0 d=3 -0.5 d=2 -1.0 Based on a recent monograph with Sushant Sachdeva (Yale) Simons Institute, Dec. 3, 2014 The Goal Many algorithms today rely on our ability to quickly compute good approximations to matrix-function-vector products: e.g., As v; A−1v; exp(−A)v; ... or top few eigenvalues and eigenvectors. Demonstrate How to reduce the problem of computing these primitives to a small number of those of the form Bu where B is a matrix closely related to A (often A itself) and u is some vector. A key feature of these algorithms is that if the matrix-vector product for A can be computed quickly, e.g., when A is sparse, then Bu can also be computed in essentially the same time. The classical area in analysis of approximation theory provides the right framework to study these questions. The Goal Many algorithms today rely on our ability to quickly compute good approximations to matrix-function-vector products: e.g., As v; A−1v; exp(−A)v; ... or top few eigenvalues and eigenvectors. Demonstrate How to reduce the problem of computing these primitives to a small number of those of the form Bu where B is a matrix closely related to A (often A itself) and u is some vector. A key feature of these algorithms is that if the matrix-vector product for A can be computed quickly, e.g., when A is sparse, then Bu can also be computed in essentially the same time. The classical area in analysis of approximation theory provides the right framework to study these questions. The Goal Many algorithms today rely on our ability to quickly compute good approximations to matrix-function-vector products: e.g., As v; A−1v; exp(−A)v; ... or top few eigenvalues and eigenvectors. Demonstrate How to reduce the problem of computing these primitives to a small number of those of the form Bu where B is a matrix closely related to A (often A itself) and u is some vector. A key feature of these algorithms is that if the matrix-vector product for A can be computed quickly, e.g., when A is sparse, then Bu can also be computed in essentially the same time. The classical area in analysis of approximation theory provides the right framework to study these questions. The Goal Many algorithms today rely on our ability to quickly compute good approximations to matrix-function-vector products: e.g., As v; A−1v; exp(−A)v; ... or top few eigenvalues and eigenvectors. Demonstrate How to reduce the problem of computing these primitives to a small number of those of the form Bu where B is a matrix closely related to A (often A itself) and u is some vector. A key feature of these algorithms is that if the matrix-vector product for A can be computed quickly, e.g., when A is sparse, then Bu can also be computed in essentially the same time. The classical area in analysis of approximation theory provides the right framework to study these questions. Approximation Theory Approximation Theory Approximation Theory Approximation Theory Approximation Theory How well can functions be approximated by simpler ones? Uniform (Chebyshev) Approximation by Polynomials/Rationals For f : R 7! R and an interval I; what is the closest a degree d polynomial/rational function can remain to f (x) throughout I inf supjf (x) − p(x)j: p2Σd x2I inf supjf (x) − p(x)=q(x)j: p;q2Σd x2I Σd : set of all polynomials of degree at most d. 150+ years of fascinating history, deep results and many applications. Interested in fundamental functions such as xs ; e−x and 1=x over finite and infinite intervals such as [−1; 1]; [0; n]; [0; 1): For our applications good enough approximations suffice. Approximation Theory How well can functions be approximated by simpler ones? Uniform (Chebyshev) Approximation by Polynomials/Rationals For f : R 7! R and an interval I; what is the closest a degree d polynomial/rational function can remain to f (x) throughout I inf supjf (x) − p(x)j: p2Σd x2I inf supjf (x) − p(x)=q(x)j: p;q2Σd x2I Σd : set of all polynomials of degree at most d. 150+ years of fascinating history, deep results and many applications. Interested in fundamental functions such as xs ; e−x and 1=x over finite and infinite intervals such as [−1; 1]; [0; n]; [0; 1): For our applications good enough approximations suffice. Approximation Theory How well can functions be approximated by simpler ones? Uniform (Chebyshev) Approximation by Polynomials/Rationals For f : R 7! R and an interval I; what is the closest a degree d polynomial/rational function can remain to f (x) throughout I inf supjf (x) − p(x)j: p2Σd x2I inf supjf (x) − p(x)=q(x)j: p;q2Σd x2I Σd : set of all polynomials of degree at most d. 150+ years of fascinating history, deep results and many applications. Interested in fundamental functions such as xs ; e−x and 1=x over finite and infinite intervals such as [−1; 1]; [0; n]; [0; 1): For our applications good enough approximations suffice. Approximation Theory How well can functions be approximated by simpler ones? Uniform (Chebyshev) Approximation by Polynomials/Rationals For f : R 7! R and an interval I; what is the closest a degree d polynomial/rational function can remain to f (x) throughout I inf supjf (x) − p(x)j: p2Σd x2I inf supjf (x) − p(x)=q(x)j: p;q2Σd x2I Σd : set of all polynomials of degree at most d. 150+ years of fascinating history, deep results and many applications. Interested in fundamental functions such as xs ; e−x and 1=x over finite and infinite intervals such as [−1; 1]; [0; n]; [0; 1): For our applications good enough approximations suffice. Approximation Theory How well can functions be approximated by simpler ones? Uniform (Chebyshev) Approximation by Polynomials/Rationals For f : R 7! R and an interval I; what is the closest a degree d polynomial/rational function can remain to f (x) throughout I inf supjf (x) − p(x)j: p2Σd x2I inf supjf (x) − p(x)=q(x)j: p;q2Σd x2I Σd : set of all polynomials of degree at most d. 150+ years of fascinating history, deep results and many applications. Interested in fundamental functions such as xs ; e−x and 1=x over finite and infinite intervals such as [−1; 1]; [0; n]; [0; 1): For our applications good enough approximations suffice. Approximation Theory How well can functions be approximated by simpler ones? Uniform (Chebyshev) Approximation by Polynomials/Rationals For f : R 7! R and an interval I; what is the closest a degree d polynomial/rational function can remain to f (x) throughout I inf supjf (x) − p(x)j: p2Σd x2I inf supjf (x) − p(x)=q(x)j: p;q2Σd x2I Σd : set of all polynomials of degree at most d. 150+ years of fascinating history, deep results and many applications. Interested in fundamental functions such as xs ; e−x and 1=x over finite and infinite intervals such as [−1; 1]; [0; n]; [0; 1): For our applications good enough approximations suffice. Algorithms/Numerical Linear Alg.- f (A)v, Eigenvalues, .. A simple example: Compute As v where A is symmetric with eigenvalues in [−1; 1]; v is a vector and s is a large positive integer. The straightforward way to compute As v takes time O(ms) where m is the number of non-zero entries in A: Suppose xs can be δ-approximated over the interval [−1; 1] Pd i by a degree d polynomial ps;d (x) = i=0 ai x : s Pd i Candidate approximation to A v: i=0 ai A v. Pd i The time to compute i=0 ai A v is O(md). Pd i s k i=0 ai A v − A vk ≤ δkvk since all the eigenvalues of A lie in [−1; 1]; and s ps;d is δ-close to x in the entire interval [−1; 1]. How small can d be? Algorithms/Numerical Linear Alg.- f (A)v, Eigenvalues, .. A simple example: Compute As v where A is symmetric with eigenvalues in [−1; 1]; v is a vector and s is a large positive integer. The straightforward way to compute As v takes time O(ms) where m is the number of non-zero entries in A: Suppose xs can be δ-approximated over the interval [−1; 1] Pd i by a degree d polynomial ps;d (x) = i=0 ai x : s Pd i Candidate approximation to A v: i=0 ai A v. Pd i The time to compute i=0 ai A v is O(md). Pd i s k i=0 ai A v − A vk ≤ δkvk since all the eigenvalues of A lie in [−1; 1]; and s ps;d is δ-close to x in the entire interval [−1; 1]. How small can d be? Algorithms/Numerical Linear Alg.- f (A)v, Eigenvalues, .. A simple example: Compute As v where A is symmetric with eigenvalues in [−1; 1]; v is a vector and s is a large positive integer. The straightforward way to compute As v takes time O(ms) where m is the number of non-zero entries in A: Suppose xs can be δ-approximated over the interval [−1; 1] Pd i by a degree d polynomial ps;d (x) = i=0 ai x : s Pd i Candidate approximation to A v: i=0 ai A v.