AMS526: Numerical Analysis I (Numerical Linear Algebra) Lecture 08: Floating Point Arithmetic; Condition Numbers

AMS526: Numerical Analysis I (Numerical Linear Algebra) Lecture 08: Floating Point Arithmetic; Condition Numbers Xiangmin Jiao SUNY Stony Brook Xiangmin Jiao Numerical Analysis I 1 / 11 Outline 1 Floating Point Arithmetic 2 Conditioning and Condition Numbers Xiangmin Jiao Numerical Analysis I 2 / 11 Floating Point Representations Computers can only use finite number of bits to represent a real number I Numbers cannot be arbitrarily large or small (associated risks of overflow and underflow) I There must be gaps between representable numbers (potential round-off errors) Commonly used computer-representations are floating point representations, which resemble scientific notation −1 −p+1 e ±(d0 + d1β + ··· + dp−1β )β ; 0 ≤ di < β where β is base, p is digits of precision, and e is exponent between emin and emax Normalize if d0 6= 0 (except for 0) Gaps between adjacent numbers scale with size of numbers 1−p Relative resolution given by machine epsilon machine = 0:5β For all x, there exists a floating point x0 such that 0 jx − x j ≤ machinejxj Xiangmin Jiao Numerical Analysis I 3 / 11 IEEE Floating Point Representations Single precision: 32 bit I 1 sign bit (S), 8 exponent bits (E), 23 significant bits (M), (−1)S × 1:M × 2E−127 −24 I machine is 2 ≈ 6e − 8 Double precision: 64 bits I 1 sign bit (S), 11 exponent bits (E), 52 significant bits (M), (−1)S × 1:M × 2E−1023 −53 I machine is 2 ≈ e − 16 Special quantities I +1 and −∞ when operation overflows; e.g., x=0 for nonzero x I NaN (Not a Number) is returnedp when an operation has no well-defined result; e.g., 0/0, −1, arcsin(2), NaN Xiangmin Jiao Numerical Analysis I 4 / 11 Machine Epsilon Define fl(x) as closest floating point approximation to x By definition of machine, we have: For all x 2 R, there exists with jj ≤ machine such that fl(x) = x(1 + ) Given operation +, −, ×, and = (denoted by ∗), floating point numbers x and y, and corresponding floating point arithmetic (denoted by ~), we require that x ~ y = fl(x ∗ y) This is guaranteed by IEEE floating point arithmetic Fundamental axiom of floating point arithmetic: For all x; y 2 F, there exists with jj ≤ machine such that x ~ y = (x ∗ y)(1 + ) These properties will be the basis of error analysis with rounding errors Xiangmin Jiao Numerical Analysis I 5 / 11 Outline 1 Floating Point Arithmetic 2 Conditioning and Condition Numbers Xiangmin Jiao Numerical Analysis I 6 / 11 I Round-off error (input, computation), truncation (approximation) error We would like the solution to be accurate, i.e., with small errors The error depends on property (conditioning) of the problem, property (stability) of the algorithm Overview of Error Analysis Error analysis is important subject of numerical analysis Given a problem f and an algorithm f~ with an input x, the absolute error is kf~(x) − f (x)k and relative error is kf~(x) − f (x)k=kf (x)k What are possible sources of errors? Xiangmin Jiao Numerical Analysis I 7 / 11 Overview of Error Analysis Error analysis is important subject of numerical analysis Given a problem f and an algorithm f~ with an input x, the absolute error is kf~(x) − f (x)k and relative error is kf~(x) − f (x)k=kf (x)k What are possible sources of errors? I Round-off error (input, computation), truncation (approximation) error We would like the solution to be accurate, i.e., with small errors The error depends on property (conditioning) of the problem, property (stability) of the algorithm Xiangmin Jiao Numerical Analysis I 7 / 11 Absolute Condition Number Condition number is a measure of sensitivity of a problem Absolute condition number of a problem f at x is kδf k κ^ = lim sup "!0 kδxk≤" kδxk where δf = f (x + δx) − f (x) kδf k Less formally, κ^ = supδx kδxk for infinitesimally small δx If f is differentiable, then κ^ = kJ(x)k where J is the Jacobian of f at x, with Jij = @fi =@xj , and the matrix norm is induced by vector norms on @f and @x Question: What is absolute condition number of f (x) = αx? Question: Is absolute condition number scale invariant? Xiangmin Jiao Numerical Analysis I 8 / 11 Relative Condition Number Relative condition number of f at x is kδf k=kf (x)k κ = lim sup "!0 kδxk≤" kδxk=kxk Less formally, κ = sup kδf k=kδxk for infinitesimally small δx δx kf (x)k=kxk Note: we can use different types of norms to get different condition numbers If f is differentiable, then kJ(x)k κ = kf (x)k=kxk Question: What is relative condition number of f (x) = αx? Question: Is relative condition number scale invariant? In numerical analysis, we in general use relative condition number A problem is well-conditioned if κ is small and is ill-conditioned if κ is large Xiangmin Jiao Numerical Analysis I 9 / 11 Condition Numbers Absolute condition number of a problem f at x is kδf k κ^ = lim sup "!0 kδxk≤" kδxk where δf = f (x + δx) − f (x) kδf k Less formally, κ^ = supδx kδxk for infinitesimally small δx Relative condition number of f at x is kδf k=kf (x)k κ = lim sup "!0 kδxk≤" kδxk=kxk Less formally, κ = sup kδf k=kδxk for infinitesimally small δx δx kf (x)k=kxk Xiangmin Jiao Numerical Analysis I 10 / 11 p I Absolute condition number of f at x is κ^ = kJk = 1=(2 x) F Note: We are talking about the condition number of the problem for a given x p kJk 1=(2 x) I Relative condition number κ = = p = 1=2 kf (x )k=kx k x=x T Example: Function f (x) = x1 − x2, where x = (x1; x2) I Absolute condition number of f at x in 1-norm is κ^ = kJk1 = k(1; −1)k1 = 2 kJk1 2 I Relative condition number κ = = kf (x )k1=kx k1 jx1−x2j= maxfjx1j;jx2jg I κ is arbitrarily large (f is ill-conditioned) if x1 ≈ x2 (hazard of cancellation error) Note: From now on, we will talk about only relative condition number Examples p Example: Function f (x) = x Xiangmin Jiao Numerical Analysis I 11 / 11 I Absolute condition number of f at x in 1-norm is κ^ = kJk1 = k(1; −1)k1 = 2 kJk1 2 I Relative condition number κ = = kf (x )k1=kx k1 jx1−x2j= maxfjx1j;jx2jg I κ is arbitrarily large (f is ill-conditioned) if x1 ≈ x2 (hazard of cancellation error) Note: From now on, we will talk about only relative condition number Examples p Example: Function f (x) = x p I Absolute condition number of f at x is κ^ = kJk = 1=(2 x) F Note: We are talking about the condition number of the problem for a given x p kJk 1=(2 x) I Relative condition number κ = = p = 1=2 kf (x )k=kx k x=x T Example: Function f (x) = x1 − x2, where x = (x1; x2) Xiangmin Jiao Numerical Analysis I 11 / 11 Examples p Example: Function f (x) = x p I Absolute condition number of f at x is κ^ = kJk = 1=(2 x) F Note: We are talking about the condition number of the problem for a given x p kJk 1=(2 x) I Relative condition number κ = = p = 1=2 kf (x )k=kx k x=x T Example: Function f (x) = x1 − x2, where x = (x1; x2) I Absolute condition number of f at x in 1-norm is κ^ = kJk1 = k(1; −1)k1 = 2 kJk1 2 I Relative condition number κ = = kf (x )k1=kx k1 jx1−x2j= maxfjx1j;jx2jg I κ is arbitrarily large (f is ill-conditioned) if x1 ≈ x2 (hazard of cancellation error) Note: From now on, we will talk about only relative condition number Xiangmin Jiao Numerical Analysis I 11 / 11.

AMS526: Numerical Analysis I (Numerical Linear Algebra) Lecture 08: Floating Point Arithmetic; Condition Numbers

Structured Eigenvalue Condition Numbers and Linearizations for Matrix Polynomials

Condition Number Bounds for Problems with Integer Coefficients*

The Eigenvalue Problem: Perturbation Theory

Inverse Littlewood-Offord Theorems and the Condition Number Of

Computing the Jordan Structure of an Eigenvalue∗

High Probability Analysis of the Condition Number of Sparse Polynomial Systems

Condition Number and Matrices

1 on the Condition of Numerical Problems and the Numbers That

Chapter 9 PRECONDITIONING

CPSC 303: WHAT the CONDITION NUMBER DOES and DOES NOT TELL US Contents 1. Motivating Examples from Interpolation 2 1.1. an N

Structured Eigenvalue Condition Numbers

7.4 Matrix Norms and Condition Numbers This Section Considers the Accuracy of Computed Solutions to Linear Systems