Lecture 13 Gradient Methods for Constrained Optimization

Total Page:16

File Type:pdf, Size:1020Kb

Lecture 13 Gradient Methods for Constrained Optimization Lecture 13 Gradient Methods for Constrained Optimization October 16, 2008 Lecture 13 Outline • Gradient Projection Algorithm • Convergence Rate Convex Optimization 1 Lecture 13 Constrained Minimization minimize f(x) subject x ∈ X • Assumption 1: • The function f is convex and continuously differentiable over Rn • The set X is closed and convex ∗ • The optimal value f = infx∈Rn f(x) is finite • Gradient projection algorithm xk+1 = PX[xk − αk∇f(xk)] starting with x0 ∈ X. Convex Optimization 2 Lecture 13 Bounded Gradients Theorem 1 Let Assumption 1 hold, and suppose that the gradients are uniformly bounded over the set X. Then, the projection gradient method generates the sequence {xk} ⊂ X such that • When the constant stepsize αk ≡ α is used, we have 2 ∗ αL lim inf f(xk) ≤ f + k→∞ 2 P • When diminishing stepsize is used with k αk = +∞, we have ∗ lim inf f(xk) = f . k→∞ Proof: We use projection properties and the line of analysis similar to that of unconstrained method. HWK 6 Convex Optimization 3 Lecture 13 Lipschitz Gradients • Lipschitz Gradient Lemma For a differentiable convex function f with Lipschitz gradients, we have for all x, y ∈ Rn, 1 k∇f(x) − ∇f(y)k2 ≤ (∇f(x) − ∇f(y))T (x − y), L where L is a Lipschitz constant. • Theorem 2 Let Assumption 1 hold, and assume that the gradients of f are Lipschitz continuous over X. Suppose that the optimal solution ∗ set X is not empty. Then, for a constant stepsize αk ≡ α with 0 2 < α < L converges to an optimal point, i.e., ∗ ∗ ∗ lim kxk − x k = 0 for some x ∈ X . k→∞ Convex Optimization 4 Lecture 13 Proof: n Fact 1: If z = PX[z − v] for some v ∈ < , then z = PX[z − τv] for any τ > 0. ∗ Fact 2: z ∈ X if and only if z = PX[z − ∇f(z)]. ∗ These facts imply that z ∈ X if and only if z = PX[z − τ∇f(z)] for any τ > 0. By using the definition of the method and the preceding relation with τ = α, we obtain for any z ∈ X∗, 2 2 kxk+1 − zk = kPX[xk − α∇f(xk)] − PX[z − α∇f(z)k . By non-expansiveness of the projection, it follows 2 2 kxk+1 − zk = kxk − z − α(∇f(xk) − ∇f(z))k 2 T = kxk − zk − 2α(xk − z) (∇f(xk) − ∇f(z)) 2 2 +α k∇f(xk) − ∇f(z)k Convex Optimization 5 Lecture 13 Using Lipschitz Gradient Lemma, we obtain for any z ∈ X∗, 2 2 α 2 kxk+1 − zk ≤ kxk − zk − (2 − αL)k∇f(xk) − ∇f(z)k . (1) L Hence, for all k, α 2 2 2 (2 − αL)k∇f(xk) − ∇f(z)k ≤ kxk − zk − kxk+1 − zk . L By summing the preceding relations from arbitrary K to N, with K < N, we obtain N α X 2 2 2 2 (2−αL) k∇f(xk)−∇f(z)k ≤ kxK−zk −kxN+1−zk ≤ kxK−zk . L k=K Convex Optimization 6 Lecture 13 In particular, setting K = 0 and letting N → ∞, we see that ∞ α X 2 2 (2 − αL) k∇f(xk) − ∇f(z)k ≤ kx0 − zk < ∞. (2) L k=0 As a consequence, we also have lim ∇f(xk) = ∇f(z). (3) k→∞ By discarding the non-positive term in the right hand side of Eq. (1), we have for any z ∈ X∗ and all k, 2 2 2 kxk+1 − zk ≤ kxk − zk + (2 − αL)k∇f(xk) − ∇f(z)k . By summing these relations over k = K, . , N for arbitrary K and N with K < N, we obtain Convex Optimization 7 Lecture 13 N 2 2 X 2 kxN+1 − zk ≤ kxK − zk + (2 − αL) k∇f(xk) − ∇f(z)k . k=K Taking limsup as N → ∞, we obtain ∞ 2 2 X 2 lim sup kxN+1 − zk ≤ kxK − zk + (2 − αL) k∇f(xk) − ∇f(z)k . N→∞ k=K Now, taking liminf as K → ∞ yields 2 2 lim sup kxN+1 − zk ≤ lim inf kxK − zk N→∞ K→∞ ∞ X 2 + (2 − αL) lim k ∇f(xk) − ∇f(z)k K→∞ k=K 2 = lim inf kxK − zk , K→∞ Convex Optimization 8 Lecture 13 where the equality follows in view of the relation in (2). Thus, we have that ∗ the sequence {kxk − zk} is convergent for every z ∈ X . By the inequality in Eq. (1), we have that kxk − zk ≤ kx0 − zk for all k. Hence, the sequence {xk} is bounded, and it has an accumulation point. ∗ Since the scalar sequence {kxk − zk} is convergent for every z ∈ X , the sequence {xk} must be convergent. Suppose now that xk → x.¯ By considering the definition of the iterate xk+1, we have xk+1 = PX[xk − α∇f(xk)]. Letting k → ∞ and using xk → x,¯ and continuity of the gradient ∇f(x), we obtain x¯ = PX[¯x − α∇f(¯x)]. ∗ In view of facts 1 and 2, the preceding relation is equivalent to x¯ ∈ X . Convex Optimization 9 Lecture 13 Modes of Convexity: Strict and Strong • Def. f is strictly convex if for all x 6= y and α ∈ (0, 1) we have f(αx + (1 − α)y) < αf(x) + (1 − α)f(y) • Def. f is strongly convex if there exists a scalar ν > 0 such that ν f(αx + (1 − α)y) ≤ αf(x) + (1 − α)f(y) − α(1 − α)kx − yk2 2 for all x, y ∈ <n and any α ∈ [0, 1]. The scalar ν is referred to as strongly convex constant. The function is said to be strongly convex with constant ν. Convex Optimization 10 Lecture 13 Modes of Convexity: Differentiable Function • Let f : <n → R be continuously differentiable. • Modes of convexity can be equivalently characterized in terms of the linearization properties of the function ∇f : <n → <n. • We have • f is convex if and only if f(x) + ∇f(x)T (y − x) ≤ f(y) for all x, y ∈ <n • f is strictly convex if and only if f(x) + ∇f(x)T (y − x) < f(y) for all x 6= y • f is strongly convex with constant ν if and only if ν f(x) + ∇f(x)T (y − x) + ky − xk2 ≤ f(y) for all x, y ∈ <n 2 Convex Optimization 11 Lecture 13 Modes of Convexity: Gradient Mapping • Let f : <n → R be continuously differentiable. • Modes of convexity can be equivalently characterized in terms of the monotonicity properties of the gradient mapping ∇f : <n → <n. • We have • f is convex if and only if (∇f(x) − ∇f(y))T (x − y) ≥ 0 for all x, y ∈ <n • f is strictly convex if and only if (∇f(x) − ∇f(y))T (x − y) > 0 for all x 6= y • f is strongly convex with constant ν if and only if (∇f(x) − ∇f(y))T (x − y) ≥ ν kx − yk2 for all x, y ∈ <n Convex Optimization 12 Lecture 13 Modes of Convexity: Twice Differentiable Function • Let f : <n → R be twice continuously differentiable. • Modes of convexity can be equivalently characterized in terms of the definiteness of the Hessians ∇2f(x) for x ∈ <n. • We have • f is convex if and only if ∇2f(x) ≥ 0 for all x ∈ <n • f is strictly convex if ∇2f(x) > 0 for all x ∈ <n • f is strongly convex with constant ν if and only if ∇2f(x) ≥ ν I for all x ∈ <n Convex Optimization 13 Lecture 13 Strong Convexity: Implications Let f be continuously differentiable and strongly convex∗ over Rn with constant m • Implications: • Lower Bound on f over Rn: for all x, y ∈ Rn m f(y) ≥ f(x) + ∇f(x)T (y − x) + kx − yk2 (4) 2 2 minimize w/r to y in the right-hand side: 1 f(y) ≥ f(x) − k∇f(x)k2 2m n minimum over y ∈ R : 1 f(x) − f ∗ ≤ k∇f(x)k2 2m • Useful as a stopping criterion (if you know m) ∗ n Strong convexity over R can be replaced by a strong convexity over a set X. Then, all the relations stay valid over the set Convex Optimization 14 Lecture 13 • Relation (4) with x = x0 and f(y) ≤ f(x0) implies that the level set Lf (f(x0)) is bounded • Relation (4) also yields for an optimal x∗ and any x ∈ Rn, m kx − x∗k2 ≤ f(x) − f(x∗) 2 • Last two bullets HWK6 assignment. Convex Optimization 15 Lecture 13 Convergence Rate: Once Differentiable Theorem 3 Let Assumption 1 hold, and assume that the gradients of f are Lipschitz continuous over X with constant L > 0. Suppose that the function is strongly convex with constant m > 0. Then: • A solution x∗ exists and it is unique. • The iterates generated by the gradient projection method with αk ≡ α 2 ∗ and α < L converge to x with geometric rate, i.e., ∗ 2 k ∗ 2 kxk+1 − x k ≤ q kxk − x k for all k with q ∈ (0, 1) depending on m and L. Proof: HWK 6. Convex Optimization 16 Lecture 13 Convergence Rate: Twice Differentiable Theorem 4 Let Assumption 1 hold. Assume that the function is twice continuously differentiable and strongly convex with constant m > 0. Assume also that ∇f 2(x) ≤ L for all x ∈ X. Then: • A solution x∗ exists and it is unique. • The iterates generated by the gradient projection method with αk ≡ α 2 ∗ and α < L converge to x with geometric rate, i.e., ∗ k ∗ kxk+1 − x k ≤ q kxk − x k for all k with q = max{|1 − αm|, |1 − αL}. Convex Optimization 17 Lecture 13 Proof: The q here is different from the one in the preceding theorem.
Recommended publications
  • Section 8.8: Improper Integrals
    Section 8.8: Improper Integrals One of the main applications of integrals is to compute the areas under curves, as you know. A geometric question. But there are some geometric questions which we do not yet know how to do by calculus, even though they appear to have the same form. Consider the curve y = 1=x2. We can ask, what is the area of the region under the curve and right of the line x = 1? We have no reason to believe this area is finite, but let's ask. Now no integral will compute this{we have to integrate over a bounded interval. Nonetheless, we don't want to throw up our hands. So note that b 2 b Z (1=x )dx = ( 1=x) 1 = 1 1=b: 1 − j − In other words, as b gets larger and larger, the area under the curve and above [1; b] gets larger and larger; but note that it gets closer and closer to 1. Thus, our intuition tells us that the area of the region we're interested in is exactly 1. More formally: lim 1 1=b = 1: b − !1 We can rewrite that as b 2 lim Z (1=x )dx: b !1 1 Indeed, in general, if we want to compute the area under y = f(x) and right of the line x = a, we are computing b lim Z f(x)dx: b !1 a ASK: Does this limit always exist? Give some situations where it does not exist. They'll give something that blows up.
    [Show full text]
  • Notes on Calculus II Integral Calculus Miguel A. Lerma
    Notes on Calculus II Integral Calculus Miguel A. Lerma November 22, 2002 Contents Introduction 5 Chapter 1. Integrals 6 1.1. Areas and Distances. The Definite Integral 6 1.2. The Evaluation Theorem 11 1.3. The Fundamental Theorem of Calculus 14 1.4. The Substitution Rule 16 1.5. Integration by Parts 21 1.6. Trigonometric Integrals and Trigonometric Substitutions 26 1.7. Partial Fractions 32 1.8. Integration using Tables and CAS 39 1.9. Numerical Integration 41 1.10. Improper Integrals 46 Chapter 2. Applications of Integration 50 2.1. More about Areas 50 2.2. Volumes 52 2.3. Arc Length, Parametric Curves 57 2.4. Average Value of a Function (Mean Value Theorem) 61 2.5. Applications to Physics and Engineering 63 2.6. Probability 69 Chapter 3. Differential Equations 74 3.1. Differential Equations and Separable Equations 74 3.2. Directional Fields and Euler’s Method 78 3.3. Exponential Growth and Decay 80 Chapter 4. Infinite Sequences and Series 83 4.1. Sequences 83 4.2. Series 88 4.3. The Integral and Comparison Tests 92 4.4. Other Convergence Tests 96 4.5. Power Series 98 4.6. Representation of Functions as Power Series 100 4.7. Taylor and MacLaurin Series 103 3 CONTENTS 4 4.8. Applications of Taylor Polynomials 109 Appendix A. Hyperbolic Functions 113 A.1. Hyperbolic Functions 113 Appendix B. Various Formulas 118 B.1. Summation Formulas 118 Appendix C. Table of Integrals 119 Introduction These notes are intended to be a summary of the main ideas in course MATH 214-2: Integral Calculus.
    [Show full text]
  • Two Fundamental Theorems About the Definite Integral
    Two Fundamental Theorems about the Definite Integral These lecture notes develop the theorem Stewart calls The Fundamental Theorem of Calculus in section 5.3. The approach I use is slightly different than that used by Stewart, but is based on the same fundamental ideas. 1 The definite integral Recall that the expression b f(x) dx ∫a is called the definite integral of f(x) over the interval [a,b] and stands for the area underneath the curve y = f(x) over the interval [a,b] (with the understanding that areas above the x-axis are considered positive and the areas beneath the axis are considered negative). In today's lecture I am going to prove an important connection between the definite integral and the derivative and use that connection to compute the definite integral. The result that I am eventually going to prove sits at the end of a chain of earlier definitions and intermediate results. 2 Some important facts about continuous functions The first intermediate result we are going to have to prove along the way depends on some definitions and theorems concerning continuous functions. Here are those definitions and theorems. The definition of continuity A function f(x) is continuous at a point x = a if the following hold 1. f(a) exists 2. lim f(x) exists xœa 3. lim f(x) = f(a) xœa 1 A function f(x) is continuous in an interval [a,b] if it is continuous at every point in that interval. The extreme value theorem Let f(x) be a continuous function in an interval [a,b].
    [Show full text]
  • Calculus Terminology
    AP Calculus BC Calculus Terminology Absolute Convergence Asymptote Continued Sum Absolute Maximum Average Rate of Change Continuous Function Absolute Minimum Average Value of a Function Continuously Differentiable Function Absolutely Convergent Axis of Rotation Converge Acceleration Boundary Value Problem Converge Absolutely Alternating Series Bounded Function Converge Conditionally Alternating Series Remainder Bounded Sequence Convergence Tests Alternating Series Test Bounds of Integration Convergent Sequence Analytic Methods Calculus Convergent Series Annulus Cartesian Form Critical Number Antiderivative of a Function Cavalieri’s Principle Critical Point Approximation by Differentials Center of Mass Formula Critical Value Arc Length of a Curve Centroid Curly d Area below a Curve Chain Rule Curve Area between Curves Comparison Test Curve Sketching Area of an Ellipse Concave Cusp Area of a Parabolic Segment Concave Down Cylindrical Shell Method Area under a Curve Concave Up Decreasing Function Area Using Parametric Equations Conditional Convergence Definite Integral Area Using Polar Coordinates Constant Term Definite Integral Rules Degenerate Divergent Series Function Operations Del Operator e Fundamental Theorem of Calculus Deleted Neighborhood Ellipsoid GLB Derivative End Behavior Global Maximum Derivative of a Power Series Essential Discontinuity Global Minimum Derivative Rules Explicit Differentiation Golden Spiral Difference Quotient Explicit Function Graphic Methods Differentiable Exponential Decay Greatest Lower Bound Differential
    [Show full text]
  • The Infinite and Contradiction: a History of Mathematical Physics By
    The infinite and contradiction: A history of mathematical physics by dialectical approach Ichiro Ueki January 18, 2021 Abstract The following hypothesis is proposed: \In mathematics, the contradiction involved in the de- velopment of human knowledge is included in the form of the infinite.” To prove this hypothesis, the author tries to find what sorts of the infinite in mathematics were used to represent the con- tradictions involved in some revolutions in mathematical physics, and concludes \the contradiction involved in mathematical description of motion was represented with the infinite within recursive (computable) set level by early Newtonian mechanics; and then the contradiction to describe discon- tinuous phenomena with continuous functions and contradictions about \ether" were represented with the infinite higher than the recursive set level, namely of arithmetical set level in second or- der arithmetic (ordinary mathematics), by mechanics of continuous bodies and field theory; and subsequently the contradiction appeared in macroscopic physics applied to microscopic phenomena were represented with the further higher infinite in third or higher order arithmetic (set-theoretic mathematics), by quantum mechanics". 1 Introduction Contradictions found in set theory from the end of the 19th century to the beginning of the 20th, gave a shock called \a crisis of mathematics" to the world of mathematicians. One of the contradictions was reported by B. Russel: \Let w be the class [set]1 of all classes which are not members of themselves. Then whatever class x may be, 'x is a w' is equivalent to 'x is not an x'. Hence, giving to x the value w, 'w is a w' is equivalent to 'w is not a w'."[52] Russel described the crisis in 1959: I was led to this contradiction by Cantor's proof that there is no greatest cardinal number.
    [Show full text]
  • Convergence Rates for Deterministic and Stochastic Subgradient
    Convergence Rates for Deterministic and Stochastic Subgradient Methods Without Lipschitz Continuity Benjamin Grimmer∗ Abstract We extend the classic convergence rate theory for subgradient methods to apply to non-Lipschitz functions. For the deterministic projected subgradient method, we present a global O(1/√T ) convergence rate for any convex function which is locally Lipschitz around its minimizers. This approach is based on Shor’s classic subgradient analysis and implies generalizations of the standard convergence rates for gradient descent on functions with Lipschitz or H¨older continuous gradients. Further, we show a O(1/√T ) convergence rate for the stochastic projected subgradient method on convex functions with at most quadratic growth, which improves to O(1/T ) under either strong convexity or a weaker quadratic lower bound condition. 1 Introduction We consider the nonsmooth, convex optimization problem given by min f(x) x∈Q for some lower semicontinuous convex function f : Rd R and closed convex feasible → ∪{∞} region Q. We assume Q lies in the domain of f and that this problem has a nonempty set of minimizers X∗ (with minimum value denoted by f ∗). Further, we assume orthogonal projection onto Q is computationally tractable (which we denote by PQ( )). arXiv:1712.04104v3 [math.OC] 26 Feb 2018 Since f may be nondifferentiable, we weaken the notion of gradients to· subgradients. The set of all subgradients at some x Q (referred to as the subdifferential) is denoted by ∈ ∂f(x)= g Rd y Rd f(y) f(x)+ gT (y x) . { ∈ | ∀ ∈ ≥ − } We consider solving this problem via a (potentially stochastic) projected subgradient method.
    [Show full text]
  • Mean Value, Taylor, and All That
    Mean Value, Taylor, and all that Ambar N. Sengupta Louisiana State University November 2009 Careful: Not proofread! Derivative Recall the definition of the derivative of a function f at a point p: f (w) − f (p) f 0(p) = lim (1) w!p w − p Derivative Thus, to say that f 0(p) = 3 means that if we take any neighborhood U of 3, say the interval (1; 5), then the ratio f (w) − f (p) w − p falls inside U when w is close enough to p, i.e. in some neighborhood of p. (Of course, we can’t let w be equal to p, because of the w − p in the denominator.) In particular, f (w) − f (p) > 0 if w is close enough to p, but 6= p. w − p Derivative So if f 0(p) = 3 then the ratio f (w) − f (p) w − p lies in (1; 5) when w is close enough to p, i.e. in some neighborhood of p, but not equal to p. Derivative So if f 0(p) = 3 then the ratio f (w) − f (p) w − p lies in (1; 5) when w is close enough to p, i.e. in some neighborhood of p, but not equal to p. In particular, f (w) − f (p) > 0 if w is close enough to p, but 6= p. w − p • when w > p, but near p, the value f (w) is > f (p). • when w < p, but near p, the value f (w) is < f (p). Derivative From f 0(p) = 3 we found that f (w) − f (p) > 0 if w is close enough to p, but 6= p.
    [Show full text]
  • MATH 162: Calculus II Differentiation
    MATH 162: Calculus II Framework for Mon., Jan. 29 Review of Differentiation and Integration Differentiation Definition of derivative f 0(x): f(x + h) − f(x) f(y) − f(x) lim or lim . h→0 h y→x y − x Differentiation rules: 1. Sum/Difference rule: If f, g are differentiable at x0, then 0 0 0 (f ± g) (x0) = f (x0) ± g (x0). 2. Product rule: If f, g are differentiable at x0, then 0 0 0 (fg) (x0) = f (x0)g(x0) + f(x0)g (x0). 3. Quotient rule: If f, g are differentiable at x0, and g(x0) 6= 0, then 0 0 0 f f (x0)g(x0) − f(x0)g (x0) (x0) = 2 . g [g(x0)] 4. Chain rule: If g is differentiable at x0, and f is differentiable at g(x0), then 0 0 0 (f ◦ g) (x0) = f (g(x0))g (x0). This rule may also be expressed as dy dy du = . dx x=x0 du u=u(x0) dx x=x0 Implicit differentiation is a consequence of the chain rule. For instance, if y is really dependent upon x (i.e., y = y(x)), and if u = y3, then d du du dy d (y3) = = = (y3)y0(x) = 3y2y0. dx dx dy dx dy Practice: Find d x d √ d , (x2 y), and [y cos(xy)]. dx y dx dx MATH 162—Framework for Mon., Jan. 29 Review of Differentiation and Integration Integration The definite integral • the area problem • Riemann sums • definition Fundamental Theorem of Calculus: R x I: Suppose f is continuous on [a, b].
    [Show full text]
  • Generalizing the Mean Value Theorem -Taylor's Theorem
    Generalizing the Mean Value Theorem { Taylor's theorem We explore generalizations of the Mean Value Theorem, which lead to error estimates for Taylor polynomials. Then we test this generalization on polynomial functions. Recall that the mean value theorem says that, given a continuous function f on a closed interval [a; b], which is differentiable on (a; b), then there is a number c in (a; b) such that f(b) − f(a) f 0(c) = : b − a Rearranging terms, we can make this look very much like the linear approximation for f(b) using the tangent line at x = a: f(b) = f(a) + f 0(c)(b − a) except that the term f 0(a) has been replaced by f 0(c) for some point c in order to achieve an exact equality. Remember that the Mean Value Theorem only gives the existence of such a point c, and not a method for how to find c. We understand this equation as saying that the difference between f(b) and f(a) is given by an expression resembling the next term in the Taylor polynomial. Here f(a) is a \0-th degree" Taylor polynomial. Repeating this for the first degree approximation, we might expect: f 00(c) f(b) = �f(a) + f 0(a)(b − a) � + (b − a)2 2 for some c in (a; b). The term in square brackets is precisely the linear approximation. Question: Guess the formula for the difference between f(b) and its n-th order Taylor polynomial at x = a. Test your answer using the cubic polynomial f(x) = x3 + 2x + 1 using a quadratic approximation for f(3) at x = 1.
    [Show full text]
  • Lecture 5 : Continuous Functions Definition 1 We Say the Function F Is
    Lecture 5 : Continuous Functions Definition 1 We say the function f is continuous at a number a if lim f(x) = f(a): x!a (i.e. we can make the value of f(x) as close as we like to f(a) by taking x sufficiently close to a). Example Last day we saw that if f(x) is a polynomial, then f is continuous at a for any real number a since limx!a f(x) = f(a). If f is defined for all of the points in some interval around a (including a), the definition of continuity means that the graph is continuous in the usual sense of the word, in that we can draw the graph as a continuous line, without lifting our pen from the page. Note that this definition implies that the function f has the following three properties if f is continuous at a: 1. f(a) is defined (a is in the domain of f). 2. limx!a f(x) exists. 3. limx!a f(x) = f(a). (Note that this implies that limx!a− f(x) and limx!a+ f(x) both exist and are equal). Types of Discontinuities If a function f is defined near a (f is defined on an open interval containing a, except possibly at a), we say that f is discontinuous at a (or has a discontinuiuty at a) if f is not continuous at a. This can happen in a number of ways. In the graph below, we have a catalogue of discontinuities. Note that a function is discontinuous at a if at least one of the properties 1-3 above breaks down.
    [Show full text]
  • Week 3 Quiz: Differential Calculus: the Derivative and Rules of Differentiation
    Week 3 Quiz: Differential Calculus: The Derivative and Rules of Differentiation SGPE Summer School 2016 Limits Question 1: Find limx!3f(x): x2 − 9 f(x) = x − 3 (A) +1 (B) -6 (C) 6 (D) Does not exist! (E) None of the above x2−9 (x−3)(x+3) Answer: (C) Note the the function f(x) = x−3 = x−3 = x + 3 is actually a line. However it is important to note the this function is undefined at x = 3. Why? x = 3 requires dividing by zero (which is inadmissible). As x approaches 3 from below and from above, the value of the function f(x) approaches f(3) = 6. Thus the limit limx!3f(x) = 6. Question 2: Find limx!2f(x): f(x) = 1776 (A) +1 (B) 1770 (C) −∞ (D) Does not exist! (E) None of the above Answer: (E) The limit of any constant function at any point, say f(x) = C, where C is an arbitrary constant, is simply C. Thus the correct answer is limx!2f(x) = 1776. Question 3: Find limx!4f(x): f(x) = ax2 + bx + c (A) +1 (B) 16a + 4b + c (C) −∞ (D) Does not exist! (E) None of the above 1 Answer: (B) Applying the rules of limits: 2 2 limx!4ax + bx + c = limx!4ax + limx!4bx + limx!4c 2 = a [limx!4x] + blimx!4x + c = 16a + 4b + c Question 4: Find the limits in each case: (i) lim x2 x!0 jxj (ii) lim 2x+3 x!3 4x−9 (iii) lim x2−3x x!6 x+3 2 Answer: (i) lim x2 = lim (jxj) = lim j x j= 0 x!0 jxj x!0 jxj x!0 (ii) lim 2x+3 = 2·3+3 = 3 x!3 4x−9 4·3−9 (iii) lim x2−3x = 62−3·6 = 2 x!6 x+3 6+3 Question 5: Show that lim sin x = 0 (Hint: −x ≤ sin x ≤ x for all x ≥ 0.) x!0 Answer: Given hint and squeeze theorem we have lim −x = 0 ≤ lim sin x ≤ 0 = lim x hence, x!0 x!0 x!0 lim sin x = 0 x to0 Question 6: Show that lim x sin( 1 ) = 0 x!0 x 1 Answer: Note first that for any real number t we have −1 ≤ sin t ≤ 1 so −1 ≤ sin( x ) ≤ 1.
    [Show full text]
  • Tensor Products with Bounded Continuous Functions
    New York Journal of Mathematics New York J. Math. 9 (2003) 69–77. Tensor products with bounded continuous functions Dana P. Williams b b Abstract. We study the natural inclusions of C (X) ⊗ A into C (X, A) and Cb X, Cb(Y ) into Cb(X × Y ). In particular, excepting trivial cases, both these maps are isomorphisms only when X and Y are pseudocompact. This implies a result ofGlicksberg showing that the Stone- Cechˇ compactificiation β(X × Y ) is naturally identified with βX × βY ifand only if X and Y are pseudocompact. Contents 1. Main results 70 2. Glicksberg’s theorem 72 3. Pseudocompact spaces 74 4. Proof of Theorem 3 75 References 77 Suppose that X is a locally compact Hausdorff space and that A is a C∗-algebra. The collection Cb(X, A) of bounded continuous A-valued functions on X is a C∗- algebra with respect to the supremum norm. (When A = C, we write simply Cb(X)). Elements in the algebraic tensor product Cb(X)A will always be viewed as functions in Cb(X, A), and the supremum norm on Cb(X, A) restricts to a C∗- b b norm on C (X) A. Thus we obtain an injection ι1 of the completion C (X) ⊗ A into Cb(X, A): b b ι1 : C (X) ⊗ A→ C (X, A), and we can identify Cb(X) ⊗ A with a subalgebra of Cb(X, A). It is one of the fundamental examples in the theory that ι1 is an isomorphism in the case that X is compact [8, Proposition B.16], and we want to investigate the general case here.
    [Show full text]