Arxiv:2011.09194V1 [Math.OC]

Noname manuscript No. (will be inserted by the editor) Lagrangian duality for nonconvex optimization problems with abstract convex functions Ewa M. Bednarczuk · Monika Syga Received: date / Accepted: date Abstract We investigate Lagrangian duality for nonconvex optimization problems. To this aim we use the Φ-convexity theory and minimax theorem for Φ-convex functions. We provide conditions for zero duality gap and strong duality. Among the classes of functions, to which our duality results can be applied, are prox-bounded functions, DC functions, weakly convex functions and paraconvex functions. Keywords Abstract convexity · Minimax theorem · Lagrangian duality · Nonconvex optimization · Zero duality gap · Weak duality · Strong duality · Prox-regular functions · Paraconvex and weakly convex functions 1 Introduction Lagrangian and conjugate dualities have far reaching consequences for solution methods and theory in convex optimization in finite and infinite dimensional spaces. For recent state-of the-art of the topic of convex conjugate duality we refer the reader to the monograph by Radu Bot¸[5]. There exist numerous attempts to construct pairs of dual problems in nonconvex optimization e.g., for DC functions [19], [34], for composite functions [8], DC and composite functions [30], [31] and for prox-bounded functions [15]. In the present paper we investigate Lagrange duality for general optimization problems within the framework of abstract convexity, namely, within the theory of Φ-convexity. The class Φ-convex functions encompasses convex l.s.c. Ewa M. Bednarczuk Systems Research Institute, Polish Academy of Sciences, Newelska 6, 01–447 Warsaw Warsaw University of Technology, Faculty of Mathematics and Information Science, ul. arXiv:2011.09194v1 [math.OC] 18 Nov 2020 Koszykowa 75, 00–662 Warsaw, Poland E-mail: [email protected] Monika Syga Warsaw University of Technology, Faculty of Mathematics and Information Science, ul. Koszykowa 75, 00–662 Warsaw, Poland E-mail: [email protected] 2 Ewa M. Bednarczuk, Monika Syga functions, and, among others, the classes of functions mentioned above. A comprehensive study of Φ-convex functions can be found in the monographs by Pallaschke and Rolewicz [20] and by Rubinow [29]. The set Φ is a set of functions defined on a given space X, called elementary functions. Φ-convex functions are pointwise suprema of elementary minorizing functions ϕ ∈ Φ (e.g. quadratic, quasi-convex). This corresponds to the classical fact that proper lower semicontinuous convex functions are pointwise suprema of affine minorizing functions. Φ-convexity provides a uni- fying framework for dealing with important classes of nonconvex functions, e.g, paraconvex (weakly convex), DC, and prox-bounded functions. In the context of duality theory Φ-convexity is investigated in a large number of papers, e.g. [1], [3], [12], [16], [25]. The underlying concepts of Φ-convexity are Φ-conjugation and Φ-subdifferen- tiation, which mimic the corresponding constructions of convex analysis, i.e., the Φ-conjugate function and the Φ-subdifferential are defined by replacing, in the respective classical definitions, the linear (affine) functions with elementary functions ϕ ∈ Φ which may not be affine, in general. This motivates the name Convexity without linearity, coined for Φ-convexity by Rolewicz [25]. The subject of the present investigation is the Lagrangian duality for optimization problems involving Φ-convex functions. In particular, we investigate Lagrangian duality for problems involving Φlsc-convex functions (see (1)), i.e., proper lower semicontinuous functions defined on a Hilbert space and minorized by quadratic functions. The class of Φlsc-convex functions embodies many important classes of functions appearing in optimization, e.g. prox-bounded functions [23], DC (difference of convex) functions [35], weakly convex functions [36], paraconvex functions [27] and lower semicontinuous convex (in the classical sense) functions. Within the framework of Φ-convexity, the Lagrange duality has been al- ready investigated on different levels of generality by [9], [14], [21]. Main contributions of the paper are as follows. (i) Theorem 3 provides necessary and sufficient condition for zero duality gap for the pair of Lagrange dual problems LP and LD with the Lagrangian L satisfying some Φ-convexity assumptions, where Φ is any class of elementary functions. This condition, called the intersection property is defined in Definition 4. See also [32]. (ii) Theorem 6 and Theorem 7 provide conditions for zero duality gap for the pair of Lagrange dual problems LP and LD with the Lagrangian L satisfying some Φlsc-convexity assumptions, where the class Φlsc is defined in Example 1.2. (iii) Theorem 8 provides sufficient conditions for the strong duality for problems, where the optimal value function V (see the formula (14)) is paraconvex (weakly convex), and int domV 6= ∅. Title Suppressed Due to Excessive Length 3 (iv) Theorem 9 shows that for constrained optimization problems with finite optimal values, Ψlsc-convex perturbation functions and Φlsc-convex Lagrangians, our main condition i.e. intersection property is satisfied. The organization of the paper is as follows. In Section 2 we recall basic concepts of Φ-convexity. We close Section 2 with a minimax theorem from [32] which is the starting point for our investigations. In Section 3 we introduce the Lagrange function and we discuss the Lagrangian duality for optimization problems involving Φ-convex functions (Theorem 3), we also deliver conditions for the strong duality (Theorem 4). In Section 4 and Section 5 we discuss our duality scheme in the class of Φconv-convex functions nad Φlsc-convex functions, respectively. In Section 6 we discuss our duality scheme for for a particular form of optimization problem. 2 Φ-convexity Let X be a set. A function f : X → Rˆ := R ∪{−∞}∪{+∞} is proper if its domain dom f = {x ∈ X | f(x) < +∞} 6= ∅ and f(x) > −∞ for any x ∈ X. Let Φ be a set of real-valued functions ϕ : X → R and f : X → Rˆ. The set suppΦ(f) := {ϕ ∈ Φ : ϕ ≤ f} is called the support of f with respect to Φ, where, for any g,h : X → Rˆ, g ≤ h ⇔ g(x) ≤ h(x) ∀ x ∈ X. We will use the notation supp(f) whenever the class Φ is clear from the context. Elements of class Φ are called elementary functions. Definition 1 ([12], [20], [29]) A function f : X → Rˆ is called Φ-convex on X if f(x) = sup{ϕ(x): ϕ ∈ supp(f)} ∀ x ∈ X. If the set X is clear from the context, we simply say that f is Φ-convex. A function f : X → Rˆ is called Φ-convex at x0 ∈ X if f(x0) = sup{ϕ(x0): ϕ ∈ supp(f)}. We have supp(f)= ∅ iff f ≡ −∞. By convention, we consider that the function f ≡ −∞ is Φ-convex (c.f. [29]). A Φ-convex function f : X → Rˆ is proper if supp(f) 6= ∅ and the domain of f is nonempty, i.e. dom(f) := {x ∈ X : f(x) < +∞} 6= ∅. If X is a topological space and a given class Φ consists of functions ϕ : X → R which are lower semicontinuous on X, then Φ-convex functions are lower semicontinuous on X ([37]). Note that Φ-convex functions defined above may admit the value +∞ which allows us to consider indicator functions within the framework of Φ-convexity. 4 Ewa M. Bednarczuk, Monika Syga Analogously, we say that f : X → R¯ is Φ-concave on X if f(x) = inf{ϕ(x): ϕ ∈ Φ, f ≤ ϕ}. Clearly, f is Φ-concave on X iff −f is −Φ-convex on X. The following classes of elementary functions are considered. Example 1 1. ∗ Φconv := {ϕ : X → R, ϕ(x)= hℓ, xi + c, x ∈ X, ℓ ∈ X , c ∈ R}, where X is a topological vector space, X∗ is the dual space to X. It is a well known fact (see for example Proposition 3.1 of [13]) that a proper convex lower semicontinuous function f : X → R¯ is Φconv-convex. For the analysis of the class Φ which generates all convex functions [32]. 2. 2 ∗ Φlsc := {ϕ : X → R, ϕ(x)= −akxk +hℓ, xi+c, x ∈ X, ℓ ∈ X , a ≥ 0, c ∈ R}, (1) where X is a normed space. If X is a Hilbert space, then f : X → R¯ is Φlsc- convex iff f is lower semicontinuous and minorized by a quadratic function 2 q(x) := −akxk −c on X (e.g. [29], Example 6.2). The class of Φlsc- convex functions encompasses: prox-bounded functions [22] and weakly convex functions [36], known also under the name paraconvex functions [27] and semiconvex functions [11]. 3. + R 2 ∗ R Φlsc := {ϕ : X → , ϕ(x)= akxk +hℓ, xi+c, x ∈ X, ℓ ∈ X , a ≥ 0, c ∈ }, where X is a normed space. If X is a Hilbert space, then f : X → R¯ is + Φlsc-concave iff f is upper semicontinuous and majorized by a quadratic function q(x) := akxk2 − c on X ([29], Example 6.2). 2.1 Φ-Subgradients Definition 2 An element ϕ ∈ Φ is called a Φ-subgradient of a function f : X → R¯ atx ¯ ∈ dom f, if the following inequality holds f(x) − f(¯x) ≥ ϕ(x) − ϕ(¯x) ∀ x ∈ X. (2) The set of all Φ-subgradients of f atx ¯ is denoted as ∂Φf(¯x). Let ε > 0. An element ϕ ∈ Φ is called a Φ − ε-subgradient of a function f : X → R¯ atx ¯ ∈ dom f, if the following inequality holds f(x) − f(¯x) ≥ ϕ(x) − ϕ(¯x) − ε ∀ x ∈ X. (3) ε The set of all Φ − ε-subgradients of f atx ¯ is denoted as ∂Φf(¯x). Title Suppressed Due to Excessive Length 5 Definition 2 was introduced in [20] and [29]. Definition 3 An element (a, v) ∈ R+ × X is called a Φlsc-subgradient of a function f : X → R¯ atx ¯ ∈ dom f, if the following inequality holds f(x) − f(¯x) ≥ hv, x − x¯i− akxk2 + akx¯k2, ∀ x ∈ X.

Arxiv:2011.09194V1 [Math.OC]

Boosting Algorithms for Maximizing the Soft Margin

On the Dual Formulation of Boosting Algorithms

Lagrangian Duality in Convex Optimization

Applications of LP Duality

Duality Theory and Sensitivity Analysis

Duality Gap Estimation Via a Refined Shapley--Folkman Lemma | SIAM

On the Ekeland Variational Principle with Applications and Detours

Subdifferentiability and the Duality

A Perturbation View of Level-Set Methods for Convex Optimization

Lagrangian Duality and Perturbational Duality I ∗

Bounding the Duality Gap for Problems with Separable Objective

Arxiv:2001.06511V2 [Math.OC] 16 May 2020 R