Linear Programming 1 an Introduction to Linear Programming

18.415/6.854 Advanced Algorithms October 1994 Linear Programming Lecturer: Michel X. Goemans 1 An Introduction to Linear Programming Linear programming is a very important class of problems, both algorithmically and combinatorially. Linear programming has many applications. From an algorithmic point-of-view, the simplex was proposed in the forties (soon after the war, and was motivated by military applications) and, although it has performed very well in prac- tice, is known to run in exponential time in the worst-case. On the other hand, since the early seventies when the classes P and NP were defined, it was observed that linear programming is in NP co-NP although no polynomial-time algorithm was known at ∩ that time. The first polynomial-time algorithm, the ellipsoid algorithm, was only dis- covered at the end of the seventies. Karmarkar’s algorithm in the mid-eighties lead to very active research in the area of interior-point methods for linear programming. We shall present one of the numerous variations of interior-point methods in class. From a combinatorial perspective, systems of linear inequalities were already studied at the end of the last century by Farkas and Minkovsky. Linear programming, and especially the notion of duality, is very important as a proof technique. We shall illustrate its power when discussing approximation algorithms. We shall also talk about network flow algorithms where linear programming plays a crucial role both algorithmically and combinatorially. For a more in-depth coverage of linear programming, we refer the reader to [1, 4, 7, 8, 5]. A linear program is the problem of optimizing a linear objective function in the decision variables, x1 ...xn, subject to linear equality or inequality constraints on the xi’s. In standard form, it is expressed as: n Min cjxj (objective function) j=1 subject to: n aijxj = bi, i =1 ...m (constraints) j=1 x 0, j =1 ...n (non-negativity constraints) j ≥ where a , b ,c are given. { ij i j} A linear program is expressed more conveniently using matrices: Ax = b min cT x subject to x 0 ≥ LP-1 where x1 . Rn 1 x = . × ∈ x n b1 . Rm 1 b = . × ∈ b m c1 . Rn 1 c = . × ∈ c n a11 . m n A = .. R × ∈ a mn 2 Basic Terminology Definition 1 If x satisfies Ax = b, x 0, then x is feasible. ≥ Definition 2 A linear program (LP) is feasible if there exists a feasible solution, otherwise it is said to be infeasible. T T Definition 3 An optimal solution x∗ is a feasible solution s.t. c x∗ = min c x : Ax = b, x 0 . { ≥ } T Definition 4 LP is unbounded (from below) if λ R, a feasible x∗ s.t. c x∗ λ. ∀ ∈ ∃ ≤ 3 Equivalent Forms A linear program can take on several forms. We might be maximizing instead of minimizing. We might have a combination of equality and inequality contraints. Some variables may be restricted to be non-positive instead of non-negative, or be unrestricted in sign. Two forms are said to be equivalent if they have the same set of optimal solutions or are both infeasible or both unbounded. 1. A maximization problem can be expressed as a minimization problem. max cT x min cT x ⇔ − 2. An equality can be represented as a pair of inequalities. T ai x bi T ≤ T ⇔ ai x bi ai x = bi T ≥ ai x bi T ≤ ⇔ a x bi − i ≤ − LP-2 3. By adding a slack variable, an inequality can be represented as a combination of equality and non-negativity constraints. aT x b aT x + s = b , s 0. i ≤ i ⇔ i i i i ≥ 4. Non-positivity constraints can be expressed as non-negativity constraints. To express xj 0, replace xj everywhere with yj and impose the condition y 0. ≤ − j ≥ 5. x may be unrestricted in sign. If x is unrestricted in sign, i.e. non-positive or non-negative, everywhre replace + + x by x x−, adding the constraints x , x− 0. j j − j j j ≥ In general, an inequality can be represented using a combination of equality and non-negativity constraints, and vice versa. T T + T Using these rules, min c x s.t. Ax b can be transformed into min c x c x− + + ≥ } − s.t. Ax Ax− Is = b,x , x−,s 0 . The former LP is said to be in canonical − − ≥ } form, the latter in standard form. Conversely, an LP in standard form may be written in canonical form. min cT x s.t. Ax = b, x 0 is equivalent to min cT x s.t. Ax b, Ax b, Ix 0 . ≥ } ≥ − ≥ − ≥ } A b ′ ′ ′ ′ This may be rewritten as A x b , where A = -A and b = -b . ≥ I 0 4 Example Consider the following linear program: x 2 1 ≥ 3x1 x2 0 min x2 subject to − ≥ x1 + x2 6 ≥ x + 2x 0 − 1 2 ≥ The optimal solution is (4, 2) of cost 2 (see Figure 1). If we were maximizing x2 instead of minimizing under the same feasible region, the resulting linear program would be unbounded since x2 can increase arbitrarily. From this picture, the reader should be convinced that, for any objective function for which the linear program is bounded, there exists an optimal solution which is a “corner” of the feasible region. We shall formalize this notion in the next section. LP-3 3x − x21 > 0 x2 (2,4) −x1 + 2x2 > 0 x 1+ x2 > 6 (2,1) x1 > 2 x1 Figure 1: Graph representing primal in example. An example of an infeasible linear program can be obtained by reversing some of the inequalities of the above LP: x1 2 3x x ≤ 0 1 − 2 ≥ x1 + x2 6 x + 2x ≥ 0. − 1 2 ≤ 5 The Geometry of LP Let P = x : Ax = b, x 0 Rn. { ≥ } ⊆ Definition 5 x is a vertex of P if y =0 s.t. x + y, x y P . ∃ − ∈ ′ Theorem 1 Assume min cT x : x P is finite, then x P, a vertex x such that ′ { ∈ } ∀ ∈ ∃ cT x cT x. ≤ Proof: ′ If x is a vertex, then take x = x. If x is not a vertex, then, by definition, y = 0 s.t. x + y, x y P . Since A(x + y)= b and A(x y)= b, Ay = 0. ∃ − ∈ − LP-4 WLOG, assume cT y 0 (take either y or y). If cT y = 0, choose y such that j s.t. y < 0. Since y = 0 and≤ cT y = cT ( y) = 0,− this must be true for either y or ∃y. j − − Consider x + λy, λ > 0. cT (x + λy) = cT x + λcT y cT x, since cT y is assumed ≤ non-positive. Case 1 j such that y < 0 ∃ j As λ increases, component j decreases until x + λy is no longer feasible. Choose λ = min j:yj<0 xj/ yj = xk/ yk. This is the largest λ such that { } x + λy 0. Since Ay ={ 0, A−(x +} λy)= −Ax + λAy = Ax = b. So x + λy P , ≥ ∈ and moreover x + λy has one more zero component, (x + λy)k, than x. Replace x by x + λy. Case 2 y 0 j j ≥ ∀ By assumption, cT y < 0 and x + λy is feasible for all λ 0, since A(x + λy)= Ax + λAy = Ax = b, and x + λy x 0. But cT (x + λy≥)= cT x + λcT y ≥ ≥ → −∞ as λ , implying LP is unbounded, a contradiction. →∞ Case 1 can happen at most n times, since x has n components. By induction on ′ the number of non-zero components of x, we obtain a vertex x . 2 Remark: The theorem was described in terms of the polyhedral set P = x : Ax = b : x 0 . Strictly speaking, the theorem is not true for P = x : Ax{ b . Indeed, such≥ } a set P might not have any vertex. For example, consider{ P ≥= } (x1, x2):0 x2 1 (see Figure 2). This polyhedron has no vertex, since for any x{ P , we have≤ x≤+ y}, x y P , where y = (1, 0). It can be shown that P has a ∈ − ∈ vertex iff Rank(A)= n. Note that, if we transform a program in canonical form into standard form, the non-negativity constraints imply that the resulting matrix A has full column rank, since A Rank -A = n. I Corollary 2 If min cT x : Ax = b, x 0 is finite, There exists an optimal solution, { ≥ } x∗, which is a vertex. Proof: Suppose not. Take an optimal solution. By Theorem 1 there exists a vertex costing no more and this vertex must be optimal as well. 2 Corollary 3 If P = x : Ax = b, x 0 = , then P has a vertex. { ≥ } ∅ LP-5 x2 (0,1) (0,0) x1 Figure 2: A polyhedron with no vertex. Theorem 4 Let P = x : Ax = b, x 0 . For x P , let A be a submatrix of A { ≥ } ∈ x corresponding to j s.t. xj > 0. Then x is a vertex iff Ax has linearly independent columns. (i.e. Ax has full column rank.) 2 2130 2 3 0 Example A = 7321 x = A = 7 2 , and x is a vertex. 1 x 0005 0 0 0 Proof: Show i ii. ¬ → ¬ Assume x is not a vertex. Then, by definition, y = 0 s.t. x + y, x y P . ∃ − ∈ Let Ay be submatrix corresponding to non-zero components of y. As in the proof of Theorem 1, Ax + Ay = b Ay =0. Ax Ay = b ⇒ − Therefore, A has dependent columns since y = 0. y Moreover, x + y 0 ≥ yj = 0 whenever xj =0. x y 0 ⇒ − ≥ Therefore Ay is a submatrix of Ax.

Linear Programming 1 an Introduction to Linear Programming

Boosting Algorithms for Maximizing the Soft Margin

On the Dual Formulation of Boosting Algorithms

Applications of LP Duality

Duality Theory and Sensitivity Analysis

On the Ekeland Variational Principle with Applications and Detours

Lagrangian Duality

Arxiv:2011.09194V1 [Math.OC]

The Idea Behind Duality Lec12p1, ORF363/COS323

Norm to Weak Upper Semicontinuity of Pre Duality

Lecture 11: October 8 11.1 Primal and Dual Problems

Multiobjective Nonlinear Symmetric Duality Involving Generalized Pseudoconvexity

Lagrangian Duality and Convex Optimization