SIAG/OPT Views and News 24(1)

Total Page:16

File Type:pdf, Size:1020Kb

SIAG/OPT Views and News 24(1) SIAG/OPT Views and News A Forum for the SIAM Activity Group on Optimization Volume 24 Number 1 October 2016 Contents Article Article Evolution of randomness in optimization methods for super- vised machine learning Katya Scheinberg . .1 Evolution of randomness in In Memoriam optimization methods for supervised Jonathan Borwein Adrian Lewis . .8 machine learning Roger Fletcher Sven Leyffer . 10 Katya Scheinberg Christodoulos Achilleus Floudas Industrial and Systems Engineering Depart- Ruth Misener . 12 ment Bulletin Lehigh University Event Announcements ..................................16 Bethlehem, PA Book Announcements .................................. 17 USA Other Announcements [email protected] Howard Rosenbrock Prize 2015 . 18 http://coral.ise.lehigh.edu/katyas Combinatorial Scientific Computing Best Paper Prize 19 2016 SIAM Fellows Announced . .19 Chair's Column 1 Introduction Juan Meza . 20 Machine learning (ML) as a field is relatively new. Its be- Comments from the Editors ginning is usually attributed to the seminal work of Vapnik Jennifer Erway & Stefan M. Wild . 20 [1] from 1995, but it has grown at exponential speed in the past 20 years. One can say that ML is a combination of learning theory, statistics, and optimization and as such has Are you receiving this influenced all three fields tremendously. ML models, while by postal mail? virtually unknown in the optimization community in 1995, are one of the key topics of continuous optimization today. Do you prefer elec- This situation is ironic, in fact, because after Vapnik's work the core of ML models shifted to support vector ma- tronic delivery? chines (SVMs){a well-defined and nice problem that is a con- vex QP with a simple constraint set. On the other hand, today, the optimization problem at the core of many ML models is no longer as clearly defined and is not usually as nice. The main focus of our article is the evolution of the central optimization problem in ML and how different funda- mental assumptions result in different problem settings and algorithmic choices, from deterministic to stochastic. We will consider the early SVM setting, which lasted for about a decade, where the problem was considered to be determin- istic. In the following decade the perspective on ML prob- lems shifted to stochastic optimization. And, over the past couple of years, the perspective has begun changing once Email [email protected]. more: The problem is again considered to be deterministic, gov today to opt out of paper V&N! but stochastic (or perhaps we should say randomized) algo- rithms are often deemed as the methods of choice. We will 2 SIAG/OPT Views and News outline this evolution and summarize some of the key algo- distribution. This quantity is known as the expected risk and rithmic developments as well as outstanding questions. has a clear interpretation as a measure of the quality of a This article is based on one optimizer's perspective of the classifier. All that remains is to find a classifier that mini- development of the field in the past 15 years and is inevitably mizes (1)! somewhat biased. Moreover, the field of optimization in ma- Minimizing (1) is a stochastic optimization problem with chine learning is vast and quickly growing. Hence, this article unknown distribution. Such problems have been the focus of is by no means comprehensive. Most references are omitted, simulation optimization and sample average approximation results are oversimplified, and the technical content is meant techniques for a few decades [3{6]. Since the expectation to be simple and introductory. For a much more comprehen- in (1) cannot be computed, it is replaced by a sample av- sive and technically sound recent review, we refer readers to, erage. Here, there is an important assumption to be made for example, [2]. that will dictate which optimization approaches are viable: Can we sample from fX ; Yg indefinitely during the optimiza- 2 Optimization models in machine learning tion process, or are we limited to a fixed set of sample data Optimization problems arise in machine learning in the de- that is available a priori? The second option is a more prac- termination of prediction functions for some (unknown) set tical assumption and indeed is usually the preferred choice of data-label pairs fX ; Yg, where X ⊂ Rd and Y may con- in machine learning and dictates the difference between the tain binary numbers, real numbers, or vectors. Specifically, optimization in ML and the traditional field of stochastic op- given an input vector x 2 Rd, one aims to determine a func- timization. As we will see later, many optimization methods tion such that when the function is evaluated at x, one ob- for ML are theoretically justified under the assumption of tains the best prediction of its corresponding label y with sampling indefinitely while being applied under the scenario (x; y) 2 fX ; Yg. In supervised learning, one determines of a fixed set of sample data. such a function by using the information contained in a Assume that we are given a data set fX; Y g: X = set fX; Y g of n known and labeled samples, that is, pairs fx1; x2; : : : ; xng ⊂ Rd and Y = fy1; y2; : : : ; yng ⊂ f+1; −1g, i i (xi; yi) 2 fX ; Yg for i 2 f1; : : : ; ng. Restricting attention to with each data point (x ; y ) selected independently at ran- a family of functions fp(w; x): w 2 W ⊆ Rpg parameterized dom from fX ; Yg. We consider a sample average approxi- by the vector w, one aims to find w∗ such that the value mation of (1): p(w∗; x) that results from any given input x 2 X is best at ^ 1 Pn predicting the appropriate label y corresponding to x. For f01(w) = i=1`01(p(w; xi); yi): (2) simplicity we will focus our attention on binary classifica- n tion, where labels y can take values 1 and −1. Note that the This is often referred to as the empirical risk of w. predictors are often called classifiers. We now have a pair of optimization problems: the expected Our aim is to define an optimization problem to find the risk minimization (the problem we actually want to solve), best classifier/predictor; however, we first need to define the measure of quality by which classifiers are compared. The w∗ = arg minw2W f01(w) = E(x;y)∼{X ;Yg[`01(p(w; x); y)]; quality (or lack thereof) of a classifier is typically measured (3) by its loss or prediction error. Given a classifier w, a func- and the empirical risk minimization (the problem we can tion p(w; x) 2 R, and a particular data instance (x; y), the solve), classical 0 − 1 loss is defined as ^ 1 Pn w^ = arg minw2W f01(w) = `01(p(w; xi); yi); (4) ( n i=1 0 if yp(w; x) > 0 p `01(p(w; x); y) = where W ⊂ R . An important question of ML, considered by 1 if yp(w; x) ≤ 0: learning theory, is how the empirical risk minimizerw ^ com- pares with the expected risk minimizer w . A vast amount of Thus, this loss outputs a 0 if the classifiers made a correct ∗ work has been done on bounding these quantities in learning prediction or 1 if it made a mistake. Note that ` (p(w; x); y) 01 theory; see, for example, [7{9]. Here we present simplified is not a convex function of w regardless of the form of p(w; x). bounds, which help us convey the key dependencies. Given a Since the data for which the prediction is needed is not sample set fX; Y g drawn randomly and independently from known ahead of time, we assume that (x; y) is randomly the distribution fX ; Yg, with probability at least 1 − δ, the selected from fX ; Yg according to some unknown distribu- following generalization bound holds for any w: tion.1 The quality of a classifier w can be measured as 0s 1 W + log( 1 ) f01(w) = E(x;y)∼{X ;Yg[`01(p(w; x); y)] (1) jf^ (w) − f (w)j ≤ O δ : (5) 01 01 @ n A = P(x;y)∼{X ;Ygfyp(w; x) > 0g; which is the probability that p(w; x) gives the correct pre- Moreover, forw ^ and w∗, the following esimation bound holds: diction on data if the data is randomly selected from the 0s 1 1 W + log( δ ) 1For notational simplicity, fX ; Yg will hence denote both the distri- jf01(w ^) − f01(w∗)j ≤ O @ A ; (6) bution of (x; y) and the set of all pairs (x; y), according to the context. n Volume 24 Number 1 { October 2016 3 where W 2 < measures the complexity of the set of all possi- to solve), and ble classifiers produced by p(w; ·) for w 2 W . The first bound (5) ensures that at the optimal solutionw ^, the expected risk ^ 1 Pn w^ = arg minw2W f(w) = i=1`(p(w; xi); yi); (8) represents the empirical risk whenever the right-hand side of n (5) is small. The second bound (6) ensures that the qual- called the empirical loss minimization (the problem that can ity of solution of (4) is almost as good as that of the best be solved). Fortunately, one can establish bounds similar classifier from W . Thus, to learn “effectively," we need to to (5) and (6) for f, w∗, andw ^ defined above using convex balance the complexity W and the size of the data set to keep loss functions. The difference between the case of the 0 − 1 the right-hand sides of (5) and (6) small. For instance, in a loss we considered previously and the Lipschitz-continuous \big data" setting, when n is very large, very complex mod- loss is the complexity measure, which is used in place of W.
Recommended publications
  • FOCAPD 2014) Innovation for Sustainability at the Interfaces of Process and Product Design JULY 13-17, 2014 U SUNCADIA RESORT U CLE ELUM, WASHINGTON U
    8TH INTERNATIONAL CONFERENCE ON FOUNDATIONS OF COMPUTER-AIDED PROCESS DESIGN FOCAPDJULY 13-17, 2014 u SUNCADIA RESORT u CLE2014 ELUM, WASHINGTON INNOVATION FOR SUSTAINABILITY AT THE INTERFACES OF PROCESS & PRODUCT DESIGN www.focapd.org FOUNDATIONS OF COMPUTER-AIDED PROCESS DESIGN (FOCAPD 2014) Innovation for Sustainability at the Interfaces of Process and Product Design JULY 13-17, 2014 u SUNCADIA RESORT u CLE ELUM, WASHINGTON u WWW.FOCAPD.ORG CONFERENCE CHAIRS SCOPE OF FOCAPD 2014 MARIO R. EDEN The Foundations of Computer-Aided Process Design Auburn University (FOCAPD) is the premier international conference focusing JOHN D. SIIROLA exclusively on the fundamentals and applications of Sandia National Laboratories computer-aided design for the process industries. Held GAVIN P. TOWLER every five years, FOCAPD 2014 is the eighth in this Honeywell/UOP series, and brings together researchers, educators and practitioners to identify new challenges and opportunities CONFERENCE MANAGER for process and product design. The chemical industry is entering a new phase of rapid evolution. The availability of ROBIN CRAVEN low-cost feedstocks from natural gas is causing renewed [email protected] investment in basic chemicals, while societal pressures for sustainability and energy security continue to be key drivers in technology development and product selection. This dynamic environment creates opportunities to launch new products and processes and to demonstrate new methodologies for innovation, synthesis and design. FOCAPD 2014 will foster constructive interaction among thought leaders from academia, industry, and government and provide a showcase for the latest research in product/ process design. SUNCADIA RESORT FOCAPD 2014 will be located at the Suncadia Resort on the sunny eastern slopes of the Cascades, 85 miles from Seattle, Washington.
    [Show full text]
  • Innovation for Sustainability at the Interfaces of Process & Product Design
    8TH INTERNATIONAL CONFERENCE ON FOUNDATIONS OF COMPUTER-AIDED PROCESS DESIGN FOCAPDJULY 13-17, 2014 u SUNCADIA RESORT u CLE ELUM,2014 WASHINGTON INNOVATION FOR SUSTAINABILITY AT THE INTERFACES OF PROCESS & PRODUCT DESIGN www.focapd.org WELCOME TO FOCAPD 2014 It is our great pleasure and privilege to welcome We hope that you will be inspired by the presen- you to the 8th International Conference on the tations and discussions on the latest research in Foundations of Computer-Aided Process Design product and process design. Delegates from 23 (FOCAPD 2014), held at the Suncadia Resort in countries have traveled to Cle Elum to participate Cle Elum, Washington, USA, July 13-17, 2014. in FOCAPD 2014 and include 35 representatives The FOCAPD series are the premier international from industry and government. conferences focusing exclusively on the fundamen- We are indebted to our conference manager, tals and applications of computer-aided design for Mrs. Robin Craven, along with the staff at the process industries. Suncadia Resort for all of their hard work. We DR. MARIO R. EDEN Held every five years, FOCAPD 2014 is the eighth would also like to acknowledge Mr. Brennen Auburn University in this series, and brings together researchers, edu- Reece of Auburn University for his outstanding cators and practitioners to identify new challenges work in designing and developing all promotional and opportunities for process and product design. materials for the conference including the website, In the context of the conference theme “Innovation flyers, name badges, and this program book. for Sustainability at the Interfaces of Process Additionally, we appreciate the support from the and Product Design” we solicited papers that CACHE Corporation and the AIChE Computing and emphasize the impact of computer-aided design Systems Technology (CAST) Division.
    [Show full text]
  • 18Th European Symposium on Computer Aided Process
    18 TH EUROPEAN SYMPOSIUM ON COMPUTER AIDED PROCESS ENGINEERING JUNE 1-4, 2008 CENTRE DE CONGRÈS DE LYON , FRANCE CAPE WP Organised by: FINAL PROGRAMME IFP Supported by: An Event of the CAPE Working LGC - ENSIACET Party of the European Federation On behalf of: of Chemical Engineering Société de Chimie Industrielle Event n° 667 Société Française de Génie des Procédés www.escape18.org CONTENTS SPONSORED BY Introduction . 1 General Information . 2 Committees . 3 Symposium Programme . 4 Synopsis . 4 Sunday 1 June . 8 Monday 2 June . 9 Tuesday 3 June . 15 Wednesday 4 June . 21 Thursday 5 June . 26 List of Posters . 27 Floor Plan, List of Exhibitors . 47 List of Hotels . 48 Access Maps . 49 SYMPOSIUM SECRETARIAT AND ORGANISATION ESCAPE 18 c/o COLLOQUIUM 12 rue de la Croix-Faubin 75011 Paris - France Tel: +33 (0)1 44 64 15 15 Fax: +33 (0)1 44 64 15 16 e-mail: [email protected] ESCAPE 18 WELCOME AT ESCAPE 18 18 TH EUROPEAN SYMPOSIUM ON COMPUTER AIDED PROCESS ENGINEERING The main theme for ESCAPE 18 is CAPE for the Users! CAPE systems are to be put in the hands of end users who need functionality and assistance beyond the core scientific and technological capacities of the systems. User-friendliness, online or web-based advice, decision support, knowledge management and organisational issues are important points that must be taken into account when deploying a CAPE system. These issues will be addressed in a special track and industrial case studies illustrating CAPE methods and tools are encouraged. The other six main topics cover the usual scope of ESCAPE 18 conferences, from off-line design to on-line systems, including numerical and computational methods, integrated and multiscale modelling, societal aspects, and CAPE education.
    [Show full text]
  • Scheduling in the Process Industry
    Gesellschaft für Operations Research e.V. __________________________________________________________________________________________________________ Am Steinknapp 14 b, 44795 Bochum, Tel.: 0234 / 46 22 46, Fax: 0234 / 46 22 45 GOR-Arbeitsgruppe: Praxis der Mathematischen Optimierung Prof. Dr. Josef Kallrath, Am Mahlstein 8 D-67273 Weisenheim am Berg Tel: +49 172 747-0689 Fax: +49 1212/5-197197-29 E-mail: [email protected] Herewith we would like to invite you to the joint Workshop of the GOR working groups “Practice of the mathematical optimization” and “Supply Chain Management” in the House of German Physical Society in Bad Honnef (Physikzentrum Bad Honnef, Hauptstr. 5, 53604 Bad Honnef, http://www.pbh.de). This meeting’s topic is Scheduling in the Process Industry The workshop starts on 20.11.2008 at 10:00 and ends on 21.11.2008 at about 17:45 . The working language will be preferably English, since some speakers are expected from abroad. Please note that the participation in a GOR-AG-Workshop for non-members is subject to a registration fee, unless you are a speaker or a host. Please send your confirmation of participation (via e-Mail or Fax is possible) as soon as possible but not later than 10.10.2008. The latest information on the meeting is available on the homepage of the GOR ( http://www.gor-ev.de/ or https://gor.uni- paderborn.de/Members/AG06/ ) under the category Arbeitsgruppen . Yours sincerely, Josef Kallrath & Alexander Lavrov & Herbert Meyr (GOR AG) (ITWM Kaiserslautern) (TU Darmstadt) Vorstand: Bürozeiten: Bankverbindung: Prof. Dr. T. S. Spengler (Vorsitz) Dienstag bis Freitag von 10 bis 13 Uhr Sparkasse Bochum Dipl.-Math.
    [Show full text]
  • Internal Evaluation Report
    HELLENIC REPUBLIC ATHENS UNIVERSITY OF ECONOMICS AND BUSINESS Patission 76, GR-104 34 Athens, Greece • www.dmst.aueb.gr DEPARTMENT OF MANAGEMENT SCIENCE & TECHNOLOGY INTERNAL EVALUATION REPORT UPDATED VERSION SEPTEMBER 2013 Internal Evaluation Report, Department of Management Science and Technology, AUEB TABLE OF CONTENTS Foreword………………………………………………………………………………………………………………… 7 1. The Internal Evaluation Procedure……………………………………………………………………. 8 1.1 Description and Analysis of the Department’s Internal Evaluation Procedure 8 1.2 Positive and Negative Aspects in the Production Process of the Internal Evaluation Report 8 1.3 Suggestions for Improvement of the Quality Evaluation Procedure 8 2. Presentation of the Department………………………………………………………………………… 9 2.1 Geographical Location, Sites and Facilities 9 2.2 Numbers and Growth of the Department 9 2.3 Mission and objectives 10 2.4 Departmental Governance, Structure and Operations 10 3. Study Programs………………………………………………………………………………………………… 12 3-I. Undergraduate Program in Management Science and Technology…………………………………………... 12 3-I.1 How the undergraduate program of studies reflects the objectives of the department and the needs of the society 12 3-I.2 Structure, cohesion and operation of the undergraduate study program 12 3-I.3 Course Assessment 15 3-I.4 International orientation of the Undergraduate Program of Studies 15 3-I.5 Undergraduate internship project 17 3-II. Postgraduate Programs……………………………………………………………………………………………………….. 18 Program 1: MBA International………………………………………………………………………………………………… 18 3-II.1 Title: MBA International 18 3-II.2 Participating Departments 18 3-II.3 Fit of the program of study to departmental and societal needs 18 3-II.4 Structure, organization and cohesion of the Program 19 3-II.5 Examination system 19 3-II.6 Student Selection 20 3-II.7 Financing of the Program 20 3-II.8 International Orientation and Recognition of the Program 20 Program 2: Executive MBA – Master in Business Administration for Executives………………………….
    [Show full text]
  • Floudas – Memorial (October 9Th, 2016)
    Professor Christodoulos A. Floudas – Memorial (October 9th, 2016) Professor Christodoulos Floudas left us for a Final Trip on that sunny afternoon of August 14th – Yes, most of us, we are still waiting … For his car to arrive, for a phone call, for him to appear from the corner … With his big smile, his elegant walk, his spontaneous & intelligent sense of humour and small-talk…. Chris was born in Ioannina, in the north-west mountainous part of Greece called Epirus; He moved to the city of Thessaloniki, the capital of Greek Macedonia, where he earned his first degree in 1982 in chemical engineering from the Aristotle University of Thessaloniki. One thing that not many people know is that - at his early years, he was also a very talented footballer, I mean soccer player – but eventually chose what I would call a more ‘scientific path’. What would have been the equivalent of ‘global optimization’ in the world of soccer? During his undergraduate years in Thessaloniki he met his ‘future’ wife Fotini. Interesting to note here that Fotini - after she met Chris – she decided to study chemistry. Part of the old story & debate between chemists and chemical engineers, I guess. In the summer of 1981, he came here to the US – he completed his PhD in 1985 at Carnegie Mellon University in Pittsburgh, Pennsylvania and in early 1986 [I remember it was a very cold and snowy January day] he joined Princeton University as a young assistant professor, promoted to Associate Professor in 1991 and to Professor in 1994 – where he became the Stephen C.
    [Show full text]