Optimization of Perron Eigenvectors and Applications: from Web Ranking to Chronotherapeutics Olivier Fercoq

Optimization of Perron eigenvectors and applications: from web ranking to chronotherapeutics Olivier Fercoq To cite this version: Olivier Fercoq. Optimization of Perron eigenvectors and applications: from web ranking to chronotherapeutics. Optimization and Control [math.OC]. Ecole Polytechnique X, 2012. English. pastel- 00743187 HAL Id: pastel-00743187 https://pastel.archives-ouvertes.fr/pastel-00743187 Submitted on 18 Oct 2012 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Thèseprésentéepour obtenir le titre de DOCTEUR DE L’ECOLE´ POLYTECHNIQUE Spécialité: Mathématiques appliquées par Olivier Fercoq Optimization of Perron eigenvectors and applications: From web ranking to chronotherapeutics Optimisation de vecteurs propres de Perron et applications : Du référencement de pages web àla chronothérapie Soutenue le 17 septembre 2012 devant le jury composéde : Marianne Akian INRIA Saclay et CMAP Ecole Polytechnique co-directrice Konstantin Avratchenkov INRIA Sophia Antipolis rapporteur Mustapha Bouhtou France TélécomR & D co-directeur Jean Clairambault INRIA Rocquencourt et UniversitéParis 6 président du jury Michel De Lara Ecole des Ponts ParisTech et UniversitéParis-Est examinateur Stéphane Gaubert INRIA Saclay et CMAP Ecole Polytechnique directeur Roberto Tempo Politecnico di Torino examinateur Paul Van Dooren UniversitéCatholique de Louvain rapporteur 2 Summary Search engines play a key role in the World Wide Web. They gather information on the web pages and for each query of a web surfer they give a sorted list of relevant web pages. Internet search engines use a variety of algorithms to sort web pages based on their text content or on the hyperlink structure of the web. Here, we focus on algorithms that use the latter hyperlink structure, called link-based algorithms, among them PageRank, HITS, SALSA and HOTS. The basic notion for all these algorithms is the web graph, which is a digraph with a node for each web page and an arc between nodes i and j if there is a hyperlink from page i to page j. The original problem considered in the present work, carried out as part of a collaboration between INRIA and Orange Labs, is the optimization of the ranking of the pages of a given web site. It consists in finding an optimal outlink strategy maximizing a scalar function of a given ranking subject to design constraints. PageRank, HITS and SALSA are Perron vector rankings, which means that they correspond to the principal eigenvector of a (elementwise) nonnegative matrix. When optimizing the ranking, we thus optimize a scalar utility function of the Perron eigenvector over a set of nonnegative irreducible matrices. The matrix is constructed from the web graph, so controlling the hyperlinks corresponds to controlling the matrix itself. We first study general PageRank optimization problems with a “total income” utility function and design constraints. This case is of particular interest since the value of the PageRank is an acknowledged economic issue. We reduced the PageRank optimization problem to Markov decision problems such that the action sets are implicitly defined as the vertices of polytopes that have a polynomial time separation oracle. We show that such Markov decision problems are solvable in polynomial time and we provide a scalable algorithm for the effective resolution of the PageRank optimization problem on large dataset. Then, we study the general problem of optimizing a scalar utility function of the Perron eigenvector over a set of nonnegative irreducible matrices. This covers all Perron vector rankings, including HITS and SALSA. We show that the matrix of partial derivatives of the objective has a low rank and can be computed by an algorithm with the same convergence properties as the power algorithm used to compute the ranking and so the value of the objective. We give an optimization algorithm that couples power and gradient iterations and prove its convergence to a stationary point of the optimization problem. Considering HOTS 4 as a nonlinear Perron vector, we show that the HOTS algorithm converges with a linear rate of convergence, that the objective of the HOTS optimization problem has a low rank and that the coupled power and gradient algorithm applies. Finally, we extend the domain of application of the Perron eigenvalue and eigenvector optimization methods to the optimization of chemotherapy under the McKendrick model of population dynamics. We consider here that the cells behave differently at an hour of the day or another. We want to take advantage of this feature to minimize the growth rate of cancer cell population while we maintain the growth rate of healthy cell population over a given toxicity threshold. The objective and the constraint can be written as the Floquet eigenvalues of age-structured PDE models with periodic coefficients, and they are approximated by Perron eigenvalues in the discretized problem. We search for locally optimal drug infusion strategies by a method of multipliers, where the unconstrained minimizations are performed using the coupled power and gradient algorithm that we have developed in the context of web ranking optimization. Résumé Les moteurs de recherche jouent un rôleessentiel sur le Web. Ils rassemblent des informations sur les pages web et pour chaque requêted’un internaute, ils donnent une liste ordonnéede pages pertinentes. Ils utilisent divers algorithmes pour classer les pages en fonction de leur contenu textuel ou de la structure d’hyperlien du Web. Ici, nous nous concentrons sur les algorithmes qui utilisent cette structure d’hyperliens, comme le PageRank, HITS, SALSA et HOTS. La notion fondamentale pour tous ces algorithmes est le graphe du web. C’est le graphe orientéqui a un nœud pour chaque page web et un arc entre les nœuds i et j si il y a un hyperlien entre les pages i et j. Le problèmeoriginal considérédans cette thèse,réalisée dans le cadre d’une collaboration entre INRIA et Orange Labs, est l’optimisation du référencement des pages d’un site web donné. Il consiste àtrouver une stratégieoptimale de liens qui maximise une fonction scalaire d’un classement donnésous des contraintes de design. Le PageRank, HITS et SALSA classent les pages par un vecteur de Perron, c’est-à-dire qu’ils correspondent au vecteur propre principal d’une matrice àcoefficients positifs. Quand on optimise le référencement, on optimise donc une fonction scalaire du vecteur propre de Perron sur un ensemble de matrices positives irréductibles. La matrice est construite àpartir du graphe du web, donc commander les hyperliens revient àcommander la matrice elle-même. Nous étudions d’abord un problèmegénérald’optimisation du PageRank avec une fonction d’utilitécorrespondant au revenu total du site et des contraintes de design. Ce cas est d’un intérêtparticulier car pour de nombreux sites la valeur du PageRank est corrélée au chiffre d’affaires. Nous avons réduit le problèmed’optimisation du PageRank àdes problèmesde décisionmarkoviens dont les ensembles d’action sont définis implicitement comme étant les points extrêmesde polytopes qui ont un oracle de séparation polynomial. Nous montrons que de tels problèmesde décisionmarkoviens sont solubles en temps polynomial et nous donnons un algorithme qui passe àl’échelle pour la résolution effective du problèmed’optimisation du PageRank sur de grandes bases de données. Ensuite, nous étudions le problèmegénéralde l’optimisation d’une fonction scalaire du vecteur propre de Perron sur un ensemble de matrices positives irréductibles. Cela couvre tous les classements par vecteur de Perron, HITS et SALSA compris. Nous montrons que la matrice des dérivéespartielles de la fonction objectif a un petit rang et peut êtrecalculée 6 par un algorithme qui a les mêmespropriétésde convergence que la méthode de la puissance utiliséepour calculer le classement. Nous donnons un algorithme d’optimisation qui couple les itérations puissance et gradient et nous prouvons sa convergence vers un point station- naire du problèmed’optimisation. En considérant HOTS comme un vecteur de Perron non linéaire,nous montrons que l’algorithme HOTS converge géométriquement et nous résolvons l’optimisation locale de HOTS. Finalement, nous étendons le domaine d’application des méthodes d’optimisation du vecteur propre et de la valeur propre de Perron àl’optimisation de la chimiothérapie, sous l’hypothèseque les cellules se comportent différemment suivant l’heure de la journée. Nous voulons profiter de cette caractéristique pour minimiser le taux de croissance des cellules cancéreuses tout en maintenant le taux de croissance des cellules saines au dessus d’un seuil de toxicitédonné. L’objectif et la contrainte peuvent s’écrirecomme les valeurs propres de Floquet de modèles d’EDP structurésen âgeavec des coefficients périodiques, qui sont ap- prochéspar des valeurs propres de Perron dans le problèmediscrétisé. Nous cherchons des stratégiesd’injection de médicament localement optimales par une méthode des multiplica- teurs oùles minimisations sans contrainte sont faites en utilisant l’algorithme couplant les itérations puissance et gradient développédans le cadre de l’optimisation du référencement.

Load more