Runtime Multicore Scheduling Techniques for Dispatching Parameterized Signal and Vision Dataflow Applications on Heterogeneous Mpsocs Julien Heulot

Total Page:16

File Type:pdf, Size:1020Kb

Runtime Multicore Scheduling Techniques for Dispatching Parameterized Signal and Vision Dataflow Applications on Heterogeneous Mpsocs Julien Heulot Runtime multicore scheduling techniques for dispatching parameterized signal and vision dataflow applications on heterogeneous MPSoCs Julien Heulot To cite this version: Julien Heulot. Runtime multicore scheduling techniques for dispatching parameterized signal and vi- sion dataflow applications on heterogeneous MPSoCs. Signal and Image processing. INSA deRennes, 2015. English. NNT : 2015ISAR0023. tel-01301642 HAL Id: tel-01301642 https://tel.archives-ouvertes.fr/tel-01301642 Submitted on 12 Apr 2016 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Résumé Abstract Thèse Une tendance importante dans le domaine de l’embarqué est l’intégration An important trend in embedded processing is the integration of de plus en plus d’éléments de calcul dans les systèmes multiprocesseurs increasingly more processing elements into Multiprocessor Systems- sur puce (MPSoC). Cette tendance est due en partie aux limitations des on-Chip (MPSoC). This trend is due in part to limitations in processing puissances individuelles de ces éléments causées par des considérations power of individual elements that are caused by power consumption de consommation d’énergie. Dans le même temps, en raison de leur considerations. At the same time, signal processing applications are sophistication croissante, les applications de traitement du signal ont becoming increasingly dynamic in terms of their hardware resource des besoins en puissance de calcul de plus en plus dynamique. Dans requirements due to the growing sophistication of algorithms to reach la conception et le développement d’applications de traitement de signal higher levels of performance. In design and implementation of multicore multicoeur, l’un des principaux déis consiste à répartir eficacement les signal processing systems, one of the main challenges is to dispatch différentes tâches sur les éléments de calcul disponibles, tout en tenant computational tasks eficiently onto the available processing elements compte des changements dynamiques des fonctionnalités de l’application while taking into account dynamic changes in application functionality and et des ressources disponibles. Une utilisation ineficace peut conduire à une resource requirements. An ineficient use can lead to longer processing 2015 durée de traitement plus longue et/ou une consommation d’énergie plus times and higher energy consumption, making multicore task scheduling élevée, ce qui fait de la répartition des tâches sur un système multicoeur a very dificult problem to solve. une tâche dificile à résoudre. Datalow process network Models of Computation (MoCs) are widely Les modèles de calcul (MoC) lux de données sont communément utilisés used in design of signal processing systems. It decomposes application dans la conception de systèmes de traitement du signal. Ils décomposent la functionality into actors that communicate data exclusively through fonctionnalité de l’application en acteurs qui communiquent exclusivement channels. The interconnection of actors and communication channels is par l’intermédiaire de canaux. L’interconnexion des acteurs et des canaux modeled and manipulated as a directed graph, called a datalow graph. de communication est modélisée et manipulée comme un graphe orienté, There are different datalow MoCs which offer different trade-off between appelé un graphique de lux de données. Il existe différents MoCs de predictability and expressiveness. These MoCs are widely used in design lux de données qui offrent différents compromis entre la prédictibilité et of signal processing systems l’expressivité. Ces modèles de calculs sont communément utilisés dans due to their analyzability and their natural parallel expressivity. la conception de systèmes de traitement du signal en raison de leur analysabilité et leur expressivité naturelle du parallélisme de l’application. In this thesis, we propose a novel scheduling method to address multicore scheduling challenge. This scheduling method determines scheduling Dans cette thèse, une nouvelle méthode de répartition de tâches est decisions strategically at runtime to optimize the overall execution time proposée ain de répondre au déi que propose la programmation of applications onto heterogeneous multicore processing resources. multicoeur. Cette méthode de répartition de tâches prend ses décisions en Applications are described using the Parameterized and Interfaced temps réel ain d’optimiser le temps d’exécution global de l’application. Les Synchronous DataFlow (PiSDF) MoC. The PiSDF model allows describing applications sont décrites en utilisant le modèle paramétrée et interfacé parameterized application, making possible changes in application’s lux de données (PiSDF). Ce modèle permet de décrire une application resource requirement at runtime. At each execution, the parameterized paramétrée en autorisant des changements dans ses besoins en datalow is then transformed into a locally static one used to eficiently ressources de calcul lors de l’exécution. A chaque exécution, le modèle de schedule the application with an a priori knowledge of its behavior. THESE INSA Rennes présentée par lux de données paramétré est déroulé en un modèle intermédiaire faisant The proposed scheduling method have been tested and benchmarked sous le sceau de l’Université européenne de Bretagne apparaitre toute les tâches de l’application ainsi que leurs dépendances. on multiple state-of-the-art applications from computer vision, signal pour obtenir le titre de Julien Heulot processing and multimedia domains. Ce modèle est ensuite utilisé pour répartir eficacement les tâches de DOCTEUR DE L’INSA DE RENNES ECOLE DOCTORALE : MATISSE l’application. La méthode proposé a été testée et validé sur plusieurs Spécialité : Traitement du Signal et de l’Image LABORATOIRE : IETR applications des domaines de la vision par ordinateur, du traitement du signal et du multimédia. Julien HEULOT Thèse soutenue le 24.11.2015 Runtime Multicore devant le jury composé de : Scheduling Techniques Alix MUNIER-KORDON Professeur au LIP6 Paris / Présidente for Dispatching Gilles SASSATELLI Directeur de Recherche au LIRMM, Montpellier / Rapporteur Alain GIRAULT Parameterized Directeur de Recherche à l’INRIA Grenoble / Rapporteur Johan LILIUS Signal and Vision Professeur à l’Abo Akademi (Finlande) / Examinateur John McALLISTER Datalow Applications Maitre de Conférence au Queen’s University of Belfast (UK) / Examinateur Slaheddine ARIDHI on Heterogeneous Docteur, Texas Instruments France / Encadrant Maxime PELCAT MPSoCs Maitre de Conférence à INSA Rennes / Encadrant Jean-François NEZAN N° d’ordre : 15ISAR 27 / D15 - 27 Professeur à l’INSA Rennes / Directeur de thèse Institut National des Sciences Appliquées de Rennes 20, Avenue des Buttes de Coësmes CS 70839 F-35708 Rennes Cedex 7 Tel : 02 23 23 82 00 - Fax : 02 23 23 83 96 Runtime Multicore Scheduling Techniques for Dispatching Parameterized Signal and Vision Dataflow Applications on Heterogeneous MPSoCs Julien Heulot Document protégé par les droits d’auteur Contents Acknowledgements 1 1 Introduction 3 1.1 General Context ................................. 3 1.1.1 Embedded Systems Constraints ..................... 3 1.1.2 Designing an Embedded Systems .................... 4 1.1.3 Embedded Parallel Systems ....................... 4 1.1.4 Dataflow Programming ......................... 5 1.2 Contributions of the Thesis ........................... 5 1.3 Outline ...................................... 5 I Background 7 2 Embedded Parallel Platforms 9 2.1 Introduction .................................... 9 2.2 Parallel Embedded System architecture .................... 10 2.2.1 What is an parallel embedded system ? . 10 2.2.2 Processing Element Types and Platform Heterogeneity . 11 2.2.3 Memory architecture of embedded systems . 12 2.2.3.1 Memory Organization: Shared or Distributed . 12 2.2.3.2 Memory Hierarchy ....................... 12 2.3 Parallel programming methods ......................... 13 2.3.1 Programming Styles ........................... 13 2.3.2 Granularity of Parallelism ........................ 14 2.3.3 Parallelism sources ............................ 15 2.3.4 Parallel Programming techniques .................... 16 2.3.4.1 Multicore scheduling ...................... 16 2.3.4.2 Scheduling approaches ..................... 17 2.3.4.3 An embedded programing runtime: the Open Event Machine 18 2.4 Conclusion ..................................... 19 i ii CONTENTS 3 Dataflow Models of Computation 21 3.1 Introduction .................................... 21 3.2 Dataflow Models of Computation (MoC): an Overview . 22 3.2.1 What is a Model of Computation (MoC) ? . 22 3.2.2 Dataflow MoC Definition ........................ 22 3.2.3 Dataflow MoC Properties ........................ 23 3.3 The Landscape of Dataflow MoCs ........................ 26 3.3.1 Static Dataflow MoCs .......................... 26 3.3.1.1 The Synchronous Dataflow (SDF) MoC . 26 3.3.1.1.1 Consistency and Schedulability of SDF . 27 3.3.1.1.2 A pre-scheduling transformation of SDF graphs . 28 3.3.1.2 The Cyclo-Static Dataflow
Recommended publications
  • Personal Computing, the Notebook Battery Crisis, and Postindustrial
    1 1 <running head>Eisler | Exploding the Black Box 2 <title>Exploding the Black Box 3 <subtitle>Personal Computing, the Notebook Battery Crisis, and Postindustrial 4 Systems Thinking 5 Matthew N. Eisler 6 Matthew N. Eisler studies the relationship between ideology, material practices, and 7 the social relations of contemporary science and engineering at the intersection of energy, 8 environmental, and industrial policy. His first book, Overpotential: Fuel Cells, Futurism, 9 and the Making of a Power Panacea was published by Rutgers University Press in 2012. 10 He has been affiliated with Western University, the Center for Nanotechnology in Society 11 at the University of California at Santa Barbara, the Chemical Heritage Foundation, and 12 the Department of Engineering and Society at the University of Virginia. He is currently a 13 Visiting Assistant Professor in the Department of Integrated Science and Technology at 14 James Madison University. He thanks Jack K. Brown, W. Bernard Carlson, Paul E. 15 Ceruzzi, Michael D. Gordin, Barbara Hahn, Cyrus C.M. Mody, Hannah S. Rogers, and 16 three anonymous referees for their incisive and constructive criticism of earlier drafts of 17 this article. 18 Abstract: 19 Historians of science and technology have generally ignored the role of power 20 sources in the development of consumer electronics. In this they have followed the 21 predilections of historical actors. Research, development, and manufacturing of 22 batteries has historically occurred at a social and intellectual distance from the research, 23 development, and manufacturing of the devices they power. Nevertheless, power source 24 technoscience should properly be understood as an allied yet estranged field of 2 1 electronics.
    [Show full text]
  • Stclang: State Thread Composition As a Foundation for Monadic Dataflow Parallelism Sebastian Ertel∗ Justus Adam Norman A
    STCLang: State Thread Composition as a Foundation for Monadic Dataflow Parallelism Sebastian Ertel∗ Justus Adam Norman A. Rink Dresden Research Lab Chair for Compiler Construction Chair for Compiler Construction Huawei Technologies Technische Universität Dresden Technische Universität Dresden Dresden, Germany Dresden, Germany Dresden, Germany [email protected] [email protected] [email protected] Andrés Goens Jeronimo Castrillon Chair for Compiler Construction Chair for Compiler Construction Technische Universität Dresden Technische Universität Dresden Dresden, Germany Dresden, Germany [email protected] [email protected] Abstract using monad-par and LVars to expose parallelism explicitly Dataflow execution models are used to build highly scalable and reach the same level of performance, showing that our parallel systems. A programming model that targets parallel programming model successfully extracts parallelism that dataflow execution must answer the following question: How is present in an algorithm. Further evaluation shows that can parallelism between two dependent nodes in a dataflow smap is expressive enough to implement parallel reductions graph be exploited? This is difficult when the dataflow lan- and our programming model resolves short-comings of the guage or programming model is implemented by a monad, stream-based programming model for current state-of-the- as is common in the functional community, since express- art big data processing systems. ing dependence between nodes by a monadic bind suggests CCS Concepts • Software and its engineering → Func- sequential execution. Even in monadic constructs that explic- tional languages. itly separate state from computation, problems arise due to the need to reason about opaquely defined state.
    [Show full text]
  • APPLYING MODEL-VIEW-CONTROLLER (MVC) in DESIGN and DEVELOPMENT of INFORMATION SYSTEMS an Example of Smart Assistive Script Breakdown in an E-Business Application
    APPLYING MODEL-VIEW-CONTROLLER (MVC) IN DESIGN AND DEVELOPMENT OF INFORMATION SYSTEMS An Example of Smart Assistive Script Breakdown in an e-Business Application Andreas Holzinger, Karl Heinz Struggl Institute of Information Systems and Computer Media (IICM), TU Graz, Graz, Austria Matjaž Debevc Faculty of Electrical Engineering and Computer Science, University of Maribor, Maribor, Slovenia Keywords: Information Systems, Software Design Patterns, Model-view-controller (MVC), Script Breakdown, Film Production. Abstract: Information systems are supporting professionals in all areas of e-Business. In this paper we concentrate on our experiences in the design and development of information systems for the use in film production processes. Professionals working in this area are neither computer experts, nor interested in spending much time for information systems. Consequently, to provide a useful, useable and enjoyable application the system must be extremely suited to the requirements and demands of those professionals. One of the most important tasks at the beginning of a film production is to break down the movie script into its elements and aspects, and create a solid estimate of production costs based on the resulting breakdown data. Several film production software applications provide interfaces to support this task. However, most attempts suffer from numerous usability deficiencies. As a result, many film producers still use script printouts and textmarkers to highlight script elements, and transfer the data manually into their film management software. This paper presents a novel approach for unobtrusive and efficient script breakdown using a new way of breaking down text into its relevant elements. We demonstrate how the implementation of this interface benefits from employing the Model-View-Controller (MVC) as underlying software design paradigm in terms of both software development confidence and user satisfaction.
    [Show full text]
  • Process Scheduling
    PROCESS SCHEDULING ANIRUDH JAYAKUMAR LAST TIME • Build a customized Linux Kernel from source • System call implementation • Interrupts and Interrupt Handlers TODAY’S SESSION • Process Management • Process Scheduling PROCESSES • “ a program in execution” • An active program with related resources (instructions and data) • Short lived ( “pwd” executed from terminal) or long-lived (SSH service running as a background process) • A.K.A tasks – the kernel’s point of view • Fundamental abstraction in Unix THREADS • Objects of activity within the process • One or more threads within a process • Asynchronous execution • Each thread includes a unique PC, process stack, and set of processor registers • Kernel schedules individual threads, not processes • tasks are Linux threads (a.k.a kernel threads) TASK REPRESENTATION • The kernel maintains info about each process in a process descriptor, of type task_struct • See include/linux/sched.h • Each task descriptor contains info such as run-state of process, address space, list of open files, process priority etc • The kernel stores the list of processes in a circular doubly linked list called the task list. TASK LIST • struct list_head tasks; • init the "mother of all processes” – statically allocated • extern struct task_struct init_task; • for_each_process() - iterates over the entire task list • next_task() - returns the next task in the list PROCESS STATE • TASK_RUNNING: running or on a run-queue waiting to run • TASK_INTERRUPTIBLE: sleeping, waiting for some event to happen; awakes prematurely if it receives a signal • TASK_UNINTERRUPTIBLE: identical to TASK_INTERRUPTIBLE except it ignores signals • TASK_ZOMBIE: The task has terminated, but its parent has not yet issued a wait4(). The task's process descriptor must remain in case the parent wants to access it.
    [Show full text]
  • CS61C : Machine Structures
    inst.eecs.berkeley.edu/~cs61c/su05 Review CS61C : Machine Structures • Benchmarks Lecture #29: Intel & Summary • Attempt to predict performance • Updated every few years • Measure everything from simulation of desktop graphics programs to battery life • Megahertz Myth • MHz ≠ performance, it’s just one factor 2005-08-10 Andy Carle CS 61C L29 Intel & Review (1) A Carle, Summer 2005 © UCB CS 61C L29 Intel & Review (2) A Carle, Summer 2005 © UCB MIPS is example of RISC MIPS vs. 80386 • RISC = Reduced Instruction Set Computer • Address: 32-bit • 32-bit • Term coined at Berkeley, ideas pioneered • Page size: 4KB • 4KB by IBM, Berkeley, Stanford • Data aligned • Data unaligned • RISC characteristics: • Destination reg: Left • Right • Load-store architecture •add $rd,$rs1,$rs2 •add %rs1,%rs2,%rd • Fixed-length instructions (typically 32 bits) • Regs: $0, $1, ..., $31 • %r0, %r1, ..., %r7 • Three-address architecture • Reg = 0: $0 • (n.a.) • RISC examples: MIPS, SPARC, • Return address: $31 • (n.a.) IBM/Motorola PowerPC, Compaq Alpha, ARM, SH4, HP-PA, ... CS 61C L29 Intel & Review (3) A Carle, Summer 2005 © UCB CS 61C L29 Intel & Review (4) A Carle, Summer 2005 © UCB MIPS vs. Intel 80x86 MIPS vs. Intel 80x86 • MIPS: “load-store architecture” • MIPS: “Three-address architecture” • Only Load/Store access memory; rest • Arithmetic-logic specify all 3 operands operations register-register; e.g., add $s0,$s1,$s2 # s0=s1+s2 lw $t0, 12($gp) add $s0,$s0,$t0 # s0=s0+Mem[12+gp] • Benefit: fewer instructions ⇒ performance • Benefit: simpler hardware ⇒ easier
    [Show full text]
  • Scheduling: Introduction
    7 Scheduling: Introduction By now low-level mechanisms of running processes (e.g., context switch- ing) should be clear; if they are not, go back a chapter or two, and read the description of how that stuff works again. However, we have yet to un- derstand the high-level policies that an OS scheduler employs. We will now do just that, presenting a series of scheduling policies (sometimes called disciplines) that various smart and hard-working people have de- veloped over the years. The origins of scheduling, in fact, predate computer systems; early approaches were taken from the field of operations management and ap- plied to computers. This reality should be no surprise: assembly lines and many other human endeavors also require scheduling, and many of the same concerns exist therein, including a laser-like desire for efficiency. And thus, our problem: THE CRUX: HOW TO DEVELOP SCHEDULING POLICY How should we develop a basic framework for thinking about scheduling policies? What are the key assumptions? What metrics are important? What basic approaches have been used in the earliest of com- puter systems? 7.1 Workload Assumptions Before getting into the range of possible policies, let us first make a number of simplifying assumptions about the processes running in the system, sometimes collectively called the workload. Determining the workload is a critical part of building policies, and the more you know about workload, the more fine-tuned your policy can be. The workload assumptions we make here are mostly unrealistic, but that is alright (for now), because we will relax them as we go, and even- tually develop what we will refer to as ..
    [Show full text]
  • Scheduling Light-Weight Parallelism in Artcop
    Scheduling Light-Weight Parallelism in ArTCoP J. Berthold1,A.AlZain2,andH.-W.Loidl3 1 Fachbereich Mathematik und Informatik Philipps-Universit¨at Marburg, D-35032 Marburg, Germany [email protected] 2 School of Mathematical and Computer Sciences Heriot-Watt University, Edinburgh EH14 4AS, Scotland [email protected] 3 Institut f¨ur Informatik, Ludwig-Maximilians-Universit¨at M¨unchen, Germany [email protected] Abstract. We present the design and prototype implementation of the scheduling component in ArTCoP (architecture transparent control of parallelism), a novel run-time environment (RTE) for parallel execution of high-level languages. A key feature of ArTCoP is its support for deep process and memory hierarchies, shown in the scheduler by supporting light-weight threads. To realise a system with easily exchangeable components, the system defines a micro-kernel, providing basic infrastructure, such as garbage collection. All complex RTE operations, including the handling of parallelism, are implemented at a separate system level. By choosing Concurrent Haskell as high-level system language, we obtain a prototype in the form of an executable specification that is easier to maintain and more flexible than con- ventional RTEs. We demonstrate the flexibility of this approach by presenting implementations of a scheduler for light-weight threads in ArTCoP, based on GHC Version 6.6. Keywords: Parallel computation, functional programming, scheduling. 1 Introduction In trying to exploit the computational power of parallel architectures ranging from multi-core machines to large-scale computational Grids, we are currently developing a new parallel runtime environment, ArTCoP, for executing parallel Haskell code on such complex, hierarchical architectures.
    [Show full text]
  • Scheduling Weakly Consistent C Concurrency for Reconfigurable
    SUBMISSION TO IEEE TRANS. ON COMPUTERS 1 Scheduling Weakly Consistent C Concurrency for Reconfigurable Hardware Nadesh Ramanathan, John Wickerson, Member, IEEE, and George A. Constantinides Senior Member, IEEE Abstract—Lock-free algorithms, in which threads synchronise These reorderings are invisible in a single-threaded context, not via coarse-grained mutual exclusion but via fine-grained but in a multi-threaded context, they can introduce unexpected atomic operations (‘atomics’), have been shown empirically to behaviours. For instance, if another thread is simultaneously be the fastest class of multi-threaded algorithms in the realm of conventional processors. This article explores how these writing to z, then reordering two instructions above may algorithms can be compiled from C to reconfigurable hardware introduce the behaviour where x is assigned the latest value via high-level synthesis (HLS). but y gets an old one.1 We focus on the scheduling problem, in which software The implication of this is not that existing HLS tools are instructions are assigned to hardware clock cycles. We first wrong; these optimisations can only introduce new behaviours show that typical HLS scheduling constraints are insufficient to implement atomics, because they permit some instruction when the code already exhibits a race condition, and races reorderings that, though sound in a single-threaded context, are deemed a programming error in C [2, §5.1.2.4]. Rather, demonstrably cause erroneous results when synthesising multi- the implication is that if these memory accesses are upgraded threaded programs. We then show that correct behaviour can be to become atomic (and hence allowed to race), then existing restored by imposing additional intra-thread constraints among scheduling constraints are insufficient.
    [Show full text]
  • Fork-Join Pattern
    Fork-Join Pattern Parallel Computing CIS 410/510 Department of Computer and Information Science Lecture 9 – Fork-Join Pattern Outline q What is the fork-join concept? q What is the fork-join pattern? q Programming Model Support for Fork-Join q Recursive Implementation of Map q Choosing Base Cases q Load Balancing q Cache Locality and Cache-Oblivious Algorithms q Implementing Scan with Fork-Join q Applying Fork-Join to Recurrences Introduction to Parallel Computing, University of Oregon, IPCC Lecture 9 – Fork-Join Pattern 2 Fork-Join Philosophy When you come to a fork in the road, take it. (Yogi Bera, 1925 –) Introduction to Parallel Computing, University of Oregon, IPCC Lecture 9 – Fork-Join Pattern 3 Fork-Join Concept q Fork-Join is a fundamental way (primitive) of expressing concurrency within a computation q Fork is called by a (logical) thread (parent) to create a new (logical) thread (child) of concurrency ❍ Parent continues after the Fork operation ❍ Child begins operation separate from the parent ❍ Fork creates concurrency q Join is called by both the parent and child ❍ Child calls Join after it finishes (implicitly on exit) ❍ Parent waits until child joins (continues afterwards) ❍ Join removes concurrency because child exits Introduction to Parallel Computing, University of Oregon, IPCC Lecture 9 – Fork-Join Pattern 4 Fork-Join Concurrency Semantics q Fork-Join is a concurrency control mechanism ❍ Fork increases concurrency ❍ Join decreases concurrency q Fork-Join dependency rules ❍ A parent must join with its forked children
    [Show full text]
  • An Analytical Model for Multi-Tier Internet Services and Its Applications
    An Analytical Model for Multi-tier Internet Services and Its Applications Bhuvan Urgaonkar, Giovanni Pacificiy, Prashant Shenoy, Mike Spreitzery, and Asser Tantawiy Dept. of Computer Science, y Service Management Middleware Dept., University of Massachusetts, IBM T. J. Watson Research Center, Amherst, MA 01003 Hawthorne, NY 10532 fbhuvan,[email protected] fgiovanni,mspreitz,[email protected] ABSTRACT that is responsible for HTTP processing, a middle tier Java enterprise Since many Internet applications employ a multi-tier architecture, server that implements core application functionality, and a back- in this paper, we focus on the problem of analytically modeling the end database that stores product catalogs and user orders. In this behavior of such applications. We present a model based on a net- example, incoming requests undergo HTTP processing, processing work of queues, where the queues represent different tiers of the by Java application server, and trigger queries or transactions at the application. Our model is sufficiently general to capture (i) the database. behavior of tiers with significantly different performance charac- This paper focuses on analytically modeling the behavior of multi- teristics and (ii) application idiosyncrasies such as session-based tier Internet applications. Such a model is important for the follow- workloads, tier replication, load imbalances across replicas, and ing reasons: (i) capacity provisioning, which enables a server farm caching at intermediate tiers. We validate our model using real to determine how much capacity to allocate to an application in or- multi-tier applications running on a Linux server cluster. Our exper- der for it to service its peak workload; (ii) performance prediction, iments indicate that our model faithfully captures the performance which enables the response time of the application to be determined of these applications for a number of workloads and configurations.
    [Show full text]
  • Communication and Scheduling Model Design of Distributed Workflow Management System
    Advances in Engineering Research, volume 119 2nd International Conference on Automatic Control and Information Engineering (ICACIE 2017) Communication and Scheduling Model Design of Distributed Workflow Management System Jing Zhao Bin Wei School of Science Key Laboratory of Network & Information Security of APF Xijing University Engineering College of APF Xi’an, China, 710123 Xi’an, China, 710123 [email protected] Abstract—The traditional OA system cannot meet the requirements of modern enterprise. Currently the widely used II. DISTRIBUTED WORKFLOW MANAGEMENT SYSTEM distributed workflow management system (DWFMS) has the The WFMC propose a classic definition of workflow: all problem of load balancing, which makes the enterprise efficiency or part of it supported or automated by a computer. The unimproved. In this paper, on the study of the DWFMS, the workflow management system is to support a batch of model design of common communication transmission and professional set workflow, which is defined by the calculated scheduling are analyzed. Then the real-time scheduling strategy process. It is used to support the definition, management and model is put forward and discussed, which based on feedback control. By using this model in DWFMS, the task can be executive the workflow. It enable the business execution controlled effectively, the load balancing performance of the processes to achieve maximum efficiency. system is improved effectively, and the stability of the system is The distributed workflow system is generally divided into ensured. 3 parts: the distribution of workflow architecture, the distribution of workflow engine and the distribution of Keywords—workflow; communication; scheduling model; workflow model. The workflow engine distribution is the core controller of distributed system.
    [Show full text]
  • Computer Scheduling Methods and Their Countermeasures
    Computer scheduling methods and their countermeasures by EDWARD G. COFFMAN, JR* Princeton University Princeton, New Jersey A anu LEONARD KLEINROCK** University of California Los Angeles, California INTRODUCTION sources. In multi-access, multiprogramming systems The simultaneous demand for computer service by throughput may conveniently be measured in terms of members from a population of users generally results computer operating efficiency defined roughly as the in the formation of queues. These queues are con­ percentage of time the computer spends in perform­ trolled by some computer scheduling method which ing user or customer-directed tasks as compared with chooses the order in which various users receive at­ the time spent in performing executive (overhead type) tention. The goal of this priority scheduling algorithm tasks. We shall avoid trying to measure the program­ is to provide the population of users with a high mers' or users' productivity in a multi-access environ­ grade of service (rapid response, resource availabil­ ment as compared with productivity in the usually less ity, etc.(, at the same time maintaining an acceptable flexible but more efficient batch-processing environ­ throughput rate. The object of the present paper is to ment. For discussions on this subject see References discuss most of the priority scheduling procedures 1 and 2 and the bibliography of Reference 3. that have been considered in the past few years, to dis­ With a somewhat different orientation some of these cuss in a coherent way their effectiveness and weak­ topics have been covered elsewhere. In particular, Coffman,4 Greenberger5 and in more detail Estrin nesses in terms of the performance measures men­ 6 tioned above, to describe what the analysis of related and Kleinrock have reviewed the many applications queueing models has been able to provide in the way of queueing theory to the analysis of multiprogram­ of design aids, and in this last respect, to point out ming systems.
    [Show full text]