11 Identifying the Root Causes of Wait States in Large-Scale Parallel Applications
Total Page:16
File Type:pdf, Size:1020Kb
Load more
										Recommended publications
									
								- 
												  Parallel Programming in Unix, Many Programs Run in at the Same Timeparallelism parallel programming! to perform a number of tasks simultaneously! these can be calculations, or data transfer or…! In Unix, many programs run in at the same time handling various tasks! we like multitasking…! we don’t like waiting… "1 simplest way: FORK() Each process has a process ID called PID.! After a fork() call, ! there will be a second process with another PID which we call the child. ! The first process, will also continue to exist and to execute commands, we call it the mother. "2 fork() example #include <stdio.h> // printf, stderr, fprintf! ! #include <sys/types.h> // pid_t ! } // end of child! #include <unistd.h> // _exit, fork! else ! #include <stdlib.h> // exit ! { ! #include <errno.h> // errno! ! ! /* If fork() returns a positive number, we int main(void)! are in the parent process! {! * (the fork return value is the PID of the pid_t pid;! newly created child process)! pid = fork();! */! ! int i;! if (pid == -1) {! for (i = 0; i < 10; i++)! fprintf(stderr, "can't fork, error %d\n", {! errno);! printf("parent: %d\n", i);! exit(EXIT_FAILURE);! sleep(1);! }! }! ! exit(0);! if (pid == 0) {! }// end of parent! /* Child process:! ! * If fork() returns 0, it is the child return 0;! process.! } */! int j;! for (j = 0; j < 15; j++) {! printf("child: %d\n", j);! sleep(1);! }! _exit(0); /* Note that we do not use exit() */! !3 why not to fork fork() system call is the easiest way of branching out to do 2 tasks concurrently. BUT" it creates a separate address space such that the child process has an exact copy of all the memory segments of the parent process.
- 
												  Making ProcessesMaking Processes • Topics • Process creation from the user level: • fork, execvp, wait, waitpid • Learning Objectives: • Explain how processes are created • Create new processes, synchronize with them, and communicate exit codes back to the forking process. • Start building a simple shell. 11/8/16 CS61 Fall 2016 1 Recall how we create processes: fork #include <unistd.h> pid_t ret_pid; ret_pid = fork(); switch (ret_pid){ case 0: /* I am the child. */ break; case -1: /* Something bad happened. */ break; default: /* * I am the parent and my child’s * pid is ret_pid. */ break; } 11/8/16 CS61 Fall 2016 2 But what good are two identical processes? • So fork let us create a new process, but it’s identical to the original. That might not be very handy. • Enter exec (and friends): The exec family of functions replaces the current process image with a new process image. • exec is the original/traditional API • execve is the modern day (more efficient) implementation. • There are a pile of functions, all of which ultimately invoke execve; we will focus on execvp (which is recommended for Assignment 5). • If execvp returns, then it was unsuccessful and will return -1 and set errno. • If successful, execvp does not return, because it is off and running as a new process. • Arguments: • file: Name of a file to be executed • args: Null-terminated argument vector; the first entry of which is (by convention) the file name associated with the file being executed. 11/8/16 CS61 Fall 2016 3 Programming with Exec #include <unistd.h> #include <errno.h> #include <stdio.h> pid_t ret_pid; ret_pid = fork(); switch (ret_pid){ case 0: /* I am the child.
- 
												  Lecture 4: September 13 4.1 Process StateCMPSCI 377 Operating Systems Fall 2012 Lecture 4: September 13 Lecturer: Prashant Shenoy TA: Sean Barker & Demetre Lavigne 4.1 Process State 4.1.1 Process A process is a dynamic instance of a computer program that is being sequentially executed by a computer system that has the ability to run several computer programs concurrently. A computer program itself is just a passive collection of instructions, while a process is the actual execution of those instructions. Several processes may be associated with the same program; for example, opening up several windows of the same program typically means more than one process is being executed. The state of a process consists of - code for the running program (text segment), its static data, its heap and the heap pointer (HP) where dynamic data is kept, program counter (PC), stack and the stack pointer (SP), value of CPU registers, set of OS resources in use (list of open files etc.), and the current process execution state (new, ready, running etc.). Some state may be stored in registers, such as the program counter. 4.1.2 Process Execution States Processes go through various process states which determine how the process is handled by the operating system kernel. The specific implementations of these states vary in different operating systems, and the names of these states are not standardised, but the general high-level functionality is the same. When a process is first started/created, it is in new state. It needs to wait for the process scheduler (of the operating system) to set its status to "new" and load it into main memory from secondary storage device (such as a hard disk or a CD-ROM).
- 
												  Pid T Fork(Void); Description: Fork Creates a Child Process That Differspid_t fork(void); Description: fork creates a child process that differs from the parent process only in its PID and PPID, and in the fact that resource utilizations are set to 0. File locks and pending signals are not inherited. Returns: On success, the PID of the child process is returned in the parent's thread of execution, and a 0 is returned in the child's thread of execution. int execve(const char *filename, char *const argv [], char *const envp[]); Description: execve() executes the program pointed to by filename. filename must be either a binary executable, or a script. argv is an array of argument strings passed to the new program. envp is an array of strings, conventionally of the form key=value, which are passed as environment to the new program. Returns: execve() does not return on success, and the text, data, bss, and stack of the calling process are overwritten by that of the program loaded. pid_t wait(int *status); pid_t waitpid(pid_t pid, int *status, int options); Description: The wait function suspends execution of the current process until a child has exited, or until a signal is delivered whose action is to terminate the current process or to call a signal handling function. The waitpid function suspends execution of the current process until a child as specified by the pid argument has exited, or until a signal is delivered whose action is to terminate the current process or to call a signal handling function. If a child has already exited by the time of the call (a so-called \zombie" process), the function returns immediately.
- 
												  System Calls & SignalsCS345 OPERATING SYSTEMS System calls & Signals Panagiotis Papadopoulos [email protected] 1 SYSTEM CALL When a program invokes a system call, it is interrupted and the system switches to Kernel space. The Kernel then saves the process execution context (so that it can resume the program later) and determines what is being requested. The Kernel carefully checks that the request is valid and that the process invoking the system call has enough privilege. For instance some system calls can only be called by a user with superuser privilege (often referred to as root). If everything is good, the Kernel processes the request in Kernel Mode and can access the device drivers in charge of controlling the hardware (e.g. reading a character inputted from the keyboard). The Kernel can read and modify the data of the calling process as it has access to memory in User Space (e.g. it can copy the keyboard character into a buffer that the calling process has access to) When the Kernel is done processing the request, it restores the process execution context that was saved when the system call was invoked, and control returns to the calling program which continues executing. 2 SYSTEM CALLS FORK() 3 THE FORK() SYSTEM CALL (1/2) • A process calling fork()spawns a child process. • The child is almost an identical clone of the parent: • Program Text (segment .text) • Stack (ss) • PCB (eg. registers) • Data (segment .data) #include <sys/types.h> #include <unistd.h> pid_t fork(void); 4 THE FORK() SYSTEM CALL (2/2) • The fork()is one of the those system calls, which is called once, but returns twice! Consider a piece of program • After fork()both the parent and the child are ..
- 
												  Troubleshoot Zombie/Defunct Processes in UC ServersTroubleshoot Zombie/Defunct Processes in UC Servers Contents Introduction Prerequisites Requirements Components Used Background Information Check Zombies using UCOS Admin CLI Troubleshoot/Clear the Zombies Manually Restart the Appropriate Service Reboot the Server Kill the Parent Process Verify Introduction This document describes how to work with zombie processes seen on CUCM, IMnP, and other Cisco UC products when logged in using Admin CLI. Prerequisites Requirements Cisco recommends that you have knowledge of using Admin CLI of the UC servers: ● Cisco Unified Communications Manager (CUCM) ● Cisco Unified Instant Messaging and Presence Server (IMnP) ● Cisco Unity Connection Server (CUC) Components Used This document is not restricted to specific software and hardware versions. The information in this document was created from the devices in a specific lab environment. All of the devices used in this document started with a cleared (default) configuration. If your network is live, ensure that you understand the potential impact of any command. Background Information The Unified Communications servers are essentially Linux OS based applications. When a process dies on Linux, it is not all removed from memory immediately, its process descriptor (PID) stays in memory which only takes a tiny amount of memory. This process becomes a defunct process and the process's parent is notified that its child process has died. The parent process is then supposed to read the dead process's exit status and completely remove it from the memory. Once this is done using the wait() system call, the zombie process is eliminated from the process table. This is known as reaping the zombie process.
- 
												  Lecture 22 Systems Programming Process ControlLecture 22 Systems Programming Process Control A process is defined as an instance of a program that is currently running. A uni processor system can still execute multiple processes giving the appearance of a multi- processor machine. A call to a program spawns a process. If a mail program is called by n users then n processes or instances are created and executed by the unix system. Many operating systems including windows and unix executes many processes at the same time. When a program is called, a process is created and a process ID is issued. The process ID is given by the function getpid() defined in <unistd.h>. The prototype for pid( ) is given by #include < unistd.h > pid_t getpid(void); In a uni-processor machine, each process takes turns running and for a short duration, a process takes time interval called a timeslice. The unix command ps can be used to list all current process status. ps PID TTY TIME CMD 10150 pts/16 00:00:00 csh 31462 pts/16 00:00:00 ps The command ps lists the process ID (PID), the terminal name(TTY), the amount of time the process has used so far(TIME) and the command it is executing(CMD). Ps command only displays the current user processes. But we can get all the processes with the flag (- a) and in long format with flag (-l) ps –a ps -l ps -al Information provided by each process may include the following. PID The process ID in integer form PPID The parent process ID in integer form STAT The state of the process TIME CPU time used by the process (in seconds) TT Control terminal of the process COMMAND The user command that started the process Copyright @ 2008 Ananda Gunawardena Each process has a process ID that can be obtained using the getpid() a system call declared in unistd.h.
- 
												  Kill a Zombie! a Process Is Called a Zombie Process If the Process HasKill a Zombie! A process is called a zombie process if the process has been completed, but its PID and process entry remains in the Linux process table. A process is removed from the process table when the process is completed, and its parent process reads the completed process' exit status by using the wait() system call. If a parent process fails to call wait() for whatever reason, its child process will be left in the process table, becoming a zombie. To find zombie processes on Linux: $ ps axo stat,ppid,pid,comm | grep -w defunct Z 250 10242 fake-prog <defunct> The above command searches for processes with zombie (defunct) state, and displays them in (state, PPID, PID, command-name) format. The sample output shows that there is one zombie process associated with "fake-prog", and it was spawned by a parent process with PID 250. Killing zombie processes is not obvious since zombie processes are already dead. You can try two options to kill a zombie process on Linux as follows. First, you can try sending SIGCHLD signal to the zombie's parent process using the kill command. Note that the above command gives you PPID (PID of parent process) of each zombie. In our example, PPID of the zombie is 250. $ sudo kill -s SIGCHLD 250 If a zombie process still does not go away, you can kill the parent process (e.g., 250) of the zombie. $ sudo kill -9 250 Once its parent process gets killed, the zombie will be adopted by the init process, which is a parent of all processes in Linux.
- 
												  Operating Systems Processes and ThreadsCOS 318: Operating Systems Processes and Threads Prof. Margaret Martonosi Computer Science Department Princeton University http://www.cs.princeton.edu/courses/archive/fall11/cos318 Today’s Topics Processes Concurrency Threads Reminder: Hope you’re all busy implementing your assignment 2 (Traditional) OS Abstractions Processes - thread of control with context Files- In Unix, this is “everything else” Regular file – named, linear stream of data bytes Sockets - endpoints of communication, possible between unrelated processes Pipes - unidirectional I/O stream, can be unnamed Devices Process Most fundamental concept in OS Process: a program in execution one or more threads (units of work) associated system resources Program vs. process program: a passive entity process: an active entity For a program to execute, a process is created for that program Program and Process main() main() { { heap ... ... foo() foo() ... ... } } stack bar() bar() { { registers ... ... PC } } Program Process 5 Process vs. Program Process > program Program is just part of process state Example: many users can run the same program • Each process has its own address space, i.e., even though program has single set of variable names, each process will have different values Process < program A program can invoke more than one process Example: Fork off processes 6 Simplest Process Sequential execution No concurrency inside a process Everything happens sequentially Some coordination may be required Process state Registers Main memory I/O devices • File system • Communication ports … 7 Process Abstraction Unit of scheduling One (or more*) sequential threads of control program counter, register values, call stack Unit of resource allocation address space (code and data), open files sometimes called tasks or jobs Operations on processes: fork (clone-style creation), wait (parent on child), exit (self-termination), signal, kill.
- 
												  Processes Goals of Today's LectureProcesses Professor Jennifer Rexford http://www.cs.princeton.edu/~jrex 1 Goals of Today’s Lecture • Processes o Process vs. program o Context switching • Creating a new process o Fork: process creates a new child process o Wait: parent waits for child process to complete o Exec: child starts running a new program o System: combines fork, wait, and exec all in one • Communication between processes o Pipe between two processes Redirecting stdin and stdout o 2 1 Processes 3 Program vs. Process • Program o Executable code, no dynamic state • Process o An instance of a program in execution, with its own – Address space (illusion of a memory) • Text, RoData, BSS, heap, stack – Processor state (illusion of a processor) • EIP, EFLAGS, registers – Open file descriptors (illusion of a disk) o Either running, waiting, or ready… • Can run multiple instances of the same program o Each as its own process, with its own process ID 4 2 Life Cycle of a Process • Running: instructions are being executed •Waiting:waiting for some event (e.g., I/O finish) • Ready: ready to be assigned to a processor Create Ready Running Termination Waiting 5 OS Supports Process Abstraction • Supporting the abstraction o Multiple processes share resources o Protected from each other • Main memory User User o Swapping pages to/from disk o Virtual memory Process Process • Processor Operating System o Switching which process gets to run on the CPU o Saving per-process state on a “context switch” Hardware 6 3 When to Change Which Process is Running? • When a process is stalled waiting for I/O o Better utilize the CPU, e.g., while waiting for disk access 1: CPU I/O CPU I/OCPU I/O 2: CPU I/O CPU I/OCPU I/O • When a process has been running for a while o Sharing on a fine time scale to give each process the illusion of running on its own machine o Trade-off efficiency for a finer granularity of fairness 7 Switching Between Processes •Context Process 1 Process 2 o State the OS needs to restart a preempted Waiting process Running Save context .
- 
												  Signals  Kernel Software This Lecture  Non‐Local Jumps  Application Code Carnegie Mellon TodayCarnegie Mellon Introduction to Computer Systems 15‐213/18‐243, Fall 2009 12th Lecture Instructors: Greg Ganger and Roger Dannenberg Carnegie Mellon Announcements Final exam day/time announced (by CMU) 5:30‐8:30pm on Monday, December 14 Cheating… please, please don’t Writing code together counts as “sharing code” – forbidden “Pair programming”, even w/o looking at other’s code – forbidden describing code line by line counts the same as sharing code Opening up code and then leaving it for someone to enjoy – forbidden in fact, please remember to use protected directories and screen locking Talking through a problem can include pictures (not code) –ok The automated tools for discovering cheating are incredibly good …please don’t test them Everyone has been warned multiple times cheating on the remaining labs will receive no mercy Carnegie Mellon ECF Exists at All Levels of a System Exceptions Hardware and operating system kernel Previous Lecture software Signals Kernel software This Lecture Non‐local jumps Application code Carnegie Mellon Today Multitasking, shells Signals Long jumps Carnegie Mellon The World of Multitasking System runs many processes concurrently Process: executing program State includes memory image + register values + program counter Regularly switches from one process to another Suspend process when it needs I/O resource or timer event occurs Resume process when I/O available or given scheduling priority Appears to user(s) as if all processes executing simultaneously Even though most systems
- 
												  Linux/UNIX System Programming FundamentalsLinux/UNIX System Programming Fundamentals Michael Kerrisk man7.org NDC TechTown; August 2020 ©2020, man7.org Training and Consulting / Michael Kerrisk. All rights reserved. These training materials have been made available for personal, noncommercial use. Except for personal use, no part of these training materials may be printed, reproduced, or stored in a retrieval system. These training materials may not be redistributed by any means, electronic, mechanical, or otherwise, without prior written permission of the author. These training materials may not be used to provide training to others without prior written permission of the author. Every effort has been made to ensure that the material contained herein is correct, including the development and testing of the example programs. However, no warranty is expressed or implied, and the author shall not be liable for loss or damage arising from the use of these programs. The programs are made available under Free Software licenses; see the header comments of individual source files for details. For information about this course, visit http://man7.org/training/. For inquiries regarding training courses, please contact us at [email protected]. Please send corrections and suggestions for improvements to this course material to [email protected]. For information about The Linux Programming Interface, please visit http://man7.org/tlpi/. Short table of contents 1 Course Introduction 1-1 2 Fundamental Concepts 2-1 3 File I/O and Files 3-1 4 Directories and Links 4-1 5 Processes 5-1 6 Signals: