Project 2: User Level Thread Library 15-410 Operating Systems September 19, 2007

Project 2: User Level Thread Library 15-410 Operating Systems September 19, 2007 1 Overview An important aspect of operating system design is organizing computations that run concurrently and share memory. Concurrency concerns are paramount when designing multi-threaded programs that share some critical resource, be it some device or piece of memory. In this project you will write a thread library and concurrency primitives. This document provides the background information and specification for writing the thread library and concurrency primitives. We will provide you with a miniature operating system kernel (called “Pebbles”) which implements a minimal set of system calls, and some multi-threaded programs. These programs will be linked against your thread library, stored on a “RAM disk,” and then run under the supervision of the Pebbles kernel. Pebbles is documented by the companion document, “Pebbles Kernel Specification,” which should probably be read before this one. The thread library will be based on the thread fork system call provided by Pebbles, which provides a “raw” (unprocessed) interface to kernel-scheduled threads. Your library will provide a basic but usable interface on top of this elemental thread building block, including the ability to join threads. You will implement mutexes and condition variables based on your consideration of the options provided by the x86 instruction set—including, but not limited to, the XCHG instruction for atomically exchanging registers and memory or registers and registers. 2 Goals • Becoming familiar with the ways in which operating systems support user libraries by providing system calls to create processes, affect scheduling, etc. • Becoming familiar with programs that involve a high level of concurrency and the sharing of critical resources, including the tools that are used to deal with these issues. • Developing skills necessary to produce a substantial amount of code, such as organization and project planning. • Working with a partner is also an important aspect of this project. You will be working with a partner on subsequent projects, so it is important to be familiar with scheduling time to work, a preferred working environment, and developing a good group dynamic before beginning larger projects. • Coming to understand the dynamics of source control in a group context, e.g., when to branch and merge. The partner goal is an important one–this project gives you an opportunity to debug not only your code deliverables but also your relationship with your partner. You may find that some of the same techniques apply. 1 3 Important Dates Wednesday, September 19th Project 2 begins Monday, September 24th You should be able to draw a very detailed picture of the parent and child stacks during thr create() at the point when the child does its first PUSHL instruction. In your design multiple pictures may be equally plausible, but it is important that you be able to draw at least one case in detail. Wednesday, September 26th You should have thread creation working well enough to pass the STARTLE test we provide. Friday, September 28th You should have thread creation, mutexes, and condition variables working well. Tuesday, October 2nd If you haven’t least begun debugging CYCLONE and AGILITY DRIL by this point, you run the risk of turning in a thread library with serious structural flaws and lacking concurrency debugging experience useful for the kernel project. Friday, October 5th Project 2 due at 23:59:59 4 Thread Library API The library you will write will contain: • Thread management calls • Mutexes and condition variables • Semaphores • Readers/writers locks Please note that all lock-like objects are defined to be “unlocked” when created. Unlike system call stubs (see “Pebbles Kernel Specification”), thread library routines need not be one- per-source-file, but we expect you to use good judgement when partitioning them (and this may influence your grade to some extent). You should arrange that the Makefile infrastructure you are given will build your library into libthread.a (see the README file in the tarball). You may assume that programs which use condition variables will include cond.h, programs which use semaphores will include sem.h, etc. 4.1 Thread Management API • int thr init( unsigned int size ) - This function is responsible for initializing the thread library. The argument size specifies the amount of stack space which will be available for use by threads created with thr create(). This function returns zero on success, and a negative number on error. The thread library can assume that programs using it are well-behaved in the sense that they will call thr init(), exactly once, before calling any other thread library function (including memory 2 allocation functions in the malloc() family, described below) or invoking the thread fork system call. Also, you may assume that all threads of a task using your thread library will call thr exit() instead of directly invoking the vanish() system call (and that the root thread will call thr exit() instead of return()’ing from main()). • int thr create( void *(*func)(void *), void *arg ) - This function creates a new thread to run func(arg). This function should allocate a stack for the new thread and then invoke the thread fork system call in an appropriate way. A stack frame should be created for the child, and the child should be provided with some way of accessing its thread identifier (tid). On success the thread ID of the new thread is returned, on error a negative number is returned. You should pay attention to (at least) two stack-related issues. First, the stack pointer should essentially always be aligned on a 32-bit boundary (i.e., %esp mod 4 == 0). Second, you need to think very carefully about the relationship of a new thread to the stack of the parent thread, especially right after the thread fork system call has completed. • int thr join( int tid, void **statusp ) - This function “cleans up” after a thread, optionally returning the status information provided by the thread at the time of exit. The target thread tid may or may not have exited before thr join() is called; if it has not, the calling thread will be suspended until the target thread does exit. If statusp is not NULL, the value passed to thr exit() by the joined thread will be placed in the location referenced by statusp. Only one thread may join on any given target thread. Other attempts to join on the same thread should return an error promptly. If thread tid was not created before thr join(tid) was called, an error will be returned. This function returns zero on success, and a negative number on error. • void thr exit( void *status ) - This function exits the thread with exit status status. If a thread other than the root thread returns from its body function instead of calling thr exit(), the behavior should be the same as if the function had called thr exit() specifying the return value from the thread’s body function. Note that status is not a “pointer to a void.” It is frequently not a pointer to anything of any kind. Instead, status is a pointer-sized opaque data type which the thread library transports uninterpreted from the caller of thr exit() to the caller of thr join(). • int thr getid( void ) - Returns the thread ID of the currently running thread. • int thr yield( int tid ) - Defers execution of the invoking thread to a later time in favor of the thread with ID tid. If tid is -1, yield to some unspecified thread. If the thread with ID tid is not runnable, or doesn’t exist, then an integer error code less than zero is returned. Zero is returned on success. Note that the “thread IDs” generated and accepted by your thread library routines (e.g., thr getid(), thr join()) are not required to be the same “thread IDs” which are generated and accepted by the thread- related system calls (e.g., thread fork, gettid(), cas2i runflag()). If you think about how you would implement an “M:N” thread library, or a user-space thread library, you will see why these two 3 name spaces cannot always be the same. Whether or not you use kernel-issued thread ID’s as your thread library’s thread ID’s is a design decision you will need to consider. However, you must not aggressively recycle thread ID’s, as this significantly reduces the utility of, e.g., thr yield(). 4.2 Mutexes Mutual exclusion locks prevent multiple threads from simultaneously executing critical sections of code. To implement mutexes you may use the XCHG instruction documented on page 3-714 of the Intel Instruction Set Reference. For more information on the behavior of mutexes, feel free to refer to the text, or to the Solaris or Linux pthread mutex init() manual page. • int mutex init( mutex t *mp ) - This function should initialize the mutex pointed to by mp. The effects of using a mutex before it has been initialized, or of initializing it when it is already initialized and in use, are undefined (and may be startling). This function returns zero on success, and a negative number on error. • int mutex destroy( mutex t *mp ) - This function should “deactivate” the mutex pointed to by mp. The effects of using a mutex from the time of its destruction until the time of a possible later re-initialization are undefined. If this function is called while the mutex is locked, it should immediately return an error. This function returns zero on success, and a negative number on error. • int mutex lock( mutex t *mp ) - A call to this function ensures mutual exclusion in the region between itself and a call to mutex unlock().

Load more