A New Facility for Resource Management in Server Systems

The following paper was originally published in the Proceedings of the 3rd Symposium on Operating Systems Design and Implementation New Orleans, Louisiana, February, 1999 Resource Containers: A New Facility for Resource Management in Server Systems Gaurav Banga, Peter Druschel Rice University Jeffrey C. Mogul Western Research Laboratory, Compaq Computer Corp. For more information about USENIX Association contact: 1. Phone: 1.510.528.8649 2. FAX: 1.510.548.5738 3. Email: [email protected] 4. WWW URL:http://www.usenix.org/ Resource containers: A new facility for resource management in server systems Gaurav Banga Peter Druschel Jeffrey C. Mogul Dept. of Computer Science Western Research Laboratory Rice University Compaq Computer Corporation Houston, TX 77005 Palo Alto, CA 94301 g fgaurav, druschel @cs.rice.edu [email protected] Abstract use of existing operating systems. For example, while General-purpose operating systems provide inade- early Web servers used a process per connection, recent quate support for resource management in large-scale servers [41, 49] use a single-process model, which re- servers. Applications lack sufficient control over duces context-switching costs. scheduling and management of machine resources, While the work cited above has been fruitful, it has which makes it difficult to enforce priority policies, and generally treated the operating system’s application pro- to provide robust and controlled service. There is a fun- gramming interface (API), and therefore its core abstrac- damental mismatch between the original design assump- tions, as a constant. This has frustrated efforts to solve tions underlying the resource management mechanisms thornier problems of server scaling and effective con- of current general-purpose operating systems, and the trol over resource consumption. In particular, servers behavior of modern server applications. In particular, the may still be vulnerable to “denial of service” attacks, in operating system’s notions of protection domain and re- which a malicious client manages to consume all of the source principal coincide in the process abstraction. This server’s resources. Also, service providers want to exert coincidence prevents a process that manages large num- explicit control over resource consumption policies, in bers of network connections, for example, from properly order to provide differentiated quality of service (QoS) to allocating system resources among those connections. clients [1] or to control resource usage by guest servers We propose and evaluate a new operating system ab- in a Rent-A-Server host [45]. Existing APIs do not al- straction called a resource container, which separates the low applications to directly control resource consump- notion of a protection domain from that of a resource tion throughout the host system. principal. Resource containers enable fine-grained re- The root of this problem is the model for resource source management in server systems and allow the de- management in current general-purpose operating sys- velopment of robust servers, with simple and firm control tems. In these systems, scheduling and resource man- over priority policies. agement primitives do not extend to the execution of sig- nificant parts of kernel code. An application has no con- 1 Introduction trol over the consumption of many system resources that Networked servers have become one of the most im- the kernel consumes on behalf of the application. The portant applications of large computer systems. For many explicit resource management mechanisms that do exist users, the perceived speed of computing is governed by are tied to the assumption that a process is what consti- server performance. We are especially interested in the tutes an independent activity1 . Processes are the resource performance of Web servers, since these must often scale principals: those entities between which the resources of to thousands or millions of users. the system are to be shared. Operating systems researchers and system vendors Modern high-performance servers, however, often use have devoted much attention to improving the perfor- a single process to perform many independent activities. mance of Web servers. Improvements in operating sys- For example, a Web server may manage hundreds or tem performance have come from reducing data move- even thousands of simultaneous network connections, all ment costs [2, 35, 43], developing better kernel algo- within the same process. Much of the resource consump- rithms for protocol control block (PCB) lookup [26] and tion associated with these connections occurs in kernel file descriptor allocation [6], improving stability under 1 We use the term independent activity to denote a unit of compu- overload [15, 30], and improving server control mech- tation for which the application wishes to perform separate resource anisms [5, 21]. Application designers have also at- allocation and accounting; for example, the processing associated with tacked performance problems by making more efficient a single HTTP request. mode, making it impossible for the application to control HTTP Master HTTP Slave Processes which connections are given priority2. Process In this paper, we address resource management in monolithic kernels. While microkernels and other novel systems offer interesting alternative approaches to this problem, monolithic kernels are still commercially sig- User level nificant, especially for Internet server applications. Listen Kernel We describe a new model for fine-grained resource Socket management in monolithic kernels. This model is based on a new operating system abstraction called a resource TCP/IP container. A resource container encompasses all system Pending HTTP HTTP resources that the server uses to perform a particular in- Connections Connections dependent activity, such as servicing a particular client connection. All user and kernel level processing for an Fig. 1: A process-per connection HTTP server with a activity is charged to the appropriate resource container, master process. and scheduled at the priority of the container. This model allows fairly arbitrary interrelationships between protec- model. The forking overhead quickly became a problem, tion domains, threads and resource containers, and can and subsequent servers (such as the NCSA httpd [32]), therefore support a wide range of resource management used a set of pre-forked processes. In this model, shown scenarios. in Figure 1, a master process accepts new connections We evaluate a prototypeimplementation of this model, and passes them to the pre-forked worker processes. as a modification of Digital UNIX, and show that it is effective in solving the problems we described. HTTP Server HTTP Process Thread 2 Typical models for high-performance servers This section describes typical execution models for select() high-performance Internet server applications, and pro- User level vides the background for the discussion in following sec- Listen Kernel Socket tions. To be concrete, we focus on HTTP servers and proxy servers, but most of the issues also apply to other TCP/IP servers, such as mail, file, and directory servers. We as- sume the use of a UNIX-like API; however, most of this Pending HTTP HTTP discussion is valid for servers based on Windows NT. Connections Connections An HTTP server receives requests from its clients via TCP connections. (In HTTP/1.1, several requests may be Fig. 2: A single-process event-driven server. sent serially over one connection.) The server listens on a well-known port for new connection requests. When a Multi-process servers can suffer from context- new connection request arrives, the system delivers the switching and interprocess communication (IPC) over- connection to the server application via the accept() heads [11, 38], so many recent servers use a single- system call. The server then waits for the client to send process architecture. In the event-driven model (Fig- a request for data on this connection, parses the request, ure 2), the server uses a single thread to manage all con- and then returns the response on the same connection. nections at the server. (Event-driven servers designed Web servers typically obtain the response from the local for multiprocessors use one thread per processor.) The file system, while proxies obtain responses from other server uses the select() (or poll()) system call servers; however, both kinds of server may use a cache to simultaneously wait for events on all connections it to speed retrieval. Stevens [42] describes the basic oper- is handling. When select() delivers one or more ation of HTTP servers in more detail. events, the server’s main loop invokes handlers for each The architecture of HTTP servers has undergone rad- ready connection. Squid [41] and Zeus [49] are examples ical changes. Early servers forked a new process to han- of event-driven servers. dle each HTTP connection, following the classical UNIX Alternatively, in the single-process multi-threaded 2 In this paper, we use the term priority loosely to mean the cur- model (Figure 3), each connection is assigned to a unique rent scheduling precedence of a resource principal, as defined by the thread. These can either be user-level threads or kernel scheduling policy based on the principal’s scheduling parameters. The threads. The thread scheduler is responsible for time- scheduling policy in use may not be priority based. sharing the CPU between the various server threads. HTTP Server HTTP Threads entity. The process is also the “chargeable”

A New Facility for Resource Management in Server Systems

Details

Download

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

Support