A POSIX Threads Kernel

Total Page:16

File Type:pdf, Size:1020Kb

A POSIX Threads Kernel THE ADVANCED COMPUTING SYSTEMS ASSOCIATION The following paper was originally published in the Proceedings of the FREENIX Track: 1999 USENIX Annual Technical Conference Monterey, California, USA, June 6–11, 1999 pk: A POSIX Threads Kernel Frank W. Miller Cornfed Systems, Inc. © 1999 by The USENIX Association All Rights Reserved Rights to individual papers remain with the author or the author's employer. Permission is granted for noncommercial reproduction of the work for educational or research purposes. This copyright notice must be included in the reproduced paper. USENIX acknowledges all trademarks herein. For more information about the USENIX Association: Phone: 1 510 528 8649 FAX: 1 510 548 5738 Email: [email protected] WWW: http://www.usenix.org pk: A POSIX Threads Kernel Frank W. Miller Cornfed Systems, Inc. www.cornfed.com Intro duction pk makes use of a literate programming to ol called noweb. The basic concept is simple. Both do cu- mentation and co de are contained in a single noweb pk is a new op erating system kernel targeted for use le that uses several sp ecial formatting conventions. in real-time and emb edded applications. There are Two to ols are provided. noweave extracts the do c- twonovel asp ects to the pk design: umentation p ortion of the noweb le and gener- A ates a do cumentation le, in this case a L T X le. E notangle extracts the source co de p ortion of the Documentation: The kernel is do cumented us- noweb le and generates a source co de le, in this ing literate programming techniques and the case C source co de. noweb [4] to ol in particular. POSIX Threads with Memory Protection: The The main consequence of using literate program- concurrency mo del is based on the POSIX ming, and noweb in this case, is that changes to the Threads aka Pthreads [2, 3] standard, how- system after initial development are p erformed on ever, the kernel also provides page-based mem- the noweb source les. Since the co de is intermixed ory protection using Memory Management with its asso ciated do cumentation, it is more likely Unit MMU hardware. that the do cumentation will b e up dated at the same time. This short pap er discusses these facets of the pk ker- This is the rst non-trivial pro ject I have under- nel pro ject. The use of literate programming is pre- taken using literate programming, and I have seen sented rst, followed by a brief description of some an evolution in my use of the to ols as I have pro- of the pk design issues. gressed with it. As with many pro jects, it was b e- gun drawing on co de from another pro ject. In this case, I drew on some elements of the Roadrunner op erating system [1]. These were basic elements, Literate Programming like initialization, interrupt pro cessing, and memory management, that were needed to get a new ker- nel up and running quickly. These reused elements Do cumentation is as imp ortant as the software it were not do cumented with noweb initially and some do cuments. This b elief led me to contemplate how remain undo cumented still although my goal is to to do cument the pk kernel design as a primary goal. do cument the entire system using noweb over time. In my exp erience, the biggest problem with gener- ating do cumentation is that it often seems to be The rst new element to b e written was the set of a secondary activity, p erformed after the co de is basic Pthread routines. I rst wrote the co de and written. I b ecame interested in the p otential for only after it was completed and tested to some de- the documentation discipline asso ciated with liter- gree, did I go back and overlay the do cumentation ate programming techniques and decided to make and formatting to turn the C source co de les into use of these techniques with pk. noweb les. This pattern rep eated itself during the implementation of mutexes and condition variables. By discipline, I refer to a structure within whicha noweb do cumentation was added only after the fact. pro ject is p erformed that provides an incentive to generate do cumentation as the co de is b eing writ- It happ ened that once I had completed the Pthreads ten. Literate programming to ols provide a mecha- routines, I decided to investigate the addition of pro- nism that fosters such structure. tected memory to the kernel. Design issues asso ci- small routines or the co de itself maybesointuitive ated with this decision are discussed in the next sec- that only high-level prose is required to get across tion. Continuing here, I want to discuss the implica- their function. tions on do cumentation that presented themselves. It was this decision that resulted in the rst signi - One interesting p oint ab out the use of literate pro- cantchanges to existing source co de that had b een gramming seems to b e that the licensing asso ciated do cumented using noweb. Sp eci cally, the mem- with the source co de must also apply to the do c- ory management co de, which maintains the heap umentation, since the two are linked in the source of available physical pages, and various parts of the les. pk is available under a BSD-style copyright, Pthreads co de needed to b e up dated. which places essentially no restrictions on redistri- bution. A similar pro ject released under the Gnu My rst thought when I went to make changes to Public License GPL would require the changes to the rst source co de le was, \don't worry ab out the do cumentation to be redistributed in addition the do cumentation, you can come back a x that up to changes source co de. later." I had no so oner op ened my second source le when I realized I would forget what I had done if I There are a variety of literate programming to ols didn't take care of the do cumentation. This would available. I evaluated cweb, written by Donald result in a do cument whose prose didn't match the Knuth, and noweb written by Norman Ramsey. co de asso ciated with it. I had to go back and Knuth's cweb generates do cumentation of co de frag- change it. This was the discipline I had hop ed would ments that are \pretty-printed", i.e. they havean present itself. Iwent back and made the do cumen- algorithmic style reminiscent of textb o oks on com- tation changes. puter science theory. The noweb to ols utilize a small set of simple formatting rules and generate co de At rst this felt cumb ersome, it added time to co de fragments that lo ok cosmetically like they were ex- maintenance. However, two unexp ected e ects b e- tracted from a source co de le. This style seemed gan to emerge. First, I found that my design was more in tune with a systems programming pro ject, cleaner. When I mo di ed the co de and changed like an op erating system kernel, and so I decided to the do cumentation, I thought ab out the problem use noweb over cweb. twice. This led in several instances to a more concise change. Second, I found that I could makechanges more quickly in co de that I had not visited in a while. It may seem obvious, but the do cumentation Pthreads and Memory Protection was right there next to the co de, and this allowed me to refamiliarize myself with it more quickly. Ihave now b egun to implement pieces of co de in the noweb source format as they are written for the rst time. pk is based on the POSIX Threads concurrency The p ower of the conciseness e ect I discovered dur- mo del. Pthreads were originally designed under the ing maintenance is also present when writing co de assumption that all of the threads would execute and do cumentation together during an initial imple- in the same address space. In fact, this address mentation. space was intended to be within a UNIX pro cess. However, the Pthreads API is also used in real- The granularity of the do cumentation varies over time kernels that provide their applications a single, di erent parts of the co de. There are several reasons physical address space. pk is also targeted at real- for this. Foremost, di erent areas of the co de have time and emb edded applications, but it augments b een do cumented at di erent times, and the do c- the Pthreads design to include page-based memory umentation for a particular segment of co de might protection using the MMU. Such a design falls some- not be p erformed all at one sitting. This results where in b etween the basic Pthreads mo del and the in sections of co de that are \complete", i.e. they more substantial pro cess concurrency mo del. are do cumented in great enough detail to under- stand all asp ects of their semantics. Other p ortions Since pk is targeted at time-critical applications, are coarser, p erhaps only setup to t into the over- paging and/or swapping to secondary storage can- all structure of the piece of do cumentation, but not not be utilized. This is b ecause of the signi cant yet completed.
Recommended publications
  • The Pretzel User Manual
    The PretzelBook second edition Felix G¨artner June 11, 1998 2 Contents 1 Introduction 5 1.1 Do Prettyprinting the Pretzel Way . 5 1.2 History . 6 1.3 Acknowledgements . 6 1.4 Changes to second Edition . 6 2 Using Pretzel 7 2.1 Getting Started . 7 2.1.1 A first Example . 7 2.1.2 Running Pretzel . 9 2.1.3 Using Pretzel Output . 9 2.2 Carrying On . 10 2.2.1 The Two Input Files . 10 2.2.2 Formatted Tokens . 10 2.2.3 Regular Expressions . 10 2.2.4 Formatted Grammar . 11 2.2.5 Prettyprinting with Format Instructions . 12 2.2.6 Formatting Instructions . 13 2.3 Writing Prettyprinting Grammars . 16 2.3.1 Modifying an existing grammar . 17 2.3.2 Writing a new Grammar from Scratch . 17 2.3.3 Context Free versus Context Sensitive . 18 2.3.4 Available Grammars . 19 2.3.5 Debugging Prettyprinting Grammars . 19 2.3.6 Experiences . 21 3 Pretzel Hacking 23 3.1 Adding C Code to the Rules . 23 3.1.1 Example for Tokens . 23 3.1.2 Example for Grammars . 24 3.1.3 Summary . 25 3.1.4 Tips and Tricks . 26 3.2 The Pretzel Interface . 26 3.2.1 The Prettyprinting Scanner . 27 3.2.2 The Prettyprinting Parser . 28 3.2.3 Example . 29 3.3 Building a Pretzel prettyprinter by Hand . 30 3.4 Obtaining a Pretzel Prettyprinting Module . 30 3.4.1 The Prettyprinting Scanner . 30 3.4.2 The Prettyprinting Parser . 31 3.5 Multiple Pretzel Modules in the same Program .
    [Show full text]
  • ESS — Emacs Speaks Statistics
    ESS — Emacs Speaks Statistics ESS version 5.14 The ESS Developers (A.J. Rossini, R.M. Heiberger, K. Hornik, M. Maechler, R.A. Sparapani, S.J. Eglen, S.P. Luque and H. Redestig) Current Documentation by The ESS Developers Copyright c 2002–2010 The ESS Developers Copyright c 1996–2001 A.J. Rossini Original Documentation by David M. Smith Copyright c 1992–1995 David M. Smith Permission is granted to make and distribute verbatim copies of this manual provided the copyright notice and this permission notice are preserved on all copies. Permission is granted to copy and distribute modified versions of this manual under the conditions for verbatim copying, provided that the entire resulting derived work is distributed under the terms of a permission notice identical to this one. Chapter 1: Introduction to ESS 1 1 Introduction to ESS The S family (S, Splus and R) and SAS statistical analysis packages provide sophisticated statistical and graphical routines for manipulating data. Emacs Speaks Statistics (ESS) is based on the merger of two pre-cursors, S-mode and SAS-mode, which provided support for the S family and SAS respectively. Later on, Stata-mode was also incorporated. ESS provides a common, generic, and useful interface, through emacs, to many statistical packages. It currently supports the S family, SAS, BUGS/JAGS, Stata and XLisp-Stat with the level of support roughly in that order. A bit of notation before we begin. emacs refers to both GNU Emacs by the Free Software Foundation, as well as XEmacs by the XEmacs Project. The emacs major mode ESS[language], where language can take values such as S, SAS, or XLS.
    [Show full text]
  • Introduction
    1 Introduction A compiler translates source code to assembler or object code for a target machine. A retargetable compiler has multiple targets. Machine-specific compiler parts are isolated in modules that are easily replaced to target different machines. This book describes lcc, a retargetable compiler for ANSI C; it fo- cuses on the implementation. Most compiler texts survey compiling al- gorithms, which leaves room for only a toy compiler. This book leaves the survey to others. It tours most of a practical compiler for full ANSI C, including code generators for three target machines. It gives only enough compiling theory to explain the methods that it uses. 1.1 Literate Programs This book not only describes the implementation of lcc,itis the imple- mentation. The noweb system for “literate programming” generates both the book and the code for lcc from a single source. This source con- sists of interleaved prose and labelled code fragments. The fragments are written in the order that best suits describing the program, namely the order you see in this book, not the order dictated by the C program- ming language. The program noweave accepts the source and produces the book’s typescript, which includes most of the code and all of the text. The program notangle extracts all of the code, in the proper order for compilation. Fragments contain source code and references to other fragments. Fragment definitions are preceded by their labels in angle brackets. For example, the code ≡ a fragment label 1 ĭ2 sum=0; for (i = 0; i < 10; i++) increment sum 1 increment sum 1≡ 1 sum += x[i]; sums the elements of x.
    [Show full text]
  • Literate Programming in Java Section 1 Is Always a Starred Section
    304 TUGboat, Volume 23 (2002), No. 3/4 called limbo; in this case, the introduction.) Most Software & Tools sections consist of a short text part followed by a short code part. Some sections (such as this one) contain only text, some others contain only code. Rambutan: Literate programming in Java Section 1 is always a starred section. That just means it has a title: ‘Computing primes’ in this Prasenjit Saha case. The title is supposed to describe a large group Introduction. Rambutan is a literate program- of consecutive sections, and gets printed at the start and on the page headline. Long programs have ming system for Java with TEX, closely resembling CWEB and the original WEB system.* I developed it many starred sections, which behave like chapter using Norman Ramsey’s Spidery WEB. headings. This article is also the manual, as well as an The source for this section begins example of a Rambutan literate program; that @* Computing primes. This is... is to say, the file Manual.w consists of code and In the source, @* begins a starred section, and any documentation written together in the Rambutan text up to the first period makes up the title. idiom. From this common source, the Rambutan system does two things: 2. This is an ordinary (i.e., unstarred) section, javatangle Manual and its source begins extracts a compilable Java applet to compute the @ This is an ordinary... first N primes, and In the source, @ followed by space or tab or newline javaweave Manual begins an ordinary section. In the next section things get more interesting.
    [Show full text]
  • Literate Programming and Reproducible Research
    JSS Journal of Statistical Software January 2012, Volume 46, Issue 3. http://www.jstatsoft.org/ A Multi-Language Computing Environment for Literate Programming and Reproducible Research Eric Schulte Dan Davison University of New Mexico Counsyl Thomas Dye Carsten Dominik University of Hawai`i University of Amsterdam Abstract We present a new computing environment for authoring mixed natural and com- puter language documents. In this environment a single hierarchically-organized plain text source file may contain a variety of elements such as code in arbitrary program- ming languages, raw data, links to external resources, project management data, working notes, and text for publication. Code fragments may be executed in situ with graphical, numerical and textual output captured or linked in the file. Export to LATEX, HTML, LATEX beamer, DocBook and other formats permits working reports, presentations and manuscripts for publication to be generated from the file. In addition, functioning pure code files can be automatically extracted from the file. This environment is implemented as an extension to the Emacs text editor and provides a rich set of features for authoring both prose and code, as well as sophisticated project management capabilities. Keywords: literate programming, reproducible research, compendium, WEB, Emacs. 1. Introduction There are a variety of settings in which it is desirable to mix prose, code, data, and compu- tational results in a single document. Scientific research increasingly involves the use of computational tools.
    [Show full text]
  • Name Synopsis Description Format of Noweb Files
    NOWEB(1) NOWEB(1) NAME notangle, noweave,nountangle − noweb, a literate-programming tool SYNOPSIS notangle [−Rrootname ...] [−filter command] [−L[format]] [file] ... nountangle [−ml|−m3|−c|−c++|−awk|−tex|−f77|−f90|−lisp|−matlab][−Rrootname ...] [−filter command] [−wwidth] [file] ... noweave [options] [file] ... DESCRIPTION Noweb is a literate-programming tool likeKnuth’s WEB, only simpler.Anoweb file contains program source code interleavedwith documentation. When notangle is givenanoweb file, it writes the program on standard output. When noweave is givenanoweb file, it reads the noweb source and produces, on standard output, LaTeX, TeX, troff,or HTML source for typeset documentation. nountangle converts a literate pro- gram into an ordinary program by turning interleaveddocumentation into comments. The file name ‘−’ refers to standard input. FORMATOFNOWEB FILES A noweb file is a sequence of chunks,which may appear in anyorder.Achunk may contain code or docu- mentation. Documentation chunks begin with a line that starts with an at sign (@) followed by a space or newline. Theyhav e no names. Code chunks begin with <<chunk name>>= on a line by itself. The double left angle bracket (<<) must be in the first column. Chunks are terminated by the beginning of another chunk, or by end of file. If the first line in the file does not mark the beginning of a chunk, it is assumed to be the first line of a documentation chunk. Documentation chunks contain text that is ignored by notangle and copied verbatim to standard output by noweave (except for quoted code). noweave can work with LaTeX,plain TeX, troff or HTML.With plain TeX,itinserts a reference to a TeX macro package, nwmac,which defines commands like \chapter and \section.
    [Show full text]
  • Reproducible Research Tools for R
    Reproducible Research Tools for R January 2017 Boriana Pratt Princeton University 1 Literate programming Literate programming (1984) “ I believe that the time is right for significantly better documentation of programs, and that we can best achieve this by considering programs to be works of literature. Hence, my title: ‘Literate Programming’. “ Donald E. Knuth. Literate Programming. The Computer Journal, 27(2):97-111, May1984. http://comjnl.oxfordjournals.org/content/27/2/97.full.pdf+html WEB “… is a combination of two other languages: (1) a document formatting language and (2) a programming language.” 2 terms • Tangle extract the code parts (code chunks), then run them sequentially • Weave extract the text part (documentation chunks) and weave back in the code and code output 3 Noweb Norman Ramsey (1989) – Noweb – simple literate programming tool… https://www.cs.tufts.edu/~nr/noweb/ “Literate programming is the art of preparing programs for human readers. “ Noweb syntax includes two parts: Code chunk: <<chink name>>= - section that starts with <<name>>= Documentation chunk: @ - line that starts with @ followed by a space; default for the first chunk 4 Tools for R 5 Tools for R • Sweave https://stat.ethz.ch/R-manual/R-devel/library/utils/doc/Sweave.pdf What is Sweave? A tool that allows to embed the R code in LaTeX documents. The purpose is to create dynamic reports, which can be updated automatically if data or analysis change. How do I cite Sweave? To cite Sweave please use the paper describing the first version: Friedrich Leisch. Sweave: Dynamic generation of statistical reports using literate data analysis.
    [Show full text]
  • Hyperpro an Integrated Documentation Environment For
    HyperPro An integrated documentation environment for CLP∗ AbdelAli Ed-Dbali LIFO - Universit d’Orlans BP 6759 - 45067 Orlans Cedex - France [email protected], http://www.univ-orleans.fr/SCIENCES/LIFI/Membres/eddbali Pierre Deransart INRIA Rocquencourt - BP 105 - 78153 Le Chesnay - France [email protected], http://contraintes.inria.fr/deransar~ Mariza A. S. Bigonha, Jos´ede Siqueira, Roberto da S. Bigonha DCC - UFMG - Brsil {mariza,jose,bigonha}@dcc.ufmg.br Abstract The purpose of this paper is to present some functionalities of the HyperPro System. HyperPro is a hypertext tool which allows to de- velop Constraint Logic Programming (CLP) together with their doc- umentation. The text editing part is not new and is based on the free software Thot. A HyperPro program is a Thot document written in a report style. The tool is designed for CLP but it can be adapted arXiv:cs/0111046v1 [cs.PL] 19 Nov 2001 to other programming paradigms as well. Thot offers navigation and editing facilities and synchronized static document views. HyperPro has new functionalities such as document exportations, dynamic views (projections), indexes and version management. Projection is a mech- anism for extracting and exporting relevant pieces of code program or of document according to specific criteria. Indexes are useful to find the references and occurrences of a relation in a document, i.e., where ∗In A. Kusalik (ed), Proceedings of the Eleventh Workshop on Logic Programming En- vironments (WLPE’01), December 1, 2001, Paphos, Cyprus. COmputer Research Reposi- tory (http://www.acm.org/corr/), ** your cs.PL/0111046 or cs.SE/0111046**; whole pro- ceedings: cs.PL/0111042.
    [Show full text]
  • Experience with Literate Programming in the Modelling and Validation of Systems
    Experience with Literate Programming in the Modelling and Validation of Systems Theo C. Ruys and Ed Brinksma Faculty of Computer Science, University of Twente. P.O. Box 217, 7500 AE Euschede, The Netherlands. {ruys, brinksma}@cs, utwente, nl Abstract. This paper discusses our experience with literate program- ming tools in the realm of the modelling and validation of systems. We propose the use of literate programming techniques to structure and con- trol the validation trajectory. The use of literate programming is illus- trated by means of a running example using Promela and Spin. The paper can also be read as a tutorial on the application of literate programming to formal methods. 1 Introduction In the past years, we have been involved in several industrial projects concerning the modelling and validation of (communication) protocols [3, 10, 18]. In these projects we used modelling languages and tools - like Pmmela, Spin [5, 7] and UPPAAL [13] - to specify and verify the protocols and their properties. During each of these projects we encountered the same practical problems of keeping track of various sorts of information and data, for example: - many documents, which describe parts of the system; - many versions of the same document; - consecutive versions of validation models; - results of validation runs. The problems of managing such information and data relate to the maintenance problems found in software engineering [15]. This is not surprising as, in a sense, validation using a model checker involves the analysis of many successive versions of the model of a system. This paper discusses our experience with literate programming techniques to help us tackle these information management problems.
    [Show full text]
  • Working with Norman Ramsey: a Guide for Research Students
    Working with Norman Ramsey A Guide for Research Students Norman Ramsey Tufts University Summer 2011 Goals We want to write the best software and publish the best papers in the best conferences and journals. We want our stuff to be the stuff people will read five or ten years from now to learn how to build their own systems. And whether we do compiler construction, language design, productivity tools, run-time systems, type theory, or some other project, we want always to be focused on making the programmer’s life better. Introduction Agreements between junior researchers (undergraduate students, graduate stu- dents, and postdoctoral associates) and their faculty supervisor (me) are often implicit. But an implicit agreement is little better than no agreement. This doc- ument makes things explicit. Much may be common knowledge, but writing down common expectations helps prevent misunderstandings. If you have been invited to join my research group or are already a member, please read the whole thing carefully. If you have questions or concerns, please talk about them; the document, like my research style, is a work in progress. I review this document periodically to make sure that my expectations and obligations are clear and that nothing important is forgotten. In the spirit of full disclosure, I have tried to identify what’s distinctive about working with me—both the good and the bad: • Working with students and contributing to their professional development is the most rewarding part of my job. My ideal is to help students develop into colleagues. • I am ambitious and have high standards, and I expect the same from you.
    [Show full text]
  • Ramsey Guide to Working with Graduate Students
    Working with Norman Ramsey A Guide for Research Students Norman Ramsey Tufts University Winter 2014 Goals We want to write the best software and publish the best papers in the best conferences and journals. We want our stuff to be the stuff people will read five or ten years from now to learn how to build their own systems. And whether we do compiler construction, language design, productivity tools, run-time systems, type theory, or some other project, we want always to focus on making the programmer’s life better. Introduction Agreements between junior researchers (undergraduate students, graduate stu- dents, and postdoctoral associates) and their faculty supervisor (me) are often implicit. But an implicit agreement is little better than no agreement. This doc- ument makes things explicit. Much may be common knowledge, but writing down common expectations helps prevent misunderstandings. If you have been invited to join my research group or are already a member, please read it all carefully. If you have questions or concerns, please talk about them; the document, like my research style, is a work in progress. I review everything periodically to make sure that my expectations and obligations are clear and that nothing important is forgotten. In the spirit of full disclosure, I have tried to identify what’s distinctive about working with me—both the good and the bad: • Working with students and contributing to their professional development is the most rewarding part of my job. My ideal is to help students develop into colleagues. (I also enjoy working with other faculty.) • I am ambitious and have high standards, and I expect the same from you.
    [Show full text]
  • 190 Tugboat, Volume 34 (2013), No. 2 Lippgen: a Presentation
    190 TUGboat, Volume 34 (2013), No. 2 LiPPGen: A presentation generator for sentation does not depend on syntactical constraints. literate-programming-based teaching For a developer this means that the original creative process can be recorded in the literate program, al- Hans-Georg Eßer lowing both top-down and bottom-up approaches. Abstract Instructors can base the presentation on didactic con- siderations, for example, they can first give function Literate programming techniques can be used as a definitions and structure declarations in an incom- teaching method | not only in book form, but also plete form (when the audience does not have the for lectures in the classroom. I present a tool which required knowledge to understand the full versions) helps instructors transform their literate programs and later extend them when the missing information into lecture presentations: LiPPGen, the Literate- has been taught. Programming-based Presentation Generator, takes a Since a book is not helpful in a classroom setting, standard literate program (with LAT X as the docu- E the question arises of how an instructor might create mentation language) as input and lets the instructor lecture slides from a literate program. While it is comfortably generate presentation slides for each possible to copy and paste fragments from the literate code chunk. It then assembles the provided slide program into a presentation and manually add text, texts and the code chunks and turns them into a such a procedure is tedious and will not always lead browser-based presentation. to good results. Also, whenever the author modifies LiPPGen offers unique features in comparison to the original literate program, he or she must also standard presentation programs (such as PowerPoint) manually update the slides.
    [Show full text]