Discriminative Learning for Probabilistic Sequence Analysis

Total Page:16

File Type:pdf, Size:1020Kb

Discriminative Learning for Probabilistic Sequence Analysis Freie Universität Berlin Fachbereich Mathematik und Informatik Dissertation zur Erlangung des akademischen Grades des Doktors der Naturwissenschaften (Dr. rer. nat.) Discriminative learning for probabilistic sequence analysis Jonas Maaskola June 2014 Erstgutachter: Prof. Dr. Martin Vingron Transcriptional Regulation Group Computational Molecular Biology Department Max Planck Institute for Molecular Genetics Honorary Professor of Bioinformatics Department of Mathematics and Computer Science Freie Universität Berlin Zweitgutachter: Prof. Dr. Nikolaus Rajewsky Laboratory for Systems Biology of Gene Regulatory Elements Berlin Institute for Medical Systems Biology Max-Delbrück-Centrum für Molekulare Medizin, Berlin-Buch Professor of Systems Biology Charité Medical School Humboldt Universität Berlin Disputation: 16. April 2015 Contents Abstract 1 Summary 3 Methodology ..................................... 3 Results and contributions .............................. 6 I Biological Background 15 1 Systems biology of gene regulation 17 1.1 State of life science research .......................... 17 1.2 Gene regulatory processes ........................... 18 1.3 Sequencing technologies ............................ 22 2 Published computational biology research 25 2.1 List of publications ............................... 25 2.2 Computational analysis of deep-sequencing data .............. 26 2.3 Population genomics of drosophilid miRNAs ................. 29 2.4 Small RNAs in C. elegans embryogenesis ................... 34 2.5 Small RNAs in S. purpuratus early development ............... 42 2.6 ChIP-Sequencing a cell-cycle regulator and a helicase ............ 45 2.7 Data curation for post-transcriptional research ................ 51 2.8 Splicing regulation ............................... 51 II Probabilistic Sequence Analysis 53 3 Motif and binding site models 57 3.1 Binding motif models .............................. 57 3.2 Binding site models ............................... 64 4 Hidden Markov models 67 4.1 Formal definition ................................ 67 4.2 Fundamental problems and basic inference algorithms ........... 68 4.3 Viterbi algorithm ................................ 69 4.4 Forward-backward algorithm ......................... 71 i Contents 4.5 Scaled forward-backward algorithm ...................... 73 4.6 Likelihood gradient ............................... 75 5 Binding site hidden Markov model 81 5.1 An HMM for binding sites ........................... 81 5.2 Posterior motif occurrence probability .................... 82 6 Learning algorithms 85 6.1 Baum-Welch algorithm ............................. 85 6.2 Gradient learning ................................ 88 6.3 Complexity ................................... 89 III Discriminative Learning for Probabilistic Sequence Analysis 91 7 Statistics for discriminative learning 95 7.1 Contrasts .................................... 95 7.2 Contingency tables of features across contrasts ................ 96 8 Table-based discriminative objective functions 99 8.1 Difference of occurrence frequency ...................... 99 8.2 Matthews correlation coefficient ........................ 100 8.3 Fisher’s exact test ................................ 101 8.4 Normalized enrichment score ......................... 102 8.5 Pearson’s χ2 test for independence ...................... 102 8.6 Mutual information of condition and occurrence ............... 102 9 Probabilistic discriminative objective functions 105 9.1 Difference of log likelihood ........................... 105 9.2 Multiple model classification ......................... 106 9.3 Probability of correct classification ...................... 106 10 Significance of association 109 10.1 Mutual information, likelihood ratio, and χ2 test ............... 109 10.2 Multiple testing correction for motif discovery problems ........... 110 10.3 Significance of association significance filtering ............... 111 11 Discrete optimization of discriminative objectives 113 11.1 Enumerating residually most discriminative words ............. 113 11.2 Identifying discriminative words with degeneracy .............. 114 12 Hybrid learning 119 12.1 Signal and context parameters ......................... 119 12.2 Learning scheme ................................ 119 12.3 Multi-objective learning ............................ 120 ii Contents 13 Discovering multiple motifs 123 13.1 Measures of conditional association ...................... 123 13.2 Discovering multiple motifs .......................... 124 14 Related work 129 14.1 Overview .................................... 129 14.2 DREME: Discriminative Regular Expression Motif Elicitation ........ 129 14.3 YMF: Yeast Motif Finder ............................ 130 14.4 CMF: Contrast Motif Finder ........................... 131 14.5 DME: Discriminating Matrix Enumerator ................... 132 14.6 DIPS: Discriminative PWM Search ....................... 134 14.7 DECOD: Deconvolved Discriminative Motif Discovery ............ 136 14.8 MoAn: Motif Annealer ............................. 137 14.9 DEME: Discriminatively Enhanced Motif Elicitation ............. 139 14.10 Dispom ..................................... 140 14.11 FIRE: Finding Informative Regulatory Elements ............... 141 14.12 Further motif discovery tools .......................... 142 IV Empirical Study of Motif Discovery Methodology 143 15 Synthetic test data 147 15.1 Generation of synthetic data .......................... 147 15.2 Recognizability ................................. 148 16 Performance metrics 151 16.1 Supervised performance metrics ........................ 151 16.2 Summarization ................................. 153 17 Results on synthetic data 155 17.1 Evaluated motif discovery tools ........................ 155 17.2 Summary performance ............................. 156 17.3 Comparing signal-only and discriminative learning ............. 159 17.4 Discriminative filtering ............................. 162 18 PUF RNA-binding protein family 163 18.1 Materials .................................... 164 18.2 Methods ..................................... 165 18.3 Discriminative motifs in PUF RBP data .................... 166 18.4 Comparing MICO, MMIE, and ML learning .................. 168 18.5 Dilution and word-based analyses ....................... 168 19 Alternative splicing regulator RBM10 173 19.1 Materials .................................... 174 19.2 Motif discovery for RBM10 reveals splicing-relevant motifs ......... 174 19.3 Previously reported RBM10 motifs not corroborated ............. 176 iii Contents 20 Mouse embryonic stem cell transcription factors 179 20.1 Materials and methods ............................. 179 20.2 Discriminative motifs in ChIP-Seq data .................... 180 20.3 Spatial distribution of motif occurrences ................... 185 20.4 Contrasting Nanog and Tcf3 against other ChIP-Seq data .......... 186 20.5 Comparing results of DREME, FIRE, and Discrover .............. 186 V Discussion 191 21 Supervised motif discovery performance experiments 193 21.1 Influence of parameters varied in synthetic data ............... 193 21.2 Generative, signal-only learning ........................ 195 21.3 Discriminative learning ............................ 196 21.4 Robustness of hybrid learning ......................... 198 21.5 Comparison to published motif discovery methods .............. 199 22 Sequence motifs in biological data 201 22.1 RIP-Chip and PAR-CLIP data of PUF family RBPs ............... 201 22.2 Alternative splicing regulator RBM10 ..................... 202 22.3 ChIP-Seq data of mouse ESC TFs ........................ 204 23 Outlook 207 23.1 Global optimization .............................. 207 23.2 Rank-based learning - Average rank information ............... 208 23.3 Faster learning ................................. 208 23.4 Additional information sources ........................ 209 23.5 Other applications ............................... 210 24 Conclusions 211 Appendices 213 A Proof of correctness of the scaling procedure 215 A.1 Correctness for the forward matrix recursion ................. 215 A.2 Correctness for the backward matrix recursion ................ 216 B Runtime of HMM inference algorithms 217 C Information theory 219 C.1 Communication systems ............................ 219 C.2 Fundamental quantities of information theory ................ 219 D Likelihood ratio, mutual information and χ2 statistic 223 D.1 The likelihood ratio statistic for goodness of fit ................ 223 D.2 G-test ...................................... 225 iv Contents D.3 Likelihood ratio and mutual information ................... 225 E Limits of Matthews correlation coefficient 227 F Gradient calculus 229 G Running parameters 231 H Synthetic data experiments 235 I PUF RBP family data 245 J Alternative splicing regulator RBM10 253 K Mouse ESC ChIP-Seq data 261 L Research anecdotes 265 L.1 Motif discovery anecdotes ........................... 265 L.2 A fishy smell in ChIP-Seq data ......................... 267 Bibliography 269 Statutory Declaration 299 Zusammenfassung 301 This is revision 66a1c19 of the manuscript, generated July 15, 2015. v List of Figures 1.1 Biogenesis of an mRNA ............................. 21 1.2 Structure of 4-thiouridine and 6-thioguanosine ............... 23 1.3 ChIP-Seq and PAR-CLIP methodologies .................... 24 2.1 Small RNA expression in early development of C. elegans and S. purpuratus 37 2.2 E2F3 and HELLS regulate many common targets, most notably MLL1 .... 46 3.1
Recommended publications
  • The Evolution of Lisp
    1 The Evolution of Lisp Guy L. Steele Jr. Richard P. Gabriel Thinking Machines Corporation Lucid, Inc. 245 First Street 707 Laurel Street Cambridge, Massachusetts 02142 Menlo Park, California 94025 Phone: (617) 234-2860 Phone: (415) 329-8400 FAX: (617) 243-4444 FAX: (415) 329-8480 E-mail: [email protected] E-mail: [email protected] Abstract Lisp is the world’s greatest programming language—or so its proponents think. The structure of Lisp makes it easy to extend the language or even to implement entirely new dialects without starting from scratch. Overall, the evolution of Lisp has been guided more by institutional rivalry, one-upsmanship, and the glee born of technical cleverness that is characteristic of the “hacker culture” than by sober assessments of technical requirements. Nevertheless this process has eventually produced both an industrial- strength programming language, messy but powerful, and a technically pure dialect, small but powerful, that is suitable for use by programming-language theoreticians. We pick up where McCarthy’s paper in the first HOPL conference left off. We trace the development chronologically from the era of the PDP-6, through the heyday of Interlisp and MacLisp, past the ascension and decline of special purpose Lisp machines, to the present era of standardization activities. We then examine the technical evolution of a few representative language features, including both some notable successes and some notable failures, that illuminate design issues that distinguish Lisp from other programming languages. We also discuss the use of Lisp as a laboratory for designing other programming languages. We conclude with some reflections on the forces that have driven the evolution of Lisp.
    [Show full text]
  • Robert Alan Saunders
    SMITHSONIAN INSTITUTION LEMELSON CENTER FOR THE STUDY OF INVENTION AND INNOVATION Robert Alan Saunders Transcript of an interview conducted by Christopher Weaver at National Museum of American History Washington, D.C., USA on 29 November 2018 with subsequent additions and corrections For additional information, contact the Archives Center at 202-633-3270 or [email protected] All uses of this manuscript are covered by an agreement between the Smithsonian Institution and Robert Alan Saunders dated November 29, 2018. For additional information about rights and reproductions, please contact: Archives Center National Museum of American History Smithsonian Institution MRC 601 P.O. Box 37012 Washington, D.C. 20013-7012 Phone: 202-633-3270 TDD: 202-357-1729 Email: [email protected] Web: http://americanhistory.si.edu/archives/rights-and-reproductions Preferred citation: Robert Alan Saunders, “Interview with Robert Alan Saunders,” conducted by Christopher Weaver, November 29, 2018, Video Game Pioneers Oral History Collection, Archives Center, National Museum of American History, Smithsonian Institution, Washington, DC. Acknowledgement: The Smithsonian’s Lemelson Center for the Study of Invention and Innovation gratefully acknowledges financial support from the Entertainment Software Association and Coastal Bridge Advisors for this oral history project. For additional information, contact the Archives Center at 202-633-3270 or [email protected] Abstract Robert Saunders begins discussing his early family life, education, and early exposure to electrical engineering. He next recounts his time at MIT, recalling members of the Tech Model Railroad Club and his work with the TX-0 and PDP-1 computers. Saunders discusses the contributions of Spacewar! team members to the project and his development of the original PDP-1 game controllers.
    [Show full text]
  • Introduction to the Literature on Programming Language Design Gary T
    Computer Science Technical Reports Computer Science 7-1999 Introduction to the Literature On Programming Language Design Gary T. Leavens Iowa State University Follow this and additional works at: http://lib.dr.iastate.edu/cs_techreports Part of the Programming Languages and Compilers Commons Recommended Citation Leavens, Gary T., "Introduction to the Literature On Programming Language Design" (1999). Computer Science Technical Reports. 59. http://lib.dr.iastate.edu/cs_techreports/59 This Article is brought to you for free and open access by the Computer Science at Iowa State University Digital Repository. It has been accepted for inclusion in Computer Science Technical Reports by an authorized administrator of Iowa State University Digital Repository. For more information, please contact [email protected]. Introduction to the Literature On Programming Language Design Abstract This is an introduction to the literature on programming language design and related topics. It is intended to cite the most important work, and to provide a place for students to start a literature search. Keywords programming languages, semantics, type systems, polymorphism, type theory, data abstraction, functional programming, object-oriented programming, logic programming, declarative programming, parallel and distributed programming languages Disciplines Programming Languages and Compilers This article is available at Iowa State University Digital Repository: http://lib.dr.iastate.edu/cs_techreports/59 Intro duction to the Literature On Programming Language Design Gary T. Leavens TR 93-01c Jan. 1993, revised Jan. 1994, Feb. 1996, and July 1999 Keywords: programming languages, semantics, typ e systems, p olymorphism, typ e theory, data abstrac- tion, functional programming, ob ject-oriented programming, logic programming, declarative programming, parallel and distributed programming languages.
    [Show full text]
  • The University of Chicago Transaction and Message
    THE UNIVERSITY OF CHICAGO TRANSACTION AND MESSAGE: FROM DATABASE TO MARKETPLACE, 1970-2000 A DISSERTATION SUBMITTED TO THE FACULTY OF THE DIVISION OF THE SOCIAL SCIENCES IN CANDIDACY FOR THE DEGREE OF DOCTOR OF PHILOSOPHY DEPARTMENT OF SOCIOLOGY BY MICHAEL C. CASTELLE CHICAGO, ILLINOIS DECEMBER 2017 TABLE OF CONTENTS List of Figures iii List of Tables v List of Abbreviations vi Acknowledgements ix Abstract xi Chapter 1. Theoretical Foundations for Social Studies of Computing 1 Chapter 2. Computing in Organizations: Electronic Data Processing 32 and the Relational Model Chapter 3. The Transaction Abstraction: From the Paperwork Crisis 69 to Black Monday Chapter 4. Brokers, Queues, and Flows: Techniques of 127 Financialization and Consolidation Chapter 5. Where Do Electronic Markets Come From? 186 Chapter 6. The Platform as Exchange 219 References 240 ii LIST OF FIGURES Figure 1. Peirce’s sign-systems. 22 Figure 2. Date and Codd’s diagrammatic comparison of the logical 51 views of a relational database and of a network database Figure 3. Date’s depiction of the Codasyl DBTG network model. 53 Figure 4. From “Generalization: Key to Successful Electronic Data 57 Processing”. Figure 5. A B-tree index for a relation using an integer primary key, 58 as used in the System R relational database. Figure 6. Diagram depicting the dynamic re-balancing of a B-tree upon 59 inserting the value ’9’ into a full leaf. Figure 7. Charles T. Davies’ early transaction abstraction. 66 Figure 8. Jim Gray’s transactions. 66 Figure 9. New York Stock Exchange trading volume, 1970-2005. 68 Figure 10.
    [Show full text]
  • Commencement1985.Pdf (6.060Mb)
    lose of the 109th Academic Year The Johns Hopkins University May 31 , 1985 Digitized by the Internet Archive in 2012 with funding from LYRASIS Members and Sloan Foundation http://archive.org/details/commencement1985 ORDER OF PROCESSION MARSHALS Karl Alexander Jean Eichelberger Ivey Edward J. Bouwer Allyn Kimball Paul R. Daniels Shin Lin Marc Donohue H. Warren Moos Frederick Holborn Selina Sue Prosen Roger A. Horn Fred S. Schock Henry M. Seidel THE GRADUATES MARSHALS Brown Murr Frederic Davidson THE FACULTIES MARSHALS Robert E. Green John Russell-Wood THE DEANS MEMBERS OF THE SOCIETY OF SCHOLARS OFFICERS OF THE UNIVERSITY THE TRUSTEES CHIEF MARSHAL Owen M. Phillips THE CHAPLAINS THE HONORARY DEGREE CANDIDATES THE PROVOST OF THE UNIVERSITY THE CHAIRMAN OF THE BOARD OF TRUSTEES THE PRESIDENT OF THE UNIVERSITY ORDER OF EVENTS STEVEN MULLER President of the University, presiding * * * PRELUDE Gaillard Johann Hermann Schein (1586-1630) A Toye New Sa-Hoo Giles Farnaby (1565-1640) A Classical March Ludwig van Beethoven (1770-1827) PROCESSIONAL The audience is requested to stand as the Academic Procession moves into the area and to remain standing after the Invocation La Mourisque Pavane TlELMAN SUSATO ( ? -1561) Festive marches from Belshazarr, Ezio, Rinaldo and Scipione George Frideric Handel (1685-1759) THE PRESIDENT'S PROCESSION Fanfare Walter Piston (1894-1976) Marches from Judas Maccabaeus and Floridante George Frideric Handel (1685-1759) INVOCATION CLYDE R. SHALLENBERGER Director The Chaplaincy Service The Johns Hopkins Hospital * THE NATIONAL ANTHEM * GREETINGS GEORGE G. RADCLIFFE Chairman of the Board of Trustees * PRESENTATION OF NEW MEMBERS OF THE SOCIETY OF SCHOLARS Jo Eirik Asvall ThomasP.
    [Show full text]
  • The Evolution of Smalltalk from Smalltalk-72 Through Squeak
    The Evolution of Smalltalk From Smalltalk-72 through Squeak DANIEL INGALLS, Independent Consultant, USA Shepherd: Andrew Black, Portland State University, USA This paper presents a personal view of the evolution of six generations of Smalltalk in which the author played a part, starting with Smalltalk-72 and progressing through Smalltalk-80 to Squeak and Etoys. It describes the forces that brought each generation into existence, the technical innovations that characterized it, and the growth in understanding of object-orientation and personal computing that emerged. It summarizes what that generation achieved and how it affected the future, both within the evolving group of developers and users, and in the outside world. The early Smalltalks were not widely accessible because they ran only on proprietary Xerox hardware; because of this, few people have experience with these important historical artifacts. To make them accessible, the paper provides links to live simulations that can be run in present-day web browsers. These simulations offer the ability to run predefined scripts, but also allow the user to go off-script, browse thedetailsof implementation, and try anything that could be done in the original system. An appendix includes anecdotal and technical aspects of how examples of each generation of Smalltalk were recovered, and how order was teased out of chaos to the point that these old systems could be brought back to life. CCS Concepts: · Software and its engineering → General programming languages; · Social and pro- fessional topics → History of programming languages. Additional Key Words and Phrases: Smalltalk, Squeak, Alto, NoteTaker, OOZE, BitBlt, Morphic, EToys, Objects, Blocks, Bytecode, Interpreter, Virtual Machine, Bootstrap 85 ACM Reference Format: Daniel Ingalls.
    [Show full text]
  • Wizards of The
    In our connected data societies, the importance of algorithms and automated systems is obvious. They determine search engines’ rankings, what driverless cars do when a child appears on the road, and stock market changes. Today data-driven algorithms and automated systems are everywhere. While algorithms and automated systems themselves are often a topic of controversy WIZARDS OF THE WEB JAKOB SVENSSON WIZARDS OF THE WEB JAKOB and debate, this book is about the people behind them; it is an account of the cultures, values, and imaginations that guide programmers in their work designing and engineering software and digital technology. Technology, it is argued, is not neutral and developed free of context. And since algorithms and automated WIZARDS systems exercise power in connected data societies, it is pivotal to understand their creators, who could be labelled, it is argued in the book, Wizards of the Web. This book is the result of an ethnographically inspired study based on interviews with software engineers and programmers, observations made at tech head quarters and conferences in Denmark, Sweden, Brazil, Germany, India, and the US, and a OF case study of the introduction of algorithmic automation on the front page of a Scandinavian newspaper. The author, Jakob Svensson, is professor of Media and Communication Studies at Malmö University. The book is part of the research project Behind the Algorithm (funded by the Swedish Research Council, 2018–2020). THE WEB AN OUTSIDER'S JOURNEY INTO TECH CULTURE, PROGRAMMING, AND MATHEMAGICS NordicomNordicom is a centre is a forcentre Nordic for media Nordic research media at the University research of Gothenburg, at the University of Gothenburg,supported supported by the by Nordic the Council Nordic of Ministers.
    [Show full text]
  • The Lenfest Ocean Program
    Agenda Item B.1 Open Comment Period November 2012 The Lenfest Ocean Program Invites you to a presentation of its 2012 report Little Fish Big Impact: Sustaining forage fish and the ecosystems that depend on them November 3, 2012 at 7pm On the site of the Pacific Fishery Management Council Meeting Balboa Bay 2 room, Hilton Orange County 3050 Bristol Street, Costa Mesa, CA 92626 Free and open to the public. Light refreshments will be served. Presentation by Christine Santora, project director, Lenfest Forage Fish Task Force. For further information on the task force, see www.LenfestOcean.org/foragefish. For questions specific to this event, please contact Ben Shouse, [email protected]. Agenda Item B.1 Supplemental Open Comment 2 November 2012 October 23, 2012 Mr. Dan Wolford, Chair Pacific Fisheries Management Council 7700 NE Ambassador Place, Suite 101 Portland, Oregon 97220-1384 Rod McInnis, Regional Administrator National Marine Fisheries Service Regional Administrator 501 West Ocean Boulevard, Suite 4200 Long Beach, CA 90802-4213 RE: Open Public Comment/ Highly Migratory Species, Proposed Drift Gillnet Expansion Dear Chairman Dan Wolford and Regional Administrator McInnis: We the undersigned 36,483 individuals (including 7, 116 California residents; 990 Oregon residents; 1,403 Washington residents; and 119 Idaho residents) urge the National Marine Fisheries Service to halt the current effort to expand the California-based drift gillnet fishery, and shift its efforts toward eliminating the fishery altogether. This fishery entangles and kills over a hundred marine mammals per year, continues to take endangered sea turtles, as well as non-target fish such as blue shark, bigeye thresher shark, striped marlin, ocean sunfish, and others.
    [Show full text]
  • Oral History of Richard Greenblatt
    Oral History of Richard Greenblatt Interviewed by: Gardner Hendrie Recorded: January 12, 2005 Boston, MA Total Running Time: 2:33:00 CHM Reference number: X3131.2005 © 2005 Computer History Museum Q: We have today with us Richard Greenblatt who has graciously agreed to do an oral history for the computer history museum. Tell us a little bit about your family background, your parents, where you were born and brought up. Richard Greenblatt: Well, I was born in Portland, Oregon, but- and my family then moved to Philadelphia, Pennsylvania, where- for a couple years but I actually grew up almost entirely in Columbia, Missouri, which is a university town halfway between St. Louis and Kansas City. It’s-- At the time it was about 30,000 people or so and the University of Missouri had 15 or so thousand university students, two other universities in town so it’s much smaller than the University of Missouri but anyway it was a rather ideal place as I look back on it now <laughs> actually to grow up and it was a college town. There were-- A lot of the teachers were wives of students or whatever and it was quite progressive in a lot of ways and I also felt that I came through at the right time. Just after I was through, they started in with new math and a bunch of other teaching things that I <laughs> didn’t agree with at all. So I got- had the majority of my high school before they switched to proofs, quote, unquote, and other nonsense that I thought was useless.
    [Show full text]
  • The Connection Machine
    The Connection Machine by William Daniel Hillis M.S., B.S., Massachusetts Institute of Technology (1981, 1978) Submitted to the Department of Electrical Engineering and Computer Science in partial fulfillment of the requirements for the degree of Doctor of Philosophy at the Massachusetts Institute of Technology June 1985 @W. Daniel Hillis, 1985 The author hereby grants to M.I.T. permission to reproduce and to distribute publicly copies of this thesis document in whole or in part. Signature of Author. Department of Electrical Engineering and Computer Science May 3, 1985 Certified byG Prc tsor Gerald Sussmnan, Thesis Supervisor Acceptd by 'hairman, Ifepartment Committee MASSACHUSETTS INSTiTUTE OF TECHNOL.OGY MAR 221988 I UDIWaS The Connection Machine by W. Daniel Hillis Submitted to the Department of Electrical Engineering and Computer Science on May 3, 1985 in partial fulfillment of the requirements for the degree of Doctor of Philosophy Abstract Someday, perhaps soon, we will build a machine that will be able to perform the functions of a human mind, a thinking machine. One of the many prob- lems that must be faced in designing such a machine is the need to process large amounts of information rapidly, more rapidly than is ever likely to be possible with a conventional computer. This document describes a new type of computing engine called a Connection Machine, which computes through the interaction of many, say a million, simple identical process- ing/memory cells. Because the processing takes place concurrently, the machine can be much faster than a traditional computer. Thesis Supervisor: Professor Gerald Sussman 2 Contents 1 Introduction 6 1.1 We Would Like to Make a Thinking Machine 6 1.2 Classical Computer Architecture Reflects Obsolete Assumptions 9 1.3 Concurrency Offers a Solution ..
    [Show full text]
  • Experimenting with Programming Languages
    Experimenting with Programming Languages Alessandro Warth VPRI Technical Report TR-2008-003 Viewpoints Research Institute, 1209 Grand Central Avenue, Glendale, CA 91201 t: (818) 332-3001 f: (818) 244-9761 U C Los Angeles Experimenting with Programming Languages A dissertation submitted in partial satisfaction of the requirements for the degree Doctor of Philosophy in Computer Science by Alessandro Warth 2009 VPRI Technical Report TR-2008-003 1 c Copyright by ! Alessandro Warth 2009 VPRI Technical Report TR-2008-003 2 The dissertation of Alessandro Warth is approved. Rebecca Allen Junghoo “John” Cho Alan Kay Jens Palsberg Todd Millstein, Committee Chair University of California, Los Angeles 2009 ii VPRI Technical Report TR-2008-003 3 >> To Carolyn, who had no idea what she was getting into when Unwavering Support– she married a Ph.D. student, but by now has completed the requirements for the degree of >>– Doctor of Philosophy in iii VPRI Technical Report TR-2008-003 4 T C 1 Introduction ................................. 1 1.1 OMeta: Experimenting with Language Design . 2 1.2 Worlds: Experimenting with Possibilities . 5 1.3 Statement of the Thesis and Organization of the Dissertation . 7 2 OMeta: An Object-Oriented Language for Pattern-Matching ...... 9 2.1 Introduction . 9 2.2 OMeta: an extended PEG . 11 2.2.1 PEGs, OMeta Style . 12 2.2.2 PEG Extensions . 15 2.2.3 A Note on Memoization . 22 2.3 O is for Object-Oriented . 22 2.3.1 Quick and Easy Language Extensions . 23 2.3.2 Extensible Pattern Matching . 23 2.3.3 Stateful Pattern Matching .
    [Show full text]
  • NBS FIPS Software Documentation
    NATL INST OF STAND & TECH AlllDt. ^7fl3fiE Computer Science National Bureau of Standards and Technology NBS Special Publication 500-94 NBS FIPS NBS Software Documentation PUBLICATIONS NATIONAL BUREAU OF STANDARDS The National Bureau of Standards' was established by an act ot Congress on March 3, 1901. The Bureau's overall goal is to strengthen and advance the Nation's science and technology and facilitate their effective application for public benefit. To this end, the Bureau conducts research and provides: (1) a basis for the Nation's physical measurement system, (2) scientific and technological services for industry and government, (3) a technical basis for equity in trade, and (4) technical services to promote public safety. The Bureau's technical work is per- formed by the National Measurement Laboratory, the National Engineering Laboratory, and the Institute for Computer Sciences and Technology. THE NATIONAL MEASUREMENT LABORATORY provides the national system of physical and chemical and materials measurement; coordinates the system with measurement systems of other nations and furnishes essential services leading to accurate and uniform physical and chemical measurement throughout the Nation's scientific community, industry, and commerce; conducts materials research leading to improved methods of measurement, standards, and data on the properties of materials needed by industry, commerce, educational institutions, and Government; provides advisory and research services to other Government agencies; develops, produces, and distributes
    [Show full text]