Sociolinguistically Driven Approaches for Just Natural Language Processing

Total Page:16

File Type:pdf, Size:1020Kb

Sociolinguistically Driven Approaches for Just Natural Language Processing University of Massachusetts Amherst ScholarWorks@UMass Amherst Doctoral Dissertations Dissertations and Theses April 2021 Sociolinguistically Driven Approaches for Just Natural Language Processing Su Lin Blodgett University of Massachusetts Amherst Follow this and additional works at: https://scholarworks.umass.edu/dissertations_2 Part of the Artificial Intelligence and Robotics Commons Recommended Citation Blodgett, Su Lin, "Sociolinguistically Driven Approaches for Just Natural Language Processing" (2021). Doctoral Dissertations. 2092. https://doi.org/10.7275/20410631 https://scholarworks.umass.edu/dissertations_2/2092 This Open Access Dissertation is brought to you for free and open access by the Dissertations and Theses at ScholarWorks@UMass Amherst. It has been accepted for inclusion in Doctoral Dissertations by an authorized administrator of ScholarWorks@UMass Amherst. For more information, please contact [email protected]. SOCIOLINGUISTICALLY DRIVEN APPROACHES FOR JUST NATURAL LANGUAGE PROCESSING A Dissertation Presented by SU LIN BLODGETT Submitted to the Graduate School of the University of Massachusetts Amherst in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY February 2021 College of Information and Computer Sciences © Copyright by Su Lin Blodgett 2021 All Rights Reserved SOCIOLINGUISTICALLY DRIVEN APPROACHES FOR JUST NATURAL LANGUAGE PROCESSING A Dissertation Presented by SU LIN BLODGETT Approved as to style and content by: Brendan O’Connor, Chair Mohit Iyyer, Member Hanna Wallach, Member Lisa Green, Member James Allan, Chair of the Faculty College of Information and Computer Sciences ACKNOWLEDGMENTS First and foremost, I would like to thank my advisor, Brendan O’Connor, who introduced me to natural language processing and computational sociolinguistics, taught me more than I can say about research, NLP, and writing, and has been endlessly supportive of my many and growing research interests; I consider myself amazingly lucky to have stumbled into his lab. I am also very grateful to my thesis committee members, Mohit Iyyer, Hanna Wallach, and Lisa Green, for their thoughtful feedback and support; in particular, I would like to thank Hanna and Lisa for their invaluable feedback during our collaborations. I would also like to thank Solon Barocas and Hal Daumé III, who along with Hanna were (and continue to be) incredibly generous and intellectually inspiring mentors and collaborators. I am also grateful to Johnny Wei for his thoughtful work during our collaboration. I have been lucky to pursue this PhD alongside my wonderful labmates Abe Handler and Katie Keith, who over the years have been the source of amazing discussions, inspiration, and support. I’ve also been fortunate to be part of a fantastic community at UMass, in- cluding fellow graduate students Lucas Chaufournier, Kaleigh Clary, Janet Guo, Myungha Jang, Neha Kennard, Kalpesh Krishna, Tiffany Liu, Dirk Ruiken, Rian Shambaugh, Emma Tosch, and Kevin Winner. I’m also grateful to friends outside of UMass who were an impor- tant source of research inspiration and support, including Michael Madaio, Ian Stewart, and Zeerak Waseem. Additionally, I’d like to thank the administrative staff at UMass, partic- ularly Leeanne Leclerc, Gwyn Mitchell, Eileen Hamel, and Malaika Ross. I’m also grateful to Sohie Lee, who encouraged me to pursue a PhD. Finally, I am forever indebted to my parents, sister, and Samer for their boundless support, encouragement, and love. iv ABSTRACT SOCIOLINGUISTICALLY DRIVEN APPROACHES FOR JUST NATURAL LANGUAGE PROCESSING FEBRUARY 2021 SU LIN BLODGETT B.A., WELLESLEY COLLEGE M.S., UNIVERSITY OF MASSACHUSETTS AMHERST Ph.D., UNIVERSITY OF MASSACHUSETTS AMHERST Directed by: Professor Brendan O’Connor Natural language processing (NLP) systems are now ubiquitous. Yet the benefits of these language technologies do not accrue evenly to all users, and indeed they can be harmful; NLP systems reproduce stereotypes, prevent speakers of non-standard language varieties from participating fully in public discourse, and re-inscribe historical patterns of linguistic stigmatization and discrimination. How harms arise in NLP systems, and who is harmed by them, can only be understood at the intersection of work on NLP, fairness and justice in machine learning, and the relationships between language and social justice. In this thesis, we propose to address two questions at this intersection: i) How can we conceptualize harms arising from NLP systems?, and ii) How can we quantify such harms? We propose the following contributions. First, we contribute a model in order to collect the first large dataset of African American Language (AAL)-like social media text. We use the dataset to quantify the performance of two types of NLP systems, identifying dispari- ties in model performance between Mainstream U.S. English (MUSE)- and AAL-like text. Turning to the landscape of bias in NLP more broadly, we then provide a critical survey v of the emerging literature on bias in NLP and identify its limitations. Drawing on work across sociology, sociolinguistics, linguistic anthropology, social psychology, and education, we provide an account of the relationships between language and injustice, propose a taxon- omy of harms arising from NLP systems grounded in those relationships, and propose a set of guiding research questions for work on bias in NLP. Finally, we adapt the measurement modeling framework from the quantitative social sciences to effectively evaluate approaches for quantifying bias in NLP systems. We conclude with a discussion of recent work on bias through the lens of style in NLP, raising a set of normative questions for future work. vi TABLE OF CONTENTS Page ACKNOWLEDGMENTS ............................................. iv ABSTRACT .......................................................... v LIST OF TABLES ................................................... xii LIST OF FIGURES.................................................. xv CHAPTER 1. INTRODUCTION ................................................. 1 2. VARIATION ON TWITTER: DEVELOPING A CORPUS OF AAL ............................................................ 4 2.1 Introduction . .4 2.1.1 What is African American Language? . .5 2.2 Identifying AAL from demographics . .6 2.2.1 AAL on Twitter . .6 2.2.2 Twitter and Census data . .7 2.2.3 Direct word-demographic analysis . .8 2.2.4 Mixed-membership demographic language model . .9 2.3 Linguistic validation . 12 2.3.1 Lexical-level validation . 13 2.3.2 Internet-specific orthography . 13 2.3.3 Phonological validation . 14 2.3.4 Syntactic validation . 14 2.4 A dataset for AAL morphosyntactic analysis . 16 2.4.1 Related work . 17 vii 2.4.2 Dataset goals . 19 2.4.3 Data collection and analysis . 22 2.5 Conclusion . 26 3. FAIRNESS IN NLP TOOLS: LANGUAGE IDENTIFICATION ............................................ 27 3.1 Introduction . 27 3.2 Related work . 28 3.3 Twitter and langid.py ............................................ 28 3.3.1 Adapting language identification for AAL. 29 3.4 Commercial systems . 31 3.4.1 Experiments . 31 3.5 Extended evaluation: World languages . 34 3.5.1 Dataset . 34 3.5.2 Experiments . 36 3.5.3 Results and discussion . 37 4. FAIRNESS IN NLP TOOLS: DEPENDENCY PARSING ......... 42 4.1 Introduction . 42 4.2 Related Work . 42 4.2.1 Parsing for Twitter . 42 4.2.2 Parsing for non-standard varieties . 43 4.3 Preliminary analysis: Stanford dependencies. 44 4.4 Extended analysis: Universal dependencies . 45 4.4.1 Dataset . 46 4.4.2 AAL annotation . 46 4.4.3 Non-AAL Twitter annotation . 49 4.4.4 Experiments . 53 4.4.5 Results and analysis . 57 4.5 Discussion and conclusion. 63 5. MEASURING BIAS: A SURVEY OF BIAS IN NLP .............. 64 5.1 Introduction . 64 5.2 A critical analysis of bias in NLP . 65 viii 5.2.1 Method for gathering papers . 66 5.2.2 Findings . 68 5.3 Conclusion . 74 6. LANGUAGE AND JUSTICE ..................................... 75 6.1 Introduction . 75 6.2 Social justice . 75 6.2.1 Social justice and technology. 77 6.3 Overview of language and justice . 80 6.3.1 Language about ........................................... 81 6.3.2 Language by .............................................. 84 6.3.3 Case study: African American Language . 87 6.3.4 Takeaways . 91 7. MEASURING BIAS: A TAXONOMY OF HARMS ............... 93 7.1 Introduction . 93 7.2 Towards a taxonomy of representational harms . 93 7.2.1 Undesirable correlations . 94 7.2.2 A taxonomy of representational harms . 96 7.2.3 Other dynamics and effects . 107 7.2.4 Other taxonomies of harms in NLP . 110 7.3 Discussion and recommendations . 114 7.3.1 Language and unjust social arrangements . 115 7.3.2 Conceptualizations of bias . 116 7.3.3 Language use in practice . 118 7.3.4 Case study . 119 8. MEASURING BIAS: EVALUATING MEASUREMENTS OF BIAS ......................................................... 123 8.1 Introduction . 123 8.2 Measurement modeling . ..
Recommended publications
  • The Girlboss Guide to Killing It Online & Making an Impact in the Digital Space
    AN INTRODUCTORY GUIDE TO MAKING YOUR MARK IN THE DIGITAL SPACE FOR GIRBOSSES, BLOGGERS AND CREATIVES. By Dani Watson A N I N T R O T O M A K I N G Y O U R M A R K O N L I N E B Y D A N I W A T S O N WITH THE AMOUNT OF COMPETITION IN THE ONLINE WORLD, IT CAN BE AN OVERWHELMING TASK TO THINK ABOUT HOW YOU ARE EVER GOING TO MAKE YOU AND YOUR BRAND STAND OUT FROM THE NOISE. WHETHER YOU'RE A BUSINESS LOOKING TO LAUNCH ONLINE, A BLOGGER WHO WANTS TO TURN THEIR BLOG INTO A FULL TIME INCOME OR A BRAND THAT IS LOOKING TO ELEVATE ITS PRESENCE IN THE DIGITAL SPACE, ITS ALL ABOUT FINDING AND CONNECTING WITH YOUR CLIQUE AND ONCE YOU DO, LOVING THEM HARD. I AM A FIRM BELIEVER THAT WHENEVER YOU ARE LOOKING TO CREATE AN INCOME ONLINE, THE COMMUNITY COMES BEFORE THE SELL. PEOPLE HAVE TO KNOW, LIKE AND TRUST YOU BEFORE THEY ARE WILLING TO INVEST. THIS IS WHY IT IS SO IMPORTANT TO BE THINKING ABOUT CREATING NOT ONLY YOUR PRODUCTS/ SERVICES/BLOG, BUT FIGURING OUT WAYS TO MAKE PEOPLE FALL IN LOVE WITH YOU AND YOUR BRAND SO THEY BUY WHATEVER IT IS YOU SELL. I AM SOMEONE WHO THOROUGHLY UNDERSTANDS THE IMPORTANCE OF FINDING YOUR TRIBE ONLINE AND BECOMING KNOWN FOR WHAT YOU DO AS THIS IS EXACTLY WHAT I HAVE BEEN ABLE TO DO WITH MY OWN BUSINESS AND BRAND. WHAT STARTED OFF AS A SMALL BLOG AND SOCIAL MEDIA CONSULTANCY HAS QUICKLY DEVELOPED INTO AN ONLINE EMPIRE THAT SHOWS NO SIGNS OF SLOWING DOWN.
    [Show full text]
  • A Method to Analyze Multiple Social Identities in Twitter Bios
    A Method to Analyze Multiple Social Identities in Twitter Bios ARJUNIL PATHAK, University at Buffalo, USA NAVID MADANI, University at Buffalo, USA KENNETH JOSEPH, University at Buffalo, USA Twitter users signal social identity in their profile descriptions, or bios, in a number of important but complex ways that are not well-captured by existing characterizations of how identity is expressed in language. Better ways of defining and measuring these expressions may therefore be useful both in understanding howsocial identity is expressed in text, and how the self is presented on Twitter. To this end, the present work makes three contributions. First, using qualitative methods, we identify and define the concept of a personal identifier, which is more representative of the ways in which identity is signaled in Twitter bios. Second, we propose a method to extract all personal identifiers expressed in a given bio. Finally, we present a series of validation analyses that explore the strengths and limitations of our proposed method. Our work opens up exciting new opportunities at the intersection between the social psychological study of social identity and the study of how we compose the self through markers of identity on Twitter and in social media more generally. CCS Concepts: • Applied computing Sociology; Additional Key Words and Phrases: Twitter, self-presentation, social identity, computational social science ACM Reference Format: Arjunil Pathak, Navid Madani, and Kenneth Joseph. 2018. A Method to Analyze Multiple Social Identities in Twitter Bios. J. ACM 37, 4, Article 111 (August 2018), 35 pages. https://doi.org/10.1145/1122445.1122456 1 INTRODUCTION A social identity is a word or phrase that refers to a particular social group (e.g.
    [Show full text]
  • Microformats the Next (Small) Thing on the Semantic Web?
    Standards Editor: Jim Whitehead • [email protected] Microformats The Next (Small) Thing on the Semantic Web? Rohit Khare • CommerceNet “Designed for humans first and machines second, microformats are a set of simple, open data formats built upon existing and widely adopted standards.” — Microformats.org hen we speak of the “evolution of the is precisely encoding the great variety of person- Web,” it might actually be more appropri- al, professional, and genealogical relationships W ate to speak of “intelligent design” — we between people and organizations. By contrast, can actually point to a living, breathing, and an accidental challenge is that any blogger with actively involved Creator of the Web. We can even some knowledge of HTML can add microformat consult Tim Berners-Lee’s stated goals for the markup to a text-input form, but uploading an “promised land,” dubbed the Semantic Web. Few external file dedicated to machine-readable use presume we could reach those objectives by ran- remains forbiddingly complex with most blog- domly hacking existing Web standards and hop- ging tools. ing that “natural selection” by authors, software So, although any intelligent designer ought to developers, and readers would ensure powerful be able to rely on the long-established facility of enough abstractions for it. file transfer to publish the “right” model of a social Indeed, the elegant and painstakingly inter- network, the path of least resistance might favor locked edifice of technologies, including RDF, adding one of a handful of fixed tags to an exist- XML, and query languages is now growing pow- ing indirect form — the “blogroll” of hyperlinks to erful enough to attack massive information chal- other people’s sites.
    [Show full text]
  • Microformats, Metadata
    Microformats, Metadata April 29, 2013 1 Introduction 1.1 Content • What are microformats? • Why to use microformats? • How to replace microformats? • Advantages of microformats to own markup. • Disadvantages of microformats.. • Common used microformats • Microformats design patterns. • Resources on microformats. 1.2 What are microformats? • The way how to add simple markup into human readable information to allow computer processing. { acquiring data from documents { indexing { searching { storing { cross-referencing • Microformats are elements of a semantic markup using a plain old semantic (X)HTML (POSH) { Why has been the HTML designed at CERN (Conseil Europ´eenpour la Recherche Nucl´eaire(http://dg-rpc.web.cern.ch/dg-rpc/Scale. html)) v in the year 1991? 1 1.3 Example of Microformat <ol class='xoxo'> <li>Prvn´ıbod <ol> <li>Podbod a</li> <li>Podbod b</li> </ol> </li> <li>Druh´ybod <ol compact="compact"> <li>Podbod c</li> <li>Podbod d</li> </ol> </li> </ol> 1.4 Why to use microformats? Microformats combines some trends: • next logical step in web-design and information architecture progress • the manner how can people and organisations publish semantically rich content without dependency on centralized services • "agreement", that the traditional meta data either failed or their adoption took too long, so there has been a need of a different solution. { microformats use the meta data. :-) • Use of (X)HTML for data. 1.5 How can be microformats replaced? (1) • Including data in own formating in a different namespace { XHTML + Voice Profile (http://www.voicexml.org/specs/multimodal/ x+v/12/) { (X)HTML + SVG { XLink { XHTML+RDF { ... 1.6 How can be microformats replaced? (2) • Advantages: { easy visualisation (direct support in browsers, using CSS for exam- ple), { independent on centralized services.
    [Show full text]
  • Restful Web Apis
    Praise for RESTful Web APIs “This book is the best place to start learning the essential craft of API Design.” —Matt McLarty Cofounder, API Academy “The entire time I read this book, I was cursing. I was cursing because as I read each explanation, I was worried that they were so good that it would be hard to find a better one to use in my own writing. You will not find another work that explores the topic so thoroughly yet explains the topic so clearly. Please, take these tools, build something fantastic, and share it with the rest of the world, okay?” —Steve Klabnik Author, Designing Hypermedia APIs “Wonderfully thorough treatment of hypermedia formats, REST’s least well understood tenet." —Stefan Tilkov REST evangelist, author, and consultant “The best practical guide to hypermedia APIs. A must-have.” — Ruben Verborgh Semantic hypermedia researcher RESTful Web APIs Leonard Richardson and Mike Amundsen Foreword by Sam Ruby RESTful Web APIs by Leonard Richardson and Mike Amundsen with a Foreword by Sam Ruby Copyright © 2013 Leonard Richardson, amundsen.com, Inc., and Sam Ruby. All rights reserved. Printed in the United States of America. Published by O’Reilly Media, Inc., 1005 Gravenstein Highway North, Sebastopol, CA 95472. O’Reilly books may be purchased for educational, business, or sales promotional use. Online editions are also available for most titles (http://my.safaribooksonline.com). For more information, contact our corporate/ institutional sales department: 800-998-9938 or [email protected]. Editors: Simon St. Laurent and Meghan Blanchette Indexer: Judith McConville Production Editor: Christopher Hearse Cover Designer: Randy Comer Copyeditor: Jasmine Kwityn Interior Designer: David Futato Proofreader: Linley Dolby Illustrator: Rebecca Demarest September 2013: First Edition Revision History for the First Edition: 2013-09-10: First release See http://oreilly.com/catalog/errata.csp?isbn=9781449358068 for release details.
    [Show full text]
  • Similarity and Sense Disambiguation of Emoji Using Emojinet
    A Framework to Understand Emoji Meaning: Similarity and Sense Disambiguation of Emoji using EmojiNet A dissertation submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy By Sanjaya Wijeratne B.Sc. in Information Technology, University of Moratuwa, Sri Lanka, 2009. 2018 Wright State University WRIGHT STATE UNIVERSITY GRADUATE SCHOOL November 19, 2018 I HEREBY RECOMMEND THAT THE DISSERTATION PREPARED UNDER MY SUPER- VISION BY Sanjaya Wijeratne ENTITLED A Framework to Understand Emoji Meaning: Similarity and Sense Disambiguation of Emoji using EmojiNet BE ACCEPTED IN PAR- TIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF Doctor of Philosophy. Amit Sheth, Ph.D. Dissertation Director Michael Raymer, Ph.D. Director, Computer Science and Engineering Ph.D. Program Barry Milligan, Ph.D. Interim Dean of the Graduate School Committee on Final Examination Amit Sheth, Ph.D. Derek Doran, Ph.D. Krishnaprasad Thirunarayan, Ph.D. Wenbo Wang, Ph.D. ABSTRACT Wijeratne, Sanjaya. Ph.D., Department of Computer Science and Engineering, Wright State Uni- versity, 2018. A Framework to Understand Emoji Meaning: Similarity and Sense Disambiguation of Emoji using EmojiNet. Pictographs, commonly referred to as `emoji', have become a popular way to enhance electronic communications. They are an important component of the language used in social media. With their introduction in the late 1990's, emoji have been widely used to enhance the sentiment, emotion, and sarcasm expressed in social media messages. They are equally popular across many social media sites including Facebook, Instagram, and Twitter. In 2015, Instagram reported that nearly half of the photo comments posted on Instagram contain emoji, and in the same year, Twitter reported that the `face with tears of joy' emoji has been tweeted 6.6 billion times.
    [Show full text]
  • Ontology and Vocabulary
    ONTOLOGIES, VOCABULARIES AND VARIOUS TOOLS REPORT An Environment Scan of Tools for Ontologies and Vocabularies Dr David A. McMeekin Contents 1 Introduction ............................................................................................................................... 3 2 Ontology and Vocabulary ........................................................................................................... 3 2.1 Preparing to Build an Ontology ........................................................................................... 4 3 Ontology Tools ........................................................................................................................... 5 3.1 Summary of Ontology and Vocabulary Tools.................................................................... 24 4 Ontologies and Vocabularies ................................................................................................... 25 5 Conclusions .............................................................................................................................. 45 © CRC for Spatial Information, 2015. All rights reserved. 2 1 Introduction This report records on the findings of an environmental scan of ontologies and vocabularies and the tools that can be used to create them. As will be seen in the report, there is a large number of tools that can be used. In preparing this report a very wide search has been carried out. However, it is possible and expected that there are more tools available than what are reported upon here. There
    [Show full text]
  • Enriching Unstructured Media Content About Events to Enable Semi-Automated Summaries, Compilations, and Improved Search by Leveraging Social Networks
    Enriching Unstructured Media Content About Events to Enable Semi-Automated Summaries, Compilations, and Improved Search by Leveraging Social Networks Thomas Steiner Departament de Llenguatges i Sistemes Informàtics Universitat Politècnica de Catalunya A thesis submitted for the degree of Philosophiæ Doctor (PhD) February 2014 1st Advisor: Joaquim Gabarró Vallés Universitat Politècnica de Catalunya, Barcelona, Spain 2nd Advisor: Michael Hausenblas Digital Enterprise Research Institute, Galway, Ireland MapR Technologies, San Jose, CA, USA ii Abstract (i) Mobile devices and social networks are omnipresent Mobile devices such as smartphones, tablets, or digital cameras together with social networks enable people to create, share, and consume enormous amounts of media items like videos or photos both on the road or at home. Such mobile devices—by pure definition—accompany their owners almost wherever they may go. In consequence, mobile devices are omnipresent at all sorts of events to capture noteworthy moments. Exemplary events can be keynote speeches at conferences, music concerts in stadiums, or even natural catastrophes like earthquakes that affect whole areas or countries. At such events—given a stable network connection—part of the event-related media items are published on social networks both as the event happens or afterwards, once a stable network connection has been established again. (ii) Finding representative media items for an event is hard Common media item search operations, for example, searching for the of- ficial video clip for a certain hit record on an online video platform can in the simplest case be achieved based on potentially shallow human-generated metadata or based on more profound content analysis techniques like optical character recognition, automatic speech recognition, or acoustic fingerprint- ing.
    [Show full text]
  • AN Etworkfor G Irlbosses PASSION, PROFITS
    A N e t w o r k f o r G i r l b o s s e s PASSION, PROFITS & PERSONAL DEVELOPMENT; A GUIDE TO CREATING A BUSINESS & LIFESTYLE YOU ARE UTTERLY OBSESSED WITH So amazing to have you here. Step into my office, grab a coffee/glass of wine (time dependent) and be prepared to start looking at life through an entirely different lens. I'm Dani and thank you for following The Clique. So glad to have you in my circle of mind blowing women who are ready to design a business that stands out from the crowds and a lifestyle that they are utterly obsessed with. In The Clique, we are all about doing what we love and doing lots of it. We aren't prepared to settle and we want to live entirely on our own terms. That's exactly what I want for you too. I created The Clique after launching my own successful business online as a social media strategist, a business that allowed me to quit a career that didn't fulfil me, gave me the freedom to work when and from where I chose and earn more than what I ever imagined I ever would. I now help other women achieve exactly the same. Although social media strategy is still a huge part of what I do, my biggest passion now lies in empowering other women with the mindset and strategies they need in order to become their own boss, realise their fullest potential and guiding them through the process of designing a brand that sells.
    [Show full text]
  • Using Microformats and Rdfa As Embeddable Data Formats
    CHAPTER 18 Using Microformats and RDFa As Embeddable Data Formats The central problem that we will study in this chapter is how to embed information in web pages in a way that is easy to understand by both humans and computer programs. The solution that we will consider in depth is microformats, little chunks of structured data that are seamlessly embedded in web pages. (X)HTML is designed primarily to produce a human user interface (via a web browser). However, by carefully following certain conventions (ones that constitute microformats), you can produce (X)HTML out of which data can be unambiguously extracted. The consequence is that it is relatively easy to write computer programs to parse microformats so that the data can be reused in other contexts—giving rise to plenty of mashup possibilities. Moreover, having data embedded right in the context of a user interface is helpful. A user can decide what to do with this data (how to “operate” on a given piece of data) while in the context of normal browsing. Now, the previous paragraph is a bit abstract. What I will do in this chapter is walk you through some concrete examples describing microformats in general. We will use Operator, a Firefox extension, to help parse and view microformats and to create scripts that enable users to take specific actions in response to microformats. Specifically, we will do the following: * Study specific examples of microformats * Look at how to use the Firefox add-on Operator to jump-start your study of microformats * Look at programmatic approaches to consuming and creating microformats * Compare microformats to leading alternatives such as RDFa as a way to embed data in human-readable contexts.
    [Show full text]
  • Restful Web Apis
    CHAPTER 10 The Hypermedia Zoo There are a lot of hypermedia document formats in active use. Some are designed for very specialized purposes—the people who use them may not even think of them as hypermedia formats. Other hypermedia formats are in such common usage that people don’t really think about them at all. In this chapter, I’ll take you on an educational tour of a “zoo” containing the most popular and most interesting hypermedia formats. I won’t be going into a lot of technical detail. Any one of these formats probably isn’t the one you want to use, and I’ve covered many of them earlier in the book. Many of the formats are still under active development, and their details might change. If you’re interested in one of the zoo’s specimens, the next step is to read its formal specification. My goal is to give you a sense of the many forms hypermedia can take, and to show how many times we’ve tackled the basic problems of representing it. The hypermedia zoo is so full that you probably don’t need to define a brand new media type for your API. You should be able to pick an existing media type and write a profile for it. I’ve organized the hypermedia zoo along the lines of my introduction to hypermedia. There’s a section for domain-specific formats (a la Chapter 5), a section for formats whose primary purpose is to implement the collection pattern (a la Chapter 6), and a section for general hypermedia formats (a la Chapter 7).
    [Show full text]
  • LOMOTEY-THESIS.Pdf (2.594Mb)
    ENABLING MOBILE DEVICES TO HOST CONSUMERS AND PROVIDERS OF RESTFUL WEB SERVICES A Thesis Submitted to the College of Graduate Studies and Research In Partial Fulfillment of the Requirements For the Degree of Master of Science In the Department of Computer Science University of Saskatchewan Saskatoon By RICHARD KWADZO LOMOTEY Keywords: mobile Web resources hosting, middleware, cloud computing, REST Copyright Richard Kwadzo Lomotey, March 2012. All rights reserved. PERMISSION TO USE In presenting this thesis in partial fulfillment of the requirements for a Postgraduate degree from the University of Saskatchewan, I agree that the Libraries of this University may make it freely available for inspection. I further agree that permission for copying of this thesis in any manner, in whole or in part, for scholarly purposes may be granted by the professor or professors who supervised my thesis work or, in their absence, by the Head of the Department or the Dean of the College in which my thesis work was done. It is understood that any copying or publication or use of this thesis or parts thereof for financial gain shall not be allowed without my written permission. It is also understood that due recognition shall be given to me and to the University of Saskatchewan in any scholarly use which may be made of any material in my thesis. Requests for permission to copy or to make other use of material in this thesis in whole or part should be addressed to: Head of the Department of Computer Science 176 Thorvaldson Building 110 Science Place University of Saskatchewan Saskatoon, Saskatchewan Canada S7N 5C9 i ABSTRACT The strong growth in the use of mobile devices such as smartphones and tablets in Enterprise Information Systems has led to growing research in the area of mobile Web services.
    [Show full text]