4.1 Feeds and Podcasts.Pptx

Total Page:16

File Type:pdf, Size:1020Kb

4.1 Feeds and Podcasts.Pptx Module 4: Web 2.0 Carlos Delgado Kloos Dep. Ing. Telemática Universidad Carlos III de Madrid Exercise Write down 10 keywords that come to your mind in relation with Web 2.0 Together with your colleagues come up with those that are mentioned more often and better characterize Web 2.0 Reduce these to just 2 keywords: a technical one and a non-technical one MIIT 2009/10: GIMI 4: Web 2.0 2 © Carlos Delgado Kloos, UC3M 2009 1 MIIT 2009/10: GIMI 4: Web 2.0 3 Web 2.0 MIIT 2009/10: GIMI 4: Web 2.0 4 © Carlos Delgado Kloos, UC3M 2009 2 Web 2.0 Read Read-only -write web web MIIT 2009/10: GIMI 4: Web 2.0 5 Web 2.0 MIIT 2009/10: GIMI 4: Web 2.0 6 © Carlos Delgado Kloos, UC3M 2009 3 Web 2.0: The Social Web Services, not packaged software, with cost-effective scalability Control over unique, hard-to-recreate data sources that get richer as more people use them Trusting users as co-developers Harnessing collective intelligence Leveraging the long tail through customer self- service Software above the level of a single device Lightweight user interfaces AND business models MIIT 2009/10: GIMI 4: Web 2.0 7 Harnessing Collective Intelligence James Surowiecki: The Wisdom of Crowds MIT Center for Collective Intelligence – cci.mit.edu – scripts.mit.edu/ ~cci/wiki/ Cooperation Commons – www.cooperationcommons.com MIIT 2009/10: GIMI 4: Web 2.0 8 © Carlos Delgado Kloos, UC3M 2009 4 Overview From Web Pages – … to Feeds – … to Podcasts – … to Widgets – … to Mashups MIIT 2009/10: GIMI 4: Web 2.0 9 From Web Pages to Feeds Carlos Delgado Kloos Dep. Ing. Telemática Universidad Carlos III de Madrid © Carlos Delgado Kloos, UC3M 2009 5 Question What is the key feature that characterizes feeds in relation to web pages? MIIT 2009/10: GIMI 4: Web 2.0 11 Communication Paradigms Through shared variables (shared memory) Proc1 Mem Proc2 web Through message passing Proc1 Proc2 email MIIT 2009/10: GIMI 4: Web 2.0 12 © Carlos Delgado Kloos, UC3M 2009 6 Web: http request client server response MIIT 2009/10: GIMI 4: Web 2.0 13 Push vs. Pull Pull – Web, http Push – PointCast • Company in California (1992-97) • News over a screensaver MIIT 2009/10: GIMI 4: Web 2.0 14 © Carlos Delgado Kloos, UC3M 2009 7 Push or pull? Continuum – E-mail – E-mail with periodic retrieval – IM It’s all about how to launch requests. MIIT 2009/10: GIMI 4: Web 2.0 15 Content Syndication The distribution or redistribution of content to other Web sites Typical content forms include headlines, news, images, and multimedia clips (i.e., video and/or audio) Web sites that produce content place it on other sites for brand awareness or site traffic generation. Web sites that host syndicated content use it to attract visitors to their own sites. MIIT 2009/10: GIMI 4: Web 2.0 16 © Carlos Delgado Kloos, UC3M 2009 8 What is RSS? RSS is a format for syndicating news and the content of news-like sites, e.g. – Major news sites like Wired, – news-oriented community sites like Slashdot, – and personal weblogs Not only news: – the 'recent changes' page of a wiki, – a changelog of CVS checkins, – the revision history of a book Feedreaders or aggregators are used to read feeds MIIT 2009/10: GIMI 4: Web 2.0 17 feed.html <html> Helps the browser identify feeds <head> <link rel="alternate" type="application/rss+xml" href=”feed.xml" title="GIMI News"> </head> <body> Subscribe to the GIMI channel (by cdk): <a href="feed.xml"> <img src="pic/rss.png" border="0" width="20"> </a> </body> Reference to the feed </html> MIIT 2009/10: GIMI 4: Web 2.0 18 © Carlos Delgado Kloos, UC3M 2009 9 feed.xml <?xml version="1.0" ?> <rss version="2.0”> <channel> <title>GIMI news (by cdk)</title> <link>http://www.it.uc3m.es/berto/GIMI</link> <description>News for GIMI</description> <image><url>http://www.it.uc3m.es/imag/logoit3.gif</url></image> <lastBuildDate>Sun, 04 Nov 2007 21:12:03 GMT</lastBuildDate> Dates are important <item> for the <title>Class about XML available</title> reader to <link>http://www.it.uc3m.es/cdk/xml.pdf</link> learn about <description>The slides about XML are available</description> changes <pubDate>Sun, 04 Nov 2007 21:12:03 GMT</pubDate> </item> </channel> Date syntax from RFC 822 </rss> http://asg.web.cmu.edu/rfc/rfc822.html MIIT 2009/10: GIMI 4: Web 2.0 19 RSS Evolution MIIT 2009/10: GIMI 4: Web 2.0 20 © Carlos Delgado Kloos, UC3M 2009 10 RSS Versions MIIT 2009/10: GIMI 4: Web 2.0 Source: Mark Pilgrim, 2002 21 Meaning of RSS Rich Site Summary (RSS 0.91) RDF Site Summary (RSS 0.9 and 1.0) Really Simple Syndication (RSS 2.0) (Resource Subscription Specification) MIIT 2009/10: GIMI 4: Web 2.0 22 © Carlos Delgado Kloos, UC3M 2009 11 Specifications RSS 2.0: – www.rssboard.org/rss-specification RDF Site Summary 1.0: – web.resource.org/rss/1.0/ Atom: – atomenabled.org – tools.ietf.org/html/rfc4287 MIIT 2009/10: GIMI 4: Web 2.0 23 Atom Supported by IETF, Google and iTunes Inspite of that, RSS 2.0 is still preferred The term RSS is often used to include Atom as a possibility as well. MIIT 2009/10: GIMI 4: Web 2.0 24 © Carlos Delgado Kloos, UC3M 2009 12 feed.xml <?xml version="1.0" ?> <feed xmlns="http://www.w3.org/2005/Atom"> <title>GIMI news (by cdk)</title> <link href="http://www.it.uc3m.es/berto/GIMI"/> <subtitle>News for GIMI</subtitle> <icon><link href="http://www.it.uc3m.es/imag/logoit3.gif"/></icon> <updated>2007-11-04T21:12:03Z</updated> Dates are important <entry> for the <title>Class about XML available</title> reader to <link href="http://www.it.uc3m.es/cdk/xml.pdf"/> learn about <summary>The slides about XML are available</summary> changes <updated>2007-11-04T21:12:03Z</updated> </entry> </feed> Date syntax from RFC 3339, compatible with XML Schema MIIT 2009/10: GIMI 4: Web 2.0 25 How to Exchange Feeds? XML formats to exchange lists of web feeds between web feed aggregators OPML – Outline Processor Markup Language – www.opml.org XOXO – eXtensibleOpen XHTML Outlines – microformats.org/wiki/xoxo OML – Outline Markup Language MIIT 2009/10: GIMI 4: Web 2.0 26 © Carlos Delgado Kloos, UC3M 2009 13 From Feeds to Podcasts Carlos Delgado Kloos Dep. Ing. Telemática Universidad Carlos III de Madrid Question What is the key feature that characterizes podcasts in relation to feeds ? MIIT 2009/10: GIMI 4: Web 2.0 28 © Carlos Delgado Kloos, UC3M 2009 14 Enclosures RSS enclosures are a way of attaching multimedia content to RSS feeds by providing the URL of a file such as – an MP3 file to a music recommendation or – a photo to a diary entry The addition of enclosures to RSS was an important prerequisite for the emergence of podcasting How it all started: – www.thetwowayweb.com/payloadsforrss MIIT 2009/10: GIMI 4: Web 2.0 29 Enclosures <rss version="2.0”><channel> <title>Title of page</title> <link>http://www.dom.com</link> <description>Description of page</description> <item> <title>Story about something</title> <link>http://www.dom.com/item1.htm</link> <enclosure url="http://www.dom.com/file.mov" length="320000" type="video/quicktime"/> </item> </channel></rss> MIIT 2009/10: GIMI 4: Web 2.0 30 © Carlos Delgado Kloos, UC3M 2009 15 Media RSS Extends enclosures to handle other media types, such as short films or TV, as well as provide additional metadata with the media http://search.yahoo.com/mrss MIIT 2009/10: GIMI 4: Web 2.0 31 Media RSS <rssversion="2.0” xmlns:media=http://search.yahoo.com/mrss xmlns:creativeCommons= http://backend.userland.com/creativeCommonsRssModule> <channel> <title>My MovieReviewSite</title> <link>http://www.dom.com</link> <description>I reviewmovies.</description> <item> <title>MovieTitle: Isthisa goodmovie?</title> <link>http://www.dom.com/item1.htm</link> <media:contenturl=http://www.dom.com/trailer.mov fileSize="96320" type="video/quicktime" expression="sample"/> <creativeCommons:license> http://www.creativecommons.org/licenses/by-nc/1.0 </creativeCommons:license> <media:rating>nonadult</media:rating> </item> </channel> </rss> MIIT 2009/10: GIMI 4: Web 2.0 32 © Carlos Delgado Kloos, UC3M 2009 16 Vodcast directories http://www.vodcasts.tv/ http://www.vodstock.com MIIT 2009/10: GIMI 4: Web 2.0 33 Assignment Design a web site with – Login for users – Several gadgets: at least • a twitter box • a video • a calendar with events • event announcement with information (including a map) and registration • a feedback form MIIT 2009/10: GIMI 4: Web 2.0 34 © Carlos Delgado Kloos, UC3M 2009 17 iGoogle www.igoogle.com/ig MIIT 2008/09: GIMI 4: Web 2.0 35 iGoogle Gadgets <?xml version="1.0" encoding="UTF-8" ?> <Module> <ModulePrefs title="Hello World!" /> <Content type="html"> <![CDATA[ Hello, world! ]]> </Content> </Module> MIIT 2009/10: GIMI 4: Web 2.0 36 © Carlos Delgado Kloos, UC3M 2009 18 Google Friend Connect Make your site social Allow logins from Example: www.mylatestpiece.com MIIT 2009/10: GIMI 4: Web 2.0 37 Google Friend Connect google.com/friendconnect Easy set-up without programming MIIT 2009/10: GIMI 4: Web 2.0 38 © Carlos Delgado Kloos, UC3M 2009 19 Google Friend Connect Choose the apps you want from a gallery of gadgets created by Google and the OpenSocial developer community – You can get gadgets for photo sharing, concert listings and games, with many more to come Copy the snippets for the gadgets you've chosen Paste the snippets where you want them on your site and just like that, they'll be up and running MIIT 2009/10: GIMI 4: Web 2.0 39 © Carlos Delgado Kloos, UC3M 2009 20 .
Recommended publications
  • Googling the Grey: Open Data, Web Services, and Semantics
    Archaeologies: Journal of the World Archaeological Congress (Ó 2010) DOI 10.1007/s11759-010-9146-4 Googling the Grey: Open Data, Web Services, and Semantics Eric C. Kansa, School of Information, UC Berkeley, Berkeley, CA, USA E-mail: [email protected] RESEARCH Sarah Whitcher Kansa, Alexandria Archive Institute, San Francisco, CA, USA Margie M. Burton and Cindy Stankowski, San Diego Archaeological Center, Escondido, CA, USA ABSTRACT ________________________________________________________________ Primary data, though an essential resource for supporting authoritative archaeological narratives, rarely enters the public record. Lack of primary data publication is also a major obstacle to cultural heritage preservation and the goals of cultural resource management (CRM). Moreover, access to primary data is key to contesting claims about the past and to the formulation of credible alternative interpretations. In response to these concerns, experimental systems have implemented a variety of strategies to support online publication of primary data. Online data dissemination can be a powerful tool to meet the needs of CRM professionals, establish better communication and collaborative ties with colleagues in academic settings, and encourage public engagement with the documented record of the past. This paper introduces the ArchaeoML standard and its implementation in the Open Context system. As will be discussed, the integration and August 2010 online dissemination of primary data offer great opportunities for making archaeological knowledge creation more participatory and transparent. However, different strategies in this area involve important trade-offs, and all face complex conceptual, ethical, legal, and professional challenges. ________________________________________________________________ Volume 6 Number 2 Since Open Context is operated in the United States, the European Union’s database protection laws are less applicable.
    [Show full text]
  • Digital Fountain Erasure-Recovery in Bittorrent
    UNIVERSITÀ DEGLI STUDI DI BERGAMO Facoltà di Ingegneria Corso di Laurea Specialistica in Ingegneria Informatica Classe n. 35/S – Sistemi Informatici Digital Fountain Erasure Recovery in BitTorrent: integration and security issues Relatore: Chiar.mo Prof. Stefano Paraboschi Correlatore: Chiar.mo Prof. Andrea Lorenzo Vitali Tesi di Laurea Specialistica Michele BOLOGNA Matricola n. 56108 ANNO ACCADEMICO 2007 / 2008 This thesis has been written, typeset and prepared using LATEX 2". Printed on December 5, 2008. Alla mia famiglia “Would you tell me, please, which way I ought to go from here?” “That depends a good deal on where you want to get to,” said the Cat. “I don’t much care where —” said Alice. “Then it doesn’t matter which way you go,” said the Cat. “— so long as I get somewhere,” Alice added as an explanation. “Oh, you’re sure to do that,” said the Cat, “if you only walk enough.” Lewis Carroll Alice in Wonderland Acknowledgments (in Italian) Ci sono molte persone che mi hanno aiutato durante lo svolgimento di questo lavoro. Il primo ringraziamento va ai proff. Stefano Paraboschi e Andrea Vitali per la disponibilità, la competenza, i consigli, la pazienza e l’aiuto tecnico che mi hanno saputo dare. Grazie di avermi dato la maggior parte delle idee che sono poi confluite nella mia tesi. Un sentito ringraziamento anche a Andrea Rota e Ruben Villa per l’aiuto e i chiarimenti che mi hanno gentilmente fornito. Vorrei ringraziare STMicroelectronics, ed in particolare il gruppo Advanced System Technology, per avermi offerto le infrastrutture, gli spa- zi e tutto il necessario per svolgere al meglio il mio periodo di tirocinio.
    [Show full text]
  • Foundations of Temporal Text Networks
    Vega et al. Applied Network Science (2018) 3:25 Applied Network Science https://doi.org/10.1007/s41109-018-0082-3 RESEARCH Open Access Foundations of Temporal Text Networks Davide Vega* and Matteo Magnani *Correspondence: [email protected] Abstract InfoLab, Department of Information Three fundamental elements to understand human information networks are the Technology, Uppsala University, Uppsala, Sweden individuals (actors) in the network, the information they exchange, that is often observable online as text content (emails, social media posts, etc.), and the time when these exchanges happen. An extremely large amount of research has addressed some of these aspects either in isolation or as combinations of two of them. There are also more and more works studying systems where all three elements are present, but typically using ad hoc models and algorithms that cannot be easily transfered to other contexts. To address this heterogeneity, in this article we present a simple, expressive and extensible model for temporal text networks, that we claim can be used as a common ground across different types of networks and analysis tasks, and we show how simple procedures to produce views of the model allow the direct application of analysis methods already developed in other domains, from traditional data mining to multilayer network mining. Keywords: Network, Text, Time, Model, Temporal text network, Human information network Introduction A large amount of human-generated information is available online in the form of text exchanged between individuals at specific times. Examples include social network sites, online forums and emails. The public accessibility of several of these sources allows us to observe our society at various scales, from focused conversations among small groups of individuals to broad political discussions involving heterogeneous audiences from large geographical areas (Zhou et al.
    [Show full text]
  • The Girlboss Guide to Killing It Online & Making an Impact in the Digital Space
    AN INTRODUCTORY GUIDE TO MAKING YOUR MARK IN THE DIGITAL SPACE FOR GIRBOSSES, BLOGGERS AND CREATIVES. By Dani Watson A N I N T R O T O M A K I N G Y O U R M A R K O N L I N E B Y D A N I W A T S O N WITH THE AMOUNT OF COMPETITION IN THE ONLINE WORLD, IT CAN BE AN OVERWHELMING TASK TO THINK ABOUT HOW YOU ARE EVER GOING TO MAKE YOU AND YOUR BRAND STAND OUT FROM THE NOISE. WHETHER YOU'RE A BUSINESS LOOKING TO LAUNCH ONLINE, A BLOGGER WHO WANTS TO TURN THEIR BLOG INTO A FULL TIME INCOME OR A BRAND THAT IS LOOKING TO ELEVATE ITS PRESENCE IN THE DIGITAL SPACE, ITS ALL ABOUT FINDING AND CONNECTING WITH YOUR CLIQUE AND ONCE YOU DO, LOVING THEM HARD. I AM A FIRM BELIEVER THAT WHENEVER YOU ARE LOOKING TO CREATE AN INCOME ONLINE, THE COMMUNITY COMES BEFORE THE SELL. PEOPLE HAVE TO KNOW, LIKE AND TRUST YOU BEFORE THEY ARE WILLING TO INVEST. THIS IS WHY IT IS SO IMPORTANT TO BE THINKING ABOUT CREATING NOT ONLY YOUR PRODUCTS/ SERVICES/BLOG, BUT FIGURING OUT WAYS TO MAKE PEOPLE FALL IN LOVE WITH YOU AND YOUR BRAND SO THEY BUY WHATEVER IT IS YOU SELL. I AM SOMEONE WHO THOROUGHLY UNDERSTANDS THE IMPORTANCE OF FINDING YOUR TRIBE ONLINE AND BECOMING KNOWN FOR WHAT YOU DO AS THIS IS EXACTLY WHAT I HAVE BEEN ABLE TO DO WITH MY OWN BUSINESS AND BRAND. WHAT STARTED OFF AS A SMALL BLOG AND SOCIAL MEDIA CONSULTANCY HAS QUICKLY DEVELOPED INTO AN ONLINE EMPIRE THAT SHOWS NO SIGNS OF SLOWING DOWN.
    [Show full text]
  • Specifications for Implementing Web Feeds in DLXS Kevin S
    Hawkins 1/5/2011 5:01:52 PM Page 1 of 5 * SCHOLARLY PUBLISHING OFFICE WHITEPAPER Specifications for implementing web feeds in DLXS Kevin S. Hawkins Executive Summary SPO uses DLXS to deliver nearly all of its publications online, but this software does not offer any functionality for web feeds (commonly called RSS feeds). The components of web feeds are reviewed, and recommendations are made for implementing Atom 1.0 web feeds in SPO publications using a script run during releasing which determines which content to push to the feed. Motivations Members of the editorial board of the Journal of Electronic Publishing (collid jep) and SPO staff have requested that web feed functionality be provided for SPO publications. This would allow end users to subscribe to feeds of not only serial content (the obvious use) but in theory any other collection to which content is added, especially on a regular basis. A feed could provide a single notice that a collection has been updated, perhaps with a table of contents (henceforth a small feed), or instead contain separate notices for each item added to the collection (a large feed). Components of a web feed A web feed operates with three necessary components: • a website that syndicates (or publishes) a feed • a feed standard being used by the website • the end user’s news aggregator (feed reader or news reader), which retrieves the feed periodically. A feed is an XML document containing individual entries (also items, stories, or articles). Syndicating a feed The content provider’s website often has buttons (icons called chicklets) to be used by the end user to add a feed to his or her news aggregator.
    [Show full text]
  • A Method to Analyze Multiple Social Identities in Twitter Bios
    A Method to Analyze Multiple Social Identities in Twitter Bios ARJUNIL PATHAK, University at Buffalo, USA NAVID MADANI, University at Buffalo, USA KENNETH JOSEPH, University at Buffalo, USA Twitter users signal social identity in their profile descriptions, or bios, in a number of important but complex ways that are not well-captured by existing characterizations of how identity is expressed in language. Better ways of defining and measuring these expressions may therefore be useful both in understanding howsocial identity is expressed in text, and how the self is presented on Twitter. To this end, the present work makes three contributions. First, using qualitative methods, we identify and define the concept of a personal identifier, which is more representative of the ways in which identity is signaled in Twitter bios. Second, we propose a method to extract all personal identifiers expressed in a given bio. Finally, we present a series of validation analyses that explore the strengths and limitations of our proposed method. Our work opens up exciting new opportunities at the intersection between the social psychological study of social identity and the study of how we compose the self through markers of identity on Twitter and in social media more generally. CCS Concepts: • Applied computing Sociology; Additional Key Words and Phrases: Twitter, self-presentation, social identity, computational social science ACM Reference Format: Arjunil Pathak, Navid Madani, and Kenneth Joseph. 2018. A Method to Analyze Multiple Social Identities in Twitter Bios. J. ACM 37, 4, Article 111 (August 2018), 35 pages. https://doi.org/10.1145/1122445.1122456 1 INTRODUCTION A social identity is a word or phrase that refers to a particular social group (e.g.
    [Show full text]
  • Despliegue España-América Latina De Broadcatching E-Learning
    Despliegue España-América Latina de Broadcatching e-learning Rafael García Monroy E.T.S.I.T. U.P.M. Departamento Ingeniería de Sistemas Telemáticos e-mail: [email protected] Abstract . Los avances tecnológicos y la eliminación de fronteras que resultan del potente y exponencial despliegue de Internet permiten que compartir vídeos de material educativo de gran calidad sea viable. Ya que las exigencias de los usuarios finales –alumnos- crecen paralelamente a la adopción de los avances mencionados, y las posibilidades reales de mejorar los servicios son una realidad, no queda más que emplear las herramientas existentes para crear nuevos modelos innovadores que mejoren constantemente las plataformas de difusión de conocimiento, pilar del desarrollo sostenible. El siguiente artículo describe el despliegue de un modelo e-learning basado en Broadcatching (BitTorrent, + RSS), a través del cual clases de vídeo grabadas en España podrán ser inmediatamente distribuidas a universidades latinoamericanas con las que se tenga convenio, compartiendo recursos de red y, de gran importancia, diseminando los ficheros educativos con una excelente calidad de contenido. De esta manera, la distancia, el costo de distribución y las limitaciones temporales pasan a segundo plano, permitiendo que los contenidos educativos de primera calidad tengan un acceso de naturaleza universal. Palabras clave: BitTorrent, broadcatching, e-learning, P2P, RSS, vídeo. 1 Introducción redes de distribución. Y es precisamente este el punto que hay que explotar para que el alumno obtenga el La cooperación en cualquier ámbito conlleva al mejor contenido educativo posible: la obtención de desarrollo sostenible, en que las partes implicadas vídeos de clase de alta definición que son pesados, comparten el fruto de la inversión.
    [Show full text]
  • Microformats the Next (Small) Thing on the Semantic Web?
    Standards Editor: Jim Whitehead • [email protected] Microformats The Next (Small) Thing on the Semantic Web? Rohit Khare • CommerceNet “Designed for humans first and machines second, microformats are a set of simple, open data formats built upon existing and widely adopted standards.” — Microformats.org hen we speak of the “evolution of the is precisely encoding the great variety of person- Web,” it might actually be more appropri- al, professional, and genealogical relationships W ate to speak of “intelligent design” — we between people and organizations. By contrast, can actually point to a living, breathing, and an accidental challenge is that any blogger with actively involved Creator of the Web. We can even some knowledge of HTML can add microformat consult Tim Berners-Lee’s stated goals for the markup to a text-input form, but uploading an “promised land,” dubbed the Semantic Web. Few external file dedicated to machine-readable use presume we could reach those objectives by ran- remains forbiddingly complex with most blog- domly hacking existing Web standards and hop- ging tools. ing that “natural selection” by authors, software So, although any intelligent designer ought to developers, and readers would ensure powerful be able to rely on the long-established facility of enough abstractions for it. file transfer to publish the “right” model of a social Indeed, the elegant and painstakingly inter- network, the path of least resistance might favor locked edifice of technologies, including RDF, adding one of a handful of fixed tags to an exist- XML, and query languages is now growing pow- ing indirect form — the “blogroll” of hyperlinks to erful enough to attack massive information chal- other people’s sites.
    [Show full text]
  • Sociolinguistically Driven Approaches for Just Natural Language Processing
    University of Massachusetts Amherst ScholarWorks@UMass Amherst Doctoral Dissertations Dissertations and Theses April 2021 Sociolinguistically Driven Approaches for Just Natural Language Processing Su Lin Blodgett University of Massachusetts Amherst Follow this and additional works at: https://scholarworks.umass.edu/dissertations_2 Part of the Artificial Intelligence and Robotics Commons Recommended Citation Blodgett, Su Lin, "Sociolinguistically Driven Approaches for Just Natural Language Processing" (2021). Doctoral Dissertations. 2092. https://doi.org/10.7275/20410631 https://scholarworks.umass.edu/dissertations_2/2092 This Open Access Dissertation is brought to you for free and open access by the Dissertations and Theses at ScholarWorks@UMass Amherst. It has been accepted for inclusion in Doctoral Dissertations by an authorized administrator of ScholarWorks@UMass Amherst. For more information, please contact [email protected]. SOCIOLINGUISTICALLY DRIVEN APPROACHES FOR JUST NATURAL LANGUAGE PROCESSING A Dissertation Presented by SU LIN BLODGETT Submitted to the Graduate School of the University of Massachusetts Amherst in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY February 2021 College of Information and Computer Sciences © Copyright by Su Lin Blodgett 2021 All Rights Reserved SOCIOLINGUISTICALLY DRIVEN APPROACHES FOR JUST NATURAL LANGUAGE PROCESSING A Dissertation Presented by SU LIN BLODGETT Approved as to style and content by: Brendan O’Connor, Chair Mohit Iyyer, Member Hanna Wallach, Member Lisa Green, Member James Allan, Chair of the Faculty College of Information and Computer Sciences ACKNOWLEDGMENTS First and foremost, I would like to thank my advisor, Brendan O’Connor, who introduced me to natural language processing and computational sociolinguistics, taught me more than I can say about research, NLP, and writing, and has been endlessly supportive of my many and growing research interests; I consider myself amazingly lucky to have stumbled into his lab.
    [Show full text]
  • Database-Driven Web Mashups
    Database-Driven Web Mashups Andrei Vancea Michael Grossniklaus Moira C. Norrie Institute for Information Systems ETH Zurich CH-8092 Zurich, Switzerland {vancea,grossniklaus,norrie}@inf.ethz.ch Abstract news feeds being combined into a single feed. An enterprise mashup uses the general mashup techniques within a com- In most web mashup applications, the content is gener- pany’s own internal applications. It usually combines data ated using either web feeds or an application programming from both internal and external sources. Finally, a business interface (API) based on web services. Both approaches mashup is a combination of all of the above that makes the have limitations. Data models provided by web feeds are result available for a business application. not powerful enough to permit complex data structures to In this paper, we argue that current approaches to the de- be transmitted. APIs based on web services are usually velopment of web mashup applications lack a powerful data different for each web application, and thus different im- model for data interchange. As mentioned above, in most plementations of the APIs are required for each web ser- cases, the content is generated using feeds or an applica- vice that a web mashup application uses. We propose a tion programming interface (API) based on web services. database-driven approach to web mashups that supports In the former case, the data models provided by web feeds integration at the database level and enables mashup de- (RSS or Atom) lack flexibility and do not permit complex velopers to work with a uniform abstract model and have data structures to be transmitted.
    [Show full text]
  • Evaluating Web Service Apis: Finding a Flexible Video Management System
    Evaluating Best-of-Class Web Service APIs for Today's Multi-platform Video Management Solutions By Alan Ramaley, CTO, and Nick Rossi, VP Engineering thePlatform for Media, Inc. ABSTRACT services in a deployment and scale them independently. Video management and publishing platforms are evolving to meet the market’s Areas of focus will include: need for reaching consumers with reliable, high-capacity services – anytime, anywhere, Breadth – an API should expose all on any device. As such, solution providers the functionality in the underlying have to integrate their technology with a service vast set of devices, systems, and Cohesion– a given service should environments—includingauthenticated have a single area of responsibility syndication, third-party websites, mobile Security – we will compare and devices with vastly differing specs, set- contrast five common models topboxes, connectedTVs, smart over-the-top Web standards– support for devices, andthird-party services, such as ad REST,Atom, RSS, and JSON for data networks and content discovery engines. services, and REST and SOAP for business services. Web service application programming Data access – APIs should provide interfaces (APIs) play an integral role in very flexible read and write access to enabling content providers and distributors service data to succeed in a consumer driven market Notifications – with a comparison of that’s in constant flux. Developers at media push vs. pull notification models. companies and TV service providersneed Extending the schema– what to look flexibility and open APIs to adapt to for to make sure a service can changes in TV, online, and mobile video support your custom data. publishing. Scalability– how to build scalability into an API at the core, to allow for This paper provides an in-depth evaluation a 99.99% read SLA of the most important features web service APIs should offer and explains why those Lastly, the paper focuses on some of the best features are important.
    [Show full text]
  • An Introduction to Georss: a Standards Based Approach for Geo-Enabling RSS Feeds
    Open Geospatial Consortium Inc. Date: 2006-07-19 Reference number of this document: OGC 06-050r3 Version: 1.0.0 Category: OpenGIS® White Paper Editors: Carl Reed OGC White Paper An Introduction to GeoRSS: A Standards Based Approach for Geo-enabling RSS feeds. Warning This document is not an OGC Standard. It is distributed for review and comment. It is subject to change without notice and may not be referred to as an OGC Standard. Recipients of this document are invited to submit, with their comments, notification of any relevant patent rights of which they are aware and to provide supporting do Document type: OpenGIS® White Paper Document subtype: White Paper Document stage: APPROVED Document language: English OGC 06-050r3 Contents Page i. Preface – Executive Summary........................................................................................ iv ii. Submitting organizations............................................................................................... iv iii. GeoRSS White Paper and OGC contact points............................................................ iv iv. Future work.....................................................................................................................v Foreword........................................................................................................................... vi Introduction...................................................................................................................... vii 1 Scope.................................................................................................................................1
    [Show full text]