Copyright and Use of This Thesis This Thesis Must Be Used in Accordance with the Provisions of the Copyright Act 1968

Total Page:16

File Type:pdf, Size:1020Kb

Copyright and Use of This Thesis This Thesis Must Be Used in Accordance with the Provisions of the Copyright Act 1968 COPYRIGHT AND USE OF THIS THESIS This thesis must be used in accordance with the provisions of the Copyright Act 1968. Reproduction of material protected by copyright may be an infringement of copyright and copyright owners may be entitled to take legal action against persons who infringe their copyright. Section 51 (2) of the Copyright Act permits an authorized officer of a university library or archives to provide a copy (by communication or otherwise) of an unpublished thesis kept in the library or archives, to a person who satisfies the authorized officer that he or she requires the reproduction for the purposes of research or study. The Copyright Act grants the creator of a work a number of moral rights, specifically the right of attribution, the right against false attribution and the right of integrity. You may infringe the author’s moral rights if you: - fail to acknowledge the author of this thesis if you quote sections from the work - attribute this thesis to another author - subject this thesis to derogatory treatment which may prejudice the author’s reputation For further information contact the University’s Director of Copyright Services sydney.edu.au/copyright Structured Named Entities Nicky Ringland Supervisor: James Curran A thesis submitted in fulfilment of the requirements for the degree of Doctor of Philosophy in the School of Information Technologies at The University of Sydney School of Information Technologies 2016 i Abstract The names of people, locations, and organisations play a central role in language, and named entity recognition (NER) has been widely studied, and successfully incorporated, into natural language processing (NLP) applications. The most common variant of NER involves identifying and classifying proper noun mentions of these and miscellaneous entities as linear spans in text. Unfortunately, this version of NER is no closer to a detailed treatment of named entities than chunking is to a full syntactic analysis. NER, so con- strued, reflects neither the syntactic nor semantic structure of NE mentions, and provides insufficient categorical distinctions to represent that structure. Representing this nested structure, where a mention may contain mention(s) of other entities, is critical for applications such as coreference resolution. The lack of this structure creates spurious ambiguity in the linear approximation. Research in NER has been shaped by the size and detail of the available annotated corpora. The existing structured named entity corpora are either small, in specialist domains, or in languages other than English. This thesis presents our Nested Named Entity (NNE) corpus of named entities and numerical and temporal expressions, taken from the WSJ portion of the Penn Treebank (PTB, Marcus et al., 1993). We use the BBN Pronoun Coreference and Entity Type Corpus (Weischedel and Brunstein, 2005a) as our basis, manu- ally annotating it with a principled, fine-grained, nested annotation scheme and detailed annotation guidelines. The corpus comprises over 279,000 entities over 49,211 sentences (1,173,000 words), including 118,495 top-level entities. Our annotations were designed using twelve high-level principles that guided the development of the annotation scheme and difficult decisions for annotators. We also monitored the semantic grammar that was being induced during annotation, seeking to identify and reinforce common patterns to main- tain consistent, parsimonious annotations. ii The result is a scheme of 118 hierarchical fine-grained entity types and nesting rules, covering all capitalised mentions of entities, and numerical and temporal expressions. Unlike many corpora, we have developed detailed guidelines, including extensive discussion of the edge cases, in an ongoing dia- logue with our annotators which is critical for consistency and reproducibility. We annotated independently from the PTB bracketing, allowing annotators to choose spans which were inconsistent with the PTB conventions and errors, and only refer back to it to resolve genuine ambiguity consistently. We merged our NNE with the PTB, requiring some systematic and one-off changes to both annotations. This allows the NNE corpus to complement other PTB resources, such as PropBank, and inform PTB-derived corpora for other formalisms, such as CCG and HPSG. We compare this corpus against BBN. We consider several approaches to integrating the PTB and NNE annotations, which affect the sparsity of grammar rules and visibility of syntactic and NE structure. We explore their impact on parsing the NNE and merged variants using the Berkeley parser (Petrov et al., 2006), which performs surprisingly well without specialised NER features. We experiment with flattening the NNE annotations into linear NER variants with stacked categories, and explore the ability of a maximum entropy and a CRF NER system to reproduce them. The CRF performs substantially better, but is infeasible to train on the enormous stacked category sets. The flattened output of the Berkeley parser are almost competitive with the CRF. Our results demonstrate that the NNE corpus is feasible for statistical models to reproduce. We invite researchers to explore new, richer models of (joint) parsing and NER on this complex and challenging task. Our nested named entity corpus will improve a wide range of NLP tasks, such as coreference resolution and question answering, allowing automated systems to understand and exploit the true structure of named entities. Acknowledgements The completion of this thesis has been a long journey, and I am immensely glad to have had wonderful travelling companions. Foremost thanks must go to James Curran, without whose encouragement, support and guidance I would never have embarked on this journey. Thank you for introducing me to computer science, for nurturing (and occasionally rekindling) my love of computational linguistics, and for encouraging my passion for education. Thank you to all members of Schwa Lab past and present who have helped me in so many ways. I have been incredibly fortunate to work with such a wonderful group of friends and colleagues. Thanks especially to Ben Hachey, whose extra guidance at the pointy end of things was particularly appreciated, Matt Honnibal, especially for giving me that first glimmer of hope that a lin- guistics student can add that extra ’computational’ adjective, David Vadas, Tara McIntosh, Jonathan Kummerfeld, Stephen Merity, Tim O’Keefe, Daniel Tse, Will Radford, Dominick Ng, James Constable, Glen Pink, and to Tim Dawborn, without whom I would have neither servers to run my experiments on, nor an NER system to evaluate on. Katie Bell, thank you for helping me debug my very first Python program, and for being an inspiration henceforth. Joel Nothman, thank you for the many stimulating discussions, cups of tea, con- tinued encouragement, and for embodying the epitome of a research student. My work is much the better thanks to the good example you have always set. Kellie Webster, thank you for being my thesis writing buddy, for helping me iii iv find words when I was stuck, for keeping me accountable to my deadlines, for the many hours of annotation and proof-reading and for countless other ways your presence and efforts have improved the past few years. My work rests on that of my annotators: Kellie Webster, Vivian Li, Joanne Yang and Kristy Hughes. Thank you for the many hours of discussion, and for battling through all 50,000 WSJ articles with me. I hope that, in time, you can start to enjoy reading newspaper articles again without too many flashbacks. I gratefully acknowledge the funding received from the Tempe Mann Trav- elling Scholarship, which enabled me to take a mini-sabbatical to Edinburgh, where my research was greatly improved by the efforts of Bonnie Webber, Mark Steedman and the entire computational linguistics group. Thanks also to the research communities of Cambridge, especially Stephen Clark, and the Univer- sity of Texas at Austin, especially Jason Baldridge. Thank you also to the many local academics who have advised, guided and influenced me along the way, especially Alan Fekete and Jon Patrick, and to the support, help navigating various bureaucratic hurdles, and friendship from all the SIT staff, especially Josie Spongberg and Evelyn Riegler. To the women of SIT, especially Georgina Wilcox, Emma Fitzgerald, Shagh- ayegh Sharif Nabavi, Mahboobeh Moghaddam, and Tara Babaie, thank you for always being ready with tea and chocolate. To the National Computer Science School and Girls’ Programming Network tutors, students and teachers, whose encouragement was always appreciated, even when phrased as the question: Have you finished yet? The challenges of a PhD can only be overcome when balanced by incredible friends. Thank you all for helping in more ways than you are probably aware, from much-needed stress relief, to technical support, distractions, laughter and copious amounts of chocolate. Jonathan Usmar, thank you for encouraging me to actually give this computer science thing a try. Lachlan Howe and James v Bailey, thank you for the music and camaraderie. Sophia Di Marco, thank you for always being there, and for your unfaltering confidence in me. Finally, thank you to my family for being constantly supportive, for raising me with a love of learning, and for being encouraging and patient even while not understanding my work. And to Sam, perhaps the most patient of all, who has been by my side throughout every minute of this PhD. Thank you. Statement of compliance I certify that: I have read and understood the University of Sydney Student • Plagiarism: Coursework Policy and Procedure; I understand that failure to comply with the Student Pla- • giarism: Coursework Policy and Procedure can lead to the University commencing proceedings against me for poten- tial student misconduct under Chapter 8 of the University of Sydney By-Law 1999 (as amended); this Work is substantially my own, and to the extent that any • part of this Work is not my own I have indicated that it is not my own by Acknowledging the Source of that part or those parts of the Work.
Recommended publications
  • Lmnop All Rights Reserved
    Outside the box design. pg 4 Mo Willems: LAUGH • MAKE • NURTURE • ORGANISE • PLAY kid-lit legend. pg 37 Fresh WelcomeEverywhere I turn these days there’s a little bundle on the fun with way. Is it any wonder that babies are on my mind? If they’re plastic toys. on yours too, you might be interested in our special feature pg 27 on layettes. It’s filled with helpful tips on all the garments you need to get your newborn through their first few weeks. Fairytale holdiay. My own son (now 6) is growing before my very eyes, almost pg 30 magically sprouting out of his t-shirts and jeans. That’s not the only magic he can do. His current obsession with conjuring inspired ‘Hocus Pocus’ on page 17. You’ll find lots of clever ideas for apprentice sorcerers, including the all- © Mo Willems. © time classic rabbit-out-of-a-hat trick. Littlies aren’t left out when it comes to fun. Joel Henriques offers up a quick project — ‘Fleece Play Squares’ on page Nuts and bolts. pg 9 40 — that’s going to amaze you with the way it holds their interest. If you’re feeling extra crafty, you’ll find some neat things to do with those pesky plastic toys that multiply in your home on page 27. Bright is Our featured illustrator this month is the legendary Mo the word on Willems (page 37). If you haven’t yet read one of Mo’s books the streets (and we find that hard to believe) then here’s a good place this autumn.
    [Show full text]
  • En Vogue Ana Popovic Artspower National Touring Theatre “The
    An Evening with Jazz Trumpeter Art Davis Ana Popovic En Vogue Photo Credit: Ruben Tomas ArtsPower National Alyssa Photo Credit: Thomas Mohr Touring Theatre “The Allgood Rainbow Fish” Photo Courtesy of ArtsPower welcome to North Central College ometimes good things come in small Ana Popovic first came to us as a support act for packages. There is a song “Bigger Isn’t Better” Jonny Lang for our 2015 homecoming concert. S in the Broadway musical “Barnum” sung by When she started on her guitar, Ana had everyone’s the character of Tom Thumb. He tells of his efforts attention. Wow! I am fond of telling people who to prove just because he may be small in stature, he have not been in the concert hall before that it’s a still can have a great influence on the world. That beautiful room that sounds better than it looks. Ana was his career. Now before you think I have finally Popovic is the performer equivalent of my boasts lost it, there is a connection here. February is the about the hall. The universal comment during the smallest month, of the year, of course. Many people break after she performed was that Jonny had better feel that’s a good thing, given the normal prevailing be on his game. Lucky for us, of course, he was. But weather conditions in the month of February in the when I was offered the opportunity to bring Ana Chicagoland area. But this little month (here comes back as the headliner, I jumped at the chance! Fasten the connection!) will have a great influence on the your seatbelts, you are in for an amazing evening! fine arts here at North Central College with the quantity and quality of artists we’re bringing to you.
    [Show full text]
  • Preliminary Version of the Text Analysis Component, Including: Ner, Event Detection and Sentiment Analysis
    D4.1 PRELIMINARY VERSION OF THE TEXT ANALYSIS COMPONENT, INCLUDING: NER, EVENT DETECTION AND SENTIMENT ANALYSIS Grant Agreement nr 611057 Project acronym EUMSSI Start date of project (dur.) December 1st 2013 (36 months) Document due Date : November 30th 2014 (+60 days) (12 months) Actual date of delivery December 2nd 2014 Leader GFAI Reply to [email protected] Document status Submitted Project co-funded by ICT-7th Framework Programme from the European Commission EUMSSI_D4.1 Preliminary version of the text analysis component 1 Project ref. no. 611057 Project acronym EUMSSI Project full title Event Understanding through Multimodal Social Stream Interpretation Document name EUMSSI_D4.1_Preliminary version of the text analysis component.pdf Security (distribution PU – Public level) Contractual date of November 30th 2014 (+60 days) delivery Actual date of December 2nd 2014 delivery Deliverable name Preliminary Version of the Text Analysis Component, including: NER, event detection and sentiment analysis Type P – Prototype Status Submitted Version number v1 Number of pages 60 WP /Task responsible GFAI / GFAI & UPF Author(s) Susanne Preuss (GFAI), Maite Melero (UPF) Other contributors Mahmoud Gindiyeh (GFAI), Eelco Herder (LUH), Giang Tran Binh (LUH), Jens Grivolla (UPF) EC Project Officer Mrs. Aleksandra WESOLOWSKA [email protected] Abstract The deliverable reports on the resources and tools that have been gathered and installed for the preliminary version of the text analysis component Keywords Text analysis component, Named Entity Recognition, Named Entity Linking, Keyphrase Extraction, Relation Extraction, Topic modelling, Sentiment Analysis Circulated to partners Yes Peer review Yes completed Peer-reviewed by Eelco Herder (L3S) Coordinator approval Yes EUMSSI_D4.1 Preliminary version of the text analysis component 2 Table of Contents 1.
    [Show full text]
  • Information Retrieval and Web Search
    Information Retrieval and Web Search Text processing Instructor: Rada Mihalcea (Note: Some of the slides in this slide set were adapted from an IR course taught by Prof. Ray Mooney at UT Austin) IR System Architecture User Interface Text User Text Operations Need Logical View User Query Database Indexing Feedback Operations Manager Inverted file Query Searching Index Text Ranked Retrieved Database Docs Ranking Docs Text Processing Pipeline Documents to be indexed Friends, Romans, countrymen. OR User query Tokenizer Token stream Friends Romans Countrymen Linguistic modules Modified tokens friend roman countryman Indexer friend 2 4 1 2 Inverted index roman countryman 13 16 From Text to Tokens to Terms • Tokenization = segmenting text into tokens: • token = a sequence of characters, in a particular document at a particular position. • type = the class of all tokens that contain the same character sequence. • “... to be or not to be ...” 3 tokens, 1 type • “... so be it, he said ...” • term = a (normalized) type that is included in the IR dictionary. • Example • text = “I slept and then I dreamed” • tokens = I, slept, and, then, I, dreamed • types = I, slept, and, then, dreamed • terms = sleep, dream (stopword removal). Simple Tokenization • Analyze text into a sequence of discrete tokens (words). • Sometimes punctuation (e-mail), numbers (1999), and case (Republican vs. republican) can be a meaningful part of a token. – However, frequently they are not. • Simplest approach is to ignore all numbers and punctuation and use only case-insensitive
    [Show full text]
  • Muppets Now Fact Sheet As of 6.22
    “Muppets Now” is The Muppets Studio’s first unscripted series and first original series for Disney+. In the six- episode season, Scooter rushes to make his delivery deadlines and upload the brand-new Muppet series for streaming. They are due now, and he’ll need to navigate whatever obstacles, distractions, and complications the rest of the Muppet gang throws at him. Overflowing with spontaneous lunacy, surprising guest stars and more frogs, pigs, bears (and whatevers) than legally allowed, the Muppets cut loose in “Muppets Now” with the kind of startling silliness and chaotic fun that made them famous. From zany experiments with Dr. Bunsen Honeydew and Beaker to lifestyle tips from the fabulous Miss Piggy, each episode is packed with hilarious segments, hosted by the Muppets showcasing what the Muppets do best. Produced by The Muppets Studio and Soapbox Films, “Muppets Now” premieres Friday, July 31, streaming only on Disney+. Title: “Muppets Now” Category: Unscripted Series Episodes: 6 U.S. Premiere: Friday, July 31 New Episodes: Every Friday Muppet Performers: Dave Goelz Matt Vogel Bill Barretta David Rudman Eric Jacobson Peter Linz 1 6/22/20 Additional Performers: Julianne Buescher Mike Quinn Directed by: Bill Barretta Rufus Scot Church Chris Alender Executive Producers: Andrew Williams Bill Barretta Sabrina Wind Production Company: The Muppets Studio Soapbox Films Social Media: facebook.com/disneyplus twitter.com/disneyplus instagram.com/disneyplus facebook.com/muppets twitter.com/themuppets instagram.com/themuppets #DisneyPlus #MuppetsNow Media Contacts: Disney+: Scott Slesinger Ashley Knox [email protected] [email protected] The Muppets Studio: Debra Kohl David Gill [email protected] [email protected] 2 6/22/20 .
    [Show full text]
  • Children's Television in Asia : an Overview
    This document is downloaded from DR‑NTU (https://dr.ntu.edu.sg) Nanyang Technological University, Singapore. Children's television in Asia : an overview Valbuena, Victor T 1991 Valbuena, V. T. (1991). Children's television in Asia : an overview. In AMIC ICC Seminar on Children and Television : Cipanas, September 11‑13, 1991. Singapore: Asian Mass Communiction Research & Information Centre. https://hdl.handle.net/10356/93067 Downloaded on 23 Sep 2021 22:26:31 SGT ATTENTION: The Singapore Copyright Act applies to the use of this document. Nanyang Technological University Library Children's Television In Asia : An Overview By Victor Valbuena CHILDREN'ATTENTION: ThSe S inTELEVISIONgapore Copyright Act apIpNlie s tASIAo the use: o f thiAs dNo cumOVERVIEent. NanyangW T echnological University Library Victor T. Valbuena, Ph.D. Senior Programme Specialist Asian Mass Communication Research and Information Centre Presented at the ICC - AMIC - ICWF Seminar on Children and Television Cipanas, West Java, Indonesia 11-13 September, 1991 CHILDREN'S TELEVISION IN ASIA: AN OVERVIEW Victor T. Valbuena The objective of this paper is to present an overview of children's AtelevisioTTENTION: The Snin gaiponre CAsiopyrigaht Aanct apdpl iessom to thee u seo off t histh doceu meproblemnt. Nanyang Tsec hnanologdic al issueUniversity sLib rary that beset it . By "children's television" is meant the range of programme materials made available specifically for children — from pure entertainment like televised popular games and musical shows to cartoon animation, comedies, news, cultural perform­ ances, documentaries, specially made children's dramas, science programmes, and educational enrichment programmes — and trans­ mitted during time slots designated for children.
    [Show full text]
  • Happiness Is
    HAPPINESS IS No 2 Vol 13 June 2011 VITAL Page 2 Editorial Inspriational Seminar Page 3 Parenting Night Success in this Issue Internet Safety WelcoMe To the Summer edition of We feature poetry from Gerry Galvin and Seminar happiness is Vital. Find us on facebook and both ‘Medical Matters’ and ‘dear lorraine’ Page 4 Medical Matters follow us on twitter @aidswest for interesting examine issues and concerns affecting readers and inspirational news, videos and and as always we take a look at the latest Page 5 In the News information. medical advancements and news stories Page 6 Dear Lorraine in this issue we revisit AidS West events over relating to hiV from around the world. the past few months including our highly Please check our website for details of Page 7 Poetry with Gerry Galvin successful seminar on ‘coping With upcoming events. i hope you enjoy this issue Adversity’ with inspirational speakers Joan and those long lazy days of Summer in the Page 8 Excerpt from ‘e Freeman, John lonergan and ophelia West of ireland. Keep well, keep safe and Govenor’ haanyama orum. We look back on our above all, for all of us, happiness, each and Page 9 Interview with Ophelia parenting seminar and announce plans for an every day is indeed vital. Parenting seminar tackles tough topics exciting seminar on ‘Parenting and the Tracey Ferguson Page 10 High Times internet’ with full details to be announced e: [email protected] / Find us on Page 11 Useful Services 120 parents get facts and advice on sensitive issues soon. Facebook: www.facebook.com/aidswest AidS WeST held a very successful public seminar on Monday, 21st transmitted infections, the most common being chlamydia where the of March at e Ardilaun hotel.
    [Show full text]
  • Constructing a Lexicon of Arabic-English Named Entity Using SMT and Semantic Linked Data
    Constructing a Lexicon of Arabic-English Named Entity using SMT and Semantic Linked Data Emna Hkiri, Souheyl Mallat, and Mounir Zrigui LaTICE Laboratory, Faculty of Sciences of Monastir, Tunisia Abstract: Named entity recognition is the problem of locating and categorizing atomic entities in a given text. In this work, we used DBpedia Linked datasets and combined existing open source tools to generate from a parallel corpus a bilingual lexicon of Named Entities (NE). To annotate NE in the monolingual English corpus, we used linked data entities by mapping them to Gate Gazetteers. In order to translate entities identified by the gate tool from the English corpus, we used moses, a statistical machine translation system. The construction of the Arabic-English named entities lexicon is based on the results of moses translation. Our method is fully automatic and aims to help Natural Language Processing (NLP) tasks such as, machine translation information retrieval, text mining and question answering. Our lexicon contains 48753 pairs of Arabic-English NE, it is freely available for use by other researchers Keywords: Named Entity Recognition (NER), Named entity translation, Parallel Arabic-English lexicon, DBpedia, linked data entities, parallel corpus, SMT. Received April 1, 2015; accepted October 7, 2015 1. Introduction Section 5 concludes the paper with directions for future work. Named Entity Recognition (NER) consists in recognizing and categorizing all the Named Entities 2. State of the art (NE) in a given text, corresponding to two intuitive IAJIT First classes; proper names (persons, locations and Written and spoken Arabic is considered as difficult organizations), numeric expressions (time, date, money language to apprehend in the domain of Natural and percent).
    [Show full text]
  • Menlo Park Juvi Dvds Check the Online Catalog for Availability
    Menlo Park Juvi DVDs Check the online catalog for availability. List run 09/28/12. J DVD A.LI A. Lincoln and me J DVD ABE Abel's island J DVD ADV The adventures of Curious George J DVD ADV The adventures of Raggedy Ann & Andy. J DVD ADV The adventures of Raggedy Ann & Andy. J DVD ADV The adventures of Curious George J DVD ADV The adventures of Ociee Nash J DVD ADV The adventures of Ichabod and Mr. Toad J DVD ADV The adventures of Tintin. J DVD ADV The adventures of Pinocchio J DVD ADV The adventures of Tintin J DVD ADV The adventures of Tintin J DVD ADV v.1 The adventures of Swiss family Robinson. J DVD ADV v.1 The adventures of Swiss family Robinson. J DVD ADV v.2 The adventures of Swiss family Robinson. J DVD ADV v.2 The adventures of Swiss family Robinson. J DVD ADV v.3 The adventures of Swiss family Robinson. J DVD ADV v.3 The adventures of Swiss family Robinson. J DVD ADV v.4 The adventures of Swiss family Robinson. J DVD ADV v.4 The adventures of Swiss family Robinson. J DVD ADV v.5 The adventures of Swiss family Robinson. J DVD ADV v.5 The adventures of Swiss family Robinson. J DVD ADV v.6 The adventures of Swiss family Robinson. J DVD ADV v.6 The adventures of Swiss family Robinson. J DVD AGE Agent Cody Banks J DVD AGE Agent Cody Banks J DVD AGE 2 Agent Cody Banks 2 J DVD AIR Air Bud J DVD AIR Air buddies J DVD ALA Aladdin J DVD ALE Alex Rider J DVD ALE Alex Rider J DVD ALI Alice in Wonderland J DVD ALI Alice in Wonderland J DVD ALI Alice in Wonderland J DVD ALI Alice in Wonderland J DVD ALI Alice in Wonderland J DVD ALI Alice in Wonderland J DVD ALICE Alice in Wonderland J DVD ALL All dogs go to heaven J DVD ALL All about fall J DVD ALV Alvin and the chipmunks.
    [Show full text]
  • Fasttext-Based Intent Detection for Inflected Languages †
    information Article FastText-Based Intent Detection for Inflected Languages † Kaspars Balodis 1,2,* and Daiga Deksne 1 1 Tilde, Vien¯ıbas Gatve 75A, LV-1004 R¯ıga, Latvia; [email protected] 2 Faculty of Computing, University of Latvia, Rain, a blvd. 19, LV-1586 R¯ıga, Latvia * Correspondence: [email protected] † This paper is an extended version of our paper published in 18th International Conference AIMSA 2018, Varna, Bulgaria, 12–14 September 2018. Received: 15 January 2019; Accepted: 25 April 2019; Published: 1 May 2019 Abstract: Intent detection is one of the main tasks of a dialogue system. In this paper, we present our intent detection system that is based on fastText word embeddings and a neural network classifier. We find an improvement in fastText sentence vectorization, which, in some cases, shows a significant increase in intent detection accuracy. We evaluate the system on languages commonly spoken in Baltic countries—Estonian, Latvian, Lithuanian, English, and Russian. The results show that our intent detection system provides state-of-the-art results on three previously published datasets, outperforming many popular services. In addition to this, for Latvian, we explore how the accuracy of intent detection is affected if we normalize the text in advance. Keywords: intent detection; word embeddings; dialogue system 1. Introduction and Related Work Recent developments in deep learning have made neural networks the mainstream approach for a wide variety of tasks, ranging from image recognition to price forecasting to natural language processing. In natural language processing, neural networks are used for speech recognition and generation, machine translation, text classification, named entity recognition, text generation, and many other tasks.
    [Show full text]
  • Big Dreams Sparked by a Spirited Girl Muppet
    GLOBAL GIRLS’ EDUCATION Big Dreams Sparked by a Spirited Girl Muppet Globally, an estimated 510 million women grow up unable to read and write – nearly twice the rate of adult illiteracy as men.1 To counter this disparity in countries around the world, there’s Sesame Street. Local adaptations of Sesame Street are opening minds and doors for eager young learners, encouraging girls to dream big and gain the skills they need to succeed in school and life. We know these educational efforts yield benefits far beyond girls’ prospects. They produce a ripple effect that advances entire families and communities. Increased economic productivity, reduced poverty, and lowered infant mortality rates are just a few of the powerful outcomes of educating girls. “ Maybe I’ll be a police officer… maybe a journalist… maybe an astronaut!” Our approach is at work in India, Bangladesh, Nigeria, Egypt, South Africa, Afghanistan, and many other developing countries where educational and professional opportunities for women are limited. — Khokha Afghanistan BAGHCH-E-SIMSIM Bangladesh SISIMPUR Brazil VILA SÉSAMO China BIG BIRD LOOKS AT THE WORLD Colombia PLAZA SÉSAMO Egypt ALAM SIMSIM India GALLI GALLI SIM SIM United States Indonesia JALAN SESAMA Israel RECHOV SUMSUM Mexico PLAZA SÉSAMO Nigeria SESAME SQUARE Northern Ireland SESAME TREE West Bank / Gaza SHARA’A SIMSIM South Africa TAKALANI SESAME Tanzania KILIMANI SESAME GLOBAL GIRLS’ EDUCATION loves about school: having lunch with friends, Watched by millions of children across the Our Approach playing sports, and, of course, learning new country, Baghch-e-Simsim shows real-life girls things every day. in situations that have the power to change Around the world, local versions of Sesame gender attitudes.
    [Show full text]
  • Winter/Spring Season
    WINTER/SPRING SEASON Jan 23–24 eighth blackbird Hand Eye __________________________________________ Jan 28–30 Toshiki Okada/chelfitsch God Bless Baseball __________________________________________ Feb 4 and 6–7 Ingri Fiksdal, Ingvild Langgård & Signe Becker Cosmic Body __________________________________________ Feb 11–14 Faye Driscoll Thank You For Coming: Attendance __________________________________________ Feb 18–27 Tim Etchells/Forced Entertainment The Notebook, Speak Bitterness, and (In) Complete Works: Table Top Shakespeare __________________________________________ Mar 5–6 Joffrey Academy of Dance Winning Works __________________________________________ Mar 25–26 eighth blackbird featuring Will Oldham (Bonnie “Prince” Billy) Ghostlight __________________________________________ Mar 31–Apr 3 Blair Thomas & Co. Moby Dick __________________________________________ Apr 7–10 Teatrocinema Historia de Amor (Love Story) __________________________________________ Apr 12 and 14–16 Taylor Mac The History of Popular Music __________________________________________ Apr 28–May 1 Kyle Abraham/ Abraham.In.Motion When the Wolves Came In Museum of Contemporary Art Chicago Mar 31–Apr 3, 2016 Technical director Jim Moore Hurdy-gurdy coffin and Erik Newman Blair Thomas & Co. wind machine constructor Puppet builder and Tyler Culligan Moby Dick set constructor or The Brotherhood of the Monastic Costume constructor Uber Costume ____________________________________________ Order of Ancient Mariners Purges Additional puppet and set construction by Hannah
    [Show full text]