Arxiv:2107.04929V1 [Cs.CL] 10 Jul 2021 Rying High Degrees of Symbolic and Indexical Meaning Metaphoricity, and Their Dependence on Context [11]
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
JAMES BOND in the 1980S
June 1983 Marxism Today 37 Casino Royale, the first Bond novel, was It is no accident that the Daily Express published in 1953 when, as David Cannadine serialized this particular novel, in which has observed, the coronation celebrations Bond is pitted against the Soviet espionage provided 'a retrospectively unconvincing organisation SMERSH, as, in the late reaffirmation of Britain's continued world JAMES BOND 1950s, Bond functioned primarily as a power status'. The last of Fleming's IN THE 1980s political hero for the lower middle classes. novels, The Man with the Golden Gun, An exemplary figure of the cold war era, appeared in 1965, the year of Churchill's Tony Bennett he personified the virtues of Western funeral, 'self-consciously recognised as the individualism, effortlessly triumphing over requiem for Britain as a great power'.1 the leaden hand of Soviet bureaucracy. The same period also witnessed a Equally important, particularly after the gradual relaxation in the relations between Suez fiasco and humiliation, Bond offered East and West as the acute tensions of the an imaginary outlet for a historically cold war period gave way to the less scene — a case of history repeating itself blocked jingoism. His exploits — warding troubled atmosphere of detente. There that would be farcical were it not so tragic. off the communist threat to world peace had been significant changes in the sphere Great power illusions have been kicked single-handedly, reducing the USA, in the ol gender and sexual relations, too. In the into a macabre half-life again in the person of Felix Leiter, to the role of 'permissive society' of the early 1960s, aftermath of the Falklands crisis. -
Corpora: Google Ngram Viewer and the Corpus of Historical American English
EuroAmerican Journal of Applied Linguistics and Languages E JournALL Volume 1, Issue 1, November 2014, pages 48 68 ISSN 2376 905X DOI - - www.e journall.org- http://dx.doi.org/10.21283/2376905X.1.4 - Exploring mega-corpora: Google Ngram Viewer and the Corpus of Historical American English ERIC FRIGINALa1, MARSHA WALKERb, JANET BETH RANDALLc aDepartment of Applied Linguistics and ESL, Georgia State University bLanguage Institute, Georgia Institute of Technology cAmerican Language Institute, New York University, Tokyo Received 10 December 2013; received in revised form 17 May 2014; accepted 8 August 2014 ABSTRACT EN The creation of internet-based mega-corpora such as the Corpus of Contemporary American English (COCA), the Corpus of Historical American English (COHA) (Davies, 2011a) and the Google Ngram Viewer (Cohen, 2010) signals a new phase in corpus-based research that provides both novice and expert researchers immediate access to a variety of online texts and time-coded data. This paper explores the applications of these corpora in the analysis of academic word lists, in particular, Coxhead’s (2000) Academic Word List (AWL). Coxhead (2011) has called for further research on the AWL with larger corpora, noting that learners’ use of academic vocabulary needs to address for the AWL to be useful in various contexts. Results show that words on the AWL are declining in overall frequency from 1990 to the present. Implications about the AWL and future directions in corpus-based research utilizing mega-corpora are discussed. Keywords: GOOGLE N-GRAM VIEWER, CORPUS OF HISTORICAL AMERICAN ENGLISH, MEGA-CORPORA, TREND STUDIES. ES La creación de megacorpus basados en Internet, tales como el Corpus of Contemporary American English (COCA), el Corpus of Historical American English (COHA) (Davies, 2011a) y el Visor de Ngramas de Google (Cohen, 2010), anuncian una nueva fase en la investigación basada en corpus, pues proporcionan, tanto a investigadores noveles como a expertos, un acceso inmediato a una gran diversidad de textos online y datos codificados con time-code. -
The 007Th Minute Ebook Edition
“What a load of crap. Next time, mate, keep your drug tripping private.” JACQUES A person on Facebook. STEWART “What utter drivel” Another person on Facebook. “I may be in the minority here, but I find these editorial pieces to be completely unreadable garbage.” Guess where that one came from. “No, you’re not. Honestly, I think of this the same Bond thinks of his obituary by M.” Chap above’s made a chum. This might be what Facebook is for. That’s rather lovely. Isn’t the internet super? “I don’t get it either and I don’t have the guts to say it because I fear their rhetoric or they’d might just ignore me. After reading one of these I feel like I’ve walked in on a Specter round table meeting of which I do not belong. I suppose I’m less a Bond fan because I haven’t read all the novels. I just figured these were for the fans who’ve read all the novels including the continuation ones, fan’s of literary Bond instead of the films. They leave me wondering if I can even read or if I even have a grasp of the language itself.” No comment. This ebook is not for sale but only available as a free download at Commanderbond.net. If you downloaded this ebook and want to give something in return, please make a donation to UNICEF, or any other cause of your personal choice. BOOK Trespassers will be masticated. Fnarr. BOOK a commanderbond.net ebook COMMANDERBOND.NET BROUGHT TO YOU BY COMMANDERBOND.NET a commanderbond.net book Jacques I. -
Arxiv:2007.16007V2 [Cs.CL] 17 Apr 2021 Beddings in Deep Networks Like the Transformer Can Improve Performance
Exploring Swedish & English fastText Embeddings for NER with the Transformer Tosin P. Adewumi, Foteini Liwicki & Marcus Liwicki [email protected] EISLAB SRT Department Lule˚aUniversity of Technology Sweden Abstract In this paper, our main contributions are that embeddings from relatively smaller cor- pora can outperform ones from larger corpora and we make the new Swedish analogy test set publicly available. To achieve a good network performance in natural language pro- cessing (NLP) downstream tasks, several factors play important roles: dataset size, the right hyper-parameters, and well-trained embeddings. We show that, with the right set of hyper-parameters, good network performance can be reached even on smaller datasets. We evaluate the embeddings at both the intrinsic and extrinsic levels. The embeddings are deployed with the Transformer in named entity recognition (NER) task and significance tests conducted. This is done for both Swedish and English. We obtain better performance in both languages on the downstream task with smaller training data, compared to re- cently released, Common Crawl versions; and character n-grams appear useful for Swedish, a morphologically rich language. Keywords: Embeddings, Transformer, Analogy, Dataset, NER, Swedish 1. Introduction The embedding layer of neural networks may be initialized randomly or replaced with pre- trained vectors, which act as lookup tables. One of such pre-trained vector tools include fastText, introduced by Joulin et al. Joulin et al. (2016). The main advantages of fastText are speed and competitive performance to state-of-the-art (SotA). Using pre-trained em- arXiv:2007.16007v2 [cs.CL] 17 Apr 2021 beddings in deep networks like the Transformer can improve performance. -
A Collection of Swedish Diachronic Word Embedding Models Trained on Historical
A Collection of Swedish Diachronic Word Embedding Models Trained on Historical Newspaper Data DATA PAPER SIMON HENGCHEN NINA TAHMASEBI *Author affiliations can be found in the back matter of this article ABSTRACT CORRESPONDING AUTHOR: Simon Hengchen This paper describes the creation of several word embedding models based on a large Språkbanken Text, Department collection of diachronic Swedish newspaper material available through Språkbanken of Swedish, University of Text, the Swedish language bank. This data was produced in the context of Språkbanken Gothenburg, SE Text’s continued mission to collaborate with humanities and natural language [email protected] processing (NLP) researchers and to provide freely available language resources, for the development of state-of-the-art NLP methods and tools. KEYWORDS: word embeddings; semantic change; newspapers; diachronic word embeddings TO CITE THIS ARTICLE: Hengchen, S., & Tahmasebi, N. (2021). A Collection of Swedish Diachronic Word Embedding Models Trained on Historical Newspaper Data. Journal of Open Humanities Data, 7: 2, pp. 1–7. DOI: https://doi. org/10.5334/johd.22 1 OVERVIEW Hengchen and Tahmasebi 2 Journal of Open We release diachronic word2vec (Mikolov et al., 2013) and fastText (Bojanowski et al., 2017) models Humanities Data DOI: 10.5334/johd.22 in their skip-gram with negative sampling (SGNS) architecture. The models are trained on 20-year time bins, with two temporal alignment strategies: independently-trained models for post-hoc alignment (as introduced by Kulkarni et al. 2015), and incremental training (Kim et al., 2014). In the incremental scenario, a model for t1 is trained and saved, then updated with the data from t2. -
English Words and Culture
Wayne State University Anthropology Faculty Research Publications Anthropology 2021 The Lexiculture Papers: English Words and Culture Stephen Chrisomalis Wayne State University, [email protected] Follow this and additional works at: https://digitalcommons.wayne.edu/anthrofrp Part of the Anthropological Linguistics and Sociolinguistics Commons, Digital Humanities Commons, Discourse and Text Linguistics Commons, and the Linguistic Anthropology Commons Recommended Citation Chrisomalis, Stephen (editor). 2021. The Lexiculture Papers: English Words and Culture. https://digitalcommons.wayne.edu/anthrofrp/4/ This Book is brought to you for free and open access by the Anthropology at DigitalCommons@WayneState. It has been accepted for inclusion in Anthropology Faculty Research Publications by an authorized administrator of DigitalCommons@WayneState. The Lexiculture Papers English Words and Culture edited by Stephen Chrisomalis Wayne State University 2021 Introduction 1 Stephen Chrisomalis Lexiculture: A Student Perspective 6 Agata Borowiecki Akimbo 8 Agata Borowiecki Anymore 14 Jackie Lorey Artisanal 19 Emelyn Nyboer Aryan 23 David Prince Big-ass 28 Nicole Markovic Bromance 34 Alistair King Cafeteria 39 Deanna English Childfree 46 Virginia Nastase Chipotle 51 Gabriela Ortiz Cyber 56 Matthew Worpell Discotheque 60 Ashley Johnson Disinterested 66 Hope Kujawa Douche 71 Aila Kulkarni Dwarf 78 John Anderson Eleventy 84 Julia Connally Erectile Dysfunction 87 Corrinne Sanger Eskimo 93 Valerie Jaruzel Expat 97 Michelle Layton Feisty 103 Kathryn Horner -
Google Ngram Viewer by Andrew Weiss
Google Ngram Viewer By Andrew Weiss Digital Services Librarian California State University, Northridge Introduction: Google, mass digitization and the emerging science of culturonomics Google Books famously launched in 2004 with great fanfare and controversy. Two stated original goals for this ambitious digitization project were to replace the generic library card catalog with their new “virtual” one and to scan all the books that have ever been published, which by a Google employee’s reckoning is approximately 129 million books. (Google 2014) (Jackson 2010) Now ten years in, the Google Books mass-digitization project as of May 2014 has claimed to have scanned over 30 million books, a little less than 24% of their overt goal. Considering the enormity of their vision for creating this new type of massive digital library their progress has been astounding. However, controversies have also dominated the headlines. At the time of the Google announcement in 2004 Jean Noel Jeanneney, head of Bibliothèque nationale de France at the time, famously called out Google’s strategy as excessively Anglo-American and dangerously corporate-centric. (Jeanneney 2007) Entrusting a corporation to the digitization and preservation of a diverse world-wide corpus of books, Jeanneney argues, is a dangerous game, especially when ascribing a bottom-line economic equivalent figure to cultural works of inestimable personal, national and international value. Over the ten years since his objections were raised, many of his comments remain prescient and relevant to the problems and issues the project currently faces. Google Books still faces controversies related to copyright, as seen in the current appeal of the Author’s Guild v. -
Google N-Gram Viewer Does Not Include Arabic Corpus! Towards N-Gram Viewer for Arabic Corpus
The International Arab Journal of Information Technology, Vol. 15, No. 5, September 2018 785 Google N-Gram Viewer does not Include Arabic Corpus! Towards N-Gram Viewer for Arabic Corpus Izzat Alsmadi1 and Mohammad Zarour2 1Department of Computing and Cyber Security, Texas A&M University, USA 2Information Systems Department, Prince Sultan University, KSA Abstract: Google N-gram viewer is one of those newly published Google services. Google archived or digitized a large number of books in different languages. Google populated the corpora from over 5 million books published up to 2008. This Google service allows users to enter queries of words. The tool then charts time-based data that show the frequency of usage of query words. Although Arabic is one of the top spoken language in the world, Arabic language is not included as one of the corpora indexed by the Google n-gram viewer. This research work discusses the development of large Arabic corpus and indexing it using N-grams to be included in Google N-gram viewer. A showcase is presented to build a dataset to initiate the process of digitizing the Arabic content and prepare it to be incorporated in Google N-gram viewer. One of the major goals of including Arabic content in Google N-gram is to enrich Arabic public content, which has been very limited in comparison with the number of people who speak Arabic. We believe that adopting Arabic language by Google N-gram viewer can significantly benefit researchers in different fields related to Arabic language and social sciences. Keywords: Arabic language processing, corpus, google N-gram viewer. -
Never Say Never Again
Svensk Filmdatabas Never Say Never Again Produktionsuppgifter: Produktionsbolag: Woodcote. A Taliafilm Production i samarb m Produc () Distributör i Sverige (35 mm): AB Svensk Filmindustri () Regi: Irvin Kershner Manus: Lorenzo Semple, jr. Idé: Kevin McClory Jack Whittingham Ian Fleming Producent: Jack Schwartzman Foto: Douglas Slocombe Musik: Michel Legrand Arkitekt: Philip Harrison Stephen Grimes Klippning: Ian Crafford Robert Lawrence Bildformat: Panavision (2,35:1) Färgsystem: Technicolor Ljudsystem: Dolby Stereo Spectral Recording Originallängd i minuter: 137 Censur: 124.403 Datum: 1983.11.24 Åldersgräns: Tillåten från 15 år LÄNGD: 3681 meter Sverigepremiär: 1983-12-16 Draken Göteborg Sverige 134 minuter 1983-12-16 Scania 1 Malmö Sverige 134 minuter 1983-12-16 Draken Stockholm Sverige 134 minuter 1983-12-16 Look Stockholm Sverige 134 minuter 1983-12-16 Lyran Stockholm Sverige 134 minuter TV-visning: 1995-09-17 TV3 Sverige 129 minuter 2000-08-27 TV3 Sverige 129 minuter 2001-04-20 TV3 Sverige 129 minuter 2004-12-18 Kanal 5 Sverige 129 minuter Svensk Filmdatabas Rollista: Sean Connery James Bond Klaus Maria Brandauer Largo Max von Sydow Blofeld Barbara Carrera Fatima Kim Basinger Domino Bernie Casey Felix Leiter Alec McCowen Q Edward Fox M Pamela Salem Miss Moneypenny Rowan Atkinson Small-Fawcett Valerie Leon Lady på Bahamas Milan Kirek Kovacs Pat Roach Lippe Anthony Sharp Lord Ambrose Prunella Gee Patricia Gavan O'Herlihy Jack Petachi Ronald Pickup Elliot Robert Rietty Guido Adorni italienska ministrar Vincent Marzello Culpepper Christopher -
James Bond's Films
James Bond’s Films Picture : all-free-download.com October 5th, 2012 is the Global James Bond Day, celebrating the 50th anniversary of the first James Bond film: Dr. No (ISAN 0000-0000-FD39-0000-2-0000-0000-V), released in UK on October 5th, 1962. James Bond, code name 007, is a fictional character created in 1953 by writer Ian Fleming, who featured him in twelve novels and two short story collections. Since 1962, 25 theatrical films have been produced around the character of James Bond. Sean Connery has been the first James Bond. He also played this role in other 6 films. Waiting for the next James Bond film, Skyfall, directed by Sam Mendes and starring Daniel Craig, and that will be released in October 2012, ISAN-IA joins the celebration for the Global James Bond Day by publishing a list of the past films dedicated to the most popular secret agent with the related identifying ISANs. Happy Birthday, Mr. Bond! Actor as Year Original Title Director ISAN James Bond 1962 Dr. No Terence Young Sean Connery 0000-0000-FD39-0000-2-0000-0000-V 1963 From Russia with Love Terence Young Sean Connery 0000-0000-D3B0-0000-6-0000-0000-J 1964 Goldfinger Guy Hamilton Sean Connery 0000-0000-39C7-0000-P-0000-0000-0 1965 Thunderball Terence Young Sean Connery 0000-0000-D3B1-0000-B-0000-0000-4 1967 You Only Live Twice Lewis Gilbert Sean Connery 0000-0000-7BDF-0000-L-0000-0000-B More titles ISAN International Agency Follows us on 1 A rue du Beulet CH- 1203 Geneva - Switzerland Website : www.isan.org - Tel.: +41 22 545 10 00 - email: [email protected] James Bond’s Films Actor as Year Original Title Director ISAN James Bond Ken Hughes, John Huston, Joseph 1967 Casino Royale McGrath, Robert David Niven 0000-0000-1C49-0000-4-0000-0000-P Parrish, Val Guest, Richard Talmadge On Her Majesty's Secret 1969 Service Peter R. -
Mining Semantics for Culturomics: Towards a Knowledge-Based Approach
Mining semantics for culturomics: towards a knowledge-based approach Lars, Borin; Devdatt, Dubhashi; Markus, Forsberg; Johansson, Richard; Dimitrios, Kokkinakis; Nugues, Pierre Published in: UnstructureNLP '13 Proceedings of the 2013 international workshop on Mining unstructured big data using natural language processing DOI: 10.1145/2513549.2513551 2013 Link to publication Citation for published version (APA): Lars, B., Devdatt, D., Markus, F., Johansson, R., Dimitrios, K., & Nugues, P. (2013). Mining semantics for culturomics: towards a knowledge-based approach. In UnstructureNLP '13 Proceedings of the 2013 international workshop on Mining unstructured big data using natural language processing (pp. 3-10). Association for Computing Machinery (ACM). https://doi.org/10.1145/2513549.2513551 Total number of authors: 6 General rights Unless other specific re-use rights are stated the following general rights apply: Copyright and moral rights for the publications made accessible in the public portal are retained by the authors and/or other copyright owners and it is a condition of accessing publications that users recognise and abide by the legal requirements associated with these rights. • Users may download and print one copy of any publication from the public portal for the purpose of private study or research. • You may not further distribute the material or use it for any profit-making activity or commercial gain • You may freely distribute the URL identifying the publication in the public portal Read more about Creative commons licenses: https://creativecommons.org/licenses/ Take down policy If you believe that this document breaches copyright please contact us providing details, and we will remove access to the work immediately and investigate your claim. -
Die Another Day, Post-Colonialism, and the James Bond Film Series." ZAA: Zeitschrift Fur Anglistik Und Amerkanistic 52, No
Southern Illinois University Carbondale OpenSIUC Articles Department of Cinema and Photography Spring 2004 Breaking the Cycle: Die Another Day, Post- colonialism, and the James Bond Film Series Walter C. Metz Southern Illinois University Carbondale, [email protected] Follow this and additional works at: http://opensiuc.lib.siu.edu/cp_articles Recommended Citation Metz, Walter C. "Breaking the Cycle: Die Another Day, Post-colonialism, and the James Bond Film Series." ZAA: Zeitschrift fur Anglistik und Amerkanistic 52, No. 1 (Spring 2004): 63-77. doi:10.1515/zaa.2004.52.1.63. This Article is brought to you for free and open access by the Department of Cinema and Photography at OpenSIUC. It has been accepted for inclusion in Articles by an authorized administrator of OpenSIUC. For more information, please contact [email protected]. “Breaking the Cycle”: Die Another Day, Post-Colonialism, and the James Bond Film Series By Walter Metz Published in: ZAA: Zeitschrift fur Anglistik und Amerikanistik. 52.1 [Spring 2004]. 63-77. As delighted to watch things blowing up as the next guy, I went to see the latest James Bond installment, Die Another Day (2002), with not much in the way of expectations beyond bombs. As long as the film delivered the usual Bond nonsense--exotic locales, witty one-liners, and explosions- -I would be happy enough. On this level, the film does not disappoint: it begins with a stunt-filled sequence in which Bond (played by Pierce Brosnan) surfs twenty foot waves (more on this later) to land on a beach in North Korea. Bond, we learn, is investigating the trading of African conflict diamonds for hovercraft1 that glide over the land mines (ironically labeled as “America’s cultural contribution”) in the demilitarized zone, to be used by Colonel Moon, the rebellious son of a top- ranking North Korean general.