Arxiv:2108.10755V1 [Cs.CL]

Total Page:16

File Type:pdf, Size:1020Kb

Arxiv:2108.10755V1 [Cs.CL] More Than Words: Collocation Tokenization for Latent Dirichlet Allocation Models Jin Cheevaprawatdomrong Alexandra Schofield Attapol T. Rutherford Chulalongkorn University Harvey Mudd College Chulalongkorn University [email protected] [email protected] [email protected] Abstract without marked word boundaries, such as Chinese and Thai: tokenizers for these languages may Traditionally, Latent Dirichlet Allocation (LDA) ingests words in a collection of doc- split conceptual units into segments that, while uments to discover their latent topics using functional as standalone words, do not express word-document co-occurrences. However, it the concept of the original text. Meaningful is unclear how to achieve the best results for interpretation of topics can be lost without careful languages without marked word boundaries recombination of these words. such as Chinese and Thai. Here, we explore In this paper, we evaluate three techniques to the use of Pearson’s chi-squared (χ2) test, t- merge multiple adjacent words into conceptually- statistics, and Word Pair Encoding (WPE) to produce tokens as input to the LDA model. unified phrasal tokens prior to LDA model infer- The χ2, t and WPE tokenizers are trained on ence: Pearson Chi-square test, t-statistic, and word Wikipedia text to look for words that should pair encoding (WPE). We apply merging strate- be grouped together, such as compound nouns, gies to different language families including Indo- proper nouns, and complex event verbs. We European language (English), Kra-Dai language propose a new metric for measuring the clus- (Thai) and Sinitic language (Chinese). Inspired tering quality in settings where the vocabular- by silhouette coefficients, we also introduce a new ies of the models differ. Based on this met- ric and other established metrics, we show that method to assess the coherence of topics in a set- topics trained with merged tokens result in ting with variable vocabularies caused by different topic keys that are clearer, more coherent, and pre-processing treatments, which was not possible more effective at distinguishing topics than with previously proposed methods. Using this new those unmerged models. metric and existing topic model evaluations, we 1 Introduction find that all three approaches to merging adjacent words can improve the likelihood, coherence, and Latent Dirichlet allocation (LDA) models topic distinctiveness of LDA models. (Blei et al., 2003) provide useful insights into themes and trends in a large text collection 2 Related Work through the unsupervised inference of topics, or arXiv:2108.10755v1 [cs.CL] 24 Aug 2021 probability distributions over unigram word types Despite their popularity in analyzing large in the corpus. In this model, a topic is often amounts of text data, LDA models are notori- interpreted based on its highest-probability words, ously complex to evaluate. One must evaluate with documents expressed in terms of proportions both the statistical fit of a model and the human- of each topic. Unfortunately, the context in registered thematic coherence of the words found which these tokens arise can be obscured in the to arise in the high-probability words, or keys, bag-of-words rendering of text as unigram counts of a topic, which may not correlate (Chang et al., in documents. For instance, a topic with high 2009). Analyses often combine evaluations of fit probabilities of both “coffee” and “table” is tempt- (Wallach et al., 2009) and automated approxima- ing to interpret as focusing on the furniture item tions of human judgments of coherence (Bouma, “coffee table”, but both words could be frequent 2009; Mimno et al., 2011) based on mutual infor- in a discussion of cafes containing no coffee mation, even with the expectation these may only tables. This problem is amplified in languages somewhat correlate with true human judgments (Lau et al., 2014). A limitation of these existing that even a few co-occurrences can trigger signifi- approaches, however, is that they expect the vo- cance. cabulary and tokenization to remain constant be- Taking inspiration from byte-pair encoding, or tween two models. For our evaluation, we use BPE, we propose an alternative to obtain word- a normalized log likelihood approach to capture pair encoding (WPE) tokens. To do this, we first fit while accounting for changes in vocabulary tokenize a large corpus and then collect bigram (Schofield and Mimno, 2016). counts for all bigrams found in the corpus. Sec- Pre-processing steps can meaningfully al- ond, we merge the most frequent bigram to form a ter the results of the LDA models even in new WPE token. This new bigram is then treated languages with good tokenization heuristics as a word in all occurrences. Next, we continue to such as English (Schofield and Mimno, 2016; repeat the counting and merging process with one Schofield et al., 2017). We believe that languages extra word type. Finally, we obtain a vocabulary that do not have clear tokenization standards de- list of both unigram and WPE tokens. serve investigation into what kind of process- ing is appropriate. Many works recognize that 4 Evaluation Metrics LDA results can be improved when input are Held-Out Likelihood. When multi-word phrases including phrases (Lindsey et al., 2012; Yu et al., are converted to individual tokens, the number of 2013; El-Kishky et al., 2014; Wang et al., 2016; tokens in the document decreases while the size Bin et al., 2018; Li et al., 2018). We consider it of the corpus vocabulary increases. It is therefore valuable to specifically assess approaches to deter- illogical to compare the likelihoods of the word- mining these phrases. token model and WPE-token model directly. In order to normalize the scores between the 3 Collocations and Word Pair Encoding two models that do not have the exact same vo- Collocations consist of two or more words that can cabulary and tokens, we use the log-likelihood express conventional meaning. Since collocations ratio between the LDA model likelihood and can convey information about multi-word entities, the null (unigram) likelihood for each model. context, and word usage, we hypothesize that the In other words, we normalize the LDA model L introduction of multi-word tokens, which capture likelihood ( model) by dividing it with the un- L collocations as unigrams through concatenation, igram likelihood ( unigram) as introduced by can help achieve more useful and coherent topic Schofield and Mimno (2016): models. For languages that do not have clear word boundaries, there is a possible additional benefit to L − L log model log unigram multi-word tokens: it can be hard to intuit whether PTLLnorm = (1) N inferred word boundaries will have a large impact on the final results. Merging adjacent words into New Metric: Concatenation-based Embed- ’multi-word’ tokens may help remedy the potential ding Silhouette (CBES) Previous measures of problem of a segmentation that is not optimal for topic coherence rely on statistics from the training topic modeling purposes. data and assume that the vocabularies are identical Many methods are possible to select colloca- for both models, which is not the case for our set- tions from tokenized text, such as frequency, mean tings. To address this, we propose a new metric and variance, and statistical hypothesis testing. In called a concatenation-based embedding silhou- this paper, we evaluate Pearson’s chi-squared test ette (CBES), which measures the coherence within (χ2) and the t-statistic for word co-occurrence, the same topic and also the distinguishability of two hypothesis tests to determine if two words different topics in the LDA results. CBES extends are collocated significantly more than would oc- silhouette coefficients (Rousseeuw, 1987), a com- cur randomly. To implement these tests, we use mon clustering evaluation metric, by projecting to- the NLTK package to compute (Bird, 2006). We kens and multiword tokens into the same space impose a minimum frequency in the corpus for and computing the silhouette coefficients in this each selected bigram: otherwise, top bigrams from vector space in the usual way. the χ2 test will contain only exceptional rare A good topic should have all of their topic words, as these are expected to co-occur so rarely keys close to each other and away from other %merged χ2: debes jugar, euskaltel euskadi, taare zameen, chetro Corpus Docs Tokens χ2 t WPE ketl, hetch hetchy, ngwat mahop, mullum malarum, NYTimes 80K 2M 15.43 15.99 16.55 pazz jop, phnom penh, eisernen kreuzes, sirimalle SOTU 21K 1M 12.21 12.90 13.53 chettu, kasa vubu, moondram pirai, gjems onstad, Yelp 200K 13M 9.47 10.33 12.16 lettow vorbeck, pather panchali, ioann zlatoust, kud TNC 2K 4M 11.43 11.50 8.40 wafter, poquita ropa, viribus unitis BEST 4K 6M 12.43 12.50 9.53 t: united states, new york, world war, km h, take place, Wongnai 40K 8M 5.68 5.71 4.48 miles km, los angeles, united kingdom, first time, high Prachathai 68K 119M 13.41 13.45 10.36 school, tropical storm, new zealand, war ii, video game, Chinanews 100K 3M 12.62 14.35 10.62 mph km, h mph, north america, air force, two years, Dianping 100K 4M 3.22 3.84 2.59 peak number Douban 200K 2M 4.61 5.18 3.99 WPE: unite state, new york, take place, first time, unite kingdom, follow year, world war ii, also know, next Table 1: A survey of corpora providing the number day, new york city, high school, los angeles, north amer- of documents and tokens, as well as the percentage of ica, even though, new zealand, follow day, become first, also use, year old, take part unigram tokens merged using each approach. Figure 1: Different collocation scoring methods result in different top 20 English collocations.
Recommended publications
  • Download Kud Wafter Digital Version Kud Wafter
    download kud wafter digital version Kud Wafter. The Little Busters' trip was a complete success, and summer vacation has finally begun. Everyone heads home for a bit to see their families. Everyone, that is, except for Riki (who has no family) and Kud (whose family lives in another country). One day, an accident occurs that floods the men's dormitory, which leaves Riki with no place to stay. Luckily, Kud is willing to lend a hand, and she's also looking for a roommate. A spin-off of Little Busters! that focuses on Noumi Kudryavka and Riki Naoe's after story. Difficult Files Site. Shiina is a cheerful, energetic girl without a hint of shyness around strangers. Kanon Air Clannad Planetarian: God's Blessing on this Wonderful World! See All Buying Options. See and discover other items: Archived from the original on June 28, Pages with related products. Uploader: Shaktizilkree Date Added: 28 October 2015 File Size: 64.70 Mb Operating Systems: Windows NT/2000/XP/2003/2003/7/8/10 MacOS 10/X Downloads: 71315 Price: Free* [ *Free Regsitration Required ] The single of " One's Future " was released in April Original Events and Visuals will be Added] in Japanese. PC Game Key KUD Wafter Limited Japan Windows Little Busters Kudryavka Stock. Retrieved July 21, Much of its gameplay is spent on reading the story's narrative and dialogue. Retrieved September 19, Learn more about Amazon Prime. After the release of Little Busters! Download Alexa for your Windows 10 PC for free. Holland Youth Shin Megami Tensei: Views Read Edit View history.
    [Show full text]
  • Out of Base Mini-Blinds at the Mokapu Mall Grand Information, Inhabited the Tickets and Opening
    INSIDE SEMPER FIT MARINE/SAILOR OF Friends of the Marina A-2 THE YEAR AESC Sholarship A-8 B4 Semper Fit Opening B-1 A-8 L va, 27# No. 16 Serving the base of choice for the 21st century April 3011998= y on the Shoppers ring line .. flock to new mall swung open to the public LCpI. Trent Lowry the first time, though. Combat Correspondent "This is so convenient and Bustling like a beehive, close to home," said Mary the official opening of the Ann Juarez, a family mem- Mokapu Mall here ber. "You really don't have Saturday was overrun by to go off base to shop any- swarms of more. You shoppers really can't eager to see ask for any- all the com- thing else." plex has to M W R. offer. planned Merchants events for the and customers entire day, alike were keeping cus- excited with tomers happy Digital photo by Cpl. Barry Melton the potential until 11 p.m. for success the Adults were 3/3 conducts SPT Mall promis- able to get LCpI. Michael Roby, a fire team leader with I Company 3d Battalion, 3d Marines, is next in line to fire the M-203 grenade launcher during es. haircuts at Standard Proficiency Test training at Schofield Barracks. The Marines were graded on their ability to fire various weapons by officers and staff "This is such the new bar- NCOs of 2/3, and 3/3 will use the SPT in preparation for its upcoming deployment in December. a fabulous ber shop building and while the kids center, I feel played the very fortunate new Jet Ski to be here," video game at Housing gets the lead said Pam Digital photo by LCpI.
    [Show full text]
  • Romantic Love and Narrative Form in Japanese Visual Novels and Romance Adventure Games
    arts Article From Novels to Video Games: Romantic Love and Narrative Form in Japanese Visual Novels and Romance Adventure Games Kumiko Saito Department of Languages, Clemson University, Clemson, SC 29634, USA; [email protected] Abstract: Video games are powerful narrative media that continue to evolve. Romance games in Japan, which began as text-based adventure games and are today known as bishojo¯ games and otome games, form a powerful textual corpus for literary and media studies. They adopt conventional literary narrative strategies and explore new narrative forms formulated by an interface with computer- generated texts and audiovisual fetishism, thereby challenging the assumptions about the modern textual values of storytelling. The article first examines differences between visual novels that feature female characters for a male audience and romance adventure games that feature male characters for a female audience. Through the comparison, the article investigates how notions of romantic love and relationship have transformed from the modern identity politics based on freedom and the autonomous self to the decentered model of mediation and interaction in the contemporary era. Keywords: Japanese video games; visual novels; bishojo¯ games; otome games; romance simulation; literature; romance; narrative form; modernity; postmodernity Citation: Saito, Kumiko. 2021. From Novels to Video Games: Romantic 1. Introduction Love and Narrative Form in Japanese With the rise of video games as a new medium for storytelling, scholars have un- Visual Novels and Romance equivocally posed the question, “Are games stories?” (Salen and Zimmerman 2003, p. 378). Adventure Games. Arts 10: 42. Although any computer or video game can be considered a form of popular fiction (Atkins https://doi.org/10.3390/arts10030042 2003, p.
    [Show full text]
  • Testing and Maintenance in the Video Game Industry Today
    Running Head: TESTING AND MAINTAINING VIDEO GAMES 1 Testing and Maintenance in the Video Game Industry Today Anthony Jarman A Senior Thesis submitted in partial fulfillment of the requirements for graduation in the Honors Program Liberty University Spring 2010 TESTING AND MAINTAINING VIDEO GAMES 2 Acceptance of Senior Honors Thesis This Senior Honors Thesis is accepted in partial fulfillment of the requirements for graduation from the Honors Program of Liberty University. ______________________________ Robert Tucker, Ph.D. Thesis Chair ______________________________ Mark Shaneck, Ph.D. Committee Member ______________________________ Troy Matthews, Ed.D. Committee Member ______________________________ Marilyn Gadomski, Ph.D. Assistant Honors Director ______________________________ Date TESTING AND MAINTAINING VIDEO GAMES 3 Abstract Testing and maintenance are important when designing any type of software, especially video games. Since the gaming industry began, testing and maintenance techniques have evolved and changed. In order to understand how testing and maintenance techniques are practiced in the gaming industry, several key elements must be examined. First, specific testing and maintenance techniques that are most useful for video games must be analyzed to understand their effectiveness. Second, the processes used for testing and maintaining video games at the beginning of the industry must be reviewed in order to see how far testing and maintenance techniques have progressed. Third, the potential negative side effects of new testing and maintenance techniques need to be evaluated to serve as both a warning for future game developers and a way of improving the overall quality of current video games. TESTING AND MAINTAINING VIDEO GAMES 4 Testing and Maintenance in the Video Game Industry Today Computers are used in almost every field imaginable today.
    [Show full text]
  • Arxiv:2102.02810V2 [Cs.CL] 9 Jul 2021
    Data Mining and Knowledge Discovery manuscript No. (will be inserted by the editor) Controlling Hallucinations at Word Level in Data-to-Text Generation Clement Rebuffel? · Marco Roberti? · Laure Soulier · Geoffrey Scoutheeten · Rossella Cancelliere · Patrick Gallinari Received: date / Accepted: date Abstract Data-to-Text Generation (DTG) is a subfield of Natural Language Gen- eration aiming at transcribing structured data in natural language descriptions. The field has been recently boosted by the use of neural-based generators which ex- hibit on one side great syntactic skills without the need of hand-crafted pipelines; on the other side, the quality of the generated text reflects the quality of the train- ing data, which in realistic settings only offer imperfectly aligned structure-text pairs. Consequently, state-of-art neural models include misleading statements { usually called hallucinations{ in their outputs. The control of this phenomenon is today a major challenge for DTG, and is the problem addressed in the paper. Previous work deal with this issue at the instance level: using an alignment score for each table-reference pair. In contrast, we propose a finer-grained ap- proach, arguing that hallucinations should rather be treated at the word level. Specifically, we propose a Multi-Branch Decoder which is able to leverage word- ? Equal contribution Cl´ement Rebuffel Sorbonne Universit´e,CNRS, LIP6, F-75005 Paris, France E-mail: clement.rebuff[email protected] Marco Roberti University of Turin, Italy E-mail: [email protected] Laure Soulier Sorbonne Universit´e,CNRS, LIP6, F-75005 Paris, France Geoffrey Scoutheeten BNP Paribas, Paris Rossella Cancelliere University of Turin, Italy arXiv:2102.02810v2 [cs.CL] 9 Jul 2021 Patrick Gallinari Sorbonne Universit´e,CNRS, LIP6, F-75005 Paris, France Criteo AI Lab, Paris 2 C.
    [Show full text]
  • Race 1 1 1 2 2 3 2 4 3 5 3 6 4 7 4 8 5 9 5 10
    HANSHIN SUNDAY,APRIL 5TH Post Time 10:05 1 ! Race Dirt 1800m THREE−YEAR−OLDS Course Record:10Jul.04 1:48.5 F&M DES,WEIGHT FOR AGE,MAIDEN Value of race: 9,680,000 Yen 1st 2nd 3rd 4th 5th Added Money(Yen) 5,100,000 2,000,000 1,300,000 770,000 510,000 Stakes Money(Yen) 0 0 0 Ow. Manabu Yoshitomi 0 S 00000 Life10001M 10001 1 1 51.0 Fuma Izumiya(10.5%,6−3−1−47,57th) Turf10001 I 00000 Ashaka Chan(JPN) Dirt00000L 00000 -Just a Way(0.90) -Elusive City F3,d.b. Ta k a s h i S a i to (14.7%,10−8−5−45,14th) Course00000E 00000 Wht. .Interim .Jioconda 19Mar.17 Taihei Stud Farm Co., Ltd Wet 00000 29Sep.19 HANSHIN NWC T1600Fi 5 4 1:36.7 12th/18 Taisei Danno 51.0 472, In the Mood 1:34.9 <2 1/2> Avail <1/2> Kurino Vincent Ow. Silk Racing Co.,Ltd. 5,300,000 S 10001 Life60213M 50212 1 2 B 53.0 Atsuya Nishimura(7.9%,18−26−18−167,12th) Turf20002 I 00000 Rhone Glacier(JPN) Dirt40211L 00000 -Kurofune(0.92) -Agnes Tachyon F3,g. Manabu Ikezoe(15.1%,11−8−6−48,8th) Course10001E 00000 Wht. .Adelheid .Biwa Heidi 26Mar.17 Tenei Horse Park Ltd Wet 20200 22Feb.20 KOKURA MDN D1700Sl 1111 1:45.5 2nd/16 Atsuya Nishimura 53.0 478! Copano Rakuraku 1:45.3 <1 1/4> Rhone Glacier <6> Shigeru Mokusei 16Feb.20 KOKURA MDN D1700Sl 1112 1:46.2 2nd/16 Atsuya Nishimura 53.0 478# Kurino Nikita 1:46.2 <NK> Rhone Glacier <2 1/2> Behind the Sun 21Dec.19 HANSHIN MDN D1800Go3345 1:59.515th/16YugaKawada 54.0488" A Shin Waltz 1:56.1 <4> Namura Puffin <2> Stella Alba 14Oct.19 KYOTO MDN D1800Go1111 1:55.43rd/10Andrasch Starke 54.0 492# Hakuai Windsor 1:54.4 <1 1/4> Jet Max <5> Rhone Glacier 21Sep.19 HANSHIN MDN T1600Fi 1 1 1:35.5 8th/14 Yuga Kawada 54.0 498# Shadow Blossom 1:34.3 <NK> Cortesia <NK> Lord Bay Leaf Ow.
    [Show full text]
  • Little Busters Pc Download Little Busters! Ecstasy Edition Free Download (V1.2.4) ”
    little busters pc download Little Busters! Ecstasy Edition Free Download (v1.2.4) ”. Both series are available and have gained a large following in many countries outside of Japan as well. How to Download & Install Little Busters! Ecstasy Edition. Click the Download button below and you should be redirected to UploadHaven. Wait 5 seconds and click on the blue ‘download now’ button. Now let the download begin and wait for it to finish. Once Little Busters! Ecstasy Edition is done downloading, right click the .zip file and click on “Extract to Little.Busters.Ecstasy.zip” (To do this you must have 7-Zip, which you can get here). Double click inside the Little Busters! Ecstasy Edition folder and run the exe application. Have fun and play! Make sure to run the game as administrator and if you get any missing dll errors, look for a Redist or _CommonRedist folder and install all the programs in the folder. Little Busters! Ecstasy Edition Free Download. Note: Little Busters Ecstasy contains the extra adult scenes that were added after the original game was created. While all of the dialogue and story is translated in Ecstasy – the extra adult scenes have never been translated. You can press CTRL to speed through the novel. ALT + ENTER will put the game in full-screen mode. Click the download button below to start Little Busters! Ecstasy Edition Free Download with direct link. It is the full version of the game. Don’t forget to run the game as administrator. Little Busters! Converted Edition. Let's live in the moment!This tale of adolescence from Key appears on Nintendo Switch for the first time ever! Since its humble beginnings as a PC game, the "Little Busters!" visual novel has continued to evolve.
    [Show full text]
  • ANIME OP/ED (TV-Versio) Japahari Net - Retsu No Matataki Maximum the Hormone - ROLLING 1000 Toon
    Air Master ANIME OP/ED (TV-versio) Japahari Net - Retsu no matataki Maximum the Hormone - ROLLING 1000 tOON 07 Ghost Ajin Yuki Suzuki - Aka no Kakera flumpool - Yoru wa Nemureru kai? Mamoru Miyano - How Close You Are 91 Days Angela x Fripside - Boku wa Boku de atte ELISA - Rain or Shine TK from Ling Tosite Sigure - Signal Amanchu Maaya Sakamoto - Million Clouds 11eyes Asriel - Sequentia Ange Vierge Ayane - Arrival of Tears Konomi Suzuki - Love is MY RAIL 3-gatsu no Lion Angel Beats BUMP OF CHICKEN - Answer Aoi Tada - Brave Song .dot-Hack Lisa - My Soul Your Beats See-Saw - Yasashii Yoake Girls Dead Monster - My Song Bump of chicken - Fighter Angelic Layer .hack//g.u Atsuko Enomoto - Be my Angel Ali Project - God Diva HAL - The Starry sky .hack//Liminality Anime-gataris See-Saw - Tasogare no Umi GARNiDELiA - Aikotoba .hack//Roots Akagami no Shirayuki hime Boukoku Kakusei Catharsis Saori Hayami - Yasashii Kibou eyelis - Kizuna ni nosete Abenobashi Mahou Shoutengai Megumi Hayashibara - Anata no kokoro ni Akagami no Shirayuki hime 2nd season Saori Hayami - Sono Koe ga Chizu ni Naru Absolute Duo eyelis - Page ~Kimi to Tsuzuru Monogatari~ Nozomi Yamamoto & Haruka Yamazaki - Apple Tea no Aji Akagi Maximum the Hormone - Akagi Accel World May'n - Chase the World Akame ga Kill Sachika Misawa - Unite. Miku Sawai - Konna Sekai, Shiritakunakatta Altima - Burst the gravity Rika Mayama - Liar Mask Kotoko - →unfinished→ Sora Amamiya - Skyreach Active Raid Akatsuki no Yona AKINO with bless4 - Golden Life Shikata Akiko - Akatsuki Cyntia - Akatsuki no hana
    [Show full text]
  • Understanding the Business Model in the Video Game Industry
    Understanding the business model in the video game industry A case study on an independent video game developer MASTER THESIS WITHIN: Business Administration NUMBER OF CREDITS: 30 PROGRAMME OF STUDY: Digital Business AUTHORS: Erik Almér and Gustav Eriksson JÖNKÖPING January 2019 Master Thesis in Business Administration Title: Understanding the business model in the video game industry – a case study on an independent video game developer Authors: Erik Almér and Gustav Eriksson Tutor: Imran Nazir Date: 2019-01-17 Key terms: business model, business model canvas, independent video game developer, case study Abstract Background: Tough competition, time- and resource constraints, and changing consumer demands in the video game industry requires business models that can cope with the pressure. Purpose: The purpose of this thesis is to use business model framework in order to better understand how independent video game developers develop their business models. We aim to contribute to the development of business model literature within the context of independent video game development by further the understanding of how a business model framework can be utilized in this new context. Method: A case study method was used, focusing on a single-case and interviews with participants from the case company. Conclusion: We further develop the BMC by proposing to divide the BMC for independent video game developers into a pre-release and post-release BMC to better describe the business model for an independent video game developer and the business model evolution from pre-release to post- release. i Table of Contents 1. Introduction .......................................................................... 1 1.1 Disposition .............................................................................................. 1 1.2 Background ............................................................................................
    [Show full text]
  • Rewrite: an Experimentation in the Field of Interactive Fiction a Thesis
    Rewrite: An Experimentation in the Field of Interactive Fiction A Thesis Proposal Presented to The Faculty of Arts and Humanities University of Denver by S. Sanaz Fatemi November 2014 Advisor: Rafael Fajardo Author: S. Sanaz Fatemi Title: Rewrite: An Experimentation in the Field of Interactive Fiction Advisor: Rafael Fajardo Proposal Date: November 2014 Abstract An interactive fiction, as a form of video game and virtual reality, tends to be immersive, that is drowning the player as deep as possible into its constructed world. Yet, as a form of experimental literature, or more specifically metafiction, it also tends to be self- conscious of its own form, attracting the player’s attention to the constructed quality of its structure. Therefore, any interactive fiction has an inherently paradoxical structure: it aspires to be immersive, but at the same time, due to its experimental nature, breaks the immersiveness by attracting attention to its particular formal quality. This study uses the theoretical frameworks of Narratology, particularly the theory of metafiction, and ludology, specifically the ideas formed around the notion of immersion, to trace the nature of this paradox back to its origins. Then, it proposes Rewrite, an interactive fiction that is going to be made by the author of this thesis, as a practical example to prove that the metafictional qualities of interactive fictions can be used to produce an even stronger sense of immersion for players. Rewrite is being developed by Ren’Py, an open source visual novel engine that uses Python as its base of scripting language. Creating story worlds that allow genuine immersion has been one of the main ambitions of interactive storytellers.
    [Show full text]
  • Auditory Cognition and Perception of Action Video Game Players Hannah J
    www.nature.com/scientificreports OPEN Auditory cognition and perception of action video game players Hannah J. Stewart1,2, Jasmin L. Martinez1,3, Audrey Perdew1, C. Shawn Green4 & David R. Moore1,5,6* A training method to improve speech hearing in noise has proven elusive, with most methods failing to transfer to untrained tasks. One common approach to identify potentially viable training paradigms is to make use of cross-sectional designs. For instance, the consistent fnding that people who chose to avidly engage with action video games as part of their normal life also show enhanced performance on non-game visual tasks has been used as a foundation to test the causal impact of such game play via true experiments (e.g., in more translational designs). However, little work has examined the association between action video game play and untrained auditory tasks, which would speak to the possible utility of using such games to improve speech hearing in noise. To examine this possibility, 80 participants with mixed action video game experience were tested on a visual reaction time task that has reliably shown superior performance in action video game players (AVGPs) compared to non-players (≤ 5 h/week across game categories) and multi-genre video game players (> 5 h/week across game categories). Auditory cognition and perception were tested using auditory reaction time and two speech-in-noise tasks. Performance of AVGPs on the visual task replicated previous positive fndings. However, no signifcant beneft of action video game play was found on the auditory tasks. We suggest that, while AVGPs interact meaningfully with a rich visual environment during play, they may not interact with the games’ auditory environment.
    [Show full text]
  • Game-Like 3D Visualisation of Air Quality Data
    Multimodal Technologies and Interaction Article Game-Like 3D Visualisation of Air Quality Data Bruno Teles 1, Pedro Mariano 1,2 and Pedro Santana 1,∗ 1 ISCTE-Instituto Universitário de Lisboa, Instituto de Telecomunicações, 1649-026 Lisboa, Portugal; [email protected] (B.T.); [email protected] (P.M.) 2 Centro de Ciências e Tecnologias Nucleares, Instituto Superior Técnico, 2695-066 Bobadela, Portugal * Correspondence: [email protected] Received: 7 August 2020; Accepted: 12 August 2020; Published: 17 August 2020 Abstract: The data produced by sensor networks for urban air quality monitoring is becoming a valuable asset for informed health-aware human activity planning. However, in order to properly explore and exploit these data, citizens need intuitive and effective ways of interacting with it. This paper presents CityOnStats, a visualisation tool developed to provide users, mainly adults and young adults, with a game-like 3D environment populated with air quality sensing data, as an alternative to the traditionally passive visualisation techniques. CityOnStats provides several visual cues of pollution presence with the purpose of meeting each user’s preferences. Usability tests with a sample of 30 participants have shown the value of air quality 3D game-based visualisation and have provided empirical support for which visual cues are most adequate for the task at hand. Keywords: scientific data visualisation; virtual environments; game engine; air quality; sensor network 1. Introduction The growth of urban environments brings many benefits to citizens, but also contributes to an increase in pollution levels and, as a consequence, to the degradation of air quality. To monitor air quality in cities, fixed and costly air sampling stations are often planted in key locations.
    [Show full text]