Tools for Analyzing Talk Part 2: the CLAN Program

Total Page:16

File Type:pdf, Size:1020Kb

Tools for Analyzing Talk Part 2: the CLAN Program Tools for Analyzing Talk Part 2: The CLAN Program August 24, 2021 Brian MacWhinney Carnegie Mellon University https://doi.org/10.21415/T5G10R When citing the use of TalkBank facilities, please use this reference to the last printed version of the CHILDES manual: MacWhinney, B. (2000). The CHILDES Project: Tools for Analyzing Talk. 3rd Edition. Mahwah, NJ: Lawrence Erlbaum Associates This allows us to systematically track usage of the programs and data through scholar.google.com. Part 2: CLAN 2 1 Getting Started ............................................................................................................ 8 1.1 Why you want to learn CLAN ....................................................................................... 8 1.2 Learning CLAN .................................................................................................................. 8 1.3 Installing CLAN – Mac OS X ........................................................................................... 9 1.4 Installing CLAN – Windows .......................................................................................... 9 2 Using the Web ............................................................................................................ 10 2.1 Community Resources ................................................................................................ 10 2.2 Downloading Materials .............................................................................................. 10 2.3 Using the Browsable Database ................................................................................ 10 2.4 Downloading Transcripts and Media .................................................................... 11 3 Tutorial ........................................................................................................................ 12 3.1 The Commands Window ............................................................................................ 12 3.1.1 Setting the Working Directory .......................................................................................... 12 3.1.2 The Recall Button ................................................................................................................... 13 3.1.3 The Progs Menu ...................................................................................................................... 13 3.1.4 The FILE IN Button ................................................................................................................ 13 3.1.5 The TIERS Button ................................................................................................................... 13 3.2 Typing Command Lines .............................................................................................. 14 3.2.1 The Asterisk Wildcard.......................................................................................................... 15 3.2.2 Output Files .............................................................................................................................. 15 3.2.3 Redirection ............................................................................................................................... 15 3.3 Sample Runs ................................................................................................................... 16 3.3.1 Sample KWAL Run ................................................................................................................. 16 3.3.2 Sample FREQ Run .................................................................................................................. 16 3.3.3 Sample MLU Run .................................................................................................................... 17 3.3.4 Sample COMBO Run .............................................................................................................. 18 3.3.5 Sample GEM and GEMFREQ Runs ................................................................................... 18 3.4 Advanced Commands .................................................................................................. 19 3.5 Exercises .......................................................................................................................... 22 3.5.1 MLU50 Analysis ...................................................................................................................... 23 3.5.2 MLU5 Analysis ......................................................................................................................... 25 3.5.3 MLT Analysis ............................................................................................................................ 25 3.5.4 TTR Analysis ............................................................................................................................ 26 3.5.5 Generating Language Profiles ........................................................................................... 26 3.6 Further Exercises .......................................................................................................... 28 4 The Editor .................................................................................................................... 30 4.1 Screencasts ..................................................................................................................... 30 4.2 Text Mode vs. CHAT Mode ......................................................................................... 30 4.3 File, Edit, Format, and Font Menus ......................................................................... 31 4.4 Mode Menu ...................................................................................................................... 31 4.5 Default Window Positioning, Size, and Font Control ....................................... 31 4.6 CA Styles ........................................................................................................................... 32 4.7 Setting Special Colors .................................................................................................. 32 4.8 Searching ......................................................................................................................... 32 4.9 Hiding Tiers .................................................................. Error! Bookmark not defined. 4.10 Send to Sound Analyzer ......................................................................................... 33 4.11 Tiers Menu Items ..................................................................................................... 33 4.12 Running CHECK Inside the Editor ...................................................................... 33 Part 2: CLAN 3 4.13 Preferences and Options ....................................................................................... 34 4.14 Coder Mode ................................................................................................................ 34 4.14.1 Entering Codes ................................................................................................................... 35 4.14.2 Setting Up Your Codes File ............................................................................................ 36 5 Media Linkage ........................................................................................................... 38 5.1 Media Formats ............................................................................................................... 38 5.2 Sonic Mode ...................................................................................................................... 39 5.3 Transcriber Mode ......................................................................................................... 40 5.3.1 Linking to an already existing transcript ..................................................................... 40 5.3.2 To create a new transcript ................................................................................................. 41 5.3.3 Sparse Annotation ................................................................................................................. 42 5.4 Video Linking ................................................................................................................. 42 5.5 Walker Controller ........................................................................................................ 43 5.6 Playback Control ........................................................................................................... 44 5.7 Multiple Video Playback ............................................................................................ 44 5.8 Manual Editing ............................................................................................................... 45 5.9 Video Skipping ............................................................................................................... 45 5.10 Audio Anonymization ............................................................................................. 46 6 Other Features .......................................................................................................... 47 6.1 Supplementary Commands ....................................................................................... 47 6.2 Online Help ..................................................................................................................... 47 6.3 Help for the +sm and +sg switches ........................................................................
Recommended publications
  • The Longest Words Using the Fewest Letters
    The Longest Words Using The Fewest Letters David M. Rheingold / Barnstable, Massachusetts Austin Byl, Evan Crouch, Elizabeth Krodel, Celia Schiffman, Nico Toutenhoofd / Boulder, Colorado Claude Shannon may be remembered as the father of information theory for the formula engraved on his tombstone, but it’s not the only measure of entropy he helped devise. Metric entropy gauges the amount of information proportionate to the number of unique characters in a message. Within the English language, the maximum metric entropy individual words can attain is 0.5283 binary digits (or bits). This is the value of any three-letter word composed of three different letters (e.g., WHO). Multiplying 0.5283 times the number of distinct letters equals 1.58, and 2 raised to the power of 1.58 yields the same number of letters. These values hold true for any trigram with three different letters in a 26-letter alphabet. Measurements differ for written text involving other characters including numbers, punctuation, even spaces. Distinguishing between uppercase and lowercase would double the number of letters to 52. Adjusting for word frequency would further alter values as trigrams like WHO occur far more often in written English than, say, KEY. (How much more often? We found 3,598,284 instances of WHO in Project Gutenberg versus 58,863 of KEY, or 61 times as many.) Our lexicon derives from three primary sources: Merriam-Webster, Webster’s, and the Moby II wordlist, with a combined 382,843 words. Of these we find some 2,000 trigrams made up of three different letters, including WHO and KEY, tied for highest metric entropy.
    [Show full text]
  • Audiovisual Media Annotation Using Qualitative Data Analysis Software: a Comparative Analysis
    The Qualitative Report Volume 23 Number 13 Article 4 3-6-2018 Audiovisual Media Annotation Using Qualitative Data Analysis Software: A Comparative Analysis Liliana Melgar Estrada University of Amsterdam, [email protected] Marijn Koolen Huygens ING, [email protected] Follow this and additional works at: https://nsuworks.nova.edu/tqr Part of the Library and Information Science Commons, and the Quantitative, Qualitative, Comparative, and Historical Methodologies Commons Recommended APA Citation Melgar Estrada, L., & Koolen, M. (2018). Audiovisual Media Annotation Using Qualitative Data Analysis Software: A Comparative Analysis. The Qualitative Report, 23(13), 40-60. https://doi.org/10.46743/ 2160-3715/2018.3035 This Article is brought to you for free and open access by the The Qualitative Report at NSUWorks. It has been accepted for inclusion in The Qualitative Report by an authorized administrator of NSUWorks. For more information, please contact [email protected]. Audiovisual Media Annotation Using Qualitative Data Analysis Software: A Comparative Analysis Abstract The variety of specialized tools designed to facilitate analysis of audio-visual (AV) media are useful not only to media scholars and oral historians but to other researchers as well. Both Qualitative Data Analysis Software (QDAS) packages and dedicated systems created for specific disciplines, such as linguistics, can be used for this purpose. Software proliferation challenges researchers to make informed choices about which package will be most useful for their project. This paper aims to present an information science perspective of the scholarly use of tools in qualitative research of audio-visual sources. It provides a baseline of affordances based on functionalities with the goal of making the types of research tasks that they support more explicit (e.g., transcribing, segmenting, coding, linking, and commenting on data).
    [Show full text]
  • Everyday Intolerance- Racist and Xenophic Violence in Italy
    Italy H U M A N Everyday Intolerance R I G H T S Racist and Xenophobic Violence in Italy WATCH Everyday Intolerance Racist and Xenophobic Violence in Italy Copyright © 2011 Human Rights Watch All rights reserved. Printed in the United States of America ISBN: 1-56432-746-9 Cover design by Rafael Jimenez Human Rights Watch 350 Fifth Avenue, 34th floor New York, NY 10118-3299 USA Tel: +1 212 290 4700, Fax: +1 212 736 1300 [email protected] Poststraße 4-5 10178 Berlin, Germany Tel: +49 30 2593 06-10, Fax: +49 30 2593 0629 [email protected] Avenue des Gaulois, 7 1040 Brussels, Belgium Tel: + 32 (2) 732 2009, Fax: + 32 (2) 732 0471 [email protected] 64-66 Rue de Lausanne 1202 Geneva, Switzerland Tel: +41 22 738 0481, Fax: +41 22 738 1791 [email protected] 2-12 Pentonville Road, 2nd Floor London N1 9HF, UK Tel: +44 20 7713 1995, Fax: +44 20 7713 1800 [email protected] 27 Rue de Lisbonne 75008 Paris, France Tel: +33 (1)43 59 55 35, Fax: +33 (1) 43 59 55 22 [email protected] 1630 Connecticut Avenue, N.W., Suite 500 Washington, DC 20009 USA Tel: +1 202 612 4321, Fax: +1 202 612 4333 [email protected] Web Site Address: http://www.hrw.org March 2011 ISBN: 1-56432-746-9 Everyday Intolerance Racist and Xenophobic Violence in Italy I. Summary ...................................................................................................................... 1 Key Recommendations to the Italian Government ............................................................ 3 Methodology ................................................................................................................... 4 II. Background ................................................................................................................. 5 The Scale of the Problem ................................................................................................. 9 The Impact of the Media ...............................................................................................
    [Show full text]
  • Nvivo 8 Help Using the Software
    NVivo 8 Help Using the Software This is a printable version of the NVivo 8 help that is built into the software. The help is divided into two sections - Using the Software and Working with Your Data. This section of the help contains step-by-step instructions and provides the fundamental information you need to work with the software. To electronically navigate through this help and link to other topics, go to the NVivo Help under the Help menu in your NVivo 8 software. Click a button for related instructions or concepts. Produced November 2008. Copyright © 2008 QSR International Pty Ltd ABN 47 006 357 213. All rights reserved. NVivo and QSR words and logos are trademarks or registered trademarks of QSR International Pty Ltd. Patent pending. Microsoft, .NET, SQL Server, Windows, XP, Vista, Windows Media Player, Word, PowerPoint and Excel are trademarks or registered trademarks of the Microsoft Corporation in the United States and/or other countries. Adobe, .pdf and Flash Player are trademarks or registered trademarks of Adobe Systems Integrated in the United States and/or other countries. QuickTime and the QuickTime logo are trademarks or registered trademarks of Apple Computer, Inc., used under license there from. Crystal Reports is a trademark or registered trademark of Business Objects SA. This information is subject to change without notice. NVivo 8 Help - Using the Software Contents Introduction ............................................................................................................................................................
    [Show full text]
  • Confronting Antisemitism in Modern Media, the Legal and Political Worlds an End to Antisemitism!
    Confronting Antisemitism in Modern Media, the Legal and Political Worlds An End to Antisemitism! Edited by Armin Lange, Kerstin Mayerhofer, Dina Porat, and Lawrence H. Schiffman Volume 5 Confronting Antisemitism in Modern Media, the Legal and Political Worlds Edited by Armin Lange, Kerstin Mayerhofer, Dina Porat, and Lawrence H. Schiffman ISBN 978-3-11-058243-7 e-ISBN (PDF) 978-3-11-067196-4 e-ISBN (EPUB) 978-3-11-067203-9 DOI https://10.1515/9783110671964 This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. For details go to https://creativecommons.org/licenses/by-nc-nd/4.0/ Library of Congress Control Number: 2021931477 Bibliographic information published by the Deutsche Nationalbibliothek The Deutsche Nationalbibliothek lists this publication in the Deutsche Nationalbibliografie; detailed bibliographic data are available on the Internet at http://dnb.dnb.de. © 2021 Armin Lange, Kerstin Mayerhofer, Dina Porat, Lawrence H. Schiffman, published by Walter de Gruyter GmbH, Berlin/Boston The book is published with open access at www.degruyter.com Cover image: Illustration by Tayler Culligan (https://dribbble.com/taylerculligan). With friendly permission of Chicago Booth Review. Printing and binding: CPI books GmbH, Leck www.degruyter.com TableofContents Preface and Acknowledgements IX LisaJacobs, Armin Lange, and Kerstin Mayerhofer Confronting Antisemitism in Modern Media, the Legal and Political Worlds: Introduction 1 Confronting Antisemitism through Critical Reflection/Approaches
    [Show full text]
  • The Impact of Ethnic Group Categorization on Inter-Clan Conflicts in Tanzania
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by International Institute for Science, Technology and Education (IISTE): E-Journals Developing Country Studies www.iiste.org ISSN 2224-607X (Paper) ISSN 2225-0565 (Online) Vol 2, No.11, 2012 Are We the Same or Different? The Impact of Ethnic Group Categorization on Inter-Clan Conflicts in Tanzania Gissa Andrew Mahende Dar es Salaam University College of Education (DUCE), P.O. Box 2329, Dar es Salaam, Tanzania Email: [email protected] Abstract The study sought to examine the impact of Kuria clans’ categorization on perceived similarities and differences among clan and its contribution on inter-clan prejudice. The used in this study were collected from sixty six respondents who were purposively selected from government officials and members of the Kira, Nchari and Renchoka using interview and focus group discussion. The findings indicated that categorization has laid the feeling of indifferences and prejudice among the Kuria clans. The findings also indicated that the deep rooted perceptions of differences have caused each clan to perceive itself as an independent tribe and not clan of the same Kuria ethnic group. Additionally, categorization has lead clan groups to share the available resources based on the existing groups. This evokes prejudices and inter-clan competition over resources which consequently lead to inter- clan conflict. It was concluded that strategies such as seminars, use of media and integrated education can be used to reduce the perceived inter clan differences, cultivate inclusive identity and increase inter clan social cohesion. Key Words: Categorization, Similarities, Differences, Prejudice, Conflicts.
    [Show full text]
  • Jun'ichiro Shojithe Racial Equality Issue and Konoe Fumimaro
    The Racial Equality Issue and Konoe Fumimaro The Racial Equality Issue and Konoe Fumimaro* Jun’ichiro Shoji** Abstract Konoe Fumimaro served three times as Japan’s prime minister in the crucial years before World War II. At the end of World War I, as Japan was preparing to take part in the Paris Peace Conference at Versailles of 1919, Konoe Fumimaro published his famous article titled “Eibei hon’i no heiwashugi o haisu [A Rejection of the Anglo- American-Centered Peace Ideology].” In this article and his observations on post- WWI Europe and the United States, he often praises U.S. President Woodrow Wilson. Konoe clearly was fascinated by and attracted to the American President, and felt a closeness and sympathy with his views. It seems that in some ways the two men had some things in common. The arguments and world views of both men were extremely idealistic. But even Wilson’s ideals, in the world of 1919, where the lingering influence of the old regime remained strong, ultimately did not become a reality at that time. Wilson died before he could complete his dream. Similarly, Konoe’s ideals stood no chance of being realized, given the changes of the times and the development of Japan’s militaristic foreign policy. For both Konoe and Wilson, racial equality in 1919 was a distant ideal. Focusing on Konoe Fumimaro, this article discusses the issues surrounding Japan’s racial equality proposal of 1919. oday I want to discuss some of my own views of the issues surrounding Japan’s racial equality proposal of 1919 by focusing on one individual, Konoe Fumimaro.
    [Show full text]
  • Segmentability Differences Between Child-Directed and Adult-Directed Speech: a Systematic Test with an Ecologically Valid Corpus
    Report Segmentability Differences Between Child-Directed and Adult-Directed Speech: A Systematic Test With an Ecologically Valid Corpus Alejandrina Cristia 1, Emmanuel Dupoux1,2,3, Nan Bernstein Ratner4, and Melanie Soderstrom5 1Dept d’Etudes Cognitives, ENS, PSL University, EHESS, CNRS 2INRIA an open access journal 3FAIR Paris 4Department of Hearing and Speech Sciences, University of Maryland 5Department of Psychology, University of Manitoba Keywords: computational modeling, learnability, infant word segmentation, statistical learning, lexicon ABSTRACT Previous computational modeling suggests it is much easier to segment words from child-directed speech (CDS) than adult-directed speech (ADS). However, this conclusion is based on data collected in the laboratory, with CDS from play sessions and ADS between a parent and an experimenter, which may not be representative of ecologically collected CDS and ADS. Fully naturalistic ADS and CDS collected with a nonintrusive recording device Citation: Cristia A., Dupoux, E., Ratner, as the child went about her day were analyzed with a diverse set of algorithms. The N. B., & Soderstrom, M. (2019). difference between registers was small compared to differences between algorithms; it Segmentability Differences Between Child-Directed and Adult-Directed reduced when corpora were matched, and it even reversed under some conditions. Speech: A Systematic Test With an Ecologically Valid Corpus. Open Mind: These results highlight the interest of studying learnability using naturalistic corpora Discoveries in Cognitive Science, 3, 13–22. https://doi.org/10.1162/opmi_ and diverse algorithmic definitions. a_00022 DOI: https://doi.org/10.1162/opmi_a_00022 INTRODUCTION Supplemental Materials: Although children are exposed to both child-directed speech (CDS) and adult-directed speech https://osf.io/th75g/ (ADS), children appear to extract more information from the former than the latter (e.g., Cristia, Received: 15 May 2018 2013; Shneidman & Goldin-Meadow,2012).
    [Show full text]
  • Current Issues in Qualitative Data Analysis Software (QDAS): a User and Developer Perspective
    The Qualitative Report Volume 23 Number 13 Article 5 3-6-2018 Current Issues in Qualitative Data Analysis Software (QDAS): A User and Developer Perspective Jeanine C. Evers Erasmus University of Rotterdam, [email protected] Follow this and additional works at: https://nsuworks.nova.edu/tqr Part of the Law Commons, Quantitative, Qualitative, Comparative, and Historical Methodologies Commons, and the Social Statistics Commons Recommended APA Citation Evers, J. C. (2018). Current Issues in Qualitative Data Analysis Software (QDAS): A User and Developer Perspective. The Qualitative Report, 23(13), 61-73. https://doi.org/10.46743/2160-3715/2018.3205 This Article is brought to you for free and open access by the The Qualitative Report at NSUWorks. It has been accepted for inclusion in The Qualitative Report by an authorized administrator of NSUWorks. For more information, please contact [email protected]. Current Issues in Qualitative Data Analysis Software (QDAS): A User and Developer Perspective Abstract This paper describes recent issues and developments in Qualitative Data Analysis Software (QDAS) as presented in the opening plenary at the KWALON 2016 conference. From a user perspective, it reflects current features and functionality, including the use of artificial intelligence and machine learning; implications of the cloud; user friendliness; the role of digital archives; and the development of a common exchange format. This user perspective is complemented with the views of software developers who took part in the “Rotterdam Exchange Format Initiative,” an outcome of the conference. Keywords Qualitative Data Analysis Software, QDAS, Artificial Intelligence, Machine Learning, TLA AS.ti, Cassandre, Dedoose, f4analyse, MAXQDA, NVivo, QDA Miner, Quirkos, Transana, Exchange format, Interoperability, Qualitative Data Analysis, Learning Curve QDAS, Textual Data Mining, Cloud services.
    [Show full text]
  • A Software-Assisted Qualitative Content Analysis of News Articles: Example and Reflections
    Volume 16, No. 2, Art. 8 May 2015 A Software-Assisted Qualitative Content Analysis of News Articles: Example and Reflections Florian Kaefer, Juliet Roper & Paresha Sinha Key words: Abstract: This article offers a step-by-step description of how qualitative data analysis software can qualitative content be used for a qualitative content analysis of newspaper articles. Using NVivo as an example, it analysis; method; illustrates how software tools can facilitate analytical flexibility and how they can enhance news articles; transparency and trustworthiness of the qualitative research process. Following a brief discussion NVivo software; of the key characteristics, advantages and limitations of qualitative data analysis software, the qualitative data article describes a qualitative content analysis of 230 newspaper articles, conducted to determine analysis software international media perceptions of New Zealand's environmental performance in connection with (QDAS); climate change and carbon emissions. The article proposes a multi-level coding approach during computer-assisted the analysis of news texts that combines quantitative and qualitative elements, allowing the qualitative data researcher to move back and forth in coding and between analytical levels. The article concludes analysis that while qualitative data analysis software, such as NVivo, will not do the analysis for the (CAQDAS) researcher, it can make the analytical process more flexible, transparent and ultimately more trustworthy. Table of Contents 1. Introduction 2. Literature Review 2.1 Qualitative content analysis (QCA) 2.2 Software-assisted qualitative data analysis 3. A Step-by-Step Approach to Software-Assisted Qualitative Content Analysis of News Articles 3.1 Step 1: Selecting and learning the software 3.1.1 Getting to know the software: Sample project, literature review 3.2 Step 2: Data import and preparation 3.3 Step 3: Multi-level coding 3.3.1 Coding: Inductive vs.
    [Show full text]
  • Multimedia Corpora (Media Encoding and Annotation) (Thomas Schmidt, Kjell Elenius, Paul Trilsbeek)
    Multimedia Corpora (Media encoding and annotation) (Thomas Schmidt, Kjell Elenius, Paul Trilsbeek) Draft submitted to CLARIN WG 5.7. as input to CLARIN deliverable D5.C­3 “Interoperability and Standards” [http://www.clarin.eu/system/files/clarin­deliverable­D5C3_v1_5­finaldraft.pdf] Table of Contents 1 General distinctions / terminology................................................................................................................................... 1 1.1 Different types of multimedia corpora: spoken language vs. speech vs. phonetic vs. multimodal corpora vs. sign language corpora......................................................................................................................................................... 1 1.2 Media encoding vs. Media annotation................................................................................................................... 3 1.3 Data models/file formats vs. Transcription systems/conventions.......................................................................... 3 1.4 Transcription vs. Annotation / Coding vs. Metadata ............................................................................................. 3 2 Media encoding ............................................................................................................................................................... 5 2.1 Audio encoding ..................................................................................................................................................... 5 2.2
    [Show full text]
  • Set Reading Benchmarks in South Africa
    SETTING READING BENCHMARKS IN SOUTH AFRICA October 2020 PHOTO: SCHOOL LIBRARY, CREDIT: PIXABAY Contract Number: 72067418D00001, Order Number: 72067419F00007 THIS PUBLICATION WAS PRODUCED AT THE REQUEST OF THE UNITED STATES AGENCY FOR INTERNATIONAL DEVELOPMENT. IT WAS PREPARED INDEPENDENTLY BY KHULISA MANAGEMENT SERVICES, (PTY) LTD. THE AUTHOR’S VIEWS EXPRESSED IN THIS PUBLICATION DO NOT NECESSARILY REFLECT THE VIEWS OF THE UNITED STATES AGENCY FOR INTERNATIONAL DEVELOPMENT OR THE UNITED STATES GOVERNMENT. ISBN: 978-1-4315-3411-1 All rights reserved. You may copy material from this publication for use in non-profit education programmes if you acknowledge the source. Contract Number: 72067418D00001, Order Number: 72067419F00007 SETTING READING BENCHMARKS IN SOUTH AFRICA October 2020 AUTHORS Matthew Jukes (Research Consultant) Elizabeth Pretorius (Reading Consultant) Maxine Schaefer (Reading Consultant) Katharine Tjasink (Project Manager, Senior) Margaret Roper (Deputy Director, Khulisa) Jennifer Bisgard (Director, Khulisa) Nokuthula Mabhena (Data Visualisation, Khulisa) CONTACT DETAILS Margaret Roper 26 7th Avenue Parktown North Johannesburg, 2196 Telephone: 011-447-6464 Email: [email protected] Web Address: www.khulisa.com CONTRACTUAL INFORMATION Khulisa Management Services Pty Ltd, (Khulisa) produced this Report for the United States Agency for International Development (USAID) under its Practical Education Research for Optimal Reading and Management: Analyze, Collaborate, Evaluate (PERFORMANCE) Indefinite Delivery Indefinite Quantity
    [Show full text]