Story Understanding Through Semantic Analysis and Automatic Alignment of Text and Video

Total Page:16

File Type:pdf, Size:1020Kb

Story Understanding Through Semantic Analysis and Automatic Alignment of Text and Video Story Understanding through Semantic Analysis and Automatic Alignment of Text and Video zur Erlangung des akademischen Grades eines Doktors der Ingenieurwissenschaften der KIT-Fakultät für Informatik des Karlsruher Instituts für Technologie (KIT) genehmigte Dissertation von Makarand Murari Tapaswi aus Goa, Indien Tag der mündlichen Prüfung: 16. Juni 2016 Hauptreferent: Prof. Dr.-Ing. Rainer Stiefelhagen Karlsruher Institut für Technologie Korreferent: Dr. Cordelia Schmid INRIA, Grenoble KIT – Universität des Landes Baden-Württemberg und nationales Forschungszentrum in der Helmholtz-Gemeinschaft www.kit.edu Abstract Stories are the pinnacle of human creativity, and yet a ubiquitous phenomenon. An important element of human communication consists of telling and listening, reading, and nowadays watching stories enacted on film. Among different means of story-telling, videos (TV series and movies) are a very powerful medium as they have the potential to engage multiple human senses. Artificial Intelligence (AI) has made large strides in the last decade, through which several advances have been achieved in both language and vision. However, these fields have been primarily studied separately, and only in the last few years do we see joint analysis rising to prominence. We argue that the ability to model, analyze, understand, and create new stories is a stepping stone for strong AI – a machine that could perform any intellectual task that a human can. Towards this grander goal of story understanding, we seek to build upon a wave of research in the joint study of vision and language. TV series and movies are perfect candidates for such a study, as they are videos produced for the specific purpose of story-telling. In this thesis, we define machine understanding of stories as the ability to perform human-like tasks upon those stories, such as indexing and searching for story events in large collections, summarizing the stories, and answering questions about them. We address the problem of story understanding from three vantage points. First, we introduce the use of novel sources of natural language text that allow to better learn the content of the videos. Next, we propose a visualization technique to obtain a big picture overview of the story conveyed in a video. Finally, we provide a means to examine machine understanding of stories by using question-answering as a surrogate task. iv Subtitles and transcripts have been an excellent source of low-level information for video understanding, however, they are inadequate to understand the story plot. We introduce the use of two diverse forms of natural language text that focus on the story: plot synopses and books. Plot synopses are concise descriptions of the story in the episodes or movies and are obtained easily through crowd sourcing. On the contrary, books, from which the videos are adapted, are large texts that describe the events (characters, scenes, and interactions) in rich detail. Unlike transcripts, the potential of these text sources needs to be unlocked by first aligning the text units with the video. We propose similarity metrics to bridge the gap between the text and video modalities. Using them, we align plot synopsis sentences with individual video shots, and book chapters with video scenes. To this end, we develop several alignment models that attempt to maximize joint similarity while respecting story progression constraints. We test these approaches on two sets of videos for both plots and books and obtain promising alignment performance. The alignment gives rise to applications such as describing video clips using plot sentences or book paragraphs, story-based video retrieval using plots as intermediaries, and even the ability to predict whether a scene from the video adaptation was present in the original book. Our second approach towards improving story understanding is through visualization. We automatically generate StoryGraphs – charts that depict character interactions in an episode and augment them with information about key events. The graph layout is treated as an optimization problem that trades off functionality with aesthetics. We conduct a user experiment and show that such graphs can aid humans in speeding up the search for story-events in a video. Our third important contribution is in the field of assessing machine understanding. Here, we create a large scale question-answering (QA) data set based on movie stories. The data set not only covers simple visual aspects such as “Who”, “What”, and “Where”, but also requires long-range temporal reasoning to answer “Why” and “How” questions. A unique aspect of our QA data set is that answering can be performed using text sources (e.g. plots, subtitles) or video clips. The data set is made available as part of a benchmark challenge. Further, we analyze data set bias, explore the quality of our multiple-choice questions, and propose several techniques for answering. In addition to the primary contributions, we also work on analyzing and creating better meta-data for the videos. In particular, we propose new techniques for scene boundary detection, and improve person identification in TV series. Kurzzusammenfassung Geschichten sind ein Höhepunkt menschlicher Kreativität und noch immer ein uni- verselles Phänomen. Ein wesentlicher Teil der menschlichen Kommunikation besteht aus dem Lesen, Erzählen und Anhören von Geschichten. Heutzutage geschieht dies meist in einer modernen Form, als Fernsehserien und -filme. Von den verschiedenen Möglichkeiten eine Geschichte zu erzählen, sind diese ein sehr leistungsfähiges Medium, da sie mehrere menschliche Sinne beteiligen können. Im letzten Jahrzehnt hat die Entwicklung der Künstlichen Intelligenz (KI) große Fortschritte gemacht, wodurch mehrere Entwicklungen in der Sprach- und Bildverarbeitung möglich wurden. Dennoch wurden diese Fachgebiete bisher hauptsächlich getrennt voneinander untersucht, erst in den letzten Jahren änderte sich dies. Wir betrachten die Fähigkeit, Geschichten zu analysieren, zu verstehen und neu zu erschaffen, als einen wichtigen Schritt auf dem Weg zur Entwicklung starker KI – das heißt einer Maschine, die jegliche in- tellektuelle Aufgabe wie ein Mensch lösen kann. Für die Analyse und das Verständnis von Geschichten wollen wir dabei auf den großen Fortschritten im Bereich der automatischen Sprach- und Bildanalyse aufbauen. In dieser Dissertation definieren wir Maschinelles Verständnis von Geschichten als die Fähigkeit einer Maschine, wie ein Mensch mit Geschichten umzugehen, das heißt zum Beispiel, die Fähigkeit bestimmte Handlungen und Ereignisse zu erkennen und wieder finden zu können, sowie die Fähigkeit Geschichten zusammen zu fassen und Fragen über die Geschichte beantworten zu können. Als Anwendungsbeispiele betrachten wir dabei Fernsehserien und Spielfilme. Wir betrachten das Problem des Verständnisses von Geschichten aus drei verschiedenen Blickwinkeln. Zuerst führen wir die Nutzung neuartiger Bezugsquellen von Video- vi Beschreibungen in natürlicher Sprache ein, welche uns ein besseres automatisches Ver- ständnis des Videoinhaltes ermöglichen. Dann schlagen wir eine Visualisierungstechnik vor, um einen Überblick der Geschichte zu bekommen. Abschließend stellen wir die automatische Beantwortung von Fragen zu Filmen als eine Methode vor, mit deren Hilfe das maschinelle Verständnis von Geschichten bewertet werden kann. Texte, wie z.B. Untertitel und Filmskripte, sind eine exzellente Quelle an ergänzenden Informationen um Videos zu verstehen. Wir stellen zwei unterschiedliche natürlich- sprachige Textquellen vor: Synopsen (plot synopses) und Bücher. Synopsen sind kurze Zusammenfassungen von Serien oder Filmen, sie sind für viele Filme und Serien erhältlich. Im Gegensatz dazu sind Bücher lange Texte, die Ereignisse (Charaktere, Szenen und Inter- aktionen) mit zahlreichen Details beschreiben. Um ihr Potenzial zur inhaltlichen Analyse der Videos zu entfalten, müssen die einzelnen Textabschnitte dieser Textquellen dabei zunächst den Szenen eines Videos zugeordnet werden. Wir erstellen Ähnlichkeitsfunktio- nen, um diese Lücke zwischen Text und Video schließen. Hiermit ordnen wir die Sätze der Synopsen den einzelnen Video-Einstellungen (shots), sowie einzelne Buchkapitel den Videoszenen, automatisch zu. Wir entwickeln hierzu verschiedene Alignment-Modelle, welche unsere Ähnlichkeitsfunktionen maximieren, dabei aber den Ablauf der Geschichte nicht außer Acht lassen. Wir testen diese Ansätze an zwei Datenquellen, sowohl für Synopsen als auch für Bücher und können dabei vielversprechende Ergebnisse für das Alignment erzielen. Dies ermöglicht eine Fülle von Anwendungen, wie die Beschreibung von Video-Clips durch Teile der Synopse oder dem Buch, das Auffinden von Ereignissen in Videos unter Nutzung der Synopse als Zwischenschritt und auch die Fähigkeit fest zu stellen, ob eine Szene einer Videoadaptierung im Buch vorhanden ist. Unser zweiter Ansatz zur Verbesserung des Verständnisses einer Geschichte erfolgt durch Visualisierung. Wir generieren so gennante StoryGraphs, Diagramme, welche die Interaktionen zwischen Personen in einer Folge darstellen und ergänzen diese mit Informationen zu wichtigen Ereignissen. Die Anordnung der Grafik wird dabei als ein Optimierungsproblem betrachtet, welches funktionale und ästhetische Aspekte abwägt. Wir führen eine Benutzerstudie durch und zeigen, dass derartige Grafiken dem Menschen dabei helfen, gesuchte Ereignisse im Video schneller zu finden. Unser dritter wichtiger Beitrag ist im Bereich der Bewertung von Maschinellem Ver-
Recommended publications
  • Some Strands of Wittgenstein's Normative Pragmatism
    Some Strands of Wittgenstein’s Normative Pragmatism, and Some Strains of his Semantic Nihilism ROBERT B. BRANDOM ABSTRACT WORK TYPE In this reflection I address one of the critical questions this monograph is Article about: How to justify proposing yet another semantic theory in the light of Wittgenstein’s strong warnings against it. I see two clear motives for ARTICLE HISTORY Wittgenstein’s semantic nihilism. The first one is the view that philosophical Received: problems arise from postulating hypothetical entities such as ‘meanings’. To 27–January–2018 dissolve the philosophical problems rather than create new ones, Wittgenstein Accepted: suggests substituting ‘meaning’ with ‘use’ and avoiding scientism in philosophy 3–March–2018 together with the urge to penetrate in one's investigation to unobservable depths. I believe this first motive constitutes only a weak motive for ARTICLE LANGUAGE Wittgenstein’s quietism, because there are substantial differences between English empirical theories in natural sciences and semantic theories in philosophy that KEYWORDS leave Wittgenstein’s assimilation of both open to criticism. But Wittgenstein is Meaning and Use right, on the second motive, that given the dynamic character of linguistic Hypothetical Entities practice, the classical project of semantic theory is a disease that can be Antiscientism removed or ameliorated only by heeding the advice to replace concern with Semantic Nihilism meaning by concern with use. On my view, this does not preclude, however, a Linguistic Dynamism different kind of theoretical approach to meaning that avoids the pitfalls of the Procrustean enterprise Wittgenstein complained about. © Studia Humanitatis – Universidad de Salamanca 2019 R. Brandom (✉) Disputatio. Philosophical Research Bulletin University of Pittsburgh, USA Vol.
    [Show full text]
  • Semantic Analysis of the First Cities from a Deconstruction Perspective Doi: 10.23968/2500-0055-2020-5-3-43-48
    Mojtaba Valibeigi, Faezeh Ashuri— Pages 43–48 SEMANTIC ANALYSIS OF THE FIRST CITIES FROM A DECONSTRUCTION PERSPECTIVE DOI: 10.23968/2500-0055-2020-5-3-43-48 SEMANTIC ANALYSIS OF THE FIRST CITIES FROM A DECONSTRUCTION PERSPECTIVE Mojtaba Valibeigi*, Faezeh Ashuri Buein Zahra Technical University Imam Khomeini Blvd, Buein Zahra, Qazvin, Iran *Corresponding author: [email protected] Abstract Introduction: Deconstruction is looking for any meaning, semantics and concepts and then shows how all of them seem to lead to chaos, and are always on the border of meaning duality. Purpose of the study: The study is aimed to investigate urban identity from a deconstruction perspective. Since two important bases of deconstruction are text and meaning and their relationships, we chose the first cities on a symbolic level as a text and tried to analyze their meanings. Methods: The study used a deductive content analysis in three steps including preparation, organization and final report or conclusion. In the first step, we argued deconstruction philosophy based on Derrida’s views accordingly. Then we determined some common semantic features of the first cities. Finally, we presented some conclusions based on a semantic interpretation of the first cities’ identity.Results: It seems that all cities as texts tend to provoke a special imaginary meaning, while simultaneously promoting and emphasizing the opposite meaning of what they want to show. Keywords Deconstruction, text, meaning, urban semantics, the first cities. Introduction to have a specific meaning (Gualberto and Kress, 2019; Expressions are an act of recognizing or displaying Leone, 2019; Stojiljković and Ristić Trajković, 2018).
    [Show full text]
  • Matrix Decompositions and Latent Semantic Indexing
    Online edition (c)2009 Cambridge UP DRAFT! © April 1, 2009 Cambridge University Press. Feedback welcome. 403 Matrix decompositions and latent 18 semantic indexing On page 123 we introduced the notion of a term-document matrix: an M N matrix C, each of whose rows represents a term and each of whose column× s represents a document in the collection. Even for a collection of modest size, the term-document matrix C is likely to have several tens of thousands of rows and columns. In Section 18.1.1 we first develop a class of operations from linear algebra, known as matrix decomposition. In Section 18.2 we use a special form of matrix decomposition to construct a low-rank approximation to the term-document matrix. In Section 18.3 we examine the application of such low-rank approximations to indexing and retrieving documents, a technique referred to as latent semantic indexing. While latent semantic in- dexing has not been established as a significant force in scoring and ranking for information retrieval, it remains an intriguing approach to clustering in a number of domains including for collections of text documents (Section 16.6, page 372). Understanding its full potential remains an area of active research. Readers who do not require a refresher on linear algebra may skip Sec- tion 18.1, although Example 18.1 is especially recommended as it highlights a property of eigenvalues that we exploit later in the chapter. 18.1 Linear algebra review We briefly review some necessary background in linear algebra. Let C be an M N matrix with real-valued entries; for a term-document matrix, all × RANK entries are in fact non-negative.
    [Show full text]
  • Linguistic Relativity Hyp
    THE LINGUISTIC RELATIVITY HYPOTHESIS by Michele Nathan A Thesis Submitted to the Faculty of the College of Social Science in Partial Fulfillment of the Requirements for the Degree of Master of Arts Florida Atlantic University Boca Raton, Florida December 1973 THE LINGUISTIC RELATIVITY HYPOTHESIS by Michele Nathan This thesis was prepared under the direction of the candidate's thesis advisor, Dr. John D. Early, Department of Anthropology, and has been approved by the members of his supervisory committee. It was submitted to the faculty of the College of Social Science and was accepted in partial fulfillment of the requirements for the degree of Master of Arts. SUPERVISORY COMMITTEE: &~ rl7 IC?13 (date) 1 ii ABSTRACT Author: Michele Nathan Title: The Linguistic Relativity Hypothesis Institution: Florida Atlantic University Degree: Master of Arts Year: 1973 Although interest in the linguistic relativity hypothesis seems to have waned in recent years, this thesis attempts to assess the available evidence supporting it in order to show that further investigation of the hypothesis might be most profitable. Special attention is paid to the fact that anthropology has largely failed to substantiate any claims that correlations between culture and the semantics of language do exist. This has been due to the impressionistic nature of the studies in this area. The use of statistics and hypothesis testing to provide mor.e rigorous methodology is discussed in the hope that employing such paradigms would enable anthropology to contribute some sound evidence regarding t~~ hypothesis. iii TABLE OF CONTENTS Page Introduction • 1 CHAPTER I THE.HISTORY OF THE FORMULATION OF THE HYPOTHESIS.
    [Show full text]
  • Philosophy of Language in the Twentieth Century Jason Stanley Rutgers University
    Philosophy of Language in the Twentieth Century Jason Stanley Rutgers University In the Twentieth Century, Logic and Philosophy of Language are two of the few areas of philosophy in which philosophers made indisputable progress. For example, even now many of the foremost living ethicists present their theories as somewhat more explicit versions of the ideas of Kant, Mill, or Aristotle. In contrast, it would be patently absurd for a contemporary philosopher of language or logician to think of herself as working in the shadow of any figure who died before the Twentieth Century began. Advances in these disciplines make even the most unaccomplished of its practitioners vastly more sophisticated than Kant. There were previous periods in which the problems of language and logic were studied extensively (e.g. the medieval period). But from the perspective of the progress made in the last 120 years, previous work is at most a source of interesting data or occasional insight. All systematic theorizing about content that meets contemporary standards of rigor has been done subsequently. The advances Philosophy of Language has made in the Twentieth Century are of course the result of the remarkable progress made in logic. Few other philosophical disciplines gained as much from the developments in logic as the Philosophy of Language. In the course of presenting the first formal system in the Begriffsscrift , Gottlob Frege developed a formal language. Subsequently, logicians provided rigorous semantics for formal languages, in order to define truth in a model, and thereby characterize logical consequence. Such rigor was required in order to enable logicians to carry out semantic proofs about formal systems in a formal system, thereby providing semantics with the same benefits as increased formalization had provided for other branches of mathematics.
    [Show full text]
  • A History of Semantics Naming
    1/27 A history of semantics [in Nick Riemer (ed.) Routledge Handbook of Semantics] Naming [M]an “makes” his territory by naming the “things” in it. (Chatwin 1988: 301) Human beings name things in their environment. The name helps to distinguish and identify the denotatum (thing named) and is essential to communication with fellow humans about 1 such denotata. In Plato’s Cratylus (Plato 1997) c. 385 BCE, Socrates advances the hypothesis that the earliest name-giver (onomatourgos) selected a name that captures the essence of its denotatum, that is in some way iconic as with onomatopoeic bird names like cuckoo or whippoorwill. On this hypothesis the meaning of a word would be ‘natural’ because directly recognizable from the form of the word. Many of the Ancients sought to demonstrate that names are far more descriptive than the facts allow. For example Socrates in Cratylus 406c derives the name Dionusos (god of Bacchanalia) from didous ton oinon ‘giving wine’. In De lingua latina V: 101 (c. 45 BCE) Varro suggests that because the fox is fleet-footed, volpes ‘fox’ is a blend of volare ‘fly’ and pes ‘foot’ (Varro 1938). Isidore of Seville suggested c. 625 CE that oratio ‘utterance’ derives from oris ratio ‘the mouth’s reason’. None of these is correct and many such ‘etymologies’ are utterly absurd (see Allan 2010). Indeed, the implausibility of such accounts was recognized by Socrates in Cratylus (426b-427b, 434e- 435c), but a clear statement that names are symbols which denote by convention is first found some 25 years after Cratylus in Aristotle’s On Interpretation 16a3, 16a20 (Aristotle 1984).
    [Show full text]
  • 11 Semantics
    "'If" '.u' $. Semantics 115 ~' used to describe a product, with 'good for you', but we would not want to ., mclude this association within the basic conceptual meaning of the expres­ sion. Poets and advertisers are, of course, very interested in using terms in such a way that their associative meanings are evoked, and some linguists do lfivestigate this aspect of language use. However, in this chapter we shall be more interested in characterizing what constitutes the conceptual meaning vi terms. Semantic features , So, how would a semantic approach help us to understand something about 11 Semantics Ihe nature of language? One way it might be helpful would be as a means of accounting for the 'oddness' we experience when we read English sentences such as the following: I once referred to a character in one of my cartoons as a "dork" (a popular j The hamburgerate the man insult when I was growing up), but myeditor called me up and said that "dork" My cat studiedlinguistics couldn't be used because it meant "penis." I couldn't believe it. I ran to my A table was listening to some music New Dictionary ofAmerican Slang and, sure enough, he was right. Allthose years of saying or being called a "dork" and I had never really known what it ~otice that the oddness of these sentences does not derive from their syn­ meant. What a nerd. Gary Larson (1989) tactic structure. According to some basic syntactic rules for forming English sentences (such as those presented in Chapter 10), we have well-structured Semantics is the study of the meaning of words, phrases and sentences.
    [Show full text]
  • The Leibnizian Vision in Frank Ankersmit's Philosophy of History
    INTENSION, SUBSTANCE AND CALCULUS – THE LEIBNIZIAN VISION IN FRANK ANKERSMIT’S PHILOSOPHY OF HISTORY Oulu University Faculty of History Master’s thesis 20.5.2019 Aleksi Oja 1 Due to the abundance of referral to the extensive work of Frank Ankersmit and my general reliance on his published books and selected papers as sources, I ask the reader to use the following list of abbreviations of some of the most featured works: NL – Ankersmit, F. R. 1983: Narrative Logic – a Semantic Analysis of the Historian’s Language. Martinus Nijhoff publishers, The Hague, Netherlands. HT – Ankersmit, F. R. 1994. History and Tropology: The Rise and Fall of Metaphor. University of California Press. London, England. HR – Ankersmit, F. R. 2001: Historical Representation. Stanford University Press, California, USA. PR – Ankersmit, F. R. 2002: Political Representation. Stanford University Press, California, USA. SHE – Ankersmit, F. R. 2005: Sublime Historical Experience. Stanford University Press, California, USA. MTR – Ankersmit F. R. 2012: Meaning, Truth and Reference in Historical Representation. Stanford University Press, California, USA. HSI – Ankersmit, F. R. 2013: History as the Science of the Individual. Journal of the Philosophy of History Vol. 7 (3), 396 – 425. WEM – Ankersmit, F. R. 2017: Where the extremes meet. A presently still an unpublished paper, given as the handout for Frank Ankersmit’s opening speech at the Seminar of philosophy of history in October 2017 in Oulu university. 2 Table of Contents Introduction ..............................................................................................................................
    [Show full text]
  • Semantic Analysis on the Use of English Language Slogan
    SEMANTIC ANALYSIS ON THE USE OF ENGLISH LANGUAGE SLOGAN 1Andi Asrifan 2Like Raskova Octaberlina 3Rini Handayani 1,3Universitas Muhammadiyah Sidenreng Rappang 2Universitas Islam Negeri Maulana Malik Ibrahim, Malang, Indonesia Abstract In this study the objectives to be achieved are: (1) To identify what are the types of English slogans. (2) To identify how the semantic meanings of English slogans. The type of research used is qualitative. While the methods and instruments used to collect and analyze and answer all existing problems are qualitative methods and checklist observations. The process of data analysis in this research is carried out in a process starting from data collection and done intensively until the end of the study. Data that researchers get from observations from several sources such as the internet and journals, namely: (1) in general there are 4 types of slogans such as product / advertising slogans, health slogans, environmental slogans and educational slogans. The types of product / advertising slogans are 6 slogans, health slogans 2, environmental slogans 5 and education slogans totaling 8. (2) semantic meanings of 4 types of slogans from internet and journal sources there are 5 namely conceptual meaning, associative meaning, connotative meaning, affective meaning and stylist. Of the 20 slogans education slogans and products that contain more conceptual meaning. The environmental slogans contain more conceptual and affective meanings. Slogans that contain conceptual meaning in the form of a situation context and based on words that have been conceptualized in the dictionary. Key Words: Analysis Semantic, Semantic Meaning, English Slogan. Language is an important component in life so it is not excessive when language is termed as the heart of human life, because language is a very important role in life to support the interaction process.
    [Show full text]
  • Semantic Change, Semantic Theory and Linguistic Relativity
    Gábor Györi Semantic Change, Semantic Theory and Linguistic Relativity Series A: General & Theoretical Papers ISSN 1435-6473 Essen: LAUD 1998 (2., unveränderte Auflage 2006) Paper No. 444 Universität Duisburg-Essen Gábor Györi Jannus Pannonius University, Pecs (Hungary) Semantic Change, Semantic Theory and Linguistic Relativity Copyright by the author Reproduced by LAUD 1998 (2., unveränderte Auflage 2006) Linguistic Agency Series A University of Duisburg-Essen General and Theoretical FB Geisteswissenschaften Paper No. 444 Universitätsstr. 12 D- 45117 Essen Order LAUD-papers online: http://www.linse.uni-due.de/linse/laud/index.html Or contact: [email protected] Gábor Györi Semantic Change, Semantic Theory and Linguistic Relativity 0. Abstract The most general cause usually given for semantic and lexical change is the arising of new communicative needs in a speech community. However, communication is also a cognitive activity: we communicate our mental states, the contents of our minds, etc. Communicative needs must go hand in hand with cognitive needs, since there is no linguistic communication without the underlying mental representations. Because of this, semantic change can only be accounted for by a semantic theory which takes into account human conceptualization as a product of our understanding of bodily experience, as does cognitive semantics. Semantic change exploits and utilizes the common understanding and interpretation of experience which exists on the basis of conventionalized conceptual structures shared by the members of a speech community. However, while the semantic structure of a language is the product of conceptualization processes, it is also true that the input for these conceptualization processes is at least partly the semantic structure of that language.
    [Show full text]
  • An Introduction to Latent Semantic Analysis
    1 Running head: INTRODUCTION TO LATENT SEMANTIC ANALYSIS An Introduction to Latent Semantic Analysis Thomas K Landauer Department of Psychology University of Colorado at Boulder, Peter W. Foltz Department of Psychology New Mexico State University Darrell Laham Department of Psychology University of Colorado at Boulder, Landauer, T. K., Foltz, P. W., & Laham, D. (1998). Introduction to Latent Semantic Analysis. Discourse Processes, 25, 259-284. Introduction to Latent Semantic Analysis 2 Abstract Latent Semantic Analysis (LSA) is a theory and method for extracting and representing the contextual-usage meaning of words by statistical computations applied to a large corpus of text (Landauer and Dumais, 1997). The underlying idea is that the aggregate of all the word contexts in which a given word does and does not appear provides a set of mutual constraints that largely determines the similarity of meaning of words and sets of words to each other. The adequacy of LSA’s reflection of human knowledge has been established in a variety of ways. For example, its scores overlap those of humans on standard vocabulary and subject matter tests; it mimics human word sorting and category judgments; it simulates word–word and passage–word lexical priming data; and, as reported in 3 following articles in this issue, it accurately estimates passage coherence, learnability of passages by individual students, and the quality and quantity of knowledge contained in an essay. Introduction to Latent Semantic Analysis 3 An Introduction to Latent Semantic Analysis Research reported in the three articles that follow—Foltz, Kintsch & Landauer (1998/this issue), Rehder, et al. (1998/this issue), and Wolfe, et al.
    [Show full text]
  • A Pragma-Stylo-Semantic Analysis of Proverbs 26, Verses 4 and 5
    Ghana Journal of Linguistics 5.1: 55-75 (2016) http://dx.doi.org/10.4314/gjl.v5i1.3 A PRAGMA-STYLO-SEMANTIC ANALYSIS OF PROVERBS 26, VERSES 4 AND 5 Joseph Omoniyi Friday-Otun Abstract Although, the majority view of scholars supports the position that the Proverbs of Solomon are influenced by the Instructions of Amenemope of ancient Egypt, the book of Proverbs is believed by Christians to be one of the poetic books of the Bible known for words of wisdom. The first twelve verses of chapter twenty-six are devoted to the fool. With the possibility of being drawn into conversation with the fool, verses four and five become instructive to the wise. The purpose of this study is to demonstrate that a pragma-stylo-semantic interpretation of these verses cancels out the overt contradiction. The semantic and pragmatic presuppositions, as well as the stylistic features of the verses are unravelled to arrive at the underlying connections that unify their interpretations. Also, time-lapse or gap is considered fundamental and analysed using the concept of “appropriateness” in terms of possible compliance with the instructions in the two verses by a single agent. The paper proposes a schematic diagram supportive of the concept of "appropriateness”. The various levels of linguistic, situational, psychological, sociological and cosmological contexts of the text have been examined with the illocutionary forces of the text investigated. These are done with the purpose of arriving at a "global" or "total" meaning. At the end, the analysis indicates a non-contradictory, context- determined phenomenon. Key words: Pragmatics, Stylistics, Semantics, The book of Proverbs, Amenemope, Contexts Introduction Proverbs, Chapter Twenty-Six: V.4: Answer not a fool according to his folly, lest thou also be like him.
    [Show full text]