Generative Linguistics and Neural Networks at 60: Foundation, Friction, and Fusion*
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Paul Smolensky
Vita PAUL SMOLENSKY Department of Cognitive Science 11824 Mays Chapel Road 239A Krieger Hall Timonium, MD 21093-1821 Johns Hopkins University (667) 229-9509 Baltimore, MD 21218-2685 May 5, 1955 (410) 516-5331 Citizenship: USA [email protected] cogsci.jhu.edu/directory/paul-smolensky/ DEGREES Ph.D. in mathematical physics, Indiana University, 1981. M.S. in physics, Indiana University, 1977. A.B. summa cum laude in physics, Harvard University, 1976. PROFESSIONAL POSITIONS Partner Researcher, Microsoft Research Artificial Intelligence, Redmond WA, Dec. 2016−present. Krieger-Eisenhower Professor of Cognitive Science, Johns Hopkins University, 2006–present. Full Professor, Department of Cognitive Science, Johns Hopkins University, 1994–2006. Chair, Department of Cognitive Science, Johns Hopkins University, Jan. 1997−June 1998 (Acting), July 1998−June 2000 Professor, Department of Computer Science, University of Colorado at Boulder, Full Professor, 1994–95 (on leave, 1994–95). Associate Professor, 1990–94. Assistant Professor, 1985–90. Assistant Research Cognitive Scientist (Assistant Professor – Research), Institute for Cognitive Science, University of California at San Diego, 1982–85. Visiting Scholar, Program in Cognitive Science, University of California at San Diego, 1981–82. Adjunct Professor, Department of Linguistics, University of Maryland at College Park, 1994–2010. Assistant Director, Center for Language and Speech Processing, Johns Hopkins University, 1995–2008. Director, NSF IGERT Training Program, Unifying the Science of Language, 2006−2015. Director, NSF IGERT Training Program in the Cognitive Science of Language, 1999−2006. International Chair, Inria Paris (National Institute for Research in Computer Science and Automation), 2017−2021. Visiting Scientist, Inserm-CEA Cognitive Neuroimaging Unit, NeuroSpin Center, Paris, France, 2016. -
Scaffolding Learning of Language Structures with Visual‐Syntactic Text
UC Irvine UC Irvine Previously Published Works Title Scaffolding learning of language structures with visual-syntactic text formatting Permalink https://escholarship.org/uc/item/6235t25b Journal British Journal of Educational Technology, 50(4) ISSN 0007-1013 Authors Park, Y Xu, Y Collins, P et al. Publication Date 2018 DOI 10.1111/bjet.12689 Peer reviewed eScholarship.org Powered by the California Digital Library University of California British Journal of Educational Technology Vol 0 No 0 2018 1–17 doi:10.1111/bjet.12689 Scaffolding learning of language structures with visual-syntactic text formatting Youngmin Park, Ying Xu , Penelope Collins, George Farkas and Mark Warschauer Youngmin Park is a Lecturer at Pusan National University, Korea. She received her Ph.D. degree from the University of California, Irvine, specializing in Language, Literacy and Technology. Ying Xu is a Ph.D. student at the University of California, Irvine, with a specialization in specializing in Language, Literacy and Technology. Penelope Collins is an associate professor at the University of California, Irvine. Her research examines the development of language and literacy skills for children from linguistically diverse backgrounds. George Farkas is a professor at the University of California, Irvine. He has employed a range of statistical approaches and databases to examine the causes and consequences of reading achievement gap across varying age groups and educational settings. Mark Warschauer is a professor at the University of California, Irvine. He works on a range of research projects related to digital media in education. Address for correspondence: Ying Xu, University of California Irvine, 3200 Education Bldg, Irvine, CA 92697, USA. -
A View from Gradient Harmonic Grammar
Asymmetries in Long-Distance Dependencies: A View from Gradient Harmonic Grammar constituent class membership to a degree, and presupposes that instead of standard category symbols like [X], there are weighted category symbols like [αX] (where α ranges over the real numbers in Hyunjung Lee & Gereon M¨uller (Universit¨at Leipzig) [0,1]). Rules, filters, and other syntactic building blocks are given upper and lower threshold values WorkshoponLong-DistanceDependencies,HUBerlin October 4-5, 2018 of α between which they operate. 1. Introduction Note: This way, the concept of varying strength of syntactic categories (see Chomsky (2015) for a recent Claim: reappraisal) can be formally implemented in the grammar. Gradient Harmonic Grammar (Smolensky & Goldrick (2016)) offers a new perspective on how to derive three different types of asymmetries as they can be observed with long-distance dependencies Observation: in the world’s languages: So far, most of the work on GHG has been in phonology (e.g., Zimmermann (2017), Faust & Smo- • asymmetries between movement types lensky (2017), Kushnir (2018)); but cf. Smolensky (2017), M¨uller (2017b), Lee (2018) for syntactic applications. • asymmetries between types of moved items • asymmetries between types of local domain 1.3. Harmonic Serialism Background assumptions: Note: (i) Harmonic Grammar Harmonic serialism is a strictly derivational version of optimality theory. (ii) Gradient Representations (3) Harmonic serialism (McCarthy (2008), Heck & M¨uller (2013)): (iii) Harmonic Serialism a. GivensomeinputIi, the candidate set CSi = {Oi1,Oi2, ...Oin} is generated by applying at 1.1. Harmonic Grammar most one operation to Ii. Harmonic Grammar (Smolensky & Legendre (2006), Pater (2016)): A version of optimality theory b. -
Chapter 7 Linguistics As a Science of Structure Ryan M
Chapter 7 Linguistics as a science of structure Ryan M. Nefdt University of the Western Cape Generative linguistics has rapidly changed during the course of a relatively short period. This has caused many to question its scientific status as a realist scientific theory (Stokhof & van Lambalgen 2011; Lappin et al. 2000). In this chapter, I argue against this conclusion. Specifically, I claim that the mathematical foundations of the science present a different story below the surface. I agree with critics that due to the major shifts in theory over the past 80 years, linguistics is indeed opened up to the problem of pessimistic meta-induction or radical theory change. However, I further argue that a structural realist approach (Ladyman 1998; French 2006) can save the field from this problem and at the same time capture its structural nature. I discuss particular historical instances of theory change in generative grammar as evidence for this interpretation and finally attempt to extend it beyond the gener- ative tradition to encompass previous frameworks in linguistics. 1 Introduction The generativist revolution in linguistics started in the mid-1950s, inspired in large part by insights from mathematical logic and in particular proof theory. Since then, generative linguistics has become a dominant paradigm, with many connections to both the formal and natural sciences. At the centre of the newly established discipline was the syntactic or formal engine, the structures of which were revealed through modelling grammatical form. The generativist paradigm in linguistics initially relied heavily upon the proof-theoretic techniques intro- duced by Emil Post and other formal logicians to model the form language takes (Tomalin 2006; Pullum 2011; 2013).1 Yet despite these aforementioned formal be- ginnings, the generative theory of linguistics has changed its commitments quite 1Here my focus will largely be on the formal history of generative syntax but I will make some comments on other aspects of linguistics along the way. -
Harmonic Grammar with Linear Programming
To appear in Phonology 27(1):1–41 February 18, 2010 Harmonic grammar with linear programming: From linear systems to linguistic typology∗ Christopher Potts Joe Pater Stanford University UMass Amherst Karen Jesney Rajesh Bhatt UMass Amherst UMass Amherst Michael Becker Harvard University Abstract Harmonic Grammar (HG) is a model of linguistic constraint interac- tion in which well-formedness is calculated in terms of the sum of weighted constraint violations. We show how linear programming algorithms can be used to determine whether there is a weighting for a set of constraints that fits a set of linguistic data. The associated software package OT-Help provides a practical tool for studying large and complex linguistic systems in the HG framework and comparing the results with those of OT. We first describe the translation from harmonic grammars to systems solvable by linear programming algorithms. We then develop an HG analysis of ATR harmony in Lango that is, we argue, superior to the existing OT and rule-based treatments. We further highlight the usefulness of OT-Help, and the analytic power of HG, with a set of studies of the predictions HG makes for phonological typology. Keywords: Harmonic Grammar, Optimality Theory, linear programming, typology, Lango, ATR harmony, positional markedness, positional faithfulness 1 Introduction We examine a model of grammar that is identical to the standard version of Optimality Theory (OT: Prince & Smolensky 1993/2004), except that the optimal Our thanks to Ash Asudeh, Tim Beechey, Maitine Bergonioux, Paul Boersma, John Colby, Kathryn ∗ Flack, Edward Flemming, Bob Frank, John Goldmsith, Maria Gouskova, Bruce Hayes, René Kager, Shigeto Kawahara, John Kingston, John McCarthy, Andrew McKenzie, Ramgopal Mettu, Alan Prince, Kathryn Pruitt, Jason Riggle, Jim Smith, and Paul Smolensky, and other participants in conferences and courses where this material was presented. -
Syntactic Structures and Their Symbiotic Guests. Notes on Analepsis from the Perspective of On- Line Syntax*
Pragmatics 24:3.533-560 (2014) International Pragmatics Association DOI: 10.1075/prag.24.3.05aue SYNTACTIC STRUCTURES AND THEIR SYMBIOTIC GUESTS. NOTES ON ANALEPSIS FROM THE PERSPECTIVE OF ON- LINE SYNTAX* Peter Auer Abstract The empirical focus of this paper is on utterances that re-use syntactic structures from a preceding syntactic unit. Next utterances of this type are usually treated as (coordination) ellipsis. It is argued that from an on-line perspective on spoken syntax, they are better described as structural latency: A grammatical structure already established remains available and can therefore be made use of with one or more of its slots being filled by new material. A variety of cases of this particular kind of conversational symbiosis are discussed. It is argued that they should receive a common treatment. A number of features of the general host/guest relationship are discussed. Keywords: Analepsis; On-line syntax; Structural latency. "Ein Blick in die erste beste Erzählung und eine einfache Ueberlegung muss beweisen, dass jede frühere Aeusserung des Erzählenden die Exposition aller nachfolgenden Prädikate bildet." (Wegener 1885: 46)1 1. Introduction The notion of 'ellipsis' is often regarded with some skepticism by Interactional Linguists - the orientation to 'full' sentences is all too obvious (cf. Selting 1997), and the idea that speakers produce complete sentences just in order to delete some parts of them afterwards surely fails to account for the processual dynamics of sentence production and understanding (cf. Kindt 1985) in time. On the other hand, there can be little doubt that speakers often produce utterance units that could not be understood * I wish to thank Elizabeth Couper-Kuhlen and Susanne Günthner for their helpful comments on a previous version of this paper. -
LING83600: Context-Free Grammars
LING83600: Context-free grammars Kyle Gorman 1 Introduction Context-free grammars (or CFGs) are a formalization of what linguists call “phrase structure gram- mars”. The formalism is originally due to Chomsky (1956) though it has been independently discovered several times. The insight underlying CFGs is the notion of constituency. Most linguists believe that human languages cannot be even weakly described by CFGs (e.g., Schieber 1985). And in formal work, context-free grammars have long been superseded by “trans- formational” approaches which introduce movement operations (in some camps), or by “general- ized phrase structure” approaches which add more complex phrase-building operations (in other camps). However, unlike more expressive formalisms, there exist relatively-efficient cubic-time parsing algorithms for CFGs. Nearly all programming languages are described by—and parsed using—a CFG,1 and CFG grammars of human languages are widely used as a model of syntactic structure in natural language processing and understanding tasks. Bibliographic note This handout covers the formal definition of context-free grammars; the Cocke-Younger-Kasami (CYK) parsing algorithm will be covered in a separate assignment. The definitions here are loosely based on chapter 11 of Jurafsky & Martin, who in turn draw from Hopcroft and Ullman 1979. 2 Definitions 2.1 Context-free grammars A context-free grammar G is a four-tuple N; Σ; R; S such that: • N is a set of non-terminal symbols, corresponding to phrase markers in a syntactic tree. • Σ is a set of terminal symbols, corresponding to words (i.e., X0s) in a syntactic tree. • R is a set of production rules. -
What Is Optimality Theory? John J
University of Massachusetts Amherst ScholarWorks@UMass Amherst Linguistics Department Faculty Publication Series Linguistics January 2007 What is Optimality Theory? John J. McCarthy University of Massachusetts, Amherst, [email protected] Follow this and additional works at: https://scholarworks.umass.edu/linguist_faculty_pubs Part of the Morphology Commons, Near Eastern Languages and Societies Commons, and the Phonetics and Phonology Commons Recommended Citation McCarthy, John J., "What is Optimality Theory?" (2007). Language and Linguistics Compass. 93. Retrieved from https://scholarworks.umass.edu/linguist_faculty_pubs/93 This Article is brought to you for free and open access by the Linguistics at ScholarWorks@UMass Amherst. It has been accepted for inclusion in Linguistics Department Faculty Publication Series by an authorized administrator of ScholarWorks@UMass Amherst. For more information, please contact [email protected]. 1 What is Optimality Theory?1 John J. McCarthy University of Massachusetts Amherst Abstract. Optimality Theory is a general model of how grammars are structured. This article surveys the motivations for OT, its core principles, and the basics of analysis. It also addresses some frequently asked questions about this theory and offers suggestions for further reading. 1. Introduction In 1991, Alan Prince and Paul Smolensky began presenting their work on a new approach to language. By 1993, this new approach had a name — Optimality Theory — and it became known through their widely-circulated manuscript Optimality Theory: Constraint Interaction in Generative Grammar (hereafter, Prince and Smolensky (2004)). The impact of this work on the field of phonology was extensive and immediate; since 1993, it has also stimulated important research in syntax, semantics, sociolinguistics, historical linguistics, and other areas. -
Neural Networks and the Distorted Automation of Intelligence As Statistical Inference Matteo Pasquinelli
JOURNAL > SITE 1: LOGIC GATE, THE POLITICS OF THE ARTIFACTUAL MIND | 2017 Machines that Morph Logic: Neural Networks and the Distorted Automation of Intelligence as Statistical Inference Matteo Pasquinelli Perceptrons [artificial neural networks] are not intended to serve as detailed copies of any actual nervous system. They are simplified networks, designed to permit the study of lawful relationships between the organization of a nerve net, the organization of its environment, and the “psychological” performances of which the network is capable. Perceptrons might actually correspond to parts of more extended networks in biological systems… More likely, they represent extreme simplifications of the central nervous system, in which some properties are exaggerated, others suppressed. —Frank Rosenblatt1 No algorithm exists for the metaphor, nor can a metaphor be produced by means of a computer’s precise instructions, no matter what the volume of organized information to be fed in. —Umberto Eco2 The term Artificial Intelligence is often cited in popular press as well as in art and philosophy circles as an alchemic talisman whose functioning is rarely explained. The hegemonic paradigm to date (also crucial to the automation of labor) is not based on GOFAI (Good Old-Fashioned Artificial Intelligence that never succeeded at automating symbolic deduction), but on the neural networks designed by Frank Rosenblatt back in 1958 to automate statistical induction. The text highlights the role of logic gates in the Machines that Morph Logic: Neural Networks and the Distorted Automation of Intelligence as Statistical Inference | Matteo Pasquinelli distributed architecture of neural networks, in which a generalized control loop affects each node of computation to perform pattern recognition. -
Learning-Theoretic Linguistics: Some Examples from Phonology
Learning-Theoretic Linguistics: Some Examples from Phonology Bruce Hayes Department of Linguistics UCLA Relation of this talk to this session I’m not an expert on PDP at all! Inviting me here was quite a stretch. Common theme: learning about language through computational modeling. August 13, 2010 Hayes, Learning-theoretic linguistics 2 Some PDP models Letter perception McClelland and Rumelhart (1981), Rumelhart and McClelland (1982) Word recognition and naming Seidenberg, and McClelland (1989) Inflectional morphology Rumelhart and McClelland (1986) Speech perception McClelland and Elman (1986) Phonetic categories Vallabha et al. (2007) August 13, 2010 Hayes, Learning-theoretic linguistics 3 Linguistics in cognitive science . We’re externally famous for taking hard-line nativist views. As we view ourselves: obsession with language data and its patterning. August 13, 2010 Hayes, Learning-theoretic linguistics 4 A bit of generative linguistics The logical problem of language acquisition is at the core of linguistic theory (see, e.g. Chomsky 1965, Ch. 1) Premise: language acquisition is an extraordinary feat Proposed explanation: Universal Grammar + learning mechanisms August 13, 2010 Hayes, Learning-theoretic linguistics 5 Kinds of UG Hard UG: Grammatical principles in the genome Soft UG: Aspects of language shaped by human nature, e.g. phonetically-grounded phonology (e.g. Hayes, Kirchner and Steriade 2004) August 13, 2010 Hayes, Learning-theoretic linguistics 6 The modeling program for linguistic theory Learning model – perhaps -
History and Philosophy of Neural Networks
HISTORY AND PHILOSOPHY OF NEURAL NETWORKS J. MARK BISHOP Abstract. This chapter conceives the history of neural networks emerging from two millennia of attempts to rationalise and formalise the operation of mind. It begins with a brief review of early classical conceptions of the soul, seating the mind in the heart; then discusses the subsequent Cartesian split of mind and body, before moving to analyse in more depth the twentieth century hegemony identifying mind with brain; the identity that gave birth to the formal abstractions of brain and intelligence we know as `neural networks'. The chapter concludes by analysing this identity - of intelligence and mind with mere abstractions of neural behaviour - by reviewing various philosophical critiques of formal connectionist explanations of `human understanding', `mathematical insight' and `consciousness'; critiques which, if correct, in an echo of Aristotelian insight, sug- gest that cognition may be more profitably understood not just as a result of [mere abstractions of] neural firings, but as a consequence of real, embodied neural behaviour, emerging in a brain, seated in a body, embedded in a culture and rooted in our world; the so called 4Es approach to cognitive science: the Embodied, Embedded, Enactive, and Ecological conceptions of mind. Contents 1. Introduction: the body and the brain 2 2. First steps towards modelling the brain 9 3. Learning: the optimisation of network structure 15 4. The fall and rise of connectionism 18 5. Hopfield networks 23 6. The `adaptive resonance theory' classifier 25 7. The Kohonen `feature-map' 29 8. The multi-layer perceptron 32 9. Radial basis function networks 34 10. -
The Chomsky Enigma
Weekly Worker 655 - Thursday January 4 2007 18/08/2010 11:07 Weekly Worker 655 Thursday January 11 2007 Subscribe to the Weekly Worker 18:08:201004:05:2009 The Chomsky home contact enigma action weekly worker respect the unity coalition How is that a powerful critic of US imperialism european social forum has been regarded as a valued asset by the US theory military? In the first of three articles Chris Knight resources of the Radical Anthropology Group begins his what we fight for examination of the life and work of Noam programme Chomsky join Noam Chomsky ranks among the leading intellectual search figures of modern times. He has changed the way we Fighting fund think about what it means to be human, gaining a communist university position in the history of ideas - at least according to On the links his supporters - comparable with that of Galileo, Descartes or Newton. Since launching his intellectual our history assault against the academic orthodoxies of the move 1950s, he has succeeded - almost single-handedly - in revolutionising linguistics and establishing it as a modern science. Our January fund has received an early boost with a handsome Such intellectual victories, however, have come at a £100 donation from comrade GD. cost. The stage was set for the “linguistics wars”1 Which is very handy, as when Chomsky published his first book. He might as circumstances have conspired to well have thrown a bomb. “The extraordinary and force us to change premises - an traumatic impact of the publication of Syntactic expensive business, as readers structures by Noam Chomsky in 1957,” recalls one will know.