Quick viewing(Text Mode)

Fluid Construction Grammar for Historical and Evolutionary Linguistics Pieter Wellens1, Remi Van Trijp2, Katrien Beuls1, Luc Steels2,3

Fluid Construction Grammar for Historical and Evolutionary Linguistics Pieter Wellens1, Remi Van Trijp2, Katrien Beuls1, Luc Steels2,3

Fluid Construction Grammar for Historical and Evolutionary Pieter Wellens1, Remi van Trijp2, Katrien Beuls1, Luc Steels2,3

1VUB AI Lab 2Sony Computer Science 3 ICREA Institute for Pleinlaan 2 Laboratory Paris Evolutionary Biology (UPF-CSIC) 1050 Brussels (Belgium) 6 Rue Amyot PRBB, Dr Aiguidar 88 pieter|katrien@ 75005 Paris (France) 08003 Barcelona (Spain) ai.vub.ac.be [email protected] [email protected]

Abstract as HPSG (Pollard and Sag, 1994). A genera- tive grammar is a model of language competence Fluid Construction Grammar (FCG) is an that licenses well-formed structures and rejects ill- open-source computational grammar for- formed utterances. Such grammars often decide malism that is becoming increasingly pop- on the well- or ill-formedness of utterances by us- ular for studying the history and evolution ing a strong type system that defines a set of fea- of language. This demonstration shows tures and possible values for those features. The how FCG can be used to operationalise the burden of efficient and robust language process- cultural processes and cognitive mecha- ing with a largely rests on the nisms that underly language evolution and shoulders of the language processor. change. A cognitive-functional grammar, on the other hand, functions more like a transducer between 1 Introduction meaning and form. In parsing, such a grammar tries to uncover as much meaning as possible from has been radically trans- a given utterance rather than deciding on its gram- formed over the past two decades by the ad- maticality. In the other direction, the grammar vent of corpus-based approaches. Ever increas- tries to produce intelligible utterances, which are ing datasets, both in size and richness of anno- well-formed as a side-effect if the grammar ad- tation, are becoming available (Yuri et al., 2012; equately captures the conventions of a particular Davies, 2011), and linguists now have more pow- language. A cognitive-functional grammar can which erful tools at their disposal for uncovering best be implemented without a strong type system changes have taken place. In this demonstration, because the set of possible features and values for we present Fluid Construction Grammar (Steels, them is assumed to be open-ended. Efficient and 2011, FCG), an open-source grammar robust language processing also becomes a joint that makes it possible to also address the question responsibility of the grammar and the linguistic how of these changes happened by uncovering the processor. cognitive mechanisms and cultural processes that drive language evolution. 3 Reversible Language Processing FCG combines the expressive power of fea- As a construction grammar, FCG represents all ture structures and unification with the adaptiv- linguistic knowledge as pairings of function and ity and robustnes of machine learners. In sum, form (called constructions). This means that any FCG aims to be an open instrument for de- linguistic item, be it a concrete (see veloping robust and open-ended models of lan- Figure 1) or a schematic construction, shares the guage processing that can be used for both pars- same fundamental representation in FCG. ing and production. FCG can be downloaded at Each construction consists of two poles (a se- http://www.fcg-net.org. mantic/functional one and a syntactic/form one), 2 Design Philosophy each represented as a feature structure. By using a separate semantic and syntactic pole, FCG allows Fluid Construction Grammar is rooted in a the same construction to be efficiently parsed and cognitive-functional approach to language, which produced by the same processing engine by sim- is quite different from a generative grammar such ply changing the direction of application.

127 Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, pages 127–132, Sofia, Bulgaria, August 4-9 2013. c 2013 Association for Computational Linguistics Babel web interface http://localhost:8000/

kim-lex (lex)

?top-unit-1611 Linguistic system 1 Linguistic system 2

tag ?meaning-849 ?top-unit-1611 (meaning (== footprints (==0 kim-lex lex) (identify-person sem syn ?kim-1 ?context-243 tag ?form-946 ?person-119) 1. Reconstruction 1. Reconstruction (bind person (form (== (string ?-kim-1 "Kim"))) ?person-119 [kim]))) footprints (==0 kim-lex lex)

?word-kim-1 ?word-kim-1 2. Individual Learning 2. Individual Learning → ?meaning-849 → ?form-946 (?kim-1) args footprints sem-cat ?top-unit-1611 ?top-unit-1611 (==1 kim-lex lex) ((sem-function referring) syn-cat (sem-class person)) ((lex-cat Population Population proper-noun) 3. 3. footprints (syn-function Alignment Alignment (==1 kim-lex lex) nominal))

reset 4. Grammaticalization Figure 1: Lexical construction for the proper noun “Kim” as shown in the FCG web interface. All constructions are mappings between semantic Figure 2: Schematic overview of the experimental (left) and syntactic feature structures (right). methodology for historical and evolutionary lin- guists. The example here shows only two linguis- tic stages but there could be more. FCG processing uses two different kinds of uni- fication called match and merge. The match phase is a conditional phase which checks for applicabil- actual linguistic change over time can be modeled ity of the construction. The merge operation most (van Trijp, 2010; Beuls and Steels, 2013; Wellens closely resembles classical (yet untyped) unifica- and Loetzsch, 2012). tion. In production (i.e. going from meaning to form), the processor will consider a construction’s 4 How to set up an evolutionary semantic pole as a set of conditions that need to be linguistics experiment in FCG? satisfied, and the syntactic pole as additional infor- As the FCG processor can both produce and 1 of 1 12/6/12 11:08 PM mation that can be contributed by the construction. parse utterances it is possible to instantiate not In parsing (i.e. going from form to meaning), the one but a set or population of FCG processors roles of the poles are reversed. (or FCG agents) that can communicatively inter- Since FCG pays a lot of attention to the inter- act with each other. Experiments in historical or action between linguistic knowledge and process- evolutionary linguistics make use of this multi- ing, it makes it possible to investigate the conse- agent approach where all agents engage in situated quences of particular aspects of grammar with re- pairwise interactions (language games) (Steels, gard to representation, production, parsing, learn- 2012b). ing and propagation (in a population of language In this systems demo we will focus on a re- users). For example, a small case system may be cent experiment in the emergence of grammatical easier to represent and produce than a large sys- agreement (Beuls and Steels, 2013). The language tem, but it might also lead to increased ambigu- game consists of two agents in which one agent ity in parsing and learning that the larger system (the speaker) has to describe one or more (max would avoid. Fluid Construction Grammar can three) objects in a scene to the other agent (the bring these differences to the surface for further hearer). Each object can be described by one or computational analysis. more . It follows that without any grammat- It is exactly this ability to monitor the impact of ical marking it would be difficult (often impossi- grammatical choices, that has sparked the interest ble) for the hearer to figure out which words de- of an increasingly wide audience of historical and scribe the same object and thus to arrive at a suc- evolutionary linguists. With FCG, different histor- cessful interpretation. The hypothesis is that the ical stages can be implemented (which addresses introduction of agreement markers helps solve this questions about representation and processing) but ambiguity. FCG also comes bundled with a reflective learn- Next to setting up a language game script the ing framework (Beuls et al., 2012) for learning the methodology consists of operationalizing the lin- key constructions of each stage. That same archi- guistic strategies required for a population to boot- tecture has proven to be adequately powerful to strap and maintain a particular linguistic system (in implement processes of grammaticalization so that this case nominal agreement). Examples of lin-

128 meta-layer processing repair

repair problem problem

diagnostic diagnostic diagnostic diagnostic

!" !"

routine processing

Figure 3: Reflective meta-layer architecture oper- ating as part of an FCG agent/processor. Figure 4: Meaningful marker strategy. guistic systems already investigated include Ger- so that one agent can learn the constructions man case (van Trijp, 2012a; van Trijp, 2013), based on the input of another agent. These the grammatical expression of space (Spranger learning operations are generally divided into and Steels, 2012), the emergence of quantifiers diagnostics and repair strategies (see Fig- (Pauw and Hilferty, 2012) and the expression of ure 3). Diagnostics continually monitor FCG aspect in Russian (Gerasymova et al., 2012) [for processing for errors or inefficiencies and an overview see (Steels, 2011; Steels, 2012a)]. generate problems if they are found. Repair An experiment generally investigates multi- strategies then act on these problems by al- ple linguistic systems of increasing complexity tering the linguistic inventory (e.g. adding, where each system can, but need not, map to a removing or changing constructions). stage along an attested grammaticalization path- way. Most often a stage is introduced in order Population Alignment: There exists a large gap to gradually increase the complexity of the emer- between the cognitive machinary needed for gent dynamics. In this demo we posit four sys- learning an existing linguistic system (step 2) tems/strategies, (1) a baseline purely lexical strat- and bootstrapping, aligning and maintaining egy, (2) a strategy to bootstrap and align formal a complete linguistic system from scratch. In (meaningless) agreement markers, (3) a strategy to this step individual learning operators are ex- bootstrap and align meaningful agreement mark- tended with alignment strategies. ers, and finally (4) a strategy that allows re-use Grammaticalization: Moving from one linguis- of existing lexical constructions as markers (gram- tic system to another is the final step of the maticalization). experiment. The challenge is to find and im- Implementing and linking together all the com- plement the mechanisms that drive grammat- ponents involved in a single system is a highly icalization (Heine and Kuteva, 2007) in line non-trivial undertaking and our methodology pre- with observed grammaticalization pathways. scribes the following four steps to undertake for each system (see also Figure 2). As an example we’ll give a short sketch of one Reconstruction: A full operationalization of all possible game as played in the meaningful marker the constructions (lexical and grammatical) strategy as schematically shown in Figure 4. The involved in the chosen linguistic phenom- sketch shows a context of four objects (O1 to O4), ena. When multiple agents are initialized each described by three features. The speaker with these constructions they should be able chooses topic O1 + O2 which, given his vocab- to communicate successfully with each other. ulary (shown top right), results in uttering “shuq- This stage serves primarily to test and verify fon sizhic zabu”. Words “shuqfon” and “sizhic” intuitions about the different linguistic sys- both describe parts of O1 and “zabu” of O2. In tems. order to explicitly communicate this linking the speaker attaches the markers “-ti” and “-ta” so that Individual Learning: Implementation of learn- their meaning is compatible with the objects they ing algorithms (or re-use of existing ones) are linking as shown in the Figure. This allows

129 Figure 5: A network of constructions. Diamond shaped nodes represent lexical constructions, egg shaped nodes represent grammatical constructions and rectangular nodes represent semantic categories. Arrows can be read as “primes”. For example the preposition between [BETWEEN.PREP] primes the category LOCATIVE RELATION which in turn primes both the [LOCATIVE RELATION] and [SPATIAL PHRASE] constructions. Both of these constructions also require a semantic category [REFERENT]. the hearer to arrive at a single non-ambiguous in- Customizable construction inventory: By de- terpretation. For more details we refer the reader fault, FCG stores all constructions in one to (Beuls and Steels, 2013) and the web demo at large set. FCG however supplies a num- http://ai.vub.ac.be/materials/plos-agreement/. ber of different taxonomies, both for concep- tual and efficiency reasons. One popular op- 5 Features of FCG tion is to organize constructions in smaller subsets (Beuls, 2011) like lexical, morpho- A number of key features of FCG have already logical, functional, etc. Another option is been introduced. Reversible bidirectional process- to use networks (Wellens, 2011) that can ing, a single data representation for all linguistic learn co-occurrence relations between con- knowledge, a reflective meta-layer architecture for structions and “prime” constructions when learning and a multi-agent component for manag- they are likely to apply (see Figure 5). ing multiple interacting FCG instances. Other fea- tures, some of which are unique to FCG, include, Interfaces to external repositories: FCG but are not limited to: can connect to external repositories like Framenet (Baker et al., 1998) and Wordnet Web interface: FCG comes with a rich HTML/AJAX based web interface (Loet- (Miller, 1995) to load thousands of lexical zsch, 2012) where it can show fine-grained entries (Micelli et al., 2009; Wellens and information to the user in a user-friendly Beule, 2010). manner through the use of expandable elements. See Figure 6. Robustness: FCG continues operation as far as it can get even if some constructions do not Customizable processing: Linguistic process- apply (Steels and van Trijp, 2011). Sup- ing is implemented as a search process plied with appropriate diagnostics and repair (Bleys et al., 2011). The user has easy strategies FCG can even recover from errors access to the most important parameters (van Trijp, 2012b). influencing this process. Examples of these are the heuristics and the tests that determine Open source: Best of all, FCG is freely down- whether a node represents an acceptable loadable and open source (http://www.fcg- solution. FCG comes bundled with a library net.org). It is written in Common Lisp of heuristics and goal tests and with a bit (CLOS) and compatible with most popu- of programming skills users can add new lar lisp implementations (SBCL, CCL, Lisp- primitives easily. works, ...).

130 Parsing "block"

Applying construction set (70) in direction

Found a solution initial sem syn expanded search tree node structure top top

application block-morph (morph t) process cxn-applied application result status cxn-applied source sem syn structure top top

applied block-morph (morph t) construction

resulting sem syn structure top top block-83 block- noun- initial lex cat (lex t) (cat t) resulting ((?form-84 form ((string block-83 "block"))) bindings (?block-unit-2 . block-83) (?top-39 . top)) added in first merge block-83

added in second block-83 merge

cxn supplier :ordered-by-label remaining labels (cat gram ) remaining cxns (right-lex speaker-lex unique-lex hearer-lex)

applied noun-cat (cat t) block-lex (lex t) block-morph (morph t) constructions resulting structure block-83 expanded unit footprints (block-lex) meaning ((bind object-class ?class-1 block)) noun-unit- sem syn noun- block- top top ref ?class-1 273 unit-273 83 sem-cat ((sem-function ((value ?sem-function-value-4) (valence (identifier)))) (class (object-class)))

Meaning: ((apply-class ?ref-2 ?src-2 ?class-1) (bind object-class ?class-1 block))

Figure 6: An example of parsing the noun “Block” as shown in the FCG web interface. Users can click on nearly every element to show an expanded version.

The reader is encouraged to take a look at Acknowledgments http://www.fcg-net.org/projects/design-patterns- in-fluid-construction-grammar for a selection of demonstrations of Fluid Construction Grammar. The FCG formalism is being developed at the Ar- tificial Intelligence Laboratory of the Vrije Uni- versiteit Brussel and the Sony Computer Science Laboratory in Paris. Pieter Wellens has been 6 Conclusion supported by the ESF EuroUnderstanding project DRUST funded by FWO and by the Vrije Uni- Fluid Construction Grammar is a mature technol- versiteit Brussel. Katrien Beuls received fund- ogy that can be used by computational linguists ing from a strategic basic research grant from the to complement more traditional corpus-based ap- agency for Innovation by Science and Technol- proaches. FCG builds on many existing and ogy (IWT). Remi van Trijp is funded by the Sony proven technologies and adds new innovations to Computer Science Laboratory Paris. We would the mix resulting in a user friendly, yet powerful also like to thank Michael Spranger for his con- and extensible framework for in-depth investiga- tributions to the FCG formalism. tions in natural language phenomena.

131 References Michael Spranger and Luc Steels. 2012. Emergent functional grammar for space. In Luc Steels, editor, Collin F. Baker, Charles J. Fillmore, and John B. Lowe. Experiments in Cultural Language Evolution, pages 1998. The Berkeley FrameNet Project. In Proceed- 207 – 232. John Benjamins, Amsterdam. ings of the 17th international conference on Compu- tational linguistics, Morristown, NJ, USA. Associa- Luc Steels and Remi van Trijp. 2011. How to make tion for Computational Linguistics. construction grammars fluid and robust. In Luc Steels, editor, Design Patterns in Fluid Construction Katrien Beuls and Luc Steels. 2013. Agent-based Grammar, pages 301–330. John Benjamins, Ams- models of strategies for the emergence and evo- terdam. lution of grammatical agreement. PLoS ONE, 8(3):e58960, 03. Luc Steels, editor. 2011. Design Patterns in Fluid Construction Grammar. John Benjamins. Katrien Beuls, Remi van Trijp, and Pieter Wellens. 2012. Diagnostics and repairs in Fluid Construc- Luc Steels, editor. 2012a. Computational Issues in tion Grammar. In Luc Steels and Manfred Hild, ed- Fluid Construction Grammar, volume 7249 of Lec- itors, Language Grounding in Robots. Springer Ver- ture Notes in Computer Science. Springer, Berlin. lag, Berlin. Luc Steels, editor. 2012b. Experiments in Cultural Katrien Beuls. 2011. Construction sets and unmarked Language Evolution. John Benjamins, Amsterdam. forms: A case study for Hungarian verbal agree- ment. In Luc Steels, editor, Design Patterns in Fluid Remi van Trijp. 2010. Grammaticalization and seman- Construction Grammar, pages 237–264. John Ben- tic maps: Evidence from artificial language evolu- jamins, Amsterdam. tion. Linguistic Discovery, 8:310–326.

Joris Bleys, Kevin Stadler, and Joachim De Beule. Remi van Trijp. 2012a. Not as awful as it seems : Ex- 2011. Search in linguistic processing. In Luc Steels, plaining german case through computational exper- editor, Design Patterns in Fluid Construction Gram- iments in fluid construction grammar. In Proceed- mar, pages 149–179. John Benjamins, Amsterdam. ings of the 13th Conference of the European Chap- ter of the Association for Computational Linguistics, Mark Davies. 2011. N-grams and word frequency pages 829–839. data from the corpus of historical american english (coha). Remi van Trijp. 2012b. A reflective architecture for language processing and learning. In Luc Steels, Kateryna Gerasymova, Michael Spranger, and Katrien editor, Computational Issues in Fluid Construction Beuls. 2012. A language strategy for aspect: En- Grammar. Springer Verlag, Berlin. coding aktionsarten through . In Luc Steels, editor, Experiments in Cultural Language Remi van Trijp. 2013. Linguistic assessment crite- Evolution, pages 257 – 276. John Benjamins. ria for explaining language change: A case study on syncretism in German definite articles. Language Bernd Heine and Tania Kuteva. 2007. The Genesis Dynamics and Change, 3(1). of Grammar: A Reconstruction. Oxford University Press, October. Pieter Wellens and Joachim De Beule. 2010. Priming through constructional dependencies: a case study Martin Loetzsch. 2012. Tools for grammar engineer- in fluid construction grammar. In The Evolution ing. In Luc Steels, editor, Computational Issues of Language ( EVOLANG8), pages 344–351. World in Fluid Construction Grammar. Springer Verlag, Scientific. Berlin. Pieter Wellens and Martin Loetzsch. 2012. Multi- V. Micelli, R. van Trijp, and J. De Beule. 2009. Fram- dimensional meanings in formation. In Luc ing fluid construction grammar. In N.A. Taatgen and Steels, editor, Experiments in Cultural Language H. van Rijn, editors, the 31th Annual Conference Evolution, pages 143–166. John Benjamins, Ams- of the Cognitive Science Society, pages 3023–3027. terdam. Cognitive Science Society. Pieter Wellens. 2011. Organizing constructions in net- George A. Miller. 1995. Wordnet: a lexical database works. In Luc Steels, editor, Design Patterns in for english. Commun. ACM, 38:39–41, November. Fluid Construction Grammar, pages 181–201. John Benjamins, Amsterdam. Simon Pauw and Joseph Hilferty. 2012. The emer- gence of quantifiers. In Luc Steels, editor, Experi- Lin Yuri, Michel Jean-Baptiste, Lieberman Aiden Erez, ments in Cultural Language Evolution, pages 277 – Orwant Jon, Brockman Will, and Slav Petrov. 2012. 304. John Benjamins. Syntactic annotations for the google books ngram corpus. In ACL (System Demonstrations). The As- Carl Pollard and Ivan A. Sag. 1994. Head-Driven sociation for Computer Linguistics. . University of Chicago Press, Chicago.

132