<<

Enhancing FreeLing Rule-Based Dependency with Frames

Marina Lloberes Irene Castellon´ Llu´ıs Padro´ U. de Barcelona U. de Barcelona U. Politecnica` de Catalunya Barcelona, Spain Barcelona, Spain Barcelona, Spain [email protected] [email protected] [email protected]

Abstract explicitly relate the improvements in parser per- formance to the addition of subcategorization. Despite the recent advances in , This paper analyses in detail how subcatego- significant efforts are needed to improve rization frames acquired from an annotated cor- the current parsers performance, such as pus and distributed among fine-grained classes in- the enhancement of the / crease accuracy in rule-based dependency gram- recognition. There is evidence that verb mars. subcategorization frames can contribute to The framework used is that of the FreeLing parser accuracy, but a number of issues re- Dependency Grammars (FDGs) for Spanish and main open. The main aim of this paper is Catalan, using enriched lexical-syntactic informa- to show how subcategorization frames ac- tion about the argument structure of the verb. quired from a syntactically annotated cor- FreeLing (Padro´ and Stanilovsky, 2012) is an pus and organized into fine-grained classes open-source library of multilingual Natural Lan- can improve the performance of two rule- guage Processing (NLP) tools that provide linguis- based dependency grammars. tic analysis for written texts. The FDGs are the core of the FreeLing dependency parser, the Txala 1 Introduction Parser (Atserias et al., 2005). Statistical parsers and rule-based parsers have ad- The remainder of this paper is organized as fol- vanced over recent years. However, significant ef- lows. Section 2 contains an overview of previous forts are required to increase the performance of work related to this research. Section 3 presents current parsers (Klein and Manning, 2003; Nivre the rule-based dependency parser used and the et al., 2006; Ballesteros and Nivre, 2012; Marimon Spanish and Catalan grammars. Section 4 de- et al., 2014). scribes the strategy followed initially to integrate One of the linguistic phenomena which parsers subcategorization into the grammars and how this often fail to handle correctly is the argu- information has been redesigned. Section 5 fo- ment/adjunct distinction (Carroll et al., 1998). For cuses on the evaluation and the analysis of sev- this reason, the main goal of this paper is to eral experiments testing versions of the grammars test empirically the accuracy of rule-based depen- including or discarding subcategorization frames. dency grammars working exclusively with syntac- Finally, the main conclusions and the further re- tic rules or adding subcategorization frames to the search goals arisen from the results of the experi- rules. ments are exposed in Section 6. A number of studies shows that subcategoriza- 2 Related Work tion frames can contribute to improve parser per- formance (Carroll et al., 1998; Zeman, 2002; Mir- There has been an extensive research on parser de- roshandel et al., 2013). Particularly, these studies velopment, and most approaches can be classified are mainly concerned with the integration of sub- as statistical or rule-based. In the former, a statis- categorization information into statistical parsers. tical model learnt from annotated or unannotated The list of studies about rule-based parsers in- texts is applied to build the syntactic tree (Klein tegrating subcategorization information is also ex- and Manning, 2003; Collins and Koo, 2005; Nivre tensive (Lin, 1998; Alsina et al., 2002; Bick, 2006; et al., 2006; Ballesteros and Nivre, 2012), whereas Calvo and Gelbukh, 2011). However, they do not the latter uses hand-built grammars to guide the

201 Proceedings of the Third International Conference on Dependency (Depling 2015), pages 201–210, Uppsala, Sweden, August 24–26 2015. parser in the construction of the tree (Sleator and formation to make a decision (Carroll et al., 1998). Temperley, 1991; Jarvinen¨ and Tapanainen, 1998; Depending on the characteristics of the parser, Lin, 1998). subcategorization assists in this task in a different Concerning the languages this study is based way. Subcategorization information can be used on, some research on Spanish has been performed to assign a probability to every possible syntactic from the perspective of Constraint tree and to rank them in parsers that perform the (Bick, 2006), Unification Grammar (Ferrandez´ whole set of possible syntactic analysis of a partic- and Moreno, 2000), -Driven Structure ular sentence (Carroll et al., 1998; Zeman, 2002; Grammar (Marimon et al., 2014), and Dependency Mirroshandel et al., 2013). Grammar for statistical parsing, both supervised In contrast, subcategorization may help to re- (Carreras et al., 2006) and semi-supervised (Calvo strict the application of certain rules. Then, when and Gelbukh, 2011). For Catalan, a rule-based the parser detects the subcategorization frame in parser based on Constraint Grammar (Alsina et the input sentence, it labels the syntactic tree ac- al., 2002) and a statistical dependency parser (Car- cording to the frame discarding any other possible reras, 2007) are available. analysis (Lin, 1998; Calvo and Gelbukh, 2011). Despite the huge achievements in the area of parsing, argument/adjunct recognition is still a lin- 3 Dependency Parsing in FreeLing guistic problem in which parsers still show low ac- curacy and in which there is still no generalized The rule-based dependency grammars presented consensus in Theoretical Linguistics (Tesniere,` in this article are the core of the Txala Parser (At- 1959; Chomsky, 1965). This phenomenon refers serias et al., 2005), the NLP module in charge to the subcategorization notion, which corre- of Dependency Parsing in the FreeLing library sponds to the definition of the type and the number (Padro´ and Stanilovsky, 2012).1 of arguments of a syntactic head. FreeLing is an open-source project that has been The acquisition of subcategorization frames developed for more than ten years. It is a com- from corpora is one of the strategies for integrat- plete NLP pipeline built on a chain of modules ing information about the argument structure into that provide a general and robust linguistic anal- a parser. Depending on the level of language anal- ysis. Among the available tools, FreeLing offers ysis of the annotated corpus, two main strategies sentence recognition, tokenization, named entity are used in automatic acquisition. recognition, tagging, chunking, dependency pars- If the acquisition is performed over a mor- ing, sense disambiguation, and coreference phosyntactically annotated text, the subcatego- resolution. rization frames are inferred by applying statisti- cal techniques on morphosyntactically annotated 3.1 Txala Parser data (Brent, 1993; Manning, 1993; Korhonen et The Txala Parser is one of the dependency pars- al., 2003). ing modules available in FreeLing. It is a rule- Alternatively, acquisition can be performed based, non-projective and multilingual depen- with syntactically annotated texts (Sarkar and Ze- dency parser that provides robust syntactic anal- man, 2000; O’Donovan et al., 2005; Aparicio et ysis in three steps. al., 2008). Subcategorization acquisition can be Txala receives the partial syntactic trees - performed straightforwardly because the informa- duced by the chunker (Civit, 2003) as input. tion about the argument structure is available in Firstly, the head-child relations are identified us- the corpus. Therefore, this approach generally fo- ing a set of heuristic rules that iteratively decide cuses on the methods for subcategorization frames whether two adjacent trees must be merged, and classification. in which way, until there is only one tree left. Sec- The final classification in a lexicon of frames ondly, it is converted into syntactic dependencies is a computational resource for several NLP tools. according to Mel’cukˇ (1988). Finally, each depen- In the framework which this research focuses on, dency arch of the tree is labelled with a syntactic the integration of the acquired subcategorization is function tag. orientated to the contribution towards building the syntactic tree when the parser has incomplete in- 1http://nlp.cs.upc.edu/freeling/

202 Rules 911 !grup-verb $$ - (sn,subord ˆPR ) Language Total Linking Labelling top left RELABEL - { } English 2961 2239 722 Spanish 4042 3310 732 Catalan 2879 2099 780 Figure 1: Linking rule for relative Galician 178 87 91 Asturian 4438 3842 596

grup-verb dobj Table 1: Sizes of the FDGs d.label=grup-sp p.class=trans d.side=right d.lemma=a|al 3.2 FreeLing Dependency Grammars d:sn.tonto=Human d:sn.tonto!=Building|Place The current version of FreeLing includes rule- based dependency grammars for English, Span- Figure 2: Labelling rule for human direct objects ish, Catalan, Galician and Asturian (see Table 1 for a brief overview of their sizes). In this pa- per, the Spanish and Catalan dependency gram- Net Semantic File, WordNet Synonyms and Hy- mars are described. pernyms and other semantic features predefined The FDGs follow the linguistic basis of syn- by the user). tactic dependencies (Tesniere,` 1959; Mel’cuk,ˇ In the rule illustrated in Figure 2, the direct ob- 1988). However, we propose a different analy- ject label (dobj) is assigned to the link between sis for prepositional (preposition-headed), a verbal head (grup-verb) and a prepositional subordinate clauses (conjunction-headed) and co- phrase (grup-sp) child when the head belongs ordinating structures (conjunction-headed). to the transitive verbs class (trans) and the child A FDG is structured as a set of manually de- is post-verbal (right), the preposition is a (or the fined rules which link two adjacent syntactic par- contraction al), and the head inside the tial trees (linking rules) and assign a syntactic prepositional phrase has the TCO feature Human function to every link of the tree (labelling rules), but not (!=) the features Building or Place according to certain conditions and priority. They (to prevent organizations from being identified as are applied based on this priority: at every step, a direct ). two adjacent partial trees will be attached or will be labelled with a syntactic function tag if their 4 CompLex-VS lexicon for Parsing rule is the highest ranked for which all the condi- tions are met. Following the hypothesis that subcategorization Linking rules can contain four kind of con- frames improve the parsing performance (Carroll ditions, regarding morphological (part-of- et al., 1998), the first version of FDGs included tag), lexical (word form, lemma), syntactic (syn- verbal and nominal frames in order to improve ar- tactic context, syntactic features of lemmas) and gument/adjunct recognition and prepositional at- semantic features (semantic properties predefined tachment (Lloberes et al., 2010). In this paper, by the user). only the verbal lexicon is presented because it is For instance, the rule shown in Figure 1 has pri- the resource used for the argument/adjunct recog- ority 911, and states that a sub-tree marked as a nition task in the grammars. subordinate (subord) whose head is a rel- ative pronoun (PR) attached as a child to the noun 4.1 Initial CompLex-VS lexicon in FDGs phrase (sn) to its left (top left) when these two The initial Computational Lexicon of Verb Sub- consecutive sub-trees are not located to the right of categorization (CompLex-VS) was automatically a (!grup-verb $$). extracted from the subcategorization frames of the Concerning the labelling rules, the set of condi- SenSem Corpus (Fernandez´ and Vazquez,` 2014), tions that the parent or the child of the dependency which contains 30231 syntactically and seman- must meet may refer to morphological (part-of- tically annotated sentences per language, and of speech tag), lexical (word form, lemma), syntac- the Volem Multilingual Lexicon (Fernandez´ et al., tic (lower/upper sub-tree nodes, syntactic features 2002), which has 1700 syntactically and semanti- of lemmas) and semantic properties (EuroWord- cally annotated verbal lemmas per language. The Net Top Concept Ontology -TCO- features, Word- patterns extracted from both resources are orga-

203 nized according to the linguistic-motivated classi- A first experimental evaluation of the Spanish Grammar with the initial subcategorization lexi- mars with the initial CompLex-VS included, the lexicon has been redesigned, proposing a set of more fine-grained subcategorization frame classes in order to represent verb subcategorization in the Figure 3: Example of the CompLex-VS dependency rules in a controlled and detailed way. New syntactic-semantic patterns have been ex- Every lexicon entry contains the syntactic func- tracted automatically from the SenSem Corpus ac- tion of every argument (fs), the grammatical cat- cording to the idea that every verbal lemma with a egory of the head of the argument (cat) and the different subcategorization frame expresses a dif- thematic role (rs). The type of construction ferent meaning. Therefore, a new lexicon entry is (e.g. active, passive, impersonal, etc.) has been created every time an annotated verbal lemma with inferred from the and aspect annotations a different frame is detected. available in the SenSem Corpus. The CompLex-VS contains 3102 syntactic pat- Two non-annotated lexical items of the sentence terns in the Spanish lexicon and 2630 patterns in have also been inserted into the subcategorization the Catalan lexicon (see Section 4.3 for detailed frame because the information that they provide numbers). They are organized into 15 subcate- is crucial for the argument structure configuration gorization frames as well as into 4 subcategoriz- (e.g. the particle ‘se’ and the lexical value of the tion classes. The lexicon is distributed in XML prepositional phrase head). format under the Creative Commons Attribution- In addition, meta-linguistic information has ShareAlike 3.0 Unported License.2 been added to every entry: a unique id and the Certain patterns have been discarded because relative frequency of the pattern in the corpus they are non-prototypical in the corpus (e.g. (freq). A threshold frequency has been estab- left dislocations), they alter the sentence order lished at 7 10 5 (Spanish) and at 8.5 10 5 (Cata- (e.g. relative clauses), or they involve controver- · − · − lan). Patterns below this threshold have been con- sial argument classes (e.g. prepositional phrases sidered marginal in the corpus and they have been seen as arguments or adjuncts depending on the discarded. context). Every pattern contains a link to the frame and As Figure 3 shows, the extracted patterns subcategorization class that they belong to (ref). () have been classified into For example, if an entry has the reference 1:1, it classes according to the whole set of argument means that the pattern corresponds to a monoargu- structures occurring in the corpus (subj for in- mental verb whose unique argument is a . transitive verbs, subj,dobj for transitive verbs, etc.). Simultaneously, frames have been organized 4.3 Integration of CompLex-VS in the FDGs in classes (monoar- From the CompLex-VS, two derived lexicons per gumental, biargumental, triargumental and quatri- language containing the verbal lemmas for every argumental). recorded pattern have been created to be integrated 2http://grial.uab.es/descarregues.php into the FDGs. The CompLex-SynF lexicon con-

204 Frames Spanish Catalan Grammar Spanish Catalan subj 203 386 Bare 450 508 subj,att 3 7 Baseline 732 780 subj,dobj 440 230 SynF 872 917 subj,iobj 37 61 SynF+Cat 869 917 subj,pobj 126 93 subj,pred 45 31 subj,attr,iobj 2 1 Table 3: Labelling rules in the evaluated grammars subj,dobj,iobj 113 72 subj,dobj,pobj 42 34 subj,dobj,pred 21 18 grammar have been created. One version contains subj,pobj,iobj 2 1 the CompLex-SynF lexicon and the other one the subj,pobj,pobj 14 9 subj,pobj,pred 1 0 CompLex-Synf+Cat. subj,pred,iobj 4 5 The old CompLex-VS lexicon classes have subj,dobj,pobj,iobj 1 0 been replaced with the new ones. Specifically, this information has been inserted in the part of the la- Table 2: CompLex-SynF lexicon in numbers belling rules about the syntactic properties of the parent node (observe p.class in Figure 2). Finally, new rules have been added for frames tains the subcategorization patterns generalized by of CompLex-SynF and CompLex-SynF+Cat that the syntactic function (Table 2). The CompLex- are not present in the old CompLex-VS lexicon. SynF+Cat lexicon collects the syntactic patterns Furthermore, some rules have been disabled for combining syntactic function and grammatical frames of the old CompLex-VS lexicon that do category (adjective/noun/prepositional phrase, in- not exist in the CompLex-SynF and CompLex- finitive/interrogative/completive clause). SynF+Cat lexicons (see Table 3 for the detailed The addition of grammatical categories makes it size of the grammars). possible to restrict the grammar rules. For exam- ple, a class of verbs containing the verb quedarse 5 Evaluation (‘to get’) whose argument is a predicative and a prepositional phrase allows the rules to identify An evaluation task has been carried out to test that the prepositional phrase of the sentence Se empirically how the FDGs performance changes ha quedado de piedra (‘[He/She] got shocked’) when subcategorization information is added or is a predicative argument. Furthermore, it allows subtracted. Several versions of the grammars have for discarding the prepositional phrase of the sen- been tested using a controlled annotated linguistic tence Aparece de madrugada (‘[He/She] shows up data set. at late night’) being a predicative argument, al- This evaluation specifically focuses on though aparecer belongs to the class of predicative analysing the results of the experiments qualita- verbs but conveying a as argument. tively. This kind of analysis makes it possible to track the decisions that the parser has made, so While in the CompLex-SynF lexicon the infor- that it is possible to provide an explanation about mation is more compacted (1054 syntactic pat- the accuracy of the FDGs running with different terns classified in 15 frames), in the CompLex- linguistic information. SynF+Cat lexicon the classes are more granular (1356 syntactic patterns organized in 77 frames). 5.1 Experiments Only subcategorization patterns corresponding Four versions of both Spanish and Catalan gram- to lexicon entries referring to the active voice mars are tested in order to assess the differences of have been integrated in the FDGs, since they in- the performance depending on the linguistic infor- volve non-marked word order. Both lexicons also mation added. exclude information about the thematic role, al- though they take into account the value of the head Bare FDG. A version of the FDGs running • (if the frame contains a prepositional argument) without subcategorization frames. and the pronominal verbs (lexical entries that ac- Baseline FDG. A version of the FDGs run- cept ‘se’ particle whose value neither is reflexive • ning with the old CompLex-VS lexicon. nor reciprocal). Two versions of the Spanish dependency gram- SynF FDG. A version of the FDGs running • mar and two versions of the Catalan dependency with the CompLex-SynF lexicon.

205 Tag SenSem ParTes SenSem ParTes Tag Description Spanish Spanish Catalan Catalan adjt Adjunct subj 42.23 34.03 43.03 28.08 attr Attribute dobj 35.77 29.86 34.64 34.25 dobj Direct Object pobj 16.73 13.89 16.56 17.12 iobj Indirect Object iobj 4.64 6.25 4.70 2.05 pobj Prepositional Object pred 0.49 2.08 0.51 0.68 pred Predicative attr 0.14 13.89 0.56 17.81 subj Subject

Table 4: Comparison of the labelling tags distribu- Table 5: Tagset of syntactic functions related to tion in SenSem and ParTes (%) the subcategorization

SynF+Cat FDG. A version of the FDGs run- The metrics used to evaluate the performance of • ning with the CompLex-Synf+Cat lexicon. the several FDGs versions are the following ones:

3 Since this research is focused on the implemen- Accuracy tation of subcategorization information for argu- LAS = correct attachments and labellings ment/adjunct recognition, only the labelling rules total tokens are discussed in this paper (Table 3). However, correct attachments UAS = total tokens metrics related to linking rules are also mentioned correct labellings to provide a general description of the FDGs. LAS2 = total tokens

5.2 Evaluation data Precision system correct tokens To perform a qualitative evaluation, the ParTes test P = system tokens suite has been used (Lloberes et al., 2014). This resource is a multilingual hierarchical test suite Recall of a representative and controlled set of syntactic R = system correct tokens phenomena which has been developed for evalu- gold tokens ating the parsing performance as regards syntactic Both quantitative and qualitative analysis de- structure and word order. tailed in Section 5.4 pay special attention to the It contains 161 syntactic phenomena in Span- metric LAS2, which informs about the number of ish (99 referring to structure and 62 to word order) heads with the correct syntactic function tag. and 147 syntactic phenomena in Catalan (101 cor- Precision and recall metrics of the labelling responding to structure phenomena and 46 to word rules provide information about how the addi- order). tion of verbal subcategorization information con- The current version of ParTes is distributed with tributes to the grammar performance. For this rea- an annotated data set in the CoNLL format. Al- son, in the qualitative analysis, only labelling syn- though this data set is not initially developed for tactic function tags directly related to verbal sub- evaluating the argument/adjunct recognition, the categorization are considered (Table 5). number of arguments and adjuncts contained in ParTes is proportional to the number of arguments 5.4 Accuracy results and adjuncts of the SenSem Corpus (Table 4). The global results of the FDGs evaluation (LAS) Therefore, the ParTes data set is a reduced sam- show that the whole set of evaluated grammars ple of the linguistic phenomena that occur in a score over 80% accuracy in Spanish (Table 6) and larger corpus, which makes ParTes an appropriate around 80% in Catalan (Table 7). resource for this task. In the four Spanish grammar versions (Table 6), the correct head (UAS) has been identified in 5.3 Evaluation metrics 90.01% of the cases. On the other hand, the The metrics have been computed using the tendency changes in syntactic function labelling CoNLL-X Shared Task 2007 script (Nivre et al., (LAS2). The Baseline establishes that 85.54% 2007). The output of the FDGs (system output) of tokens have the correct syntactic function tag. has been compared to the ParTes annotated data 3LAS: Labeled Attachment Score; UAS: Unlabeled At- set (gold standard). tachment Score; LAS2: Label Accuracy

206 Grammar LAS UAS LAS2 root Bare 81.37 90.01 82.86 subj Baseline 83.76 90.01 85.54 mod SynF 84.50 90.01 86.29 SynF+Cat 84.50 90.01 86.29 adjt comp spec spec attr Table 6: Accuracy scores (%) in Spanish * La herramienta con la que trabajan es gratuita Grammar LAS UAS LAS2 The tool with Ø which work-3P is free Bare 78.99 86.84 81.91 Baseline 79.52 86.84 82.85 SynF 81.78 86.84 85.24 root SynF+Cat 81.78 86.84 85.24 subj mod pobj Table 7: Accuracy scores (%) in Catalan comp spec spec attr

However, Bare drops 2.68 scores and SynF and * La herramienta con la que trabajan es gratuita SynF+Cat improve 0.75 scores with respect to the The tool with Ø which work-3P is free baseline. A parallel behaviour is observed in Catalan, al- Figure 4: Example of bare FDGs wrongly la- though the scores are slightly lower than in Span- belling a pobj as adjt (above) and of SynF FDGs ish (Table 7). The four Catalan grammars score correctly labelling it (below) 86.84% in attachment (UAS). The Baseline scores Tag Bare Baseline SynF SynF+Cat 82.85% in syntactic function assignment (LAS2). adjt 59.26 70.27 61.54 61.54 Once again FDGs perform worse without subcat- attr 84.21 71.43 71.43 71.43 egorization information (0.94 points less in Bare dobj 78.26 85.71 87.80 87.80 iobj 100.00 100.00 100.00 100.00 grammar) and better with subcategorization infor- pobj 42.50 77.27 94.74 94.74 mation (2.39 points more in SynF and SynF+Cat). pred 12.50 0.00 33.33 33.33 From a general point of view, accuracy met- subj 90.24 90.91 91.11 91.11 rics show a medium-high accuracy performance of all versions of FDGs in both languages. Specif- Table 8: Labelling precision scores (%) in Spanish ically, these first results highlight that subcatego- rization information helps with the syntactic func- ure 4 shows this dichotomy. tion labelling. However, qualitative results will re- Despite these improvements, some items differ veal how subcategorization influences the gram- from the general tendency. mar performance (Sections 5.5 and 5.6). In Spanish, the improvement of the copulative verbs (attr) is due to lexical information in the 5.5 Precision results Bare FDG, while they keep stable in SynF and As observed in the quantitative analysis (Sec- SynF+Cat. Precision remains the same in the indi- tion 5.4), in both languages most of the syntac- rect object (iobj) because morphological informa- tic function assignments drop in precision when tion is enough to detect dative in singular. subcategorization classes are blocked in the gram- The performance of predicative (pred) in all the mar (Tables 8 and 9), whereas syntactic function grammars is related to the lack or addition of sub- labelling tends to improve when subcategorization categorization. The Baseline FDG subcategoriza- is available. tion classes do not include the same set of verbs as For example, the precision of the prepositional in the evaluation data. For this reason, a generic object (pobj) in both languages drops drastically rule for capturing predicatives (Bare FDG) covers when subcategorization is disabled (Bare). On the lack of verbs in a few cases. The improve- the contrary, the precision improves significantly ment of the coverage with new verbs (SynF and when the rules include subcategorization infor- SynF+Cat) shows an increment of the precision. mation (Baseline). Furthermore, the introduc- Adjunct (adjt) recognition drops for misla- tion of more fine-grained frames helps the gram- bellings with predicative because of the ambiguity mars reach a precision of 94.74% in Spanish and between the participle clause expressing time and 94.12% in Catalan (SynF and SynF+Cat). Fig- a true predicative complement.

207 Tag Bare Baseline SynF SynF+Cat Tag Bare Baseline SynF SynF+Cat adjt 60.71 61.76 62.50 62.50 adjt 57.14 92.86 85.71 85.71 attr 95.65 82.14 95.83 95.83 attr 80.00 100.00 100.00 100.00 dobj 75.00 83.33 84.78 82.98 dobj 83.72 83.72 83.72 83.72 iobj 100.00 100.00 100.00 100.00 iobj 33.33 33.33 44.44 44.44 pobj 61.29 66.67 94.12 94.12 pobj 85.00 85.00 90.00 90.00 pred 50.00 0.00 100.00 100.00 pred 33.33 0.00 33.33 33.33 subj 72.50 71.43 73.81 73.81 subj 75.51 81.63 83.67 83.67

Table 9: Labelling precision scores (%) in Catalan Table 10: Labelling recall scores (%) in Spanish

Tag Bare Baseline SynF SynF+Cat FDGs in Catalan show a parallel behaviour to adjt 50.00 61.76 73.53 73.53 that in Spanish, but they follow the general ten- attr 84.62 88.46 88.46 88.46 dobj 72.00 80.00 78.00 78.00 dency in more cases. SynF and SynF+Cat in- iobj 33.33 33.33 33.33 33.33 crease the precision in all the cases, except for pobj 76.00 56.00 64.00 64.00 pred 100.00 0.00 100.00 100.00 the direct object (dobj) in SynF+Cat. Once more subj 70.73 73.17 75.61 75.61 the prepositional object (pobj) performance raises when subcategorization frames are available. Table 11: Labelling recall scores (%) in Catalan Although a drop in all the cases in the Bare FDG is expected, the attribute (attr) and the predicative (pred) increase the precision because of the same more direct objects. Lower results are due to some reasons as the Spanish grammars. verbs missing. The results of SynF and SynF+Cat are almost Once again there are no significant differences identical. The analysis of their outputs shows between SynF and SynF+Cat, which reinforces that more fine-grained subcategorization classes the idea that grammatical categories do not pro- including grammatical categories do not have a vide new information for capturing new argument contribution to the precision improvement. and adjuncts.

5.6 Recall results 5.7 Analysis of the results The addition of subcategorization information in The whole set of experiments demonstrate that the FDGs also contributes to the improvement, al- subcategorization improves significantly the per- most in all the cases, in Spanish as well as in Cata- formance of the rule-based FDGs. lan (Tables 10 and 11). The use of FDGs without However, some arguments, such as the prepo- subcategorization involves a decrease in the recall sitional object and the predicative, are difficult most of times. to capture without subcategorization information. In Spanish, the Baseline grammar contains very Meanwhile, there are others, such as the attribute, generic rules to capture adjuncts and more fine- that do not need to be handled with subcategoriza- grained subcategorization classes restrict these tion classes. rules. For this reason, the recall slightly drops in Proper subcategorization information also con- SynF and SynF+Cat. As observed in the preci- tributes to capture more arguments and adjuncts. sion metric (Section 5.5), small populated classes The recall scores are stable among the grammars related to predicative arguments make recall drop that use subcategorization information. Secondly, in the baseline. Consequently, generic rules for most of these scores are medium-high precision. predicative labelling in the Bare grammar and Overall, the results show that the new better populated predicative classes in SynF and CompLex-VS is a suitable resource to improve the SynF+Cat allows a recovery in recall. performance of rule-based dependency grammars. FDGs in Catalan show a similar tendency. In The classification of frames proposed is coher- the Bare grammar, prepositional objects and pred- ent with the methodology. Furhtermore, it is an icatives are better captured than in the baseline be- essential resource for the grammars tested since it cause the lack of subcategorization information al- ensures medium-high precision results (compared lows rules to apply in a more irrestrictive way. On to medium precision results in the FDGs using the the other hand, the addition of subcategorization old CompLex-VS). It is important to consider the information does not seem to help with capturing kind of information to define the subcategorization

208 classes because it can be redundant, such as the Acknowledgments combination of syntactic function and grammati- This research arises from the research project cal category. SKATeR (Spanish Ministry of Economy and The CompLex-VS lexicon still needs the inclu- Competitiveness, TIN2012-38584-C06-06 and sion of new verbs, since some arguments for verbs TIN2012-38584-C06-01). missing in the lexicon are not captured properly.

References 6 Conclusions L. Alonso, I. Castellon,´ and N. Tincheva. 2007. Ob- taining coarse-grained classes of subcategorization This paper presented two rule-based dependency patterns for Spanish. In Proceedings of the Interna- grammars in Spanish and Catalan for the FreeLing tional Conference Recent Advances in Natural Lan- NLP library. guage Processing. Besides the grammars, a new subcategoriza- A.` Alsina, T. Badia, G. Boleda, S. Bott, A.` Gil, tion lexicon, CompLex-VS, has been designed us- M. Quixal, and O. Valentn. 2002. CATCG: Un sistema de analisis´ morfosintactico´ para el catalan.´ ing frames acquired from the SenSem Corpus. Procesamiento del Lenguaje Natural, 29. The new frames have been integrated in the argu- ment/adjunct recognition rules of the FDGs. J. Aparicio, M. Taule,´ and M.A. Mart´ı. 2008. AnCora- Verb: A Lexical Resource for the Semantic Anno- A set of experiments has been carried out to test tation of Corpora. In Proceedings of the Sixth In- how the subcategorization information improves ternational Conference on Language Resources and the performance of these grammars. Evaluation. The results show that subcategorization frames J. Atserias, E. Comelles, and A. Mayor. 2005. TXALA ensure a high accuracy performance. In most un analizador libre de dependencias para el castel- lano. Procesamiento del Lenguaje Natural, 35. cases, the old CompLex-VS frames and the new CompLex-VS frames show an improvement. M. Ballesteros and J. Nivre. 2012. MaltOptimizer: A System for MaltParser Optimization. In Proceed- However, the increment is more evident in some ings of the Eight International Conference on Lan- arguments –such as the prepositional object and guage Resources and Evaluation. the predicative– than others, like the complement E. Bick. 2006. A Constraint Grammar-Based Parser in attributive verbs. These results indicate that for Spanish. In Proceedings of TIL 2006 - 4th Work- some arguments necessarily need subcategoriza- shop on Information and Human Language Technol- tion information to be disambiguated, while others ogy. can be disambiguated just with syntactic informa- M.R. Brent. 1993. From Grammar to Lexicon: Un- tion. supervised Learning of Lexical . Computa- Furthermore, the new frames of CompLex- tional Linguistics, 19(2). VS provide better results than the initial ones. H. Calvo and A. Gelbukh. 2011. DILUCT: Analisis´ Therefore, more fine-grained frames (CompLex- Sintactico´ Semisupervisado Para El Espanol˜ . Edi- SynF) contribute to raise the accuracy. Despite torial Academica Espanola. this evidence, fine-grained classes do not neces- X. Carreras, M. Surdeanu, and L. Marquez.` 2006. sarily mean improvement of the parser perfor- Projective Dependency Parsing with Perceptron. In mance. The most fine-grained lexicon (CompLex- Proceedings of the Tenth Conference on Computa- tional Natural Language Learning SynF+Cat), which combines syntactic function . and grammatical category information, neither im- X. Carreras. 2007. Experiments with a Higher-Order proves nor worsens the results of the FDGs. Projective Dependency Parser. In Proceedings of the CoNLL Shared Task Session of EMNLP-CoNLL These conclusions are built on a small set of test 2007. data. Although it is a controlled and representative evaluation data set, these results need to be con- J. Carroll, G. Minnen, and T. Briscoe. 1998. Can Subcategorisation Probabilities Help a Statistical trasted with a larger evaluation data set. Parser? In Proceedings of the 6th ACL/SIGDAT It would be interesting to evaluate how the pars- Workshop on Very Large Corpora. ing performance improves while subcategoriza- N. Chomsky. 1965. Aspects of the Theory of Syntax. tion information is added incrementally. MIT Press.

209 M. Civit. 2003. Criterios de etiquetacion´ y desam- I.A. Mel’cuk.ˇ 1988. Dependency Syntax: Theory and biguacion´ morfosintactica´ de corpus en espanol.˜ In Practice. State U. Press of NY. Coleccion´ de Monograf´ıas de la Sociedad Espanola˜ para el Procesamiento del Lenguaje Natural: 8. So- S.A. Mirroshandel, A. Nasr, and B. Sagot. 2013. En- ciedad Espanola˜ para el Procesamiento del Lenguaje forcing Subcategorization Constraints in a Parser Natural. Using Sub-parses Recombining. In NAACL 2013 - Conference of the North American Chapter of the M. Collins and T. Koo. 2005. Discriminative Rerank- Association for Computational Linguistics. ing for Natural Language Parsing. Computational Linguistics, 31(1). J. Nivre, J. Hall, J. Nilsson, G. Eryigit,ˇ and S. Marinov. 2006. Labeled Pseudo-projective Dependency Pars- A. Fernandez´ and G. Vazquez.` 2014. The SenSem ing with Support Vector Machines. In Proceedings Corpus: an annotated corpus for Spanish and Cata- of the Tenth Conference on Computational Natural lan with information about aspectuality, modality, Language Learning. polarity and factuality. and Lin- guistic Theory, 10(2). J. Nivre, J. Hall, S. Kubler,¨ R. McDonald, J. Nilsson, S. Riedel, and D. Yuret. 2007. The CoNLL 2007 A. Fernandez,´ G. Vazquez, P. Saint-Dizier, F. Bena- Shared Task on Dependency Parsing. In Proceed- mara, and M. Kamel. 2002. The VOLEM Project: ings of the CoNLL Shared Task Session of EMNLP- A Framework for the Construction of Advanced CoNLL 2007. Multilingual Lexicons. In Proceedings of the Lan- guage Engineering Conference. R. O’Donovan, M. Burke, A. Cahill, J. Van Genabith, and A. Way. 2005. Large-Scale Induction and A. Ferrandez´ and L. Moreno. 2000. Slot Unification Evaluation of Lexical Resources from the Penn-II Grammar and Anaphora Resolution. In N. Nicolov and Penn-III . Computational Linguistics, and R. Mitkov, editors, Recent Advances in Natu- 31(3). ral Language Processing II. Selected papers from RANLP 1997. John Benjamins Publishing Co. L. Padro´ and E. Stanilovsky. 2012. Freeling 3.0: Towards wider multilinguality. In Proceedings of T. Jarvinen¨ and P. Tapanainen. 1998. Towards an im- the Eight International Conference on Language Re- plementable dependency grammar. In Proceedings sources and Evaluation. of Workshop on Processing of Dependence-Based Grammars, CoLing-ACL’98. A. Sarkar and D. Zeman. 2000. Automatic Extraction of Subcategorization Frames for Czech. In Proceed- D. Klein and C.D. Manning. 2003. Accurate Unlexi- ings of the 18th Conference on Computational Lin- calized Parsing. In Proceedings of the 41st Annual guistics - Volume 2. Meeting on Association for Computational Linguis- tics - Volume 1. D. Sleator and D. Temperley. 1991. Parsing English with a . In Third International Work- A. Korhonen, Y. Krymolowski, and Z. Marx. 2003. shop on Parsing Technologies. Clustering Polysemic Subcategorization Frame Dis- tributions Semantically. In Proceedings of the 41st L. Tesniere.` 1959. Elements´ de syntaxe structurale. Annual Meeting of the Association for Computa- Klincksieck. tional Linguistics. D. Zeman. 2002. Can Subcategorization Help a Sta- D. Lin. 1998. Dependency-Based Evaluation of MINI- tistical Dependency Parser? In 19th International PAR. In Workshop on the Evaluation of Parsing Sys- Conference on Computational Linguistics. tems, First International Conference on Language Resources and Evaluation. M. Lloberes, I. Castellon,´ and L. Padro.´ 2010. Spanish FreeLing Dependency Grammar. In Proceedings of the Seventh Conference on International Language Resources and Evaluation. M. Lloberes, I. Castellon,´ L. Padro,´ and E. Gonzalez.` 2014. ParTes. Test Suite for Parsing Evaluation. Procesamiento del Lenguaje Natural, 53. C.D. Manning. 1993. Automatic Acquisition of a Large Subcategorization Dictionary from Corpora. In Proceedings of the 31st Annual Meeting of the Association for Computational Linguistics. M. Marimon, N. Bel, and L. Padro.´ 2014. Automatic Selection of HPSG-parsed Sentences for Construction. Computational Linguistics, 40(3).

210