CoMAGD: Annotation of -Depression Relations

Rize Jin1 Jinseon You1 Jin-Woo Chung1 Hee-Jin Lee1 Maria Wolters2 Jong C. Park1* 1School of Computing Korea Advanced Institute of Science and Technology 291 Daehak-ro, Daejeon, Republic of Korea {rizejin, jsyou, jwchung, heejin, park}@nlp.kaist.ac.kr 2School of Informatics University of Edinburgh Edinburgh, UK [email protected]

depression, with consequent rapid accumulation Abstract of candidate (Kao et al., 2011; Piñero et al., 2015). However, the accumulated information is Clinical depression is a mental disorder not yet comprehensive enough to explain the role involving genetics and environmental of genes involved in depression. factors. Although much work studied its genetic causes and numerous candidate DisGeNET (Piñero et al., 2015) is a platform genes have consequently been looked into for discovering associations of genes and complex and reported in the biomedical literature, diseases including depression, defining gene-de- no gene expression changes or mutations pression relations as simple binary relations that regarding depression have yet been consist of geneId, geneSymbol, geneName, dis- adequately collected and analyzed for its easeId, diseaseName, and score, where the score full pathophysiology. In this paper, we is a measure of relevancy based on the supporting present a depression-specific annotated evidence. DEPgenes (Kao et al., 2011) gives a pri- corpus for text mining systems that target oritizing system that uses combined score to rank at providing a concise review of candidate genes for depression. Although depression-gene relations, as well as DEPgenes is a nearly comprehensive candidate capturing complex biological events such gene resource for depression in terms of its vol- as gene expression changes. We describe ume (5,055 candidate genes), its representation the annotation scheme and the conducted concepts are even simpler than DisGeNET and annotation procedure in detail. We discuss thus not quite adequate for the full understanding issues regarding proper recognition of of depression-related phenomena. depression terms and entity interactions In order to fully understand how a particular for future approaches to the task. The gene acts in depression, we need detailed infor- corpus is available at mation about gene expression changes or muta- http://www.biopathway.org/CoMAGD. tions and also how the depression level is changed along with the change in the gene. In this regard, 1 Introduction we anticipate that text mining systems, which can identify and analyze both genes and depression Clinical depression, or major depressive disorder, changes comprehensively from text, would facili- is a mental disorder of the central nervous system tate research on depression much further. Further- with a pathophysiology involving the neocortex. more, if the mined information is annotated and Genetics and environmental factors are known to then made available for reuse, key resources contribute to the development of mood disorders would be identified and constructed more effec- (Nestler et al., 2002). Many biomedical research tively (McDonald and Kelly, 2012; Winnenburg efforts studied the causative factors of genetics in

* Corresponding author

104 Proceedings of the 2015 Workshop on Biomedical Natural Language Processing (BioNLP 2015), pages 104–113, Beijing, China, July 30, 2015. c 2015 Association for Computational Linguistics et al., 2008). Such effort of making relevant cor- were then edited by a domain expert in mental pora has already been made in the studies of genes health. (Kim et al., 2008; Poux et al., 2014) and of com- plex diseases such as cancers (Lee et al., 2013; For the sentences that contain more than one Lee et al., 2014; Pyysalo et al., 2013), but has not pair, we made their copies, matching the number of depression-gene pairs. We call each of these yet been applied to depression. copies a co-occurrence. For example, if there are In this paper, we present a depression-specific three gene names and two depression related annotated corpus, CoMAGD, for future text min- terms in a sentence, the system makes six co-oc- ing systems that target specifically at providing currences for this sentence. comprehensive information of depression-gene relations as well as capturing complex infor- We then tokenized, part-of-speech tagged, and mation such as gene changes and biological parsed the co-occurrences, using the Charniak- Johnson parser (Charniak and Johnson, 2005) events. For this purpose, we follow a multi-fac- eted annotation scheme for cancers (Lee et al., with a biomedical parsing model (McClosky, 2010). The resulting phrase structures were then 2013) while tuning it extensively to depression. In this revised scheme, a piece of annotation is com- converted into dependency structures with the posed of four concepts that together express two Stanford conversion tool (Marneffe et al., 2006). We identified mentions of gene expression events, gene expression changes and depression level or antidepressant effect changes, and the re- changes, using the Turku event extraction system (Björne et al., 2009). Most of the processes above lationship between these two events. We antici- pate that the present corpus and text mined results are included in a preprocessed dataset, or EVEX based on this corpus would contribute meaning- (Landeghem et al., 2012); however, we modified the system and utilized some part of the system fully to the successful exploration of the underly- ing functional correlation between genes and clin- separately where necessary. ical depression. Finally, we performed manual work to validate The rest of the paper is organized as follows. automatically identified co-occurrences in order Section 2 shows the corpus annotation. Section 3 to produce confirmed annotation units, such as manually constructing predicates (i.e., ‘depres- gives details of inter-annotator agreement. Section 4 discusses issues about proper sion of [non-human subjects]’) to filter out false positives from the dictionary matching outputs of recognition of depression terms and entity interactions for future approaches to the task, depression-related terms and manually eliminat- before closing the paper in Section 5. ing false relations (hypothesis sentences). 2.2 A multi-faceted annotation scheme 2 Corpus Annotation We modify a multi-faceted annotation scheme of 2.1 Data collection and pre-processing (Lee et al., 2013), originally designed to represent We collected PubMed IDs (PMIDs) that contain ternary relations among genes, cancers and gene depression related terms in any of the three fields changes, in order to address relations not only be- title, abstract, and keyword, using the query “de- tween depression and genes, but also between an- press* OR dysthymia OR cyclothymia”, and ran- tidepressants and genes, so as to provide more de- domly selected 500 abstracts among them. The tails and enable further insights for follow-up 500 abstracts were then segmented into sentences. studies such as prioritizing depression candidate genes and designing effective treatments and ther- We extracted only the sentences that contain at apy. For example, one may assign a lower weight least one pair of gene and depression/antidepres- to a gene if the gene shows expression changes sant related terms. BANNER (Leaman and Gon- only in antidepressant studies. We also introduce zalez, 2008) and Moara (Neves et al., 2010) were directed causal relations between genes and de- used to identify and normalize gene names. For pression/antidepressants. Identification of the depression and antidepressant terms, the system cause and effect not only reflects the methodolo- used dictionary-based longest matching. The dic- gies of individual studies, but also provides the tionary consists of 303 entries of depression and facts. While the undirected causality claim usually antidepressant related terms collected from NCI is interpreted as a necessary and sufficient clause, Thesaurus and other relevant articles. The entries we find that it could result in false conclusions,

105 Concept Value Definition

Change in Gene Expression increased Expression level of the gene is increased (CGE) decreased Expression level of the gene is decreased The depression level/antidepressant effect is increased as Change in Depression Level increased CGE (CDL) The depression level/antidepressant effect is decreased as or decreased CGE Change in Antidepressant The information about whether or not CGE accompanies Effect unidentifiable the depression level/antidepressant effect change is not (CAE) provided CGE accompanied by CDL/CAE is reported but the cau- none Causality Claim sality between the two is not claimed (CC) g2x The causality is claimed as CGE causes CDL/CAE x2g The causality is claimed as CDL/CAE causes CGE

Table 1: Annotation concept values and their definitions

Sentence CGE CDL CC Example 1. In particular, we found decreased NF-L, PSD95, and SAP102 tran-

scripts in bipolar disorder, and [decreased]e [SAP102]g levels in [major dec. uni. non.

depression]d. [PMID: 15054476] Example 2. In conclusion, chronic forced swim stress was a good animal model of [depression] , and it induced depressive-like behavior and [de- d dec. inc. x2g creased]e [P-Erk2]g in the hippocampus and prefrontal cortex in rats. [PMID: 17050000] Sentence CGE CAE CC

Example 3. [Fluoxetine]a substantially [inhibits]e [CYP2D6]g and probably CYP2C9/10, moderately inhibits CYP2C19 and mildly inhibits dec. uni. x2g CYP3A3/4. [PMID: 9068931] Example 4. [Inhibition] of [neuronal nitric oxide synthase] in the rat hippo- e g dec. inc. g2x campus induces [antidepressant-like]a effects. [PMID: 9068931] Gene names, depression related terms, antidepressant related terms, and the keywords for gene expression change are noted in matching square brackets and marked with subscripts ‘g’, ‘d’, ‘a’, and ‘e’, respectively. Table 2: Examples of annotated co-occurrences especially in the studies of depression. For exam- location information as well, as studies show that ple, depression may decrease the expression level genes may respond differently to the same antide- of a particular gene; however, increasing the ex- pressant if they are in different parts of a body. pression level of that gene may not necessarily re- More details will be discussed in Section 4. duce the symptom. One reason is that the genetic factor is not the only cause of depression. It is also 2.3 Annotation concept believed that, compared to oncogenesis, much The proposed corpus contains four core annota- more genes act together and render a person to be- tion concepts: Change in Gene Expression (CGE), come vulnerable to depression (Belmaker and Change in Depression Level (CDL), Change in Agam, 2008). As such, a more fine-grained anno- Antidepressant Effect (CAE), and Causality tation of causal directions will be essential for Claim (CC). CGE captures whether the expres- more complex diseases such as depression. In an sion level of a gene is ‘increased’ or ‘decreased’. answer to these needs, we use a flexible schema CDL/CAE captures the way how the depression for annotating concepts and ever-changing met- level/antidepressant effect changes together with and facts in genetic studies of depression. The a gene expression level change. If information flexibility would allow the schema to exploit the about such changes is not provided in the sentence,

106 ]>

Table 3: The XML DTD of the corpus we assign ‘unidentifiable’. CC captures whether that the four annotation concepts are semantically the causality between the gene expression change orthogonal, in that the value of a concept can be and the CDL/CAE is claimed in the sentence or identified without knowing the values of the other not, with values ‘none’, ‘x2g’, and ‘g2x’. Each concepts. concept is assigned with one of the pre-specified values to complete a facet of annotation. Table 1 2.4 Corpus statistics shows the pre-specified values and the definitions The corpus consists of 210 annotation units, of the respective values. Three of the four con- where an annotation unit is simply a mention of cepts together complete a piece of annotation that gene expression change that co-occurs with at express information about a gene’s expression least one depression or antidepressant related term level change with a change in depression level or in a sentence. These annotation units are derived antidepressant effect. from 106 different sentences, which in turn are ex- tracted from 73 PubMed abstracts. The corpus Table 2 shows examples of the annotated sen- contains 82 gene types, 5 depression terms, and 20 tences and Table 3 shows the DTD schema of the antidepressant terms (cf. Table 4). corpus. As mentioned earlier, we collected sen- tences from PubMed that describe gene expres- Tables 5 and 6 show the distribution of annota- sion changes in depression/antidepressants. Each tion concept values and the distribution of the an- sentence was presented to the annotators as one or notated genes, respectively. The values of CGE more copies with markings for a gene term, key- show a uniform distribution, whereas the others words for gene expression change, and a depres- show skewed distributions. In particular, for val- sion/antidepressant-related term. The annotators ues of CDL/CAE, ‘unidentifiable’ is frequently read the sentence with such markings and selected chosen (89% for CDL, 87% for CAE). The value proper values for the annotation concepts. Note distribution of the concept CC associated with

107 CAE also exhibits dominance of a single value, or Type Count ‘x2g’. We compared the genes in our corpus with Depression 48 previous studies: 58% (48) and 95% (79) of our annotated genes (83) are included in DisGeNET Major depression 17 and DEPgenes, respectively. Note that DEPgenes Depress. Bipolar disorder 14 only published 169 core genes that exhibit a high Dysthymia 14 chance to be associated with depression from Mood disorder 4 5,055 candidate genes. Antidepressant 47 3 Inter-annotator agreement Fluoxetine 31 Electroconvulsive therapy 4 We annotated the sentence units through two Imipramine 4 main annotation phases (cf. Table 7) and revised Mirtazapine 4 annotation guidelines after each annotation phase. Table 8 shows the IAA values obtained from each Citalopram 3 annotation phase as well as from the whole cor- Escitalopram 3 pus. We measured IAAs in three different ways, Trazodone 3 using simple IAA (the proportion of annotations Lithium 2 in common between two annotators over the total SSRI 2 Antidep. number of annotations provided by either annota- Carbamazepine 1 tor), Cohen’s kappa, and G-index. IAA values from the final phase show that adequate agree- Chlorpromazine 1 ment among the annotators is achieved. The over- Fluvoxamine 1 all IAA values, obtained from the whole corpus, Haloperidol 1 also suggest internal consistency. We resolved all Papaverine 1 disagreements in the published corpus. Perphenazine 1 3.1 Disagreements Quetiapine 1 Reboxetine 1 We identify the following as the major sources for conflicts between the annotators: simple mistakes, Sertraline 1 subjective readings, the use of reasoning, and the Venlafaxine 1 judgements by using prior knowledge. Disagree- Table 4: Statistics of depression/antidepressant re- ment rate is greatly reduced in the second annota- lated terms tion phase, as we revised the guidelines after the completion of the first phase. to CC, but the other interpreted the word as having Simple mistakes are inevitable in manual anno- its literal meaning and assigned ‘none’ to CC. Af- tations, contributing a small number of conflicts ter annotator meeting, the annotators agreed to in- to all the four annotation concepts. In detail, sim- clude instructions on such subjectivity issues in ple mistakes take up 1% (1 out of 142), 8% (11 the annotation guidelines, and the IAA values on out of 142), and 24% (34 out of 142) of the disa- CC show significant improvement in the second greements on CGE, CDL/CAE, and CC values, annotation phase. Subjective readings induce dis- respectively, in Phase 1, and 9% (6 out of 67), 0% agreements on CAE values as well. (0 out of 67), and 3% (2 out of 67) in Phase 2. Example 6. BACKGROUND: Indirect evi- Disagreements also arise from subjective read- dence suggests that loss of brain-derived neu- ings, contributing to most of the disagreements on rotrophic factor (BDNF) from forebrain regions CC values. contributes to an individual's vulnerability for depression, whereas [upregulation]e of [BDNF]g Example 5. [CRF]g is [increased]e during anxi- in these regions is suggested to mediate the ther- ety, [depression]d and pain as well as functional apeutic effect of [antidepressants]a. [PMID: disorders of the pelvic viscera. [PMID: 16697351] 15538210] For the annotation unit in Example 6, one an- For the annotation unit above, one annotator notator interpreted the verb ‘mediate’ as convey- subjectively interpreted the preposition ‘during’ ing the meaning of ‘positive regulation’ and as- as implying a causal relation and assigned ‘x2g’

108 CGE CDL/CAE CC

Inc. Dec. Inc. Dec. Uni. Non. g2x x2g Depress. 54(56%) 43(44%) 4(4%) 7(7%) 86(89%) 56(58%) 8(8%) 33(34%) Antidep. 61(54%) 52(46%) 15(13%) 1(1%) 97(86%) 1(1%) 9(8%) 103(91%) Total 115(55%) 95(45%) 19(9%) 8(4%) 183(87%) 57(27%) 17(8%) 138(65%)

Table 5: Distribution of the annotation concept values

Gene

inc. dec. inc. PRKCAd, MAPK3d, MAPK1d ALB, TNFd,p, IL2d, IL1Bd,p, MAPK1d dec. MAPK1d, BDNFd,p, LEPd, SLC6A4d,p DLG4, NEFLd, DLG3, GFAPd,p, AVPd, PDLIM5d,p, CRHd,p, IL6d,p, CAMK2Ap, Depress. ESR1d,p, NR3C1d,p, TRP, CRHR1d, CAMK2B, IL1Bd,p, TNFd,p, IFNA1d, uni. S100A10d,p, INSd, BDNFd,p, GRM2d, IL2d, AVPd, PDYNp, FCGR3A, CD4d, GRIA3d, SV2A, IGFBP2d, PENK, CD8d, DRD4d,p, PCNTd HTR1Ad,p, CD19, CD8d, GRIN2Ap, GRIN1p inc. TNFd,p HTR1Ad,p, NR3C1d,p, BDNFd,p, dec. CHRM1, NOS1d,p, CYP2D6dp PLCG1d FOSd, IL6d,p, HTR2Ad, ALBd, HTR3Ap, IL1Bd,p, HTR2Ad, TNFd,p, ADRA2Ad,p, HTR1Ad,p, BDNFd,p, HTR1Ad,p, FOSd, FZD3d, ABCB1d,p, PDE4Ad, ABCB1d,p, IGF1d, Antidep. PLA2G4Ap, IL6d,p, CACNA1G, CACNA1I, S100A10d,p, HTR1Bd,p, CREB1d,p, uni. CACNA1H, GSK3Ad, SLC6A3d,p, PRLd, PLA2G4Ap, SYPd, NCAM1d, SLC6A4d,p, KCNK2d, Defa5, VIM, TRA, NTRK2d,p, PLCG1d, SPRd, Hspa9, BRCA1d, CKB, ACTB, GFAPd,p, PDE4Ad, RASEF, PDIA3, SLC6A4d,p, CREB1d,p, CCNA2, CKS1B, BAX CDKN1A, CDKN1B, BCL2d, MAPK1d Genes marked with superscripts d and p are validated with DisGeNET (Piñero et al., 2015) and DEPgenes (Kao et al., 2011), respectively. The reader is referred to the published corpus for more details. Table 6: Distribution of the annotated genes signed ‘increase’ to CAE. However, the other an- Another cause of disagreements was the use of notator interpreted the word as conveying only the reasoning and prior knowledge during annotation. meaning of ‘regulation’ with no directionality and assigned ‘unidentifiable’ to CAE. After annotator Example 8. In the current paper, we propose that the rapid [decrease] in [insulin] level dur- meeting, the CAE value of the annotation unit e g above was set to ‘increase’. ing the postpartum period may be one of the causes of [postpartum mood disorders]d. [PMID: Example 7. Repeated treatment with antide- 16321476] pressant drugs, [imipramine] (Imi) and fluoxe- a For the annotation unit in Example 8, one an- tine (Flu), significantly reduced the plasma cor- notator claimed that there is no association be- ticosterone concentration and [enhanced]e the [BDNF] and CREB levels. [PMID: 16519925] tween the gene insulin and the depression mood g disorders, as he did not find any explicitly stated For the annotation unit above, one annotator in- piece of information. The other annotator, how- terpreted the phrase ‘repeated treatment’ as con- ever, assigned ‘decreased’ to CGE, as he inferred veying the meaning of ‘enhance’ and assigned ‘in- that the mood disorders co-occurs with insulin in crease’ to CAE. However, the other annotator ar- postpartum period. After annotator meeting, the gued that the nature of the antidepressant drugs annotators agreed on ‘decreased’, and added an did not change and assigned ‘unchanged’ to CAE. instruction that allows the inference using logical reasoning to the annotation guidelines.

109 # Phase # Units #Depression #Antidepressant #Genes Data source Phase 1 142 75 67 47 PubMed abstracts Phase 2 68 22 46 42 PubMed abstracts Total/Unique 210/106 97/5 113/20 89/82 PubMed abstracts

Table 7: The annotation phases

CGE CDL/CAE CC Simple Kappa G Simple Kappa G Simple Kappa G Phase 1 1 1 1 0.92 0.69 0.88 0.76 0.47 0.64 Phase 2 0.91 0.81 0.82 1 1 1 0.97 0.93 0.96 Total 0.95 0.91 0.91 0.96 0.85 0.94 0.87 0.7 0.8 Table 8: IAA values

Example 9. All [antidepressants]a [increased]e greatly reduced. In Phase 2, almost all the disa- [c-fos mRNA]g in the central amygdala, as previ- greements were found due to simple errors. Com- ously shown, while c-fos was also increased in pared to the values from Phase 1, IAA values on the anterior insular cortex and significantly de- CDL/CAE and CC from Phase 2 show 13.6% and creased within the septum. [PMID: 15812568] 50.0% increases in terms of G index, respectively. One annotator considered the phrase “All anti- 3.2 Annotation guidelines depressants increased c-fos mRNA” a universal The initial annotation guidelines were taken from affirmative, and just modified the antidepressant Lee et al. (2013). After each annotation phase in term as the universal quantifier, “All antidepres- this work, the annotators held meetings to resolve sants”. However, the other annotator anchored on the disagreements and to revise the guidelines. Ta- the pre-annotated keyword “antidepressants”. Af- ble 9 shows the final version of guidelines. ter annotator meeting, the annotators agreed to specify the quantification type of a term and check 4 Discussion the scope of that quantifier. In this section, we show suggestions to further au- As we refined annotation guidelines after Phase tomating some of the processes described in the 1, the disagreements among the annotators were

# Instruction Annotators should annotate the sentences only if the gene exhibits changes in its expression level 1 and this has relations with the depression or anti-depressant related term Annotators can annotate the relations between CGE and CDL/CAE utilizing linguistic clues and 2 textual evidence 3 Annotators can infer omitted fact utilizing reasoning 4 Annotators should interpret the sentences from an ‘objective point of view’ Annotators need not consider gene expression level changes in healthy people and people with a 5 past history of clinical depression Annotators should not infer information using their prior experience or knowledge about proper- 6 ties of various kinds of depression Annotators should not infer information (i.e., the effects of antidepressants) using their prior 7 knowledge about the functions of genes 8 Annotators should not infer information by using inductive reasoning 9 Annotators need not consider the certainty level of propositions. 10 Annotators need consider universal propositions and particular propositions 11 Annotators should not annotate relations between genes and mania in bipolar disorder

Table 9: Annotation guidelines

110 previous section, especially those of extracting such as pronouns, acronyms, and appositions. depression-gene relations. They may have coped better by using the full re- solved forms of pronouns and acronyms for anno-  ML-based event relation recognition tation, which in turn require the access of preced- Example 10. OBJECTIVE: To examine ing sentences or the whole abstract in the worst whether the pathogenesis of [depression]d is as- case. We also found that text mining tools we used sociated with altered [activation]e and expres- extract both the appositive phrase and the phrase sion of [Rap-1]g, as well as expression of Epac, in apposition, but it would be better to utilize only in depressed suicide victims. [PMID: 16754837] appositives. For example, for the following phrase, we should not annotate the word “Tricyclic anti- Example 10 shows that there are co-occur- depressants” an antidepressant related term, or rences whose depression and gene name pairs annotate “serotonin reuptake” a gene. were identified as correct but whose relation was nonetheless incorrect. The present co-occurrence “Tricyclic antidepressants, selective serotonin has a relation of study description rather than that reuptake inhibitors, and serotonin-noradrenaline of gene expression change event. Besides training reuptake inhibitors, as well as the immediate to come up with the event relation classifier, we precursor of serotonin” can also build a system that automatically filters Instead, we should identify the three apposi- out false relations (i.e., hypothesis sentences) tives as antidepressant related terms, even if they based on the previous work such as topic-classi- were not included in the dictionary. fied relation recognition (Chun et al., 2006; Kili- coglu and Bergler, 2008) and deep-syntactic par-  Sense ambiguity of ‘depression’ ser (Ballesteros et al., 2014; Hara et al., 2005; Masseroli et al., 2006; Skounakis et al., 2008). We also see that using simple dictionary-based matching for detecting depression-related terms  Location and contrasting information produces many ambiguous terms, some of which are not related to the mental disorder at all. In par- Example 11. Animal studies demonstrate that ticular, the term ‘depression’ could also be used in some antipsychotics and [antidepressants] [in- a a situation where a certain amount, value, or func- crease] neurogenesis and [BDNF] expression e g tion is lowered or decreased, among others. We in the hippocampus, which is reduced in volume notice that such cases are frequently observed in in patients with depression or schizophrenia. biomedical texts as exemplified below: [PMID: 16652337] Example 12. Lack of enteral stimulation with Example 11, and Example 9 too, show that lo- PN impairs mucosal immunity and [reduces] cation information turn out to be important in e [IgA] levels through [depression] of GALT cy- studies of depression and genes may respond dif- g d tokines (IL-4 and IL-10) and GALT specific ad- ferently to the same antidepressant in different hesion molecules. [PMID: 16926565] parts of a body. Many annotation units do not ex- plicitly provide such location information. How- Example 13. LTA causes cardiac [depression]d ever, missing such information will lead to con- by [activating]e myocardial TNF-alpha synthe- flicts and even paradoxes among annotated or sis via [CD14]g and induces coronary vascular mined results. disturbances by activating Cox-2-dependent TXA2 synthesis. [PMID: 16043646] Although the annotation concepts of the pre- sented corpus are originally designed to represent In our initial dataset that has 1,251 occurrences relations between gene changes and depres- of depression-related terms obtained via the sim- sion/antidepressant changes, they must be made to ple dictionary-based matching, the term ‘depres- accept other concepts and constantly changing sion’ is found 730 times, which amounts to more metrics in genetic studies of depression. In this re- than half of the entire occurrences. Our corpus sta- gard, we should extend the annotation scheme to tistics in Table 4 also show that ‘depression’ is the include parts of a body as the location and their most frequent depression-related term. This hierarchical relationship information. means that not a few of such terms still have po- tential sense ambiguities. Although we manually  Pronouns, acronyms, and appositions filtered out false positive examples in our corpus, Other difficulties we faced during recognition this issue is still important since it could hinder the were in dealing with grammatical constructions performance of extracting depression-related

111 terms in a fully pipelined system. Although a few Terms in our Terms not in our named entity recognizers for biomedical text have dictionary dictionary been developed (Leaman and Gonzalez, 2008; Term Score Term Score Campos et al., 2013), none of these tools are ca- major 3414 treatment 807 pable of recognizing terms referring to depression, especially identifying ‘depression’ as the mental antidepressant 2533 reuptake 504 disorder, to the best of our knowledge. disorder 1957 serotonin 475 depressive 1615 MDD 464 It is anticipated that the disambiguation of the bipolar 986 psychiatric 450 term ‘depression’ can be addressed with the con- mood 874 rating 356 ventional methods of word sense disambiguation disorders 695 diagnostic 340 with various features such as context information unipolar 523 DSM-IV 312 or external knowledge resources. Our data analy- sis suggests that local semantic features would be tricyclic 441 criteria 301 effective in many cases, among others. In particu- depressed 409 patients 296 lar, the following three types of syntactic con- Table 10: Discriminative terms for documents struction could act as strong indicators for false related to the depressive disorder positives: (1) prepositional phrases, (2) prenomi- nal modifiers, and (3) coordinate constructions. dictionary (on the right column). It is shown that First, prenominal modifiers often signal the con- many of the terms in the latter set are used in the text where some activity or amount is decreased, context of diagnosis or treatment of depression. such as the physical malfunction (“cardiac depres- One of the possible methods is to use terms of this sion”), the object or cause of inhibition (“Orx-B- kind as features for training a binary classifier that induced depression”, “AMPAR depression”), and determines whether a given document containing the degree of decrease (“significant depression”, ‘depression’ discusses the mental disorder or not. “moderate depression”). Second, prepositional phrases provide information about the location or 5 Conclusion inhibition of a biological process (“depression in synapses”, “depression of synaptic transmission”, In this paper, we presented a depression-specific “depression of gamma interferon”). Last, coordi- corpus in support of the development of advanced nate constructions allow for exploiting the seman- text mining systems that target specifically at tic similarity (“depression and anxiety” vs. “long- providing a comprehensive information of depres- term potentiation and depression”). All of these sion-gene relations. The annotation scheme of features are highly local; syntactic dependencies current version can express two events, gene ex- do not cross the boundary of noun phrases. pression changes and depression level or antide- pressant effect changes, and the relationship be- Another possible approach would be to employ tween these two events. The presented corpus the document topic features by assuming that if shows a high inter-annotator agreement. We also the abstract of a document discusses the mental discussed several issues in the domain of depres- disorder, the term ‘depression’ in the abstract is sion and made suggestions to extend the annota- also likely to refer to the mental disorder. In order tion scheme further to resolve conflicts and some- to figure out what kind of terms are best indicative times paradoxes in the acquired knowledge for de- of documents that discuss the depressive disorder, pression. we collected a set of 5,000 Medline abstracts that contain unambiguous domain-specific terms in Acknowledgements our depression term dictionary such as ‘depres- sive disorder’, ‘bipolar disorder’, and ‘antidepres- This work was supported by the National Re- sant’, and also collected another set of 10,000 ab- search Foundation of Korea (NRF) grant funded stracts that do not contain any of those terms in- by the Korea government (MSIP) (No. NRF- cluding ‘depression’. The chi-square statistics are 2014R1A2A1A11052310). employed to measure the discriminative power of terms found in each set of abstracts. Table 10 shows the 10 top-ranked terms for each of two types of term: terms that partially match one of the terms in our depression term dictionary (on the left column) and terms that are not found in the

112 References recognition. In Proceedings of the Pacific Sympo- sium on Biocomputing, 652-663. M. Ballesteros, B. Bohnet, S. Mille, L. Wanner. 2014. Deep-Syntactic Parsing. In Proceedings of the 24th M. C. D. Marneffe, B. MacCartney, C. D. Manning. International Conference on Computational Lin- 2006. Generating typed dependency parses from guistics. 1402-1413 phrase structure parses. In Proceedings of the LREC, 449-454. R. H. Belmaker, G. Agam. 2008. Major depressive dis- order. New England Journal of Medicine, 358:55-68. M. Masseroli, H. Kilicoglu, F. Lang, T. Rindflesch. 2006. Argument-predicate distance as a filter for en- J. Björne, J. Heimonen, F. Ginter, A. Airola, T. hancing precision in extracting predications on the Pahikkala, T. Salakoski. 2009. Extracting complex genetic etiology of disease. BMC Bioinformatics, biological events with rich graph-based features sets. 7:291. In Proceedings of the BioNLP’09 Shared Task on Event Extraction Association for Computational D. McClosky. 2010. Any domain parsing: automatic Linguistics, 10-18. domain adaptation for natural language parsing. PhD Thesis, Brown University, Department of D. Campos, S. Matos, J. L. Oliveira. 2013. Gimli: open Computer Science. source and high-performance biomedical name recognition. BMC Bioinformatics, 14:54. D. McDonald, U Kelly. 2012. The value and benefits of text mining. UK JISC, [Online. Available: E. Charniak, M. Johnson. 2005. Coarse-to-fine n-best http://www.jisc.ac.uk/reports/value-and-benefits- parsing and MaxEnt discriminative reranking. In of-text-mining]. Proceedings of the 43rd ACL, 173-180. E. J. Nestler, M. Barrot, R. J. DiLeone, A. J. Eisch, S. H. Chun, Y. Tsuruoka, J. Kim, R. Shiba, N. Nagata, T. J. Gold, L. M. Monteggia. 2002. Neurobiology of Hishiki, J. Tsujii. 2006. Automatic recognition of depression. , 34:13-25. topic-classified relations between prostate cancer and genes using MEDLINE abstracts. BMC Bioin- M. Neves, J. M. Carazo, A. Pascual-Montano. 2005. formatics, 7(Suppl 3):S4. Moara: a Java library for extracting and normalizing gene and mentions. BMC Bioinformatics, 11: Hee-Jin Lee, Sang-Hyung Shim, Mi-Ryoung Song, 157-169. Hyunju Lee, Jong C. Park. 2013. CoMAGC: a cor- pus with multi-faceted annotations of gene-cancer J. Piñero, N. Queralt-Rosinach, À . Bravo, J. Deu-Pons, relations. BMC Bioinformatics, 14:323. A. Bauer-Mehren, M. Baron, F. Sanz, L. I. Furlong. 2015. DisGeNET: a discovery platform for the dy- Hee-Jin Lee, Tien Cuong Dang, Hyunju Lee, Jong C. namical exploration of human diseases and their Park. 2014. OncoSearch: cancer gene search engine genes. Database, 2015:bav028. with literature evidence. Nucleic Acids Research, 42(W1):W416-W421. S. Poux, M. Magrane, C. N. Arighi, A. Bridge, C. O'Donovan, K. Laiho, The UniProt Consortium. T. Hara, Y. Miyao, J. Tsujii. 2005. Adapting a proba- 2014. Expert curation in UniProtKB: a case study on bilistic disambiguation model of an HPSG parser to dealing with conflicting and erroneous data. Data- a new domain. In Proceedings of IJCNLP, 199-210. base, 2014:bau016. C. F. Kao, Y. S. Fang, Z. Zhao, P. H. Kuo. 2011. Pri- S. Pyysalo, T. Ohta, S. Ananiadou. 2013. Overview of oritization and evaluation of depression candidate the Cancer Genetics (CG) task of BioNLP Shared genes by combining multidimensional data re- Task 2013. In Proceedings of the BioNLP Shared sources. PLoS ONE, 6(4):1-9. Task 2013 Workshop, 58-66. H. Kilicoglu, S. Bergler. 2008. Recognizing specula- M. Skounakis, M. Craven, S. Ray. 2003. Hierarchical tive language in biomedical research articles: a lin- hidden Markov models for information extraction. guistically motivated perspective. BMC Bioinfor- In Proceedings of the Eighteenth International Joint matics, 2008, 9(Suppl 11):S10. Conference on Artificial Intelligence, 427-433. J. Kim, T. Ohta, J. Tsujii. 2008. Corpus annotation for R. Winnenburg, T. Wachter, C. Plake, A. Doms, M. mining biomedical events from literature. BMC Bi- Schroeder. 2008. Facts from text: can text mining oinformatics, 9:10. help to scale-up high-quality manual curation of S. V. Landeghem, K. Hakala, S. Rnnqvist, T. Salakoski, gene products with ontologies? Briefings in Bioin- Y. Peer, F. Ginter. 2012. Exploring biomolecular lit- formatics, 9(6):466-78. erature with EVEX: connecting genes through events, homology and indirect associations. Ad- vances in Bioinformatics, 2012:582765. R. Leaman, G. Gonzalez. 2008. BANNER: an execut- able survey of advances in biomedical named entity

113