Incorporating Trustiness and Collective Synonym/Contrastive Evidence Into Taxonomy Construction

Total Page:16

File Type:pdf, Size:1020Kb

Incorporating Trustiness and Collective Synonym/Contrastive Evidence Into Taxonomy Construction Incorporating Trustiness and Collective Synonym/Contrastive Evidence into Taxonomy Construction #1 #2 3 Luu Anh Tuan , Jung-jae Kim , Ng See Kiong ∗ #School of Computer Engineering, Nanyang Technological University, Singapore [email protected], [email protected] ∗Institute for Infocomm Research, Agency for Science, Technology and Research, Singapore [email protected] Abstract Previous works on the task of taxonomy con- struction capture information about potential tax- Taxonomy plays an important role in many onomic relations between concepts, rank the can- applications by organizing domain knowl- didate relations based on the captured information, edge into a hierarchy of is-a relations be- and integrate the highly ranked relations into a tax- tween terms. Previous works on the taxo- onomic structure. They utilize such information nomic relation identification from text cor- as hypernym patterns (e.g. A is a B, A such as pora lack in two aspects: 1) They do not B) (Kozareva et al., 2008), syntactic dependency consider the trustiness of individual source (Drumond and Girardi, 2010), definition sentences texts, which is important to filter out in- (Navigli et al., 2011), co-occurrence (Zhu et al., correct relations from unreliable sources. 2013), syntactic contextual similarity (Tuan et al., 2) They also do not consider collective 2014), and sibling relations (Bansal et al., 2014). evidence from synonyms and contrastive terms, where synonyms may provide ad- They, however, lack in the three following as- ditional supports to taxonomic relations, pects: 1) Trustiness: Not all sources are trust- while contrastive terms may contradict worthy (e.g. gossip, forum posts written by them. In this paper, we present a method non-experts) (Dong et al., 2015). The trustiness of taxonomic relation identification that of source texts is important in taxonomic rela- incorporates the trustiness of source texts tion identification because evidence from unre- measured with such techniques as PageR- liable sources can be incorrect. For example, ank and knowledge-based trust, and the the invalid taxonomic relation between “American collective evidence of synonyms and con- chameleon” and “chameleon” is mistakenly more trastive terms identified by linguistic pat- popular in the Web than the valid taxonomic rela- tern matching and machine learning. The tion between “American chameleon” and “lizard”, experimental results show that the pro- and statistical methods without considering the posed features can consistently improve trustiness may incorrectly extract the invalid rela- performance up to 4%-10% of F-measure. tion instead of the latter. However, to the best of our knowledge, no previous work considered this 1 Introduction aspect. Taxonomies which serve as backbone of struc- 2) Synonyms: A concept may be expressed in tured knowledge are useful for many applica- multiple ways, for example with synonyms. The tions such as question answering (Harabagiu et previous works mostly assumed that a term repre- al., 2003) and document clustering (Fodeh et al., sents an independent concept, and did not combine 2011). Even though there are many hand-crafted, information about a concept, which is expressed well-structured taxonomies publicly available, in- with multiple synonyms. The lack of evidence cluding WordNet (Miller, 1995), OpenCyc (Ma- from synonyms may hamper the ranking of can- tuszek et al., 2006), and Freebase (Bollacker et didate taxonomic relations. Navigli and Velardi al., 2008), they are incomplete in specific domains, (2004) combined synonyms into a concept, but and it is time-consuming to manually extend them only for those from WordNet, called synsets. or create new ones. There is thus a need for auto- 3) Contrastive terms: We observe that if two matically extracting taxonomic relations from text terms are often contrasted (e.g. A but not B, A corpora to construct/extend taxonomies. is different from B) (Kim et al., 2006), they may 1013 Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pages 1013–1022, Lisbon, Portugal, 17-21 September 2015. c 2015 Association for Computational Linguistics. not have a taxonomic relation. tactic contextual similarity (Tuan et al., 2014), and In this paper, we present a method based on the word embedding (Fu et al., 2014). The main idea state-of-the-art method (Tuan et al., 2014), which behind these techniques is that the terms that are incorporates the trustiness of source texts and asymmetrically similar to each other with regard the collective evidence from synonyms/contrastive to, for example, co-occurrences, syntactic con- terms. Tuan et al. (2014) rank candidate tax- texts, and latent vector representation may have onomic relations based on such evidence as hy- taxonomic relationships. Such methods, however, pernym patterns, WordNet, and syntactic contex- usually suffer from low accuracy, though show- tual similarity, where the pattern matches and the ing relatively high coverage. To get the balance syntactic contexts are found from the Web by us- between the two approaches, Yang and Callan ing a Web search engine. First, we calculate the (2009), Zhu et al. (2013) and Tuan et al. (2014) trustiness score of each data source with the four combine both statistical and linguistic features in following weights: importance (if it is linked by the process of finding taxonomic relations. many pages), popularity (if it is visited by many Most of these previous methods do not consider users), authority (if it is from a creditable Web site) if the source text of evidence (e.g. co-occurrences, and accuracy (if it has many facts). We integrate pattern matches) is trustworthy or not and do not this score to the work of (Tuan et al., 2014) so that combine evidence from synonyms and contrastive evidence for taxonomic relations from unreliable terms as discussed earlier. Related to synonyms, a sources is discarded. few previous works utilize siblings for taxonomy Second, we identify synonyms of two terms (t1, construction. Yang and Callan (2009) use siblings t2), whose taxonomic relation is being scrutinized, as one of the features in the metric-based frame- by matching such queries as “t1 also known as” work which incrementally clusters terms to form against the Web to find t1’s synonyms next to the taxonomies. Wentao et al. (2012) also utilize such query matches (e.g. t in “t1 also known as t”). We sibling feature that, for example of the linguistic then collect the evidence for all taxonomic rela- pattern “A such as B1, B2, and Bn”, if the ··· tions between t10 and t20 , where ti0 is either ti or its concept at the k-th position (e.g. Bk) from pat- synonym (i 1, 2 ), and combine them to calcu- tern keywords (e.g. such as) is a valid sub-concept ∈ { } late the evidence score of the candidate taxonomic (e.g. of A), then most likely its siblings from po- relation between t1 and t2. Similarly, for each pair sition 1 to position k-1 (e.g. B1, , Bk 1) are ··· − of two terms (t1, t2), we collect their contrastive also valid sub-concepts. Bansal et al. (2014) in- evidence by matching such queries as “t1 is not a clude the sibling factors to a structured probabilis- type of t2” against the Web, and use them to pro- tic model over the full space of taxonomy trees, portionally decrease the evidence score for taxo- thus helping to add more evidence to taxonomic nomic relation between contrasting terms. relations. Navigli and Velardi (2004) utilize the synonym feature (i.e. WordNet synsets) for the 2 Related Work process of semantic disambiguation and concept clustering as mentioned above, but not for the pro- The previous methods for identifying taxonomic cess of inducing novel taxonomic relations. relations from text can be generally classified into two categories: linguistic and statistical ap- 3 Methodology proaches. The former approach mostly exploits lexical-syntactic patterns (e.g. A is a B, A such We briefly introduce (Tuan et al., 2014) in Section as B) (Hearst, 1992). Those patterns can be man- 3.1. We then explain how to incorporate trustiness ually created (Kozareva et al., 2008; Wentao et al., (Section 3.2) and collective evidence from syn- 2012) or automatically identified (Navigli et al., onyms (Section 3.3) and from contrastive terms 2011; Bansal et al., 2014). The pattern matching (Section 3.4) into the work of (Tuan et al., 2014). methods show high precision when the patterns are carefully defined, but low coverage due to the 3.1 Overview of baseline (Tuan et al., 2014) lack of contextual analysis across sentences. Tuan et al. (2014) follow three steps to construct a The latter approach, on the other hand, includes taxonomy: term extraction/filtering, taxonomic re- asymmetrical term co-occurrence (Fotzo and Gal- lation identification and taxonomy induction. Be- linari, 2004), clustering (Wong et al., 2007), syn- cause the focus of this paper is on the second step, 1014 taxonomic relation identification, we use the same 3.2 Trustiness of the evidence data methods for the first and third steps as in (Tuan et We introduce our method of estimating the trusti- al., 2014) and will not discuss them here. ness of a given source text in Section 3.2.1 and ex- Given each ordered pair of two terms t1 and t2 plain how to incorporate it into the work of (Tuan from the term extraction/filtering, the taxonomic et al., 2014) in Section 3.2.2. relation identification of (Tuan et al., 2014) calcu- lates the evidence score that t1 is a hypernym of t2 3.2.1 Collecting trustiness score of the (denoted as t t ) based on the following three evidence data 1 2 measures: Given a data source (e.g. Web page), we consider four aspects of trustiness as follows: String inclusion with WordNet (SIWN): This Importance: A data source may be important • measure is to check if t1 is a substring of t2, con- if it is referenced by many other sources.
Recommended publications
  • JVP 26(3) September 2006—ABSTRACTS
    Neoceti Symposium, Saturday 8:45 acid-prepared osteolepiforms Medoevia and Gogonasus has offered strong support for BODY SIZE AND CRYPTIC TROPHIC SEPARATION OF GENERALIZED Jarvik’s interpretation, but Eusthenopteron itself has not been reexamined in detail. PIERCE-FEEDING CETACEANS: THE ROLE OF FEEDING DIVERSITY DUR- Uncertainty has persisted about the relationship between the large endoskeletal “fenestra ING THE RISE OF THE NEOCETI endochoanalis” and the apparently much smaller choana, and about the occlusion of upper ADAM, Peter, Univ. of California, Los Angeles, Los Angeles, CA; JETT, Kristin, Univ. of and lower jaw fangs relative to the choana. California, Davis, Davis, CA; OLSON, Joshua, Univ. of California, Los Angeles, Los A CT scan investigation of a large skull of Eusthenopteron, carried out in collaboration Angeles, CA with University of Texas and Parc de Miguasha, offers an opportunity to image and digital- Marine mammals with homodont dentition and relatively little specialization of the feeding ly “dissect” a complete three-dimensional snout region. We find that a choana is indeed apparatus are often categorized as generalist eaters of squid and fish. However, analyses of present, somewhat narrower but otherwise similar to that described by Jarvik. It does not many modern ecosystems reveal the importance of body size in determining trophic parti- receive the anterior coronoid fang, which bites mesial to the edge of the dermopalatine and tioning and diversity among predators. We established relationships between body sizes of is received by a pit in that bone. The fenestra endochoanalis is partly floored by the vomer extant cetaceans and their prey in order to infer prey size and potential trophic separation of and the dermopalatine, restricting the choana to the lateral part of the fenestra.
    [Show full text]
  • First Report of Ancylostoma Ceylanicum in Wild Canids Q ⇑ Felicity A
    International Journal for Parasitology: Parasites and Wildlife 2 (2013) 173–177 Contents lists available at SciVerse ScienceDirect International Journal for Parasitology: Parasites and Wildlife journal homepage: www.elsevier.com/locate/ijppaw Brief Report First report of Ancylostoma ceylanicum in wild canids q ⇑ Felicity A. Smout a, R.C. Andrew Thompson b, Lee F. Skerratt a, a School of Public Health, Tropical Medicine and Rehabilitation Sciences, James Cook University, Townsville, Queensland 4811, Australia b School of Veterinary and Biomedical Sciences, Murdoch University, Murdoch, Western Australia 6150, Australia article info abstract Article history: The parasitic nematode Ancylostoma ceylanicum is common in dogs, cats and humans throughout Asia, Received 19 February 2013 inhabiting the small intestine and possibly leading to iron-deficient anaemia in those infected. It has pre- Revised 23 April 2013 viously been discovered in domestic dogs in Australia and this is the first report of A. ceylanicum in wild Accepted 26 April 2013 canids. Wild dogs (dingoes and dingo hybrids) killed in council control operations (n = 26) and wild dog scats (n = 89) were collected from the Wet Tropics region around Cairns, Far North Queensland. All of the carcasses (100%) were infected with Ancylostoma caninum and three (11.5%) had dual infections with A. Keywords: ceylanicum. Scats, positively sequenced for hookworm, contained A. ceylanicum, A. caninum and Ancylos- Ancylostoma ceylanicum toma braziliense, with A. ceylanicum the dominant species in Mount Windsor National Park, with a prev- Dingo Canine alence of 100%, but decreasing to 68% and 30.8% in scats collected from northern and southern rural Hookworm suburbs of Cairns, respectively.
    [Show full text]
  • The Common Snapping Turtle, Chelydra Serpentina
    The Common Snapping Turtle, Chelydra serpentina Rylen Nakama FISH 423: Olden 12/5/14 Figure 1. The Common Snapping Turtle, one of the most widespread reptiles in North America. Photo taken in Quebec, Canada. Image from https://www.flickr.com/photos/yorthopia/7626614760/. Classification Order: Testudines Family: Chelydridae Genus: Chelydra Species: serpentina (Linnaeus, 1758) Previous research on Chelydra serpentina (Phillips et al., 1996) acknowledged four subspecies, C. s. serpentina (Northern U.S. and Figure 2. Side profile of Chelydra serpentina. Note Canada), C. s. osceola (Southeastern U.S.), C. s. the serrated posterior end of the carapace and the rossignonii (Central America), and C. s. tail’s raised central ridge. Photo from http://pelotes.jea.com/AnimalFact/Reptile/snapturt.ht acutirostris (South America). Recent IUCN m. reclassification of chelonians based on genetic analyses (Rhodin et al., 2010) elevated C. s. rossignonii and C. s. acutirostris to species level and established C. s. osceola as a synonym for C. s. serpentina, thus eliminating subspecies within C. serpentina. Antiquated distinctions between the two formerly recognized North American subspecies were based on negligible morphometric variations between the two populations. Interbreeding in the overlapping range of the two populations was well documented, further discrediting the validity of the subspecies distinction (Feuer, 1971; Aresco and Gunzburger, 2007). Therefore, any emphasis of subspecies differentiation in the ensuing literature should be disregarded. Figure 3. Front-view of a captured Chelydra Continued usage of invalid subspecies names is serpentina. Different skin textures and the distinctive pink mouth are visible from this angle. Photo from still prevalent in the exotic pet trade for C.
    [Show full text]
  • Arxiv:1809.06223V1 [Cs.CL] 17 Sep 2018
    Unsupervised Sense-Aware Hypernymy Extraction Dmitry Ustalov†, Alexander Panchenko‡, Chris Biemann‡, and Simone Paolo Ponzetto† †University of Mannheim, Germany fdmitry,[email protected] ‡University of Hamburg, Germany fpanchenko,[email protected] Abstract from text between two ambiguous words, e.g., apple fruit. However, by definition in Cruse In this paper, we show how unsupervised (1986), hypernymy is a binary relationship between sense representations can be used to im- senses, e.g., apple2 fruit1, where apple2 is the prove hypernymy extraction. We present “food” sense of the word “apple”. In turn, the word a method for extracting disambiguated hy- “apple” can be represented by multiple lexical units, pernymy relationships that propagate hy- e.g., “apple” or “pomiculture”. This sense is dis- pernyms to sets of synonyms (synsets), tinct from the “company” sense of the word “ap- constructs embeddings for these sets, and ple”, which can be denoted as apple3. Thus, more establishes sense-aware relationships be- generally, hypernymy is a relation defined on two tween matching synsets. Evaluation on two sets of disambiguated words; this modeling princi- gold standard datasets for English and Rus- ple was also implemented in WordNet (Fellbaum, sian shows that the method successfully 1998), where hypernymy relations link not words recognizes hypernymy relationships that directly, but instead synsets. This essential prop- cannot be found with standard Hearst pat- erty of hypernymy is however not used or modeled terns and Wiktionary datasets for the re- in the majority of current hypernymy extraction spective languages. approaches. In this paper, we present an approach that addresses this shortcoming.
    [Show full text]
  • Roberto Navigli Curriculum Vitæ
    Roberto Navigli Curriculum Vitæ Roma, 18 marzo 2021 Part I { General Information Full Name Roberto Navigli Citizenship Italian E-mail [email protected] Spoken languages Italian, English, French Sito Web http://wwwusers.di.uniroma1.it/∼navigli Part II { Education Type Year Institution Notes (Degree, Experience, etc.) Ph.D. 2007 Sapienza PhD in Computer Science, advisor: prof. Paola Velardi. Thesis: "Structural Semantic Interconnections: a Knowledge-Based WSD Algorithm, its Evaluation and Applications", Winner of the 2017 AI*IA \Marco Cadoli" national prize for the best PhD thesis in AI. University 2001 Sapienza Master degree in Computer Science, 110/110 cum laude. graduation Thesis: "An automatic algorithm for domain ontology learning", supervisor: prof. Paola Velardi. Part III { Appointments III(A) { Academic Appointments Start End Institution Position 09/2017 oggi Sapienza full professor, Dipartimento di Informatica, Universit`adi Roma \La Sapienza". 12/2010 08/2017 Sapienza associate professor, Dipartimento di Informatica, Universit`adi Roma \La Sapienza". 03/2007 12/2010 Sapienza assistant professor, Dipartimento di Informatica, Sapienza. 05/2003 03/2007 Sapienza research fellow, Dipartimento di Informatica, Sapienza. III(B) { Other Appointments Start End Institution Position 05/2000 04/2003 YH Reply software engineer S.p.A. Roma 1 III(C) { Research visits and stays Start End Institution Position 01/2015 01/2015 Center for Advanced Studies Invited Visiting Fellow (CAS), LMU, Germany 2010 2012 University of Wolverhampton Visiting
    [Show full text]
  • Download Vol. 11, No. 3
    BULLETIN OF THE FLORIDA STATE MUSEUM BIOLOGICAL SCIENCES Volume 11 Number 3 CATALOGUE OF FOSSIL BIRDS: Part 3 (Ralliformes, Ichthyornithiformes, Charadriiformes) Pierce Brodkorb M,4 * . /853 0 UNIVERSITY OF FLORIDA Gainesville 1967 Numbers of the BULLETIN OF THE FLORIDA STATE MUSEUM are pub- lished at irregular intervals. Volumes contain about 800 pages and are not nec- essarily completed in any one calendar year. WALTER AuFFENBERC, Managing Editor OLIVER L. AUSTIN, JA, Editor Consultants for this issue. ~ HILDEGARDE HOWARD ALExANDER WErMORE Communications concerning purchase or exchange of the publication and all manuscripts should be addressed to the Managing Editor of the Bulletin, Florida State Museum, Seagle Building, Gainesville, Florida. 82601 Published June 12, 1967 Price for this issue $2.20 CATALOGUE OF FOSSIL BIRDS: Part 3 ( Ralliformes, Ichthyornithiformes, Charadriiformes) PIERCE BRODKORBl SYNOPSIS: The third installment of the Catalogue of Fossil Birds treats 84 families comprising the orders Ralliformes, Ichthyornithiformes, and Charadriiformes. The species included in this section number 866, of which 215 are paleospecies and 151 are neospecies. With the addenda of 14 paleospecies, the three parts now published treat 1,236 spDcies, of which 771 are paleospecies and 465 are living or recently extinct. The nominal order- Diatrymiformes is reduced in rank to a suborder of the Ralliformes, and several generally recognized families are reduced to subfamily status. These include Geranoididae and Eogruidae (to Gruidae); Bfontornithidae
    [Show full text]
  • Onetouch 4.0 Scanned Documents
    / Chapter 2 THE FOSSIL RECORD OF BIRDS Storrs L. Olson Department of Vertebrate Zoology National Museum of Natural History Smithsonian Institution Washington, DC. I. Introduction 80 II. Archaeopteryx 85 III. Early Cretaceous Birds 87 IV. Hesperornithiformes 89 V. Ichthyornithiformes 91 VI. Other Mesozojc Birds 92 VII. Paleognathous Birds 96 A. The Problem of the Origins of Paleognathous Birds 96 B. The Fossil Record of Paleognathous Birds 104 VIII. The "Basal" Land Bird Assemblage 107 A. Opisthocomidae 109 B. Musophagidae 109 C. Cuculidae HO D. Falconidae HI E. Sagittariidae 112 F. Accipitridae 112 G. Pandionidae 114 H. Galliformes 114 1. Family Incertae Sedis Turnicidae 119 J. Columbiformes 119 K. Psittaciforines 120 L. Family Incertae Sedis Zygodactylidae 121 IX. The "Higher" Land Bird Assemblage 122 A. Coliiformes 124 B. Coraciiformes (Including Trogonidae and Galbulae) 124 C. Strigiformes 129 D. Caprimulgiformes 132 E. Apodiformes 134 F. Family Incertae Sedis Trochilidae 135 G. Order Incertae Sedis Bucerotiformes (Including Upupae) 136 H. Piciformes 138 I. Passeriformes 139 X. The Water Bird Assemblage 141 A. Gruiformes 142 B. Family Incertae Sedis Ardeidae 165 79 Avian Biology, Vol. Vlll ISBN 0-12-249408-3 80 STORES L. OLSON C. Family Incertae Sedis Podicipedidae 168 D. Charadriiformes 169 E. Anseriformes 186 F. Ciconiiformes 188 G. Pelecaniformes 192 H. Procellariiformes 208 I. Gaviiformes 212 J. Sphenisciformes 217 XI. Conclusion 217 References 218 I. Introduction Avian paleontology has long been a poor stepsister to its mammalian counterpart, a fact that may be attributed in some measure to an insufRcien- cy of qualified workers and to the absence in birds of heterodont teeth, on which the greater proportion of the fossil record of mammals is founded.
    [Show full text]
  • Summary Report of Freshwater Nonindigenous Aquatic Species in U.S
    Summary Report of Freshwater Nonindigenous Aquatic Species in U.S. Fish and Wildlife Service Region 4—An Update April 2013 Prepared by: Pam L. Fuller, Amy J. Benson, and Matthew J. Cannister U.S. Geological Survey Southeast Ecological Science Center Gainesville, Florida Prepared for: U.S. Fish and Wildlife Service Southeast Region Atlanta, Georgia Cover Photos: Silver Carp, Hypophthalmichthys molitrix – Auburn University Giant Applesnail, Pomacea maculata – David Knott Straightedge Crayfish, Procambarus hayi – U.S. Forest Service i Table of Contents Table of Contents ...................................................................................................................................... ii List of Figures ............................................................................................................................................ v List of Tables ............................................................................................................................................ vi INTRODUCTION ............................................................................................................................................. 1 Overview of Region 4 Introductions Since 2000 ....................................................................................... 1 Format of Species Accounts ...................................................................................................................... 2 Explanation of Maps ................................................................................................................................
    [Show full text]
  • CA SQL-Ease for DB2 for Z/OS User Guide
    CA SQL-Ease® for DB2 for z/OS User Guide Version 17.0.00, Second Edition This Documentation, which includes embedded help systems and electronically distributed materials, (hereinafter referred to as the “Documentation”) is for your informational purposes only and is subject to change or withdrawal by CA at any time. This Documentation is proprietary information of CA and may not be copied, transferred, reproduced, disclosed, modified or duplicated, in whole or in part, without the prior written consent of CA. If you are a licensed user of the software product(s) addressed in the Documentation, you may print or otherwise make available a reasonable number of copies of the Documentation for internal use by you and your employees in connection with that software, provided that all CA copyright notices and legends are affixed to each reproduced copy. The right to print or otherwise make available copies of the Documentation is limited to the period during which the applicable license for such software remains in full force and effect. Should the license terminate for any reason, it is your responsibility to certify in writing to CA that all copies and partial copies of the Documentation have been returned to CA or destroyed. TO THE EXTENT PERMITTED BY APPLICABLE LAW, CA PROVIDES THIS DOCUMENTATION “AS IS” WITHOUT WARRANTY OF ANY KIND, INCLUDING WITHOUT LIMITATION, ANY IMPLIED WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE, OR NONINFRINGEMENT. IN NO EVENT WILL CA BE LIABLE TO YOU OR ANY THIRD PARTY FOR ANY LOSS OR DAMAGE, DIRECT OR INDIRECT, FROM THE USE OF THIS DOCUMENTATION, INCLUDING WITHOUT LIMITATION, LOST PROFITS, LOST INVESTMENT, BUSINESS INTERRUPTION, GOODWILL, OR LOST DATA, EVEN IF CA IS EXPRESSLY ADVISED IN ADVANCE OF THE POSSIBILITY OF SUCH LOSS OR DAMAGE.
    [Show full text]
  • Ontology Learning and Its Application to Automated Terminology Translation
    Natural Language Processing Ontology Learning and Its Application to Automated Terminology Translation Roberto Navigli and Paola Velardi, Università di Roma La Sapienza Aldo Gangemi, Institute of Cognitive Sciences and Technology lthough the IT community widely acknowledges the usefulness of domain ontolo- A gies, especially in relation to the Semantic Web,1,2 we must overcome several bar- The OntoLearn system riers before they become practical and useful tools. Thus far, only a few specific research for automated ontology environments have ontologies. (The “What Is an Ontology?” sidebar on page 24 provides learning extracts a definition and some background.) Many in the and is part of a more general ontology engineering computational-linguistics research community use architecture.4,5 Here, we describe the system and an relevant domain terms WordNet,3 but large-scale IT applications based on experiment in which we used a machine-learned it require heavy customization. tourism ontology to automatically translate multi- from a corpus of text, Thus, a critical issue is ontology construction— word terms from English to Italian. The method can identifying, defining, and entering concept defini- apply to other domains without manual adaptation. relates them to tions. In large, complex application domains, this task can be lengthy, costly, and controversial, because peo- OntoLearn architecture appropriate concepts in ple can have different points of view about the same Figure 1 shows the elements of the architecture. concept. Two main approaches aid large-scale ontol- Using the Ariosto language processor,6 OntoLearn a general-purpose ogy construction. The first one facilitates manual extracts terminology from a corpus of domain text, ontology engineering by providing natural language such as specialized Web sites and warehouses or doc- ontology, and detects processing tools, including editors, consistency uments exchanged among members of a virtual com- checkers, mediators to support shared decisions, and munity.
    [Show full text]
  • THE ECOLOGY of the ALLIGATOR SNAPPING TURTLE, Macrochelys Femminckii, in OKLAHOMA
    THE ECOLOGY OF THE ALLIGATOR SNAPPING TURTLE, Macrochelys temminckii, IN OKLAHOMA By JIMMY DAREN RIEDLE Bachelor ofScience Emporia State University Emporia, Kansas 1995 Submitted to the Faculty of the Graduate College ofthe Oklahoma State University in partial fulfillment of the requirements for the Degree of MASTER OF SCIENCE December 2001 THE ECOLOGY OF THE ALLIGATOR SNAPPING TURTLE, Macrochelys temminckii, IN OKLAHOMA ----~--=----,D:::::-~Gr~-ge---- II ACKNOWLEDGMENTS There are many individuals and organizations that I would like to thank for their support, encouragement, and motivation towards completing this project. I wish to express my deepest gratitude towards my major advisor, Dr. Stanley F. Fox for his guidance, advice and just giving me a chance. I also would like to thank my other committee members, Dr. David M. Leslie, Jr. and Or. Chuck C. Peterson for their guidance. I extend my deepest gratitude to myoId friend Paul Shipman for many fine field trips in many fine locales and for forcibly coercing me into taking over this project in the first place. I also want to thank Steve Beredzen at Sequoyah National Wildlife Refuge, Oklahoma Department of Wildlife Conservation, and the Oklahoma Cooperative Fish and Wildlife Research Unit for financial, and logistical support. I extend extra appreciation to Shane Kasson, Gary Williams, and the rest of the crew at equoyah National Wildlife Refuge for putting up with me on a daily basis and providing me with the occasional evening entertainment and free beer. I wish to express a special thanks to all the people, both paid employees and volunteers who helped me in the field during the course ofthis project.
    [Show full text]
  • Server New Features Webfocus Server Release 8207 Datamigrator Server Release 8207
    Server New Features WebFOCUS Server Release 8207 DataMigrator Server Release 8207 February 26, 2021 Active Technologies, FOCUS, Hyperstage, Information Builders, the Information Builders logo, iWay, iWay Software, Omni- Gen, Omni-HealthData, Parlay, RStat, Table Talk, WebFOCUS, WebFOCUS Active Technologies, and WebFOCUS Magnify are registered trademarks, and DataMigrator, and ibi are trademarks of Information Builders, Inc. Adobe, the Adobe logo, Acrobat, Adobe Reader, Flash, Adobe Flash Builder, Flex, and PostScript are either registered trademarks or trademarks of Adobe Systems Incorporated in the United States and/or other countries. Due to the nature of this material, this document refers to numerous hardware and software products by their trademarks. In most, if not all cases, these designations are claimed as trademarks or registered trademarks by their respective companies. It is not this publisher's intent to use any of these names generically. The reader is therefore cautioned to investigate all claimed trademark rights before using any of these names other than to refer to the product described. Copyright © 2021. TIBCO Software Inc. All Rights Reserved. Contents 1. Server Enhancements ........................................................ 11 Global WebFOCUS Server Monitoring Console ........................................... 11 Server Text Editor Supports Jump to Next or Previous Difference in Compare and Merge .......14 Upload Message Added in the Edaprint Log ............................................. 15 Displaying the
    [Show full text]