Improving Entity Linking by Modeling Latent Relations Between Mentions

Total Page:16

File Type:pdf, Size:1020Kb

Improving Entity Linking by Modeling Latent Relations Between Mentions Improving Entity Linking by Modeling Latent Relations between Mentions Phong Le1 and Ivan Titov1;2 1University of Edinburgh 2University of Amsterdam fple,[email protected] Abstract sides coreference, there are many other relations between entities which constrain or favor certain Entity linking involves aligning textual alignment configurations. For example, consider mentions of named entities to their corre- relation participant in in Figure 1: if “World Cup” sponding entries in a knowledge base. En- is aligned to the entity FIFA WORLD CUP then tity linking systems often exploit relations we expect the second “England” to refer to a foot- between textual mentions in a document ball team rather than a basketball one. (e.g., coreference) to decide if the linking NEL methods typically consider only corefer- decisions are compatible. Unlike previous ence, relying either on off-the-shelf systems or approaches, which relied on supervised some simple heuristics (Lazic et al., 2015), and systems or heuristics to predict these rela- exploit them in a pipeline fashion, though some tions, we treat relations as latent variables (e.g., Cheng and Roth (2013); Ren et al. (2017)) in our neural entity-linking model. We in- additionally exploit a range of syntactic-semantic duce the relations without any supervision relations such as apposition and possessives. An- while optimizing the entity-linking sys- other line of work ignores relations altogether and tem in an end-to-end fashion. Our multi- models the predicted sequence of KB entities as a relational model achieves the best reported bag (Globerson et al., 2016; Yamada et al., 2016; scores on the standard benchmark (AIDA- Ganea and Hofmann, 2017). Though they are able CoNLL) and substantially outperforms its to capture some degree of coherence (e.g., pref- relation-agnostic version. Its training also erence towards entities from the same general do- converges much faster, suggesting that the main) and are generally empirically successful, the injected structural bias helps to explain underlying assumption is too coarse. For example, regularities in the training data. they would favor assigning all the occurrences of “England” in Figure 1 to the same entity. 1 Introduction We hypothesize that relations useful for NEL Named entity linking (NEL) is the task of as- can be induced without (or only with little) domain signing entity mentions in a text to corresponding expertise. In order to prove this, we encode rela- entries in a knowledge base (KB). For example, tions as latent variables and induce them by opti- consider Figure 1 where a mention “World Cup” mizing the entity-linking model in an end-to-end refers to a KB entity FIFA WORLD CUP. NEL fashion. In this way, relations between mentions is often regarded as crucial for natural language in documents will be induced in such a way as to understanding and commonly used as preprocess- be beneficial for NEL. As with other recent ap- ing for tasks such as information extraction (Hoff- proaches to NEL (Yamada et al., 2017; Ganea and mann et al., 2011) and question answering (Yih Hofmann, 2017), we rely on representation learn- et al., 2015). ing and learn embeddings of mentions, contexts Potential assignments of mentions to entities are and relations. This further reduces the amount regulated by semantic and discourse constraints. of human expertise required to construct the sys- For example, the second and third occurrences of tem and, in principle, may make it more portable mention “England” in Figure 1 are coreferent and across languages and domains. thus should be assigned to the same entity. Be- Our multi-relational neural model achieves an 1595 Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Long Papers), pages 1595–1604 Melbourne, Australia, July 15 - 20, 2018. c 2018 Association for Computational Linguistics England England West_Germany England_national_ England_national Germany_national_ FIFA_World_Cup football_team _football_team football_team FIBA_Basketball_ England_national_ England_national Germany_national_ World_Cup basketball_team _basketball_team basketball_team ... ... ... ... World Cup 1966 was held in England …. England won… The final saw England beat West Germany . located_in coreference beat participant_in Figure 1: Example for NEL, linking each mention to an entity in a KB (e.g. “World Cup” to FIFA WORLD CUP rather than FIBA BASKETBALL WORLD CUP). Note that the first and the sec- ond “England” are in different relations to “World Cup”. improvement of 0.85% F1 over the best re- 2.2 Local and global models ported scores on the standard AIDA-CoNLL Local models rely only on local contexts of men- dataset (Ganea and Hofmann, 2017). Substan- tions and completely ignore interdependencies be- tial improvements over the relation-agnostic ver- tween the linking decisions in the document (these sion show that the induced relations are indeed interdependencies are usually referred to as coher- beneficial for NEL. Surprisingly its training also ence). Let ci be a local context of mention mi and converges much faster: training of the full model Ψ(ei; ci) be a local score function. A local model requires ten times shorter wall-clock time than then tackles the problem by searching for what is needed for estimating the simpler relation- ∗ agnostic version. This may suggest that the in- ei = arg max Ψ(ei; ci) (1) 2 jected structural bias helps to explain regularities ei Ci in the training data, making the optimization task for each i 2 f1; :::; ng (Bunescu and Pas¸ca, 2006; easier. We qualitatively examine induced rela- Lazic et al., 2015; Yamada et al., 2017). tions. Though we do not observe direct counter- A global model, besides using local context parts of linguistic relations, we, for example, see within Ψ(ei; ci), takes into account entity co- that some of the induced relations are closely re- herency. It is captured by a coherence score func- lated to coreference whereas others encode forms tion Φ(E; D): of semantic relatedness between the mentions. Xn ∗ E = arg max Ψ(ei; ci) + Φ(E; D) 2 × × E C1 ::: Cn i=1 2 Background and Related work where E = (e1; :::; en). The coherence score function, in the simplest form, is a sum over 2.1 Named entity linking problem all pairwise scores Φ(ei; ej;D) (Ratinov et al., 2011; Huang et al., 2015; Chisholm and Hachey, Formally, given a document D containing a list of 2015; Ganea et al., 2016; Guo and Barbosa, 2016; mentions m ; :::; m , an entity linker assigns to 1 n Globerson et al., 2016; Yamada et al., 2016), re- each m an KB entity e or predicts that there is no i i sulting in: corresponding entry in the KB (i.e., ei = NILL). Xn Because a KB can be very large, it is stan- ∗ E = arg max Ψ(ei; ci)+ dard to use an heuristic to choose potential can- E2C ×:::×Cn 1 Xi=1 didates, eliminating options which are highly un- Φ(e ; e ;D) (2) likely. This preprocessing step is called candidate i j i=6 j selection. The task of a statistical model is thus re- duced to choosing the best option among a smaller A disadvantage of global models is that exact list of candidates Ci = (ei1; :::; eili ). In what fol- decoding (Equation 2) is NP-hard (Wainwright lows, we will discuss two classes of approaches et al., 2008). Ganea and Hofmann (2017) over- tackling this problem: local and global modeling. come this using loopy belief propagation (LBP), 1596 an approximate inference method based on mes- At the other extreme, feature engineering is al- sage passing (Murphy et al., 1999). Globerson most completely replaced by representation learn- et al. (2016) propose a star model which approxi- ing. These approaches rely on pretrained embed- mates the decoding problem in Equation 2 by ap- dings of words (Mikolov et al., 2013; Penning- proximately decomposing it into n decoding prob- ton et al., 2014) and entities (He et al., 2013; Ya- lems, one per each ei. mada et al., 2017; Ganea and Hofmann, 2017) and often do not use virtually any other hand-crafted 2.3 Related work features. Ganea and Hofmann (2017) showed Our work focuses on modeling pairwise score that such an approach can yield SOTA accuracy functions Φ and is related to previous approaches on a standard benchmark (AIDA-CoNLL dataset). in the two following aspects. Their local and pairwise score functions are T Relations between mentions Ψ(ei; ci) = ei Bf(ci) A relation widely used by NEL systems is corefer- 1 T Φ(ei; ej;D) = − ei Rej (3) ence: two mentions are coreferent if they refer to n 1 the same entity. Though, as we discussed in Sec- e ; e 2 Rd tion 1, other linguistic relations constrain entity as- where i j are the embeddings of entity e ; e B; R 2 Rd×d signments, only a few approaches (e.g., Cheng and i j, are diagonal matrices. The f(c ) Roth (2013); Ren et al. (2017)), exploit any rela- mapping i applies an attention mechanism to c tions other than coreference. We believe that the context words in i to obtain a feature representa- f(c ) 2 Rd) reason for this is that predicting and selecting rel- tions of context ( i . evant (often semantic) relations is in itself a chal- Note that the global component (the pairwise lenging problem. scores) is agnostic to any relations between enti- ties or even to their ordering: it models e ; :::; e In Cheng and Roth (2013), relations between 1 n simply as a bag of entities. Our work is in line with mentions are extracted using a labor-intensive ap- Ganea and Hofmann (2017) in the sense that fea- proach, requiring a set of hand-crafted rules and a ture engineering plays no role in computing local KB containing relations between entities.
Recommended publications
  • Rugby World Cup Quiz
    Rugby World Cup Quiz Round 1: Stats 1. The first eight World Cups were won by only four different nations. Which of the champions have only won it once? 2. Which team holds the record for the most points scored in a single match? 3. Bryan Habana and Jonah Lomu share the record for the most tries in the final stages of Rugby World Cup Tournaments. How many tries did they each score? 4. Which team holds the record for the most tries in a single match? 5. In 2011, Welsh youngster George North became the youngest try scorer during Wales vs Namibia. How old was he? 6. There have been eight Rugby World Cups so far, not including 2019. How many have New Zealand won? 7. In 2003, Australia beat Namibia and also broke the record for the largest margin of victory in a World Cup. What was the score? Round 2: History 8. In 1985, eight rugby nations met in Paris to discuss holding a global rugby competition. Which two countries voted against having a Rugby World Cup? 9. Which teams co-hosted the first ever Rugby World Cup in 1987? 10. What is the official name of the Rugby World Cup trophy? 11. In the 1995 England vs New Zealand semi-final, what 6ft 5in, 19 stone problem faced the English defence for the first time? 12. Which song was banned by the Australian Rugby Union for the 2003 World Cup, but ended up being sang rather loudly anyway? 13. In 2003, after South Africa defeated Samoa, the two teams did something which touched people’s hearts around the world.
    [Show full text]
  • Predicting the Correct Entities in Named-Entity Linking
    Separating the Signal from the Noise: Predicting the Correct Entities in Named-Entity Linking Drew Perkins Uppsala University Department of Linguistics and Philology Master Programme in Language Technology Master’s Thesis in Language Technology, 30 ects credits June 9, 2020 Supervisors: Gongbo Tang, Uppsala University Thorsten Jacobs, Seavus Abstract In this study, I constructed a named-entity linking system that maps between contextual word embeddings and knowledge graph embeddings to predict correct entities. To establish a named-entity linking system, I rst applied named-entity recognition to identify the entities of interest. I then performed candidate gener- ation via locality sensitivity hashing (LSH), where a candidate group of potential entities were created for each identied entity. Afterwards, my named-entity dis- ambiguation component was performed to select the most probable candidate. By concatenating contextual word embeddings and knowledge graph embeddings in my disambiguation component, I present a novel approach to named-entity link- ing. I conducted the experiments with the Kensho-Derived Wikimedia Dataset and the AIDA CoNLL-YAGO Dataset; the former dataset was used for deployment and the later is a benchmark dataset for entity linking tasks. Three deep learning models were evaluated on the named-entity disambiguation component with dierent context embeddings. The evaluation was treated as a classication task, where I trained my models to select the correct entity from a list of candidates. By optimizing the named-entity linking through this methodology, this entire system can be used in recommendation engines with high F1 of 86% using the former dataset. With the benchmark dataset, the proposed method is able to achieve F1 of 79%.
    [Show full text]
  • FIFA and the FIRST WORLD CUP by 1900 Soccer Was Well on Its Way to World Domination
    THE WORLD CUP SOCCER’S GLOBAL CHAMPIONSHIP he World Cup is international Tsoccer’s championship tournament, and it rules the global sports stage. Award-winning author Matt Doeden explores the history of international soccer and covers the World Cup’s greatest moments, from the Save of the Century to Diego Maradona’s Hand of God goal to the United States Women’s National Team’s dominance. The most shocking goals, the greatest upsets, and the fun and fanfare of soccer’s biggest event are all here. REINFORCED BINDING Matt Doeden J MILLBROOK PRESS · MINNEAPOLIS Copyright © 2018 by Lerner Publishing Group, Inc. All rights reserved. International copyright secured. No part of this book may be reproduced, stored in a retrieval system, or transmitted in any form or by any means— electronic, mechanical, photocopying, recording, or otherwise—without the prior written permission of Lerner Publishing Group, Inc., except for the inclusion of brief quotations in an acknowledged review. Millbrook Press A division of Lerner Publishing Group, Inc. 241 First Avenue North Minneapolis, MN 55401 USA For reading levels and more information, look up this title at www.lernerbooks.com. Main body text set in Adobe Garamond Pro Regular 14/19. Typeface provided by Adobe Systems. Library of Congress Cataloging-in-Publication Data Names: Doeden, Matt. Title: The world cup : soccer’s global championship / By Matt Doeden. Description: Minneapolis : Millbrook Press, [2018] | Series: Spectacular sports | Includes bibliographical references and index. Identifiers: LCCN 2017009220 (print) | LCCN 2017014231 (ebook) | ISBN 9781512498684 (eb pdf) | ISBN 9781512427554 (lb : alk. paper) Subjects: LCSH: World Cup (Soccer)—Juvenile literature.
    [Show full text]
  • Mardy Fish Named Us Davis Cup Captain
    MARDY FISH NAMED U.S. DAVIS CUP CAPTAIN WHITE PLAINS, N.Y., January 9, 2019 – The USTA today announced that former world No. 7 and Davis Cup veteran Mardy Fish has been named the new captain of the U.S. Davis Cup Team. He succeeds Jim Courier to become the 41 st captain in the team’s 120-year history and will make his debut at the newly transformed Davis Cup by BNP Paribas Finals November 18-24 in Madrid, Spain. “Ever since I started playing professionally and started understanding what the Davis Cup was and how special it was, even as a player, I wanted to be the Davis Cup Captain,” Fish said. “I just thought that position was so special – leading the guys and leading the team, building relationships and the team aspect around it. I’m a team-sport athlete stuck in an individual sport, and I love the team aspect of Davis Cup. To even be considered, let alone named the Captain, is incredibly humbling.” In this new era of Davis Cup, the role of Captain will be expanded, with the position working more closely with USTA Player Development throughout the year, as well as traveling to multiple tournaments and camps to support American players, serving as a mentor for American pros and juniors. He will also ensure the U.S. Davis Cup team remains a strong platform to grow the game through the USTA’s Net Generation youth initiative. “Mardy Fish embodies all of the qualities of a successful Davis Cup Captain and will be an invaluable asset to Team USA,” said USTA Chairman of the Board and President Patrick Galbraith.
    [Show full text]
  • Three Birds (In the LLOD Cloud) with One Stone: Babelnet, Babelfy and the Wikipedia Bitaxonomy
    Three Birds (in the LLOD Cloud) with One Stone: BabelNet, Babelfy and the Wikipedia Bitaxonomy Tiziano Flati and Roberto Navigli Dipartimento di Informatica Sapienza Universit`adi Roma Abstract. In this paper we present the current status of linguistic re- sources published as linked data and linguistic services in the LLOD cloud in our research group, namely BabelNet, Babelfy and the Wikipedia Bitaxonomy. We describe them in terms of their salient aspects and ob- jectives and discuss the benefits that each of these potentially brings to the world of LLOD NLP-aware services. We also present public Web- based services which enable querying, exploring and exporting data into RDF format. 1 Introduction Recent years have witnessed an upsurge in the amount of semantic information published on the Web. Indeed, the Web of Data has been increasing steadily in both volume and variety, transforming the Web into a global database in which resources are linked across sites. It is becoming increasingly critical that existing lexical resources be published as Linked Open Data (LOD), so as to foster integration, interoperability and reuse on the Semantic Web [5]. Thus, lexical resources provided in RDF format can contribute to the creation of the so-called Linguistic Linked Open Data (LLOD), a vision fostered by the Open Linguistic Working Group (OWLG), in which part of the Linked Open Data cloud is made up of interlinked linguistic resources [2]. The multilinguality aspect is key to this vision, in that it enables Natural Language Processing tasks which are not only cross-lingual, but also independent both of the language of the user input and of the linked data exploited to perform the task.
    [Show full text]
  • Cross Lingual Entity Linking with Bilingual Topic Model
    Proceedings of the Twenty-Third International Joint Conference on Artificial Intelligence Cross Lingual Entity Linking with Bilingual Topic Model Tao Zhang, Kang Liu and Jun Zhao Institute of Automation, Chinese Academy of Sciences HaiDian District, Beijing, China {tzhang, kliu, jzhao}@nlpr.ia.ac.cn Abstract billion web pages on Internet and 31% of them are written in other languages. This report tells us that the percentage of Cross lingual entity linking means linking an entity web pages written in English are decreasing, which indicates mention in a background source document in one that English pages are becoming less dominant compared language with the corresponding real world entity in with before. Therefore, it becomes more and more important a knowledge base written in the other language. The to maintain the growing knowledge base by discovering and key problem is to measure the similarity score extracting information from all web contents written in between the context of the entity mention and the different languages. Also, inserting new extracted knowledge document of the candidate entity. This paper derived from the information extraction system into a presents a general framework for doing cross knowledge base inevitably needs a system to map the entity lingual entity linking by leveraging a large scale and mentions in one language to their entries in a knowledge base bilingual knowledge base, Wikipedia. We introduce which may be another language. This task is known as a bilingual topic model that mining bilingual topic cross-lingual entity linking. from this knowledge base with the assumption that Intuitively, to resolve the cross-lingual entity linking the same Wikipedia concept documents of two problem, the key is how to define the similarity score different languages share the same semantic topic between the context of entity mention and the document of distribution.
    [Show full text]
  • Babelfying the Multilingual Web: State-Of-The-Art Disambiguation and Entity Linking of Web Pages in 50 Languages
    Babelfying the Multilingual Web: state-of-the-art Disambiguation and Entity Linking of Web pages in 50 languages Roberto Navigli [email protected] http://lcl.uniroma1.it! ERC StG: MultilingualERC Joint Word Starting Sense Disambiguation Grant (MultiJEDI) MultiJEDI No. 259234 1! Roberto Navigli! LIDER EU Project No. 610782! Joint work with Andrea Moro Alessandro Raganato A. Moro, A. Raganato, R. Navigli. Entity Linking meets Word Sense Disambiguation: a Unified Approach. Transactions of the Association for Computational Linguistics (TACL), 2, 2014. Babelfying the Multilingual Web: state-of-the-art Disambiguation and Entity Linking Andrea Moro and Roberto Navigli 2014.05.08 BabelNet & friends3 Roberto Navigli 2014.05.08 BabelNet & friends4 Roberto Navigli Motivation • Domain-specific Web content is available in many languages • Information should be extracted and processed independently of the source/target language • This could be done automatically by means of high- performance multilingual text understanding Babelfying the Multilingual Web: state-of-the-art Disambiguation and Entity Linking Andrea Moro, Alessandro Raganato and Roberto Navigli BabelNet (http://babelnet.org) A wide-coverage multilingual semantic network and encyclopedic dictionary in 50 languages! Named Entities and specialized concepts Concepts from WordNet from Wikipedia Concepts integrated from both resources Babelfying the Multilingual Web: state-of-the-art Disambiguation and Entity Linking Andrea Moro, Alessandro Raganato and Roberto Navigli BabelNet (http://babelnet.org)
    [Show full text]
  • Named Entity Extraction for Knowledge Graphs: a Literature Overview
    Received December 12, 2019, accepted January 7, 2020, date of publication February 14, 2020, date of current version February 25, 2020. Digital Object Identifier 10.1109/ACCESS.2020.2973928 Named Entity Extraction for Knowledge Graphs: A Literature Overview TAREQ AL-MOSLMI , MARC GALLOFRÉ OCAÑA , ANDREAS L. OPDAHL, AND CSABA VERES Department of Information Science and Media Studies, University of Bergen, 5007 Bergen, Norway Corresponding author: Tareq Al-Moslmi ([email protected]) This work was supported by the News Angler Project through the Norwegian Research Council under Project 275872. ABSTRACT An enormous amount of digital information is expressed as natural-language (NL) text that is not easily processable by computers. Knowledge Graphs (KG) offer a widely used format for representing information in computer-processable form. Natural Language Processing (NLP) is therefore needed for mining (or lifting) knowledge graphs from NL texts. A central part of the problem is to extract the named entities in the text. The paper presents an overview of recent advances in this area, covering: Named Entity Recognition (NER), Named Entity Disambiguation (NED), and Named Entity Linking (NEL). We comment that many approaches to NED and NEL are based on older approaches to NER and need to leverage the outputs of state-of-the-art NER systems. There is also a need for standard methods to evaluate and compare named-entity extraction approaches. We observe that NEL has recently moved from being stepwise and isolated into an integrated process along two dimensions: the first is that previously sequential steps are now being integrated into end-to-end processes, and the second is that entities that were previously analysed in isolation are now being lifted in each other's context.
    [Show full text]
  • Entity Linking
    Entity Linking Kjetil Møkkelgjerd Master of Science in Computer Science Submission date: June 2017 Supervisor: Herindrasana Ramampiaro, IDI Norwegian University of Science and Technology Department of Computer Science Abstract Entity linking may be of help to quickly supplement a reader with further information about entities within a text by providing links to a knowledge base containing more information about each entity, and may thus potentially enhance the reading experience. In this thesis, we look at existing solutions, and implement our own deterministic entity linking system based on our research, using our own approach to the problem. Our system extracts all entities within the input text, and then disam- biguates each entity considering the context. The extraction step is han- dled by an external framework, while the disambiguation step focuses on entity-similarities, where similarity is defined by how similar the entities’ categories are, which we measure by using data from a structured knowledge base called DBpedia. We base this approach on the assumption that simi- lar entities usually occur close to each other in written text, thus we select entities that appears to be similar to other nearby entities. Experiments show that our implementation is not as effective as some of the existing systems we use for reference, and that our approach has some weaknesses which should be addressed. For example, DBpedia is not an as consistent knowledge base as we would like, and the entity extraction framework often fail to reproduce the same entity set as the dataset we use for evaluation. However, our solution show promising results in many cases.
    [Show full text]
  • Named-Entity Recognition and Linking with a Wikipedia-Based Knowledge Base Bachelor Thesis Presentation Niklas Baumert [[email protected]]
    Named-Entity Recognition and Linking with a Wikipedia-based Knowledge Base Bachelor Thesis Presentation Niklas Baumert [[email protected]] 2018-11-08 Contents 1 Why and What?—Motivation. 1.1 What is named-entity recognition? 1.2 What is entity linking? 2 How?—Implementation. 2.1 Wikipedia-backed Knowledge Base. 2.2 Entity Recognizer & Linker. 3 Performance Evaluation. 4 Conclusion. 2 1 Why and What—Motivation. 3 Built to provide input files for QLever. ● Specialized use-case. ○ General application as an extra feature. ● Providing a wordsfile and a docsfile as output. ● Wordsfile is input for QLever. ● Support for Wikipedia page titles, Freebase IDs and Wikidata IDs. ○ Convert between those 3 freely afterwards. 4 The docsfile is a list of all sentences with IDs. record id text 1 Anarchism is a political philosophy that advocates [...] 2 These are often described as [...] 5 The wordsfile contains each word of each sentence and determines entities and their likeliness. word Entity? recordID score Anarchism 0 1 1 <Anarchism> 1 1 0.9727 is 0 1 1 a 0 1 1 political 0 1 1 philosophy 0 1 1 <Philosophy> 1 1 0.955 6 Named-entity recognition (NER) identifies nouns in a given text. ● Finding and categorizing entities in a given text. [1][2] ● (Proper) nouns refer to entities. ● Problem: Words are ambiguous. [1] ○ Same word, different entities within a category—”John F. Kennedy”, the president or his son? ○ Same word, different entities in different categories—”JFK”, person or airport? 7 Entity linking (EL) links entities to a database. ● Resolve ambiguity by associating mentions to entities in a knowledge base.
    [Show full text]
  • KORE 50^DYWC: an Evaluation Data Set for Entity Linking Based On
    Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020), pages 2389–2395 Marseille, 11–16 May 2020 c European Language Resources Association (ELRA), licensed under CC-BY-NC KORE 50DYWC: An Evaluation Data Set for Entity Linking Based on DBpedia, YAGO, Wikidata, and Crunchbase Kristian Noullet, Rico Mix, Michael Farber¨ Karlsruhe Institute of Technology (KIT) [email protected], [email protected], [email protected] Abstract A major domain of research in natural language processing is named entity recognition and disambiguation (NERD). One of the main ways of attempting to achieve this goal is through use of Semantic Web technologies and its structured data formats. Due to the nature of structured data, information can be extracted more easily, therewith allowing for the creation of knowledge graphs. In order to properly evaluate a NERD system, gold standard data sets are required. A plethora of different evaluation data sets exists, mostly relying on either Wikipedia or DBpedia. Therefore, we have extended a widely-used gold standard data set, KORE 50, to not only accommodate NERD tasks for DBpedia, but also for YAGO, Wikidata and Crunchbase. As such, our data set, KORE 50DYWC, allows for a broader spectrum of evaluation. Among others, the knowledge graph agnosticity of NERD systems may be evaluated which, to the best of our knowledge, was not possible until now for this number of knowledge graphs. Keywords: Data Sets, Entity Linking, Knowledge Graph, Text Annotation, NLP Interchange Format 1. Introduction Therefore, annotation methods can only be evaluated Automatic extraction of semantic information from texts is on a small part of the KG concerning a specific do- an open problem in natural language processing (NLP).
    [Show full text]
  • This Content Is a Part of a Full Book - Tennis for Students of Medical University - Sofia
    THIS CONTENT IS A PART OF A FULL BOOK - TENNIS FOR STUDENTS OF MEDICAL UNIVERSITY - SOFIA https://polis-publishers.com/kniga/tenis-rukovodstvo-za-studenti/ A brief history of tennis 1. Origin and development of the game Ball games were popular in ancient Rome and Greece under the name of a spherical "ball game". In the 11 th and 12 th centuries, the games of "Jacko del Palone" and "Jaco de la Corda" were mentioned, which resembled modern tennis (Todorov 2010; Mashka, Shaffarjik 1989). In the 14 th century, outdoor and indoor courts started to be built in France, where the game of "court – tennis" was played, which was later renamed "palm game", that, dependent on being played inside or outside, was called "short tennis" and " long tennis" (Penchev 1989). Antonio Scaino da Salò’s book "Treatise on the game of the ball" (1555) describes the game instruments - a racquet and a ball - a tight ball of wool, wrapped in leather. It was struck with the palm of the hand, wrapped in a leather belt and a wooden case. A glove was used as well to protect against pain and traumas (Mashka, Shaffarjik 1989). The "court - tennis" game became popular predominantly amongst the nobles in Europe under the name of "Jeu de paume" – palm game, played both indoors and outdoors. “Mirabo” hall, part of the famous Versailles Palace that was transformed at that time by the Sun King - Louis XIV into a main residence of the French kings, exists to this day and had served for that purpose. Fig. 1.
    [Show full text]