8 Knowledge Base Curation 155 8.1 KB Quality Assessment

Total Page:16

File Type:pdf, Size:1020Kb

8 Knowledge Base Curation 155 8.1 KB Quality Assessment Submitted to Foundations and Trends in Databases Machine Knowledge: Creation and Curation of Comprehensive Knowledge Bases Gerhard Weikum Xin Luna Dong Max Planck Institute for Informatics Amazon [email protected] [email protected] Simon Razniewski Fabian Suchanek Max Planck Institute for Informatics Telecom Paris University [email protected] [email protected] August 23, 2021 Abstract Equipping machines with comprehensive knowledge of the world’s entities and their relationships has been a long-standing goal of AI. Over the last decade, large-scale knowledge bases, also known as knowledge graphs, have been automatically constructed from web contents and text sources, and have become a key asset for search engines. This machine knowledge can be harnessed to semantically interpret textual phrases in news, social media and web tables, and contributes to question answering, natural language processing and data analytics. This article surveys fundamental concepts and practical methods for creating and curating large knowledge bases. It covers models and methods for discovering and canon- icalizing entities and their semantic types and organizing them into clean taxonomies. On top of this, the article discusses the automatic extraction of entity-centric properties. To support the long-term life-cycle and the quality assurance of machine knowledge, the article presents methods for constructing open schemas and for knowledge curation. Case studies on academic projects and industrial knowledge graphs complement the survey of concepts and methods. arXiv:2009.11564v2 [cs.AI] 22 Mar 2021 1 Submitted to Foundations and Trends in Databases Contents 1 What Is This All About7 1.1 Motivation . 7 1.2 History of Large-Scale Knowledge Bases . 8 1.3 Application Use Cases . 9 1.4 Scope and Tasks . 12 1.4.1 What is a Knowledge Base . 12 1.4.2 Tasks for KB Creation and Curation . 13 1.5 Audience . 15 1.6 Outline . 17 2 Foundations and Architecture 18 2.1 Knowledge Representation . 18 2.1.1 Entities . 18 2.1.2 Classes (aka Types) . 20 2.1.3 Properties of Entities: Attributes and Relations . 22 2.1.3.1 Attributes of Entities . 23 2.1.3.2 Relations between Entities . 24 2.1.3.3 Higher-Arity Relations . 25 2.1.4 Canonicalization . 27 2.1.5 Logical Invariants . 28 2.2 Design Space for Knowledge Base Construction . 28 2.2.1 Requirements . 29 2.2.2 Input Sources . 33 2.2.3 Output Scope and Quality . 34 2.2.4 Methodological Repertoire . 35 3 Knowledge Integration from Premium Sources 37 3.1 The Case for Premium Sources . 37 3.2 Category Cleaning . 39 3.3 Alignment between Knowledge Sources . 43 3.4 Graph-based Alignment . 45 3.5 Discussion . 46 3.6 Take-Home Lessons . 48 2 Submitted to Foundations and Trends in Databases 4 KB Construction: Entity Discovery and Typing 49 4.1 Problem and Design Space . 49 4.2 Dictionary-based Entity Spotting . 50 4.3 Pattern-based Methods . 52 4.3.1 Patterns for Entity-Type Pairs . 52 4.3.2 Pattern Learning . 53 4.4 Machine Learning for Sequence Labeling . 59 4.4.1 Probabilistic Graphical Models . 59 4.4.2 Deep Neural Networks . 63 4.5 Word and Entity Embeddings . 65 4.6 Ab-Initio Taxonomy Construction . 68 4.7 Take-Home Lessons . 72 5 Entity Canonicalization 74 5.1 Problem and Design Space . 74 5.1.1 Entity Linking (EL) . 74 5.1.2 EL with Out-of-KB Entities . 76 5.1.3 Coreference Resolution (CR) . 77 5.1.4 Entity Matching (EM) . 79 5.2 Popularity, Similarity and Coherence Measures . 80 5.2.1 Mention-Entity Popularity . 80 5.2.2 Mention-Entity Context Similarity . 81 5.2.3 Entity-Entity Coherence . 84 5.3 Optimization and Learning-to-Rank Methods . 86 5.4 Collective Methods based on Graphs . 87 5.4.1 Edge-based EL Methods . 88 5.4.2 Subgraph-based EL Methods . 88 5.4.3 EL Methods based on Random Walks . 90 5.4.4 EL Methods based on Probabilistic Graphical Models . 91 5.5 Methods based on Supervised Learning . 94 5.5.1 Neural EL . 94 5.6 Methods for Semi-Structured Data . 96 5.7 EL Variations and Extensions . 98 5.8 Entity Matching (EM) . 99 5.8.1 Blocking . 101 5.8.2 Matching Rules . 102 5.8.3 EM Workflow . 103 5.9 Take-Home Lessons . 104 3 Submitted to Foundations and Trends in Databases 6 KB Construction: Attributes and Relationships 106 6.1 Problem and Design Space . 106 6.2 Pattern-based and Rule-based Extraction . 108 6.2.1 Specified Patterns and Rules . 108 6.2.1.1 Regex Patterns . 109 6.2.1.2 Inducing Regex Patterns from Examples . 110 6.2.1.3 Type Checking . 110 6.2.1.4 Operator-based Extraction Plans . 110 6.2.1.5 Patterns for Text, Lists and Trees . 112 6.2.2 Distantly Supervised Pattern Learning . 115 6.2.2.1 Statement-Pattern Duality . 115 6.2.2.2 Quality Measures . 117 6.2.2.3 Scale and Scope . 117 6.2.2.4 Feature-based Classifiers . 119 6.2.2.5 Source Discovery . 120 6.3 Extraction from Semi-Structured Contents . 122 6.3.1 Distantly Supervised Extraction from DOM Trees . 124 6.3.2 Web Tables and Microformats . 126 6.4 Neural Extraction . 127 6.4.1 Classifiers and Taggers . 127 6.4.2 Training by Distant Supervision . 128 6.4.3 Transformer Networks . 130 6.5 Take-Home Lessons . 131 7 Open Schema Construction 132 7.1 Problems and Design Space . 132 7.2 Open Information Extraction for Predicate Discovery . 133 7.2.1 Pattern-based Open IE . 135 7.2.2 Extensions of Pattern-based Open IE . 136 7.2.3 Neural Learning for Open IE . 138 7.3 Seed-based Discovery of Properties . 139 7.3.1 Distant Supervision for Open IE from Text . 140 7.3.2 Open IE from Semi-Structured Web Sites . 142 7.3.3 View Discovery in the KB . 144 7.4 Property Canonicalization . 145 7.4.1 Paraphrase Dictionaries . 145 7.4.1.1 Property Synonymy . 146 7.4.1.2 Property Subsumption . 148 7.4.2 Clustering of Property Names . 149 4 Submitted to Foundations and Trends in Databases 7.4.3 Latent Organization of Properties . 151 7.5 Take-Home Lessons . 154 8 Knowledge Base Curation 155 8.1 KB Quality Assessment . 155 8.1.1 Quality Metrics . 155 8.1.2 Quality Evaluation . 156 8.1.3 Quality Prediction . 157 8.2 Knowledge Base Completeness . 160 8.2.1 Local Completeness . 160 8.2.2 Negative Statements . 162 8.3 Rules and Constraints . ..
Recommended publications
  • PERFORMED IDENTITIES: HEAVY METAL MUSICIANS BETWEEN 1984 and 1991 Bradley C. Klypchak a Dissertation Submitted to the Graduate
    PERFORMED IDENTITIES: HEAVY METAL MUSICIANS BETWEEN 1984 AND 1991 Bradley C. Klypchak A Dissertation Submitted to the Graduate College of Bowling Green State University in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY May 2007 Committee: Dr. Jeffrey A. Brown, Advisor Dr. John Makay Graduate Faculty Representative Dr. Ron E. Shields Dr. Don McQuarie © 2007 Bradley C. Klypchak All Rights Reserved iii ABSTRACT Dr. Jeffrey A. Brown, Advisor Between 1984 and 1991, heavy metal became one of the most publicly popular and commercially successful rock music subgenres. The focus of this dissertation is to explore the following research questions: How did the subculture of heavy metal music between 1984 and 1991 evolve and what meanings can be derived from this ongoing process? How did the contextual circumstances surrounding heavy metal music during this period impact the performative choices exhibited by artists, and from a position of retrospection, what lasting significance does this particular era of heavy metal merit today? A textual analysis of metal- related materials fostered the development of themes relating to the selective choices made and performances enacted by metal artists. These themes were then considered in terms of gender, sexuality, race, and age constructions as well as the ongoing negotiations of the metal artist within multiple performative realms. Occurring at the juncture of art and commerce, heavy metal music is a purposeful construction. Metal musicians made performative choices for serving particular aims, be it fame, wealth, or art. These same individuals worked within a greater system of influence. Metal bands were the contracted employees of record labels whose own corporate aims needed to be recognized.
    [Show full text]
  • Only Imagine
    I CAN ONLY IMAGINE I CAN A Memoir ONLY IMAGINE Bart Millard a memoir With Robert Noland Bart Millard with robert noland Imagine_A.indd 7 10/3/17 10:11 AM Appendix 1 YOUR IDENTITY IN CHRIST My mentor, Rusty Kennedy, was integral in discipling me in my walk with Christ. He gave me these seventy- five verses and state- ments while I was unpacking my past and starting to understand who I truly am in Jesus. Ever since then, I have carried these close to my heart. I pray these will minister to you the way they have to me so that you, too, can understand that in Christ, you are free indeed! 1. John 1:12—I am a child of God. 2. John 15:1–5—I am a part of the true vine, a channel (branch) of His life. 3. John 15:15—I am Christ’s friend. 4. John 15:16—I am chosen and appointed by Christ to bear His fruit. 5. Acts 1:8—I am a personal witness of Christ for Christ. 6. Romans 3:24—I have been justified and redeemed. 7. Romans 5:1—I have been justified (completely forgiven and made righteous) and am at peace with God. 8. Romans 6:1–6—I died with Christ and died to the power of sin’s rule in my life. APPENDIX 1 9. Romans 6:7—I have been freed from sin’s power over me. 10. Romans 6:18—I am a slave of righteousness. 11. Romans 6:22—I am enslaved to God.
    [Show full text]
  • Copyrighted Material
    Index Abulfeda crater chain (Moon), 97 Aphrodite Terra (Venus), 142, 143, 144, 145, 146 Acheron Fossae (Mars), 165 Apohele asteroids, 353–354 Achilles asteroids, 351 Apollinaris Patera (Mars), 168 achondrite meteorites, 360 Apollo asteroids, 346, 353, 354, 361, 371 Acidalia Planitia (Mars), 164 Apollo program, 86, 96, 97, 101, 102, 108–109, 110, 361 Adams, John Couch, 298 Apollo 8, 96 Adonis, 371 Apollo 11, 94, 110 Adrastea, 238, 241 Apollo 12, 96, 110 Aegaeon, 263 Apollo 14, 93, 110 Africa, 63, 73, 143 Apollo 15, 100, 103, 104, 110 Akatsuki spacecraft (see Venus Climate Orbiter) Apollo 16, 59, 96, 102, 103, 110 Akna Montes (Venus), 142 Apollo 17, 95, 99, 100, 102, 103, 110 Alabama, 62 Apollodorus crater (Mercury), 127 Alba Patera (Mars), 167 Apollo Lunar Surface Experiments Package (ALSEP), 110 Aldrin, Edwin (Buzz), 94 Apophis, 354, 355 Alexandria, 69 Appalachian mountains (Earth), 74, 270 Alfvén, Hannes, 35 Aqua, 56 Alfvén waves, 35–36, 43, 49 Arabia Terra (Mars), 177, 191, 200 Algeria, 358 arachnoids (see Venus) ALH 84001, 201, 204–205 Archimedes crater (Moon), 93, 106 Allan Hills, 109, 201 Arctic, 62, 67, 84, 186, 229 Allende meteorite, 359, 360 Arden Corona (Miranda), 291 Allen Telescope Array, 409 Arecibo Observatory, 114, 144, 341, 379, 380, 408, 409 Alpha Regio (Venus), 144, 148, 149 Ares Vallis (Mars), 179, 180, 199 Alphonsus crater (Moon), 99, 102 Argentina, 408 Alps (Moon), 93 Argyre Basin (Mars), 161, 162, 163, 166, 186 Amalthea, 236–237, 238, 239, 241 Ariadaeus Rille (Moon), 100, 102 Amazonis Planitia (Mars), 161 COPYRIGHTED
    [Show full text]
  • 2000 Terrapin Invitational Tournament - Division 1 Round 9 Questions by John Nam
    2000 Terrapin Invitational Tournament - Division 1 Round 9 Questions by John Nam Tossups 1. It tells of life at 27 rue de Fleurus, which became a hub of artistic discussion in the early 1900s. Featuring real-life encounters of its author and its narrator with artists from Picasso to Hemingway, this 1932 work was its author's first critical and commercial success. FTP, name this work, which despite its title mainly concerns the life story of its author, Gertrude Stein. Ans: The Autobiography ofAlice B. Toklas 2. First formed in August 1965, their original lineup included Bob Harvey and Jerry Pelonquin, who were soon initially replaced by Alexander Skip Spence and Signe Anderson, then by Spencer Dryden and Jack Casady. Their debut album was moderately successful, a former member of Great Society joined in 1966 and the subsequent release of Surrealistic Pillow that they took off. FTP, name this seminal band whose most famous members were Jorma Kaukonen, Marty Balin, Paul Kantner and Grace Slick. Ans: Jefferson Airplane 3. While studying for the bar in New Salem, Illinois, he began courting the fiancee of John McNamar, who had gone to get his parents in New York. McNamar's fiancee agreed to marry him ifshe could get an "honorable release" from her engagement, but got milk fever and died before McNamar's return in the fall of 1835. Such is the tale of the first love of this man for Ann Rutledge, mildly ironic considering his future reputation for honesty. FTP, name this assassination victim and u.S. statesman. Ans: Abraham Lincoln 4.
    [Show full text]
  • Shallow Crustal Composition of Mercury As Revealed by Spectral Properties and Geological Units of Two Impact Craters
    Planetary and Space Science 119 (2015) 250–263 Contents lists available at ScienceDirect Planetary and Space Science journal homepage: www.elsevier.com/locate/pss Shallow crustal composition of Mercury as revealed by spectral properties and geological units of two impact craters Piero D’Incecco a,n, Jörn Helbert a, Mario D’Amore a, Alessandro Maturilli a, James W. Head b, Rachel L. Klima c, Noam R. Izenberg c, William E. McClintock d, Harald Hiesinger e, Sabrina Ferrari a a Institute of Planetary Research, German Aerospace Center, Rutherfordstrasse 2, D-12489 Berlin, Germany b Department of Geological Sciences, Brown University, Providence, RI 02912, USA c The Johns Hopkins University Applied Physics Laboratory, Laurel, MD 20723, USA d Laboratory for Atmospheric and Space Physics, University of Colorado, Boulder, CO 80303, USA e Westfälische Wilhelms-Universität Münster, Institut für Planetologie, Wilhelm-Klemm Str. 10, D-48149 Münster, Germany article info abstract Article history: We have performed a combined geological and spectral analysis of two impact craters on Mercury: the Received 5 March 2015 15 km diameter Waters crater (106°W; 9°S) and the 62.3 km diameter Kuiper crater (30°W; 11°S). Using Received in revised form the Mercury Dual Imaging System (MDIS) Narrow Angle Camera (NAC) dataset we defined and mapped 9 October 2015 several units for each crater and for an external reference area far from any impact related deposits. For Accepted 12 October 2015 each of these units we extracted all spectra from the MESSENGER Atmosphere and Surface Composition Available online 24 October 2015 Spectrometer (MASCS) Visible-InfraRed Spectrograph (VIRS) applying a first order photometric correc- Keywords: tion.
    [Show full text]
  • Harry Manx Et Les Blues De Bouddha
    J472, & BLUES HarryManx et les blues de Bouddha SERGE TRUFFAUT les injustices.nes'y est pas trompé: Mank est un artiste ptit J e tout nouveau, dernier t{AHqY très singulier. oII m'a art|elé i,l I-ralbum confectionnépar rji;{,X y a deuxans de celapour que Harry Manx, le seul trouba- je \arti.cibe à I'enregistrement dour du blues mâtiné de réso- de son album. Il est tellement nancesindiennes, celles du aimable,facile à aiare, que sous.-continentindien et non c'était commesi nousétions de des Etats-Unis,s'appelle Bread uieux amis. Nous aaonsIa And Buddha.Du pain et Boud- mêmesensibi.lité musi.cale. Lui. dha, et non du pain et des jeux. s'habi,llecolnme un Indien, Arrêtons-nousau tife, puisquî moi.je joue cornmeun Indi.en. a valeur symboliqueautant que gentillessei.ncarnée. II me Toujours est-il qu'après coup, biographique. croyaitpas. (rires)" il m'a demandéde fai.re une Il était une fois un Britan- Bon. Questionblues, si I'on tournéeaaec lui. Ce qu'il y a nique, donc un gentleman,né peut dire les chosesainsi, d'ertraordinaire, c'estqu'on sur I'Ile de Man. où les chats Manx en connaissaitl'ortho- joue dans de grandessalles se distinguent de tous les graphe, la grammaire, la syn- contn e le CarnegieHall, à autres par leur petite queue. taxe. Et comme il est un hom- New York, ou The GreatAm.e- l.es hasardsde I'histoire com- me très singulier, un homme rican Music-Hall, à San Fran- me ceux du destin étant ce qui ne veut pas être le clone cisco.Cette association a beau- qu'ils sont.
    [Show full text]
  • Noel Paul Stookey Bio
    Noel Paul Stookey Bio Singer/songwriter Noel Paul Stookey has been altering both the musical and ethical landscape of this country and the world for decades—both as the “Paul” of the legendary Peter, Paul and Mary and as an independent musician who passionately belie es in bringing the spiritual into the practice of daily life! "unny, irre erently re erent, thoughtful, compassionately passionate, Stookey’s oice is known all across this land: from the “%edding Song” to “&n 'hese 'imes.” Noel and Betty, his bride of over )* years +but who’s counting,, moved with their three daughters to the coast of maine over forty years ago. since that time he’s done the occasional home town benefit and the here-and-there in-state show but ne er had undertaken a concentrated tour in one season. “& recorded all . of my maine concerts last summer - from ogun/uit to eastport - check '01' out on a map” he says, “and, though not all of the ideo or audio made 2prime time2, i was able to collect *3 music ideos for the 4VD and +amazingly, fit all *3 songs on the 7D to create this 1' 08ME: the maine tour package.” 'he songs in this newest release represent a broad range: '0E 71(&N "959: %1;'< +homage to the rigors of enduring the lengthened winter in maine,, a new ersion of %01'S09:NAM9 +a bittersweet =az6 shaded reminiscence of a middle-aged man in denial, originally recorded on the PP&M ?@AA album, "1M&;&1 49; 78:1<8N +a new song that speaks to the immigration issue in compassionate rather than political terms,, %944&NB S8NG +with the 2original2 lyric and a spoken introduction - DVD only, and '0E ;1DY S1YS S09 48N2' ;&KE E1<< +a commentary on a common misperception of the creati e process,.
    [Show full text]
  • Formal Concept Analysis in Knowledge Processing: a Survey on Applications ⇑ Jonas Poelmans A,C, , Dmitry I
    Expert Systems with Applications 40 (2013) 6538–6560 Contents lists available at SciVerse ScienceDirect Expert Systems with Applications journal homepage: www.elsevier.com/locate/eswa Review Formal concept analysis in knowledge processing: A survey on applications ⇑ Jonas Poelmans a,c, , Dmitry I. Ignatov c, Sergei O. Kuznetsov c, Guido Dedene a,b a KU Leuven, Faculty of Business and Economics, Naamsestraat 69, 3000 Leuven, Belgium b Universiteit van Amsterdam Business School, Roetersstraat 11, 1018 WB Amsterdam, The Netherlands c National Research University Higher School of Economics, Pokrovsky boulevard, 11, 109028 Moscow, Russia article info abstract Keywords: This is the second part of a large survey paper in which we analyze recent literature on Formal Concept Formal concept analysis (FCA) Analysis (FCA) and some closely related disciplines using FCA. We collected 1072 papers published Knowledge discovery in databases between 2003 and 2011 mentioning terms related to Formal Concept Analysis in the title, abstract and Text mining keywords. We developed a knowledge browsing environment to support our literature analysis process. Applications We use the visualization capabilities of FCA to explore the literature, to discover and conceptually repre- Systematic literature overview sent the main research topics in the FCA community. In this second part, we zoom in on and give an extensive overview of the papers published between 2003 and 2011 which applied FCA-based methods for knowledge discovery and ontology engineering in various application domains. These domains include software mining, web analytics, medicine, biology and chemistry data. Ó 2013 Elsevier Ltd. All rights reserved. 1. Introduction mining, i.e. gaining insight into source code with FCA.
    [Show full text]
  • Concept Maps Mining for Text Summarization
    UNIVERSIDADE FEDERAL DO ESPÍRITO SANTO CENTRO TECNOLÓGICO DEPARTAMENTO DE INFORMÁTICA PROGRAMA DE PÓS-GRADUAÇÃO EM INFORMÁTICA Camila Zacché de Aguiar Concept Maps Mining for Text Summarization VITÓRIA-ES, BRAZIL March 2017 Camila Zacché de Aguiar Concept Maps Mining for Text Summarization Dissertação de Mestrado apresentada ao Programa de Pós-Graduação em Informática da Universidade Federal do Espírito Santo, como requisito parcial para obtenção do Grau de Mestre em Informática. Orientador (a): Davidson CurY Co-orientador: Amal Zouaq VITÓRIA-ES, BRAZIL March 2017 Dados Internacionais de Catalogação-na-publicação (CIP) (Biblioteca Setorial Tecnológica, Universidade Federal do Espírito Santo, ES, Brasil) Aguiar, Camila Zacché de, 1987- A282c Concept maps mining for text summarization / Camila Zacché de Aguiar. – 2017. 149 f. : il. Orientador: Davidson Cury. Coorientador: Amal Zouaq. Dissertação (Mestrado em Informática) – Universidade Federal do Espírito Santo, Centro Tecnológico. 1. Informática na educação. 2. Processamento de linguagem natural (Computação). 3. Recuperação da informação. 4. Mapas conceituais. 5. Sumarização de Textos. 6. Mineração de dados (Computação). I. Cury, Davidson. II. Zouaq, Amal. III. Universidade Federal do Espírito Santo. Centro Tecnológico. IV. Título. CDU: 004 “Most of the fundamental ideas of science are essentially simple, and may, as a rule, be expressed in a language comprehensible to everyone.” Albert Einstein 4 Acknowledgments I would like to thank the manY people who have been with me over the years and who have contributed in one way or another to the completion of this master’s degree. Especially my advisor, Davidson Cury, who in a constructivist way disoriented me several times as a stimulus to the search for new answers.
    [Show full text]
  • Semantic Search
    Semantic Search Philippe Cudre-Mauroux Definition ter grasp the semantics (i.e., meaning) and the context of the user query and/or Semantic Search regroups a set of of the indexed content in order to re- techniques designed to improve tra- trieve more meaningful results. ditional document or knowledge base Semantic Search techniques can search. Semantic Search aims at better be broadly categorized into two main grasping the context and the semantics groups depending on the target content: of the user query and/or of the indexed • techniques improving the relevance content by leveraging Natural Language of classical search engines where the Processing, Semantic Web and Machine query consists of natural language Learning techniques to retrieve more text (e.g., a list of keywords) and relevant results from a search engine. results are a ranked list of documents (e.g., webpages); • techniques retrieving semi-structured Overview data (e.g., entities or RDF triples) from a knowledge base (e.g., a knowledge graph or an ontology) Semantic Search is an umbrella term re- given a user query formulated either grouping various techniques for retriev- as natural language text or using ing more relevant content from a search a declarative query language like engine. Traditional search techniques fo- SPARQL. cus on ranking documents based on a set of keywords appearing both in the user’s Those two groups are described in query and in the indexed content. Se- more detail in the following section. For mantic Search, instead, attempts to bet- each group, a wide variety of techniques 1 2 Philippe Cudre-Mauroux have been proposed, ranging from Natu- matical tags (such as noun, conjunction ral Language Processing (to better grasp or verb) to individual words.
    [Show full text]
  • SEKI@Home, Or Crowdsourcing an Open Knowledge Graph
    SEKI@home, or Crowdsourcing an Open Knowledge Graph Thomas Steiner1? and Stefan Mirea2 1 Universitat Politècnica de Catalunya – Department LSI, Barcelona, Spain [email protected] 2 Computer Science, Jacobs University Bremen, Germany [email protected] Abstract. In May 2012, the Web search engine Google has introduced the so-called Knowledge Graph, a graph that understands real-world en- tities and their relationships to one another. It currently contains more than 500 million objects, as well as more than 3.5 billion facts about and relationships between these different objects. Soon after its announce- ment, people started to ask for a programmatic method to access the data in the Knowledge Graph, however, as of today, Google does not provide one. With SEKI@home, which stands for Search for Embedded Knowledge Items,weproposeabrowserextension-basedapproachtocrowdsource the task of populating a data store to build an Open Knowledge Graph. As people with the extension installed search on Google.com, the ex- tension sends extracted anonymous Knowledge Graph facts from Search Engine Results Pages (SERPs) to a centralized, publicly accessible triple store, and thus over time creates a SPARQL-queryable Open Knowledge Graph. We have implemented and made available a prototype browser extension tailored to the Google Knowledge Graph, however, note that the concept of SEKI@home is generalizable for other knowledge bases. 1Introduction 1.1 The Google Knowledge Graph With the introduction of the Knowledge Graph, the search engine Google has made a significant paradigm shift towards “things, not strings” [7], as a post on the official Google blog states. Entities covered by the Knowledge Graph include landmarks, celebrities, cities, sports teams, buildings, movies, celestial objects, works of art, and more.
    [Show full text]
  • The Singing Guitar
    August 2011 | No. 112 Your FREE Guide to the NYC Jazz Scene nycjazzrecord.com Mike Stern The Singing Guitar Billy Martin • JD Allen • SoLyd Records • Event Calendar Part of what has kept jazz vital over the past several decades despite its commercial decline is the constant influx of new talent and ideas. Jazz is one of the last renewable resources the country and the world has left. Each graduating class of New York@Night musicians, each child who attends an outdoor festival (what’s cuter than a toddler 4 gyrating to “Giant Steps”?), each parent who plays an album for their progeny is Interview: Billy Martin another bulwark against the prematurely-declared demise of jazz. And each generation molds the music to their own image, making it far more than just a 6 by Anders Griffen dusty museum piece. Artist Feature: JD Allen Our features this month are just three examples of dozens, if not hundreds, of individuals who have contributed a swatch to the ever-expanding quilt of jazz. by Martin Longley 7 Guitarist Mike Stern (On The Cover) has fused the innovations of his heroes Miles On The Cover: Mike Stern Davis and Jimi Hendrix. He plays at his home away from home 55Bar several by Laurel Gross times this month. Drummer Billy Martin (Interview) is best known as one-third of 9 Medeski Martin and Wood, themselves a fusion of many styles, but has also Encore: Lest We Forget: worked with many different artists and advanced the language of modern 10 percussion. He will be at the Whitney Museum four times this month as part of Dickie Landry Ray Bryant different groups, including MMW.
    [Show full text]