Leveraging Entity Types and Properties for Knowledge Graph Exploitation

Total Page:16

File Type:pdf, Size:1020Kb

Leveraging Entity Types and Properties for Knowledge Graph Exploitation DEPARTMENT OF INFORMATICS UNIVERSITYOF FRIBOURG (SWITZERLAND) Leveraging Entity Types and Properties for Knowledge Graph Exploitation THESIS Presented to the Faculty of Science of the University of Fribourg (Switzerland) in consideration for the award of the academic grade of Doctor scientiarum informaticarum by ALBERTO TONON from ITALY Thesis No: 2018 UniPrint 2017 Accepted by the Faculty of Science of the University of Fribourg (Switzerland) upon the recommendation of Prof. Dr. Krisztian Balog and Dr. Gianluca Demartini. Fribourg, May 22, 2017 Thesis supervisor Dean Prof. Dr. Philippe Cudré-Mauroux Prof. Dr. Christian Bochet iii Declaration of Authorship Title: Leveraging Entity Types and Properties for Knowledge Graph Exploitation I, Alberto Tonon, declare that I have authored this thesis independently, without illicit help, that I have not used any other than the declared sources/resources, and that I have explicitly marked all material which has been quoted either literally or by content from the used sources. Signed: Date: v “Educating the mind without educating the heart is no education at all.” — Aristotle Acknowledgments There are many people without whom I would never have achieved this goal. All of them con- tributed in their own way and proved to be essential to my academic growth, to my personal development, and to my mental health. Here I want to thank them all. Thanks to my supervisor, Professor Philippe Cudré-Mauroux. Philippe provided me with inestimable advice and guidance, he also showed great patience, and always understood the issues that each Ph.D. candidate had. One of Philippe’s best achievements is, in my opinion, the creation of a familiar and friendly group where students and researchers can freely interact, provide feedback, collaborate, and have a lot of fun. The path to obtaining a Ph.D. is long and difficult, I think that working in such an environment helped me a lot in dealing with all the difficulties I faced. Special thanks go to Gianluca Demartini and Monica Noselli, my first family in Fribourg. Similarly to Philippe, Gianluca gave me invaluable advice, introduced me to the academic world, came with me to my first conference, and contributed significantly to the research presented in this thesis. I would also like to acknowledge an important component of my daily life in the lab: Dr. Roman Prokofyev. Roman bravely shared an office with me for almost five years, managed to bear me even during deadline marathons and stressful periods. Not less important are all the other members and friends of the lab with whom I shared important moments (in alphabetic order): Alisa, Alyia, Artem, Dingqi, Djellel, Esther, Giuseppe, Ines, Julia, Laura, Marcin, Martin, Michael, Michele, Michelle, Paolo, Ruslan, Sabrina, Victor. You guys are great! My friends in Italy and in Fribourg were also essential during the last five years. I would like, in particular, to mention all friends from “FR Fun!”, and my friends from all times, Alessan- dro, Annalisa, and Lorena. Finally, I owe eternal gratitude to my parents, Gianna and Luigi, my sister Cristiana, my grandparents, Edoardo and Vally, and to Aurora for supporting me unconditionally. They inspired me and helped me becoming what I am. Fribourg, June 2, 2017 Alberto Tonon ix Abstract A Knowledge Graph is a knowledge base containing semi-structured information represented as a graph. Entries (nodes) in a knowledge graph are called entities. Knowledge Graphs today play a central role in the services provided by major Web players. Recent examples include the Google Knowledge Vault, Yahoo!’s Knowledge Graph, and Bing’s Knowledge and Action Graph. In addition, open-source Knowledge Graphs such as DBpedia or Wikidata, are also available for reuse. Being able to tap into a Knowledge Graph and exploit connections among its entities is key to numerous tasks related for example to computational linguistics, information retrieval, and question answering. In this thesis we develop new methods for effectively retrieving entities and for ranking their types. In addition, we present algorithms that improve data quality in knowledge graphs by finding misused entity properties (labeled edges in the Knowledge Graph), and we show how Knowledge Graphs can be used in practice by exploiting them to effectively detect newsworthy events. Finally, we propose a novel evaluation methodology that can be used for continuously evaluating entity-centric retrieval systems. We start this thesis by presenting novel techniques to retrieve entities as responses to keyword queries. This task is called Ad-hoc Object Retrieval, and is essential for effectively exploiting Knowledge Graphs. We show that triplestores can be used to efficiently retrieve relevant responses by exploring the surroundings of entities obtained by applying standard Information Retrieval methods. This leads to significant improvements in average precision and NDCG with little increase in execution time. Subsequently, we introduce the novel task of Type Ranking (TRank), which consists in ranking all types of a given entity based on the textual context in which it appears. Information on entity types that best fit a certain context can be shown to end-users for text understanding, text summarization and for search results diversification. We experimentally demonstrate that approaches for TRank based on the type hierarchy of the Knowledge Graph provide more accurate results. As information on the schema of the Knowledge Graph is exploited by several of our methods, we also studied how to verify the adherence of the relations contained in the knowledge base to their formal specification. Specifically, we propose a method for detecting when properties in the Knowledge Graph are misused or need to be specialized. We present results that show how entropy can be used to detect the misused properties. We then switch to a more applied context by presenting a real world application that makes use of semantic technologies and Knowledge Graphs. The system we developed uses a combination of entity linking, anomaly detection and reasoning to efficiently and effectively detect newsworthy events in microblogs. We show that our xi Abstract system outperforms state-of-the-art methods based on query expansion. Finally, we tackle the problem of how to evaluate entity-centric retrieval systems like those we proposed. In this context, we introduce a new evaluation methodology that uses crowdsourcing to evaluate sets of systems in a fair and continuous fashion, and define techniques to weight the strictness or leniency of different crowds evaluating the retrieved entities. We analyze the benefits and drawbacks of our methodology by comparing AOR systems developed at different points in time and study how standard Information Retrieval metrics, AOR system ranking, and several pooling techniques behave in such a continuous evaluation context. Keywords: Knowledge Graphs, Entities, Entity Types, Data Integration. xii Résumé Un graphe de connaissances est une base de connaissances contenant des informations semi-structurées représentées comme un graphe. Les entrées (noeuds) dans un graphe de connaissances sont appelées entités. De nos jours, les graphes de connaissances jouent un role central dans les servies fournis par les majeurs acteurs du Web. Des examples récents incluent le "Google Knowledge Vault", le graphe de connaissances de Yahoo et le "Knowledge and Action Graph" de Bing. De surcroit, les graphes de connaissances open-source, comme DBpedia ou Wikipedia, sont aussi disponibles pour réutilisation. Etre en mesure de bénéficier des graphes de connaissances et exploiter les connections entre les entités sont la clé pour divers taches reliées, par example au calcul linguistique, l’extraction de connaissances, ou les questions réponses. Dans cette thèse, nous développons des nouvelles méthodes pour la recherche d’entités et pour le classement de leurs types. En plus, nous introduisons des algo- rithmes pour améliorer la qualité des données dans les graphes de connaissances en trouvant les propriétés d’entités mal utilisés (les arêtes étiquetées dans le graphe de connaissance) et nous illustrons comment les graphes de connaissances peuvent être utilisés en pratique en les exploitant dans l’otique de détecter les événements pertinents aux nouvelles (newsworthy). Finalement, nous proposons une nouvelle évaluation méthodologique qui peut être utilisée pour l’évaluation continuelle des entités centrées sur les systèmes d’extraction. Nous commençons cette thèse par présenter les nouvelles techniques pour extraire les entités comme réponses à des requêtes mots-clés. Cette tache, appelée Ad-hoc Object Retrieval, est essentielle pour l’exploitation efficace des graphes de connaissances. Nous montrons aussi que les triplestores peuvent être utilisés pour extraire efficacement les réponses pertinentes en exploitant les voisins des entités obtenus par l’application des méthodes classiques de l’extraction de connaissances. Ceci conduit à des améliorations signifiantes en précision moyenne et NDCG avec une minime augmentation dans le temps d’execution. En consé- quence, nous introduisons une nouvelle tache de classement de type (TRank), qui consiste dans le classement de tous les types d’une certaine entité basée sur le contexte textuel dans lequel elle apparait. Les informations sur les types des entités qui siéent le plus dans un contexte spécifique peuvent être montrées aux utilisateurs finaux pour la compréhension du texte, la récapitulation et la diversification de la recherche
Recommended publications
  • Knowit VQA: Answering Knowledge-Based Questions About Videos
    The Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI-20) KnowIT VQA: Answering Knowledge-Based Questions about Videos Noa Garcia,1 Mayu Otani,2 Chenhui Chu,1 Yuta Nakashima1 1Osaka University, Japan, 2CyberAgent, Inc., Japan {noagarcia, chu, n-yuta}@ids.osaka-u.ac.jp otani [email protected] Abstract We propose a novel video understanding task by fusing knowledge-based and video question answering. First, we in- troduce KnowIT VQA, a video dataset with 24,282 human- generated question-answer pairs about a popular sitcom. The dataset combines visual, textual and temporal coherence rea- /HRQDUG+DYH\RXQRWLFHGWKDW+RZDUGFDQWDNHDQ\WRSLFDQGXVHLWWRUHPLQG soning together with knowledge-based questions, which need \RXWKDWKHZHQWWRVSDFH" of the experience obtained from the viewing of the series to be 6KHOGRQ,QWHUHVWLQJK\SRWKHVLV/HW¶VDSSO\WKHVFLHQWLILFPHWKRG answered. Second, we propose a video understanding model /HRQDUG2ND\+H\+RZDUGDQ\WKRXJKWVRQZKHUHZHVKRXOGJHWGLQQHU" by combining the visual and textual video content with spe- +RZDUG$Q\ZKHUHEXWWKH6SDFH6WDWLRQ2QDJRRGGD\GLQQHUZDVDEDJIXOO cific knowledge about the show. Our main findings are: (i) the RIPHDWORDI%XWKH\\RXGRQ¶WJRWKHUHIRUWKHIRRG\RXJRWKHUHIRUWKHYLHZ incorporation of knowledge produces outstanding improve- ments for VQA in video, and (ii) the performance on KnowIT 9LVXDO +RZPDQ\SHRSOHDUHWKHUHZHDULQJJODVVHV"2QH VQA still lags well behind human accuracy, indicating its 7H[WXDO :KRKDVEHHQWRWKHV:KRKDVEHHQWRWKHVSDFH"SDFH" +R+RZDUGZDUG usefulness for studying current video modelling limitations. 7HPSRUDO +RZGRWKH\+RZGRWKH\ILQLVKWKHFRQYHUVDWLRQ" ILQLVKWKHFRQYHUVDWLRQ"66KDNLQJKDQGVKDNLQJKDQGV Introduction .QRZOHGJH :KRRZQVWKHSODFHZKHUHWKH:KRRZQVWKHSODFHZKHUHWKH\DUHVWDQGLQJ"\DUHVWDQGLQJ"66WXDUWWXDUW Visual question answering (VQA) was firstly introduced in (Malinowski and Fritz 2014) as a task for bringing to- Figure 1: Types of questions addressed in KnowIt VQA. gether advancements in natural language processing and image understanding.
    [Show full text]
  • Semantic Search
    Semantic Search Philippe Cudre-Mauroux Definition ter grasp the semantics (i.e., meaning) and the context of the user query and/or Semantic Search regroups a set of of the indexed content in order to re- techniques designed to improve tra- trieve more meaningful results. ditional document or knowledge base Semantic Search techniques can search. Semantic Search aims at better be broadly categorized into two main grasping the context and the semantics groups depending on the target content: of the user query and/or of the indexed • techniques improving the relevance content by leveraging Natural Language of classical search engines where the Processing, Semantic Web and Machine query consists of natural language Learning techniques to retrieve more text (e.g., a list of keywords) and relevant results from a search engine. results are a ranked list of documents (e.g., webpages); • techniques retrieving semi-structured Overview data (e.g., entities or RDF triples) from a knowledge base (e.g., a knowledge graph or an ontology) Semantic Search is an umbrella term re- given a user query formulated either grouping various techniques for retriev- as natural language text or using ing more relevant content from a search a declarative query language like engine. Traditional search techniques fo- SPARQL. cus on ranking documents based on a set of keywords appearing both in the user’s Those two groups are described in query and in the indexed content. Se- more detail in the following section. For mantic Search, instead, attempts to bet- each group, a wide variety of techniques 1 2 Philippe Cudre-Mauroux have been proposed, ranging from Natu- matical tags (such as noun, conjunction ral Language Processing (to better grasp or verb) to individual words.
    [Show full text]
  • SEKI@Home, Or Crowdsourcing an Open Knowledge Graph
    SEKI@home, or Crowdsourcing an Open Knowledge Graph Thomas Steiner1? and Stefan Mirea2 1 Universitat Politècnica de Catalunya – Department LSI, Barcelona, Spain [email protected] 2 Computer Science, Jacobs University Bremen, Germany [email protected] Abstract. In May 2012, the Web search engine Google has introduced the so-called Knowledge Graph, a graph that understands real-world en- tities and their relationships to one another. It currently contains more than 500 million objects, as well as more than 3.5 billion facts about and relationships between these different objects. Soon after its announce- ment, people started to ask for a programmatic method to access the data in the Knowledge Graph, however, as of today, Google does not provide one. With SEKI@home, which stands for Search for Embedded Knowledge Items,weproposeabrowserextension-basedapproachtocrowdsource the task of populating a data store to build an Open Knowledge Graph. As people with the extension installed search on Google.com, the ex- tension sends extracted anonymous Knowledge Graph facts from Search Engine Results Pages (SERPs) to a centralized, publicly accessible triple store, and thus over time creates a SPARQL-queryable Open Knowledge Graph. We have implemented and made available a prototype browser extension tailored to the Google Knowledge Graph, however, note that the concept of SEKI@home is generalizable for other knowledge bases. 1Introduction 1.1 The Google Knowledge Graph With the introduction of the Knowledge Graph, the search engine Google has made a significant paradigm shift towards “things, not strings” [7], as a post on the official Google blog states. Entities covered by the Knowledge Graph include landmarks, celebrities, cities, sports teams, buildings, movies, celestial objects, works of art, and more.
    [Show full text]
  • Seo Report for Brands July 2019
    VOICE ASSISTANT SEO REPORT FOR BRANDS JULY 2019 GIVING VOICE TO A REVOLUTION Table of Contents About Voicebot About Magic + Co An Introduction to Voice Search Today // 3 Voicebot produces the leading independent Magic + Co. is a next-gen creative services firm research, online publication, newsletter and podcast whose core expertise is in designing, producing, and Consumer Adoption Trends // 10 focused on the voice and AI industries. Thousands executing on conversational strategy, the technology of entrepreneurs, developers, investors, analysts behind it, and associated marketing campaigns. Voice Search Results Analysis // 16 and other industry leaders look to Voicebot each Grading the Experts // 32 week for the latest news, data, analysis and insights Facts defining the trajectory of the next great computing • 25 people, technologists, strategists, and Practical Voice Assistant SEO Strategies // 36 platform. At Voicebot, we give voice to a revolution. creatives with focus on conversational design and technologies. Additional Resources // 39 • 1 Webby Award Nominee and 3 Honorees. • 3 years old and global leader. Methodology • We’ve worked primarily with CPG, but our practice extends into utilities, governments, Consumer survey data was collected online during banking, entertainment, healthcare and is the first week of January 2019 and included 1,038 international in scope. U.S. adults age 18 or older that were representative of U.S. Census demographic averages. Findings were compared to earlier survey data collected in Magic + Co. the first week of September 2018 that included responses from 1,040 U.S. adults. Voice search data results from Amazon Echo Smart Speakers with Alexa, Apple HomePod with Siri, Google Home and a Smartphone with Google Assistant, and Samsung Galaxy S10 with Bixby, were collected in Q2 2019.
    [Show full text]
  • Arxiv:1812.02903V1 [Cs.LG] 7 Dec 2018 Ized Server
    APPLIED FEDERATED LEARNING: IMPROVING GOOGLE KEYBOARD QUERY SUGGESTIONS Timothy Yang*, Galen Andrew*, Hubert Eichner* Haicheng Sun, Wei Li, Nicholas Kong, Daniel Ramage, Franc¸oise Beaufays Google LLC, Mountain View, CA, U.S.A. ftimyang, galenandrew, huberte haicsun, liweithu, kongn, dramage, [email protected] ABSTRACT timely suggestions are necessary in order to maintain rele- vance. On-device inference and training through FL enable Federated learning is a distributed form of machine learn- us to both minimize latency and maximize privacy. ing where both the training data and model training are decen- In this paper, we use FL in a commercial, global-scale tralized. In this paper, we use federated learning in a commer- setting to train and deploy a model to production for infer- cial, global-scale setting to train, evaluate and deploy a model ence – all without access to the underlying user data. Our to improve virtual keyboard search suggestion quality with- use case is search query suggestions [4]: when a user enters out direct access to the underlying user data. We describe our text, Gboard uses a baseline model to determine and possibly observations in federated training, compare metrics to live de- surface search suggestions relevant to the input. For instance, ployments, and present resulting quality increases. In whole, typing “Let’s eat at Charlie’s” may display a web query sug- we demonstrate how federated learning can be applied end- gestion to search for nearby restaurants of that name; other to-end to both improve user experiences and enhance user types of suggestions include GIFs and Stickers. Here, we privacy.
    [Show full text]
  • Knolwedge Acquisi$On from Music Digital Libraries
    Knolwedge Acquision from Music Digital Libraries Sergio Oramas, Mohamed Sordo Mo0vaon • Musicological Knowledge is hidden between the lines • Machines don’t know how to read Why Knowledge Acquisi0on? • Obtain knowledge automacally • Make complex ques0ons • Visualize the informaon • Improve navigaon • Share knowledge 4 Musical Libraries Musical Libraries digital recording, scan Musical Libraries OCR, manual transcrip0on digital recording, scan Musical Libraries Informaon Extrac0on, seman0c annotaon OCR, manual transcrip0on digital recording, scan Musical Libraries Informaon Extrac0on, seman0c annotaon Current DL OCR, manual transcrip0on digital recording, scan Musical Libraries Web search Informaon Extrac0on, seman0c annotaon Current DL OCR, manual transcrip0on digital recording, scan Musical Libraries Web search Informaon Extrac0on, seman0c annotaon Current DL OCR, manual transcrip0on digital recording, scan Seman0c Web • The Semanc Web aims at conver0ng the current web, dominated by unstructured and semi-structured documents into a web of linked data. • Achievements useful for Digital Libraries – Common framework for data representaon and interconnec0on (RDF, ontologies) – Seman0c technologies to annotate texts (En0ty Linking) – Language for complex queries (SPARQL) Wikipedia and DBpedia - Digital Encyclopedia - Knowledge Base - Unstructured - Structured - Keyword search - Query search 13 14 15 DBpedia • Dbpedia example queries – Composers born in Vienna in XVIII Century – American jazz musicians that have wri\en songs recorded by RCA
    [Show full text]
  • Google's Knowledge Graph
    Google’s Knowledge Graph Google has just announced that it will completely reshape the presentation of its search results. Instead of listing websites where people can find the answer to their queries, it will present data, links, pictures, etc. from its own databases, some 500 million “items”. The redesign is said to represent the biggest change in search for the last five years, not just conceptually but also in terms of business impact. The goal, of course, is to make sure that web surfers remain on Google properties instead of clicking away to another site for answers. Internet companies all over the world already complain and regulators can add the move to their list of Google features to investigate from an antitrust perspective. The announcement is also a subtle tactical move, just one day before Facebook’s IPO. It reinforces Google’s positioning as the owner of the “knowledge network” as opposed to the “social network”. It is hard to anticipate what the new ‘product’ will really be but there are a few important pointers. First, it will make connections between the different information elements. At the moment, search provides a list of sources but does not really categorize them. Knowledge Graph will and may even link the sources to one another. People (not least Tim Berners-Lee) have always argued for the, so called ‘semantic web’ to replace the traditional web. Google’s Knowledge Graph is a fine approximation of the idea. Another aspect of the service will, hopefully, have to do with formatting. We may get a more standardized presentation of the information, which is quite important for rapid processing.
    [Show full text]
  • The Rise of KNOWLEDGE GRAPHS
    THOUGHT LEADERSHIP SERIES The Rise of KNOWLEDGE GRAPHS 2019 | MAR THOUGHT LEADERSHIP SERIES | March 2019 2 MAKING NEW CONNECTIONS WITH KNOWLEDGE GRAPHS In today’s business world, time-to-insight and time-to-action are critical competitive differentiators. The demand for quick, easy access to information is growing. However, the challenge of combining data into meaningful information is also mounting—alongside the proliferation of data sources and types. While companies nowadays are investing heavily in initiatives to increase the amount of data at their disposal, most are spending more time finding than analyzing data. Data silos remain a huge problem at many enterprises, and legacy data management technologies and processes are having trouble keeping up with the speed, scalability, and flexibility requirements of new workloads and use cases. Data volumes are exploding in will be driven by specific applications Graph databases, a type of NoSQL organizations and the pressure to exploit and increasing expertise. “NoSQL” data database that employs structures that data for faster time to insight is management approaches, features, and with nodes, edges, and properties to increasing. Along with this store data and places data growth, there is a new a heavy emphasis wave of tools to expand The ability for knowledge graphs to gather on relationships, are what is possible with the growing in use. More proliferation of data and information, relationships, and insights—and recently, the term data types. connect those facts—allows organizations to “knowledge graph” While relational databases in particular has are still both the biggest discern context in data, which is important for gained popularity with source of transactional as well as complying with the introduction of data in most organizations extracting value several high-profile and the greatest source of increasingly stringent data privacy regulations.
    [Show full text]
  • Wikidata As a Knowledge Graph for the Life Sciences
    Washington University School of Medicine Digital Commons@Becker Open Access Publications 2020 Wikidata as a knowledge graph for the life sciences Andra Waagmeester Malachi Griffith Obi L. Griffith et al. Follow this and additional works at: https://digitalcommons.wustl.edu/open_access_pubs FEATURE ARTICLE SCIENCE FORUM Wikidata as a knowledge graph for the life sciences Abstract Wikidata is a community-maintained knowledge base that has been assembled from repositories in the fields of genomics, proteomics, genetic variants, pathways, chemical compounds, and diseases, and that adheres to the FAIR principles of findability, accessibility, interoperability and reusability. Here we describe the breadth and depth of the biomedical knowledge contained within Wikidata, and discuss the open-source tools we have built to add information to Wikidata and to synchronize it with source databases. We also demonstrate several use cases for Wikidata, including the crowdsourced curation of biomedical ontologies, phenotype-based diagnosis of disease, and drug repurposing. ANDRA WAAGMEESTER†, GREGORY STUPP†, SEBASTIAN BURGSTALLER- MUEHLBACHER, BENJAMIN M GOOD, MALACHI GRIFFITH, OBI L GRIFFITH, KRISTINA HANSPERS, HENNING HERMJAKOB, TOBY S HUDSON, KEVIN HYBISKE, SARAH M KEATING, MAGNUS MANSKE, MICHAEL MAYERS, DANIEL MIETCHEN, ELVIRA MITRAKA, ALEXANDER R PICO, TIMOTHY PUTMAN, ANDERS RIUTTA, NURIA QUERALT-ROSINACH, LYNN M SCHRIML, THOMAS SHAFEE, DENISE SLENTER, RALF STEPHAN, KATHERINE THORNTON, GINGER TSUENG, ROGER TU, SABAH UL-HASAN, EGON WILLIGHAGEN, CHUNLEI WU AND ANDREW I SU* Introduction advance efforts by the open-data community to Integrating data and knowledge is a formidable build a rich and heterogeneous network of scien- challenge in biomedical research. Although new tific knowledge. That knowledge network could, scientific findings are being discovered at a in turn, be the foundation for many computa- *For correspondence: asu@ rapid pace, a large proportion of that knowl- tional tools, applications and analyses.
    [Show full text]
  • Summarized by © Lakhasly.Com Real-Time Data Feeds Although It
    Real-time Data Feeds Although it doesn’t promote itself as such, Google is actually a collection of data and a set of tools for working with it. It has progressed from an index of web pages to a central hub for real-time data feeds on just about anything that can be measured such as weather reports, travel reports, stock market and shares, shopping suggestions, travel suggestions, and several other things. Sorting Tools Big Data analysis which implies utilizing tools intended to deal with and comprehend this massive data becomes an integral factor whenever users carry out a search query. The Google’s algorithms run complex calculations intended to match the questions that user entered with all the available data. It will try to determine whether the user is searching for news, people, facts or statistics, and retrieve the data from the appropriate feed. Knowledge Graph Pages Google Knowledge Graph is a tool or database which collects all the data and facts about people, places and things along with proper differentiation and relationship between them. It is then later used by Google in solving our queries with useful answers. Google knowledge graph is user-centric and it provides them with useful relevant information quickly and easily. Literal & Semantic search The main aim of the literal search engine is to find the root of your search phrase by looking for a match for some of the word or entire phrase. The root of the phrase is then examined and explored upon to display better search results. While semantic search engine tries to understand the context of the phrase by analyzing the terms and language in knowledge graph database to directly answer a question with specific information.
    [Show full text]
  • Messageontap: a Suggestive Interface to Facilitate Messaging-Related Tasks
    MessageOnTap: A Suggestive Interface to Facilitate Messaging-related Tasks Fanglin Chen Kewei Xia Human-Computer Interaction Institute Electronic Information School Carnegie Mellon University Wuhan University Pittsburgh, PA, USA Wuhan, China [email protected] [email protected] Karan Dhabalia Jason I. Hong Electrical and Computer Engineering Human-Computer Interaction Institute Carnegie Mellon University Carnegie Mellon University Pittsburgh, PA, USA Pittsburgh, PA, USA [email protected] [email protected] ABSTRACT KEYWORDS Text messages are sometimes prompts that lead to infor- Messaging/Communication, Information Seeking & Search; mation related tasks, e.g. checking one’s schedule, creating Contextual Computing; Personal Data/Tracking; User Expe- reminders, or sharing content. We introduce MessageOnTap, rience Design; Text/Speech/Language; Productivity a suggestive interface for smartphones that uses the text in ACM Reference Format: a conversation to suggest task shortcuts that can stream- Fanglin Chen, Kewei Xia, Karan Dhabalia, and Jason I. Hong. 2019. line likely next actions. When activated, MessageOnTap MessageOnTap: A Suggestive Interface to Facilitate Messaging- uses word embeddings to rank relevant external apps, and related Tasks. In CHI Conference on Human Factors in Computing parameterizes associated task shortcuts using key phrases Systems Proceedings (CHI 2019), May 4–9, 2019, Glasgow, Scotland mentioned in the conversation, such as times, persons, or Uk. ACM, New York, NY, USA, 14 pages. https://doi.org/10.1145/ events. MessageOnTap also tailors the auto-complete dictio- 3290605.3300805 nary based on text in the conversation, to streamline any 1 INTRODUCTION text input. We first conducted a month-long study of mes- Messaging apps are among the most popular apps for smart- saging behaviors (N=22) that informed our design.
    [Show full text]
  • Adding Realtime Coverage to the Google Knowledge Graph
    Adding Realtime Coverage to the Google Knowledge Graph Thomas Steiner1?, Ruben Verborgh2, Raphaël Troncy3, Joaquim Gabarro1, and Rik Van de Walle2 1 Universitat Politècnica de Catalunya – Department lsi, Spain {tsteiner,gabarro}@lsi.upc.edu 2 Ghent University – IBBT, ELIS – Multimedia Lab, Belgium {ruben.verborgh,rik.vandewalle}@ugent.be 3 EURECOM, Sophia Antipolis, France [email protected] Abstract. In May 2012, the Web search engine Google has introduced the so-called Knowledge Graph, a graph that understands real-world entities and their relationships to one another. Entities covered by the Knowledge Graph include landmarks, celebrities, cities, sports teams, buildings, movies, celestial objects, works of art, and more. The graph enhances Google search in three main ways: by disambiguation of search queries, by search log-based summarization of key facts, and by explo- rative search suggestions. With this paper, we suggest a fourth way of enhancing Web search: through the addition of realtime coverage of what people say about real-world entities on social networks. We report on a browser extension that seamlessly adds relevant microposts from the social networking sites Google+, Facebook, and Twitter in form of a panel to Knowledge Graph entities. In a true Linked Data fashion, we inter- link detected concepts in microposts with Freebase entities, and evaluate our approach for both relevancy and usefulness. The extension is freely available, we invite the reader to reconstruct the examples of this paper to see how realtime opinions may have changed since time of writing. 1 Introduction With the introduction of the Knowledge Graph, the search engine Google has made a significant paradigm shift towards “things, not strings” [3], as a post on the official Google blog states.
    [Show full text]