Wonderful!^ ^ Ppedia, a Collaboratively Edited, Free Internet Encyclopedia Run De Nonprofit Organization Wikimedia Foundation, Inc

Total Page:16

File Type:pdf, Size:1020Kb

Wonderful!^ ^ Ppedia, a Collaboratively Edited, Free Internet Encyclopedia Run De Nonprofit Organization Wikimedia Foundation, Inc INFOTECH wrong. Goldsborough (2012) stated that "though it's often pooh- poohed by teachers and editors for its lack of academic rigor, Wikipedia has a self-correcting mechanism that eliminates much inaccuracy" (68). Policies. Three core content policies guide content creation on Wonderful!^ ^ ppedia, a collaboratively edited, free Internet encyclopedia run de nonprofit organization Wikimedia Foundation, Inc.. First, Revisiting Wikipipéaié á articles must present a neutral point of view. Second, the content must be verifiable. Third, original research should not be posted. Vandalism. Although vandalism continues to plague this re- Annette Lamb and Larry Johnson source, an army of volunteers helps to keep entries accurate. About 7 percent of edits to Wikipedia are vandalism, and almost all of egardless of your personal feel- these edits are made by anonymous users (Potthast, 2010). Users who violate rules-such as posting false information, promoting a ings about Wikipedia, it's being product, or adding inappropriate content-are blocked from edit- used by students every day. ing. In addition, the development of user access levels ensures that the most important articles are protected. In a recent Pew Intemet survey (Purcell, 2012), teachers indicated Authorship. Active article creators and editors are known as that Wikipedia was the second most frequently used online tool Wikipedians. While some are amateurs, others are experts in the behind Google. Even though some teachers continue to ban its use areas where they post articles. Many professionals pay close at- for particular assignments, Wikipedia use has increased steadily tention to the articles on topics related to their area of exper- overall (Zickuhr a Rainie, 2011). tise. For instance, the day Pluto was reclassified from planet to Launched a dozen years ago, Wikipedia currently contains mil- dwarf planet, the Pluto page (http://en.wikipedia.org/wiki/Pluto) lions of articles written by volunteers like you. Articles by new was buzzing with activity from those actually participating in the users are reviewed prior to posting and can be edited by others International Astronomical Union vote on the issue. to improve their quality. A community of users has established Sources. Most websites, including Wikipedia, aren't primary or a set of principles to follow in adding content and guidelines for «EdiUngWiMiD«dia... ^ EOuKaiMi Tafk SandMi Pralai« incas WanMl ContAbuttona Legou resolving conflicts. Anici* Talk iMrdi Q Pluto V^IKIPEDJA a * . FmmWdpiKU.ihaliManetdoe«dto WlKIPEDiA Thiiwl ... ... ... -infinMiiií m iittiü IBM MinftiiiinftBKHiiaaitti Pluto, lonr piu»E Sclar Syai !!Sn!^^i^!^l'l!í!íí!r™'t^' EngHsh CunamnanB ÛriginaBrC wdïad aa 1ha ninlh pianst fvorn tha Sufx. Piula waa racalagcuad as a dniari pUflAC 1 anaplulcMl oning lo tha (fBCwary that H íi or^ orw ol aavaral Ivga tiodaa »thin Uw Kuipaf 1 The Free Encyclofiedia mJ^Shop Likaothai namBar« M tha Kuipar Sa«. Piulo 11 eamçotta prlrnarily ol lock Hvl Wa wy) ta 1 4 iioooo+anictos raialivalya mil. adoraiiimalaly «««nth tha maaa ol tha Eanh'a Moon am or^a^hi« ita vohvna. 1 llAaaan« cenric md tigMy iKUrwd oiW DM IMiaa H Ircm 30 to 40 Au (4.4-7.4 bUkm km) 1 Inxntf« Su i Thia cauaaa PkiM lo ptiikidtcitt^ coma doaar ID tte SIM ttvn »«(ituna. A* ol 1 Español ^ Deutsch ASOUtWlopMftl .S La enciclopedia libre / Die Ireie Enzyklopädie "^^^^^^^ Frnmitidi iconryinlOWunbliOOe.PlutoMKilaaalliadaiaplanit.InthalilaigTOt. 1 &40 000+artjctjbs 1 510 000+Aniko) ConttoWuipaiBa Wloaingth t dbcovaiv ol minor planai 2060 Ovoi m Iha outar Solar Syitam and tha | iBCognUon o< Pluto's nWlivaly low mua. iti ilalua as a major «ianat Bagwx lo ba C°" DOItNU quaaMnad. '^ In tha Ma axh anl aarty 2<st caiMuMa. mwiy oCiacta lanllar lo Pkito »wa '™' • .•rf«MliadruaM0i'*'viaamgng • PyCCKHM Français m Iha outar Solar Sysiam. ndably tha icaitwad ilBC ot^acl Eiii lr> ZtX». nfMh la RalalMcnangu a7%tTMn naatwa than PlutO.^'^ On August 24. 2006. iha IrfamaUonal Aaliorontc^ UrMn DIaeswy . L'encyclopédie libre (lAUldaTini•0 wha n mwnc to ba a 'planai' mritrin iha Soiar 3yi1*m Thia aalkMion axckidad Smoaj"' 940000+CTIITOM 1 330 000+articles PvmintnWK Pftitoaia lanol and nídaií H aa a mambar ol tha now calagoy •anuil (àtiwf along urth En> ^ K Aflac tha (Kiaasincalion. Piulo Ma addad 10 iha liai ol minor i^anMi Mid givan ^" •çamtomiaSon OH>an«t«n itwmmCw 134340.^'^^^ A numtw ol aciarllsta hola thai Pkito aMuM coMnia lo t>a Italiano Polski Cna till pao« ctaaainad aa a planat. and that othar Otmt tfanata ihauM tia addad 10 Iha roatar ol (Unat* ""nl«<uuan IMMonuM along-4lh *luto.^î*^ *nnunaMIen «•'okm L'enciclopedia libera V/oIrm encyklopedia 99aoOO+voci Portugués A enciclopódia livre secondary sources. However, this doesn't mean they aren't valu- able. Most of us grew up using tertiary sources such as World Book Fncyclopedia. Tertiary sources collect existing knowledge rather Soarch . Sudim . RBehwdiw • Zookni. RIcaica . Siuk» . Buicar. no«« . ttm • Buaca. SSk K Tim Wim . nouiy« . Oofca • Sak • Hdku . HIaiUni. lUnUt. ii71 . C«ri. Ara . j;l..,. Ciulara . .i: than generating new information. Article content is only as good Hradal' ' Sag. Sariu. nparpara • PaMka - PDIICI ' Cari • via'n . Ttpcaxa - Uoay • Bllatu Suk. Unga-Tnu-M as the sources cited. Reliability. The reliability of Wikipedia compared with other JÍ encyclopedias has been studied extensively over the past decade. One of the most controversial studies compared the accuracy of WICKED OR WONDERFUL? Wikipedia with Fncyclopedia Britannica head to head. It found that they were similar in accuracy (Giles, 2005). A more recent In the early days, scholars charged that Wikipedia was full of in- study comparing the two encyclopedias on the topic of mental accuracies, bias beyond belief, and even harmful to young minds. disorders found that Wikipedia was as good or better than Britan- However, over the years, research has proven these naysayers nica (Reavley, 2012). Ragagopalan[Q: diff spelling in refs] (2011) HI 68 TEACHER LIBRARIAN 40;4 discovered that inaccuracies were rare in • EdiüngWIKipedla... 4 Eduscapes TalK sandbox Wikipedia, and Haigh (2011) concluded ¡Starch that Wikipedia was of sufficient quality Abraham Lincoln for nursing students. While Rector (2008) Prom Wikipedia. Ihe free encydopsdii This article is about the American prasident. For other uass, SOB Abmham Uncoln (disambiguation}. found Wikipedia to be less accurate than Main page Conisnts Abrstum Lincoln N»V«t>rehœm 1ir]i(sn/(rebru80' 13. 1B09-April 15, 1885) was Ihe 16th Abraham Lincoln other sources. Brown (2011) noted that Feaured content President of Ihe United States, setving from March 1861 until his assasslnalion in April 186S. Currant events Uncotn succasstully led Ns courity through its grealsst constitutional, military, and moral crisis Wikipedia was almost always accurate. Random anide - the American Civil War - prasarving the Union while ending slavery and promoiir^ acooomic Donate to Wkipedia and financial modernization. Reared in a poor family ort the westem Irorttier, Uncoln was mostly Academic Research. While it's not ap- Wiklmedia Shop seli-educated, and became a country lawyer, a Whig Party leader. Itlirxiia state legislator during the 1830s, afxj a ore-term member ot the United States House of Représentatives during the propriate to cite Wikipedia in most aca- » In »faction 1640s. Help Aboui Wikipedia After a series of debates in 1858 that gave nationel visibility to his opposition to the expansion demic research, Wikipedia is an excellent Community portal (À slavery. LirKOin lost a Senate race to his arch-rival, Stephen A. Douglas. Uncdn. a modérais Recent changes from a swing state, secured the Republican Party presittential nomination In 1860. With almost starting point for student inquiries and Contad Wikipedia no support in Ihe South. Uncoin swept the North and was alecled president In 1860. His election was the signal for seven southem slave states to declare their secession from the Union artd is pariicularly useful when investigating •" Toolbox form Ihe Confederacy, The departure of the Southemers gave Lincoln's party firm control of What links hers Congress, but no fomiula for compromise or reconciliation was found. Uncoln explained In Ns emerging topics not covered through other Reland changes second inai>gural address; 'Both parties deprec^ed war, but one of them would make war rather Upload nie than let the Natiort survive, and the other would accept war rather than let it perish, and the war Daguotrootypo o( AtnaNtm Urcok^ ol 090 S4. sources. Wikipedia should be only one of Sped ai pagas came." Psnnanent Hide t PiwidMl of tlM UnRiad S many sources used in the triangulation of evidence. By using multiple forms of evi- This article has multiple issues. Please help improve it or dence, the student can ensure the validity discuss these issues on the talli page. and reliability of findings. Many educators • Tl^ neutraiity of this article is disputed. suggest teaching student to use Wikipedia • This article irrcludes a list of references, related reading or properly rather telling them not to use it extemal links, but its sources remain unciear ttecause it (Harouni, 2009; Murley, 2008). iaci(s Iniine citations. (Aug20i0) The key is using Wikipedia effectively Q • This article may contain improper references to seif> is understanding how the resource is con- pubiished sources. (March 2011) structed. • This biographical article needs additionai citations for verification. (March 2012} WHAT YOU SHOULD KNOW School librarians often find themselves in who can edit these pages. A biographies of be addressed to improve the ariicle. These the middle of the Wikipedia debate deal- living persons policy (http://en.wiklpedia. messages may dispute the neutrality of ing with misconceptions and misuse of this org/wiki/Wikipedia:Biographies_of_liv- ariicle, question the factual accuracy, or resource. ing_persons] stresses the imporiance of note the need for additional citations for Let's explore a dozen things you should taking great care when reporiing on living verification.
Recommended publications
  • Realising the Potential of Algal Biomass Production Through Semantic Web and Linked Data
    LEAPS: Realising the Potential of Algal Biomass Production through Semantic Web and Linked data ∗ Monika Solanki Johannes Skarka Craig Chapman KBE Lab Karlsruhe Institute of KBE Lab Birmingham City University Technology (ITAS) Birmingham City University [email protected] [email protected] [email protected] ABSTRACT In order to derive fuels from biomass, algal operation plant Recently algal biomass has been identified as a potential sites are setup that facilitate biomass cultivation and con- source of large scale production of biofuels. Governments, version of the biomass into end use products, some of which environmental research councils and special interests groups are biofuels. Microalgal biomass production in Europe is are funding several efforts that investigate renewable energy seen as a promising option for biofuels production regarding production opportunities in this sector. However so far there energy security and sustainability. Since microalgae can be has been no systematic study that analyses algal biomass cultivated in photobioreactors on non-arable land this tech- potential especially in North-Western Europe. In this paper nology could significantly reduce the food vs. fuel dilemma. we present a spatial data integration and analysis frame- However, until now there has been no systematic analysis work whereby rich datasets from the algal biomass domain of the algae biomass potential for North-Western Europe. that include data about algal operation sites and CO2 source In [20], the authors assessed the resource potential for mi- sites amongst others are semantically enriched with ontolo- croalgal biomass but excluded all areas not between 37◦N gies specifically designed for the domain and made available and 37◦S, thus most of Europe.
    [Show full text]
  • From Cataloguing Cards to Semantic Web 1
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Scientific Open-access Literature Archive and Repository Archeologia e Calcolatori 20, 2009, 111-128 REPRESENTING KNOWLEDGE IN ARCHAEOLOGY: FROM CATALOGUING CARDS TO SEMANTIC WEB 1. Introduction Representing knowledge is the basis of any catalogue. The Italian Ca- talogue was based on very valuable ideas, developed in the late 1880s, with the basic concept of putting objects in their context. Initially cataloguing was done manually, with typewritten cards. The advent of computers led to some early experimentations, and subsequently to the de�nition of a more formalized representation schema. The web produced a cultural revolution, and made the need for technological and semantic interoperability more evi- dent. The Semantic Web scenario promises to allow a sharing of knowledge, which will make the knowledge that remained unexpressed in the traditional environments available to any user, and allow objects to be placed in their cultural context. In this paper we will brie�y recall the principles of cataloguing and lessons learned by early computer experiences. Subsequently we will descri- be the object model for archaeological items, discussing its strong and weak points. In section 5 we present the web scenario and approaches to represent knowledge. 2. Cataloguing: history and principles The Italian Catalogue of cultural heritage has its roots in the experiences and concepts, developed in the late 1880s and early 1900s, by the famous art historian Adolfo Venturi, who was probably one of the �rst scholars to think explicitly in terms of having a frame of reference to describe works of art, emphasizing as the main issue the context in which the work had been produced.
    [Show full text]
  • Download Slides
    a platform for all that we know savas parastatidis http://savas.me savasp transition from web to apps increasing focus on information (& knowledge) rise of personal digital assistants importance of near-real time processing http://aptito.com/blog/wp-content/uploads/2012/05/smartphone-apps.jpg today... storing computing computers are huge amounts great tools for of data managing indexing example google and microsoft both have copies of the entire web (and more) for indexing purposes tomorrow... storing computing computers are huge amounts great tools for of data managing indexing acquisition discovery aggregation organization we would like computers to of the world’s information also help with the automatic correlation analysis and knowledge interpretation inference data information knowledge intelligence wisdom expert systems watson freebase wolframalpha rdbms google now web indexing data is symbols (bits, numbers, characters) information adds meaning to data through the introduction of relationship - it answers questions such as “who”, “what”, “where”, and “when” knowledge is a description of how the world works - it’s the application of data and information in order to answer “how” questions G. Bellinger, D. Castro, and A. Mills, “Data, Information, Knowledge, and Wisdom,” Inform. pp. 1–4, 2004 web – the data platform web – the information platform web – the knowledge platform foundation for new experiences “wisdom is not a product of schooling but of the lifelong attempt to acquire it” representative examples wolframalpha watson source:
    [Show full text]
  • Knowledge Extraction for Hybrid Question Answering
    KNOWLEDGEEXTRACTIONFORHYBRID QUESTIONANSWERING Von der Fakultät für Mathematik und Informatik der Universität Leipzig angenommene DISSERTATION zur Erlangung des akademischen Grades Doctor rerum naturalium (Dr. rer. nat.) im Fachgebiet Informatik vorgelegt von Ricardo Usbeck, M.Sc. geboren am 01.04.1988 in Halle (Saale), Deutschland Die Annahme der Dissertation wurde empfohlen von: 1. Professor Dr. Klaus-Peter Fähnrich (Leipzig) 2. Professor Dr. Philipp Cimiano (Bielefeld) Die Verleihung des akademischen Grades erfolgt mit Bestehen der Verteidigung am 17. Mai 2017 mit dem Gesamtprädikat magna cum laude. Leipzig, den 17. Mai 2017 bibliographic data title: Knowledge Extraction for Hybrid Question Answering author: Ricardo Usbeck statistical information: 10 chapters, 169 pages, 28 figures, 32 tables, 8 listings, 5 algorithms, 178 literature references, 1 appendix part supervisors: Prof. Dr.-Ing. habil. Klaus-Peter Fähnrich Dr. Axel-Cyrille Ngonga Ngomo institution: Leipzig University, Faculty for Mathematics and Computer Science time frame: January 2013 - March 2016 ABSTRACT Over the last decades, several billion Web pages have been made available on the Web. The growing amount of Web data provides the world’s largest collection of knowledge.1 Most of this full-text data like blogs, news or encyclopaedic informa- tion is textual in nature. However, the increasing amount of structured respectively semantic data2 available on the Web fosters new search paradigms. These novel paradigms ease the development of natural language interfaces which enable end- users to easily access and benefit from large amounts of data without the need to understand the underlying structures or algorithms. Building a natural language Question Answering (QA) system over heteroge- neous, Web-based knowledge sources requires various building blocks.
    [Show full text]
  • Datatone: Managing Ambiguity in Natural Language Interfaces for Data Visualization Tong Gao1, Mira Dontcheva2, Eytan Adar1, Zhicheng Liu2, Karrie Karahalios3
    DataTone: Managing Ambiguity in Natural Language Interfaces for Data Visualization Tong Gao1, Mira Dontcheva2, Eytan Adar1, Zhicheng Liu2, Karrie Karahalios3 1University of Michigan, 2Adobe Research 3University of Illinois, Ann Arbor, MI San Francisco, CA Urbana Champaign, IL fgaotong,[email protected] fmirad,[email protected] [email protected] ABSTRACT to be both flexible and easy to use. General purpose spread- Answering questions with data is a difficult and time- sheet tools, such as Microsoft Excel, focus largely on offer- consuming process. Visual dashboards and templates make ing rich data transformation operations. Visualizations are it easy to get started, but asking more sophisticated questions merely output to the calculations in the spreadsheet. Asking often requires learning a tool designed for expert analysts. a “visual question” requires users to translate their questions Natural language interaction allows users to ask questions di- into operations on the spreadsheet rather than operations on rectly in complex programs without having to learn how to the visualization. In contrast, visual analysis tools, such as use an interface. However, natural language is often ambigu- Tableau,1 creates visualizations automatically based on vari- ous. In this work we propose a mixed-initiative approach to ables of interest, allowing users to ask questions interactively managing ambiguity in natural language interfaces for data through the visualizations. However, because these tools are visualization. We model ambiguity throughout the process of often intended for domain specialists, they have complex in- turning a natural language query into a visualization and use terfaces and a steep learning curve. algorithmic disambiguation coupled with interactive ambigu- Natural language interaction offers a compelling complement ity widgets.
    [Show full text]
  • Directions in AI Research and Applications at Siemens Corporate
    AI Magazine Volume 11 Number 1 (1991)(1990) (© AAAI) Research in Progress of linguistic phenomena. The com- Directions in AI Research putational work concerns questions of adequate processing models and algorithms, as embodied in the and Applications at actual interfaces being developed. These topics are explored in the Siemens Corporate Research framework of three projects: The nat- ural language consulting dialogue and Development project (Wisber) takes up research- oriented topics (descriptive grammar formalisms and linguistically adequate grammar specification, handling of Wolfram Buettner, Klaus Estenfeld, discourse, and so on), the data-access Hans Haugeneder, and Peter Struss project (Sepp) focuses on the practi- cal application of state-of-the-art technology, and the work on gram- mar-development environments ■ Many barriers exist today that prevent particular, Prolog extensions; and (4) (Ape) centers on the problem of ade- effective industrial exploitation of current design and analysis of neural networks. quate tools for specifying linguistic and future AI research. These barriers can The lab’s 26 researchers are orga- knowledge sources and efficient pro- only be removed by people who are work- nized into four groups corresponding cessing methods. ing at the scientific forefront in AI and to these areas. Together, they provide Wisber is jointly funded by the know potential industrial needs. Siemens with innovative software The Knowledge Processing Laboratory’s German government and several research and development concentrates in technologies, appropriate applications, industrial and academic partners. the following areas: (1) natural language prototypical implementations of AI The goal of the project is to develop interfaces to knowledge-based systems and systems, and evaluations of new a knowledge-based advice-giving databases; (2) theoretical and experimen- techniques and trends.
    [Show full text]
  • A Framework for Ontology-Based Library Data Generation, Access and Exploitation
    Universidad Politécnica de Madrid Departamento de Inteligencia Artificial DOCTORADO EN INTELIGENCIA ARTIFICIAL A framework for ontology-based library data generation, access and exploitation Doctoral Dissertation of: Daniel Vila-Suero Advisors: Prof. Asunción Gómez-Pérez Dr. Jorge Gracia 2 i To Adelina, Gustavo, Pablo and Amélie Madrid, July 2016 ii Abstract Historically, libraries have been responsible for storing, preserving, cata- loguing and making available to the public large collections of information re- sources. In order to classify and organize these collections, the library commu- nity has developed several standards for the production, storage and communica- tion of data describing different aspects of library knowledge assets. However, as we will argue in this thesis, most of the current practices and standards available are limited in their ability to integrate library data within the largest information network ever created: the World Wide Web (WWW). This thesis aims at providing theoretical foundations and technical solutions to tackle some of the challenges in bridging the gap between these two areas: library science and technologies, and the Web of Data. The investigation of these aspects has been tackled with a combination of theoretical, technological and empirical approaches. Moreover, the research presented in this thesis has been largely applied and deployed to sustain a large online data service of the National Library of Spain: datos.bne.es. Specifically, this thesis proposes and eval- uates several constructs, languages, models and methods with the objective of transforming and publishing library catalogue data using semantic technologies and ontologies. In this thesis, we introduce marimba-framework, an ontology- based library data framework, that encompasses these constructs, languages, mod- els and methods.
    [Show full text]
  • Ontologies to Interpret Remote Sensing Images: Why Do We Need Them?
    Ontologies to interpret remote sensing images : why do we need them? Damien Arvor, Mariana Belgiu, Zoe Falomir, Isabelle Mougenot, Laurent Durieux To cite this version: Damien Arvor, Mariana Belgiu, Zoe Falomir, Isabelle Mougenot, Laurent Durieux. Ontologies to inter- pret remote sensing images : why do we need them?. GIScience and Remote Sensing, Taylor & Francis: STM, Behavioural Science and Public Health Titles, 2019, pp.1-29. 10.1080/15481603.2019.1587890. halshs-02079438 HAL Id: halshs-02079438 https://halshs.archives-ouvertes.fr/halshs-02079438 Submitted on 26 Mar 2019 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. GIScience & Remote Sensing ISSN: 1548-1603 (Print) 1943-7226 (Online) Journal homepage: https://www.tandfonline.com/loi/tgrs20 Ontologies to interpret remote sensing images: why do we need them? Damien Arvor, Mariana Belgiu, Zoe Falomir, Isabelle Mougenot & Laurent Durieux To cite this article: Damien Arvor, Mariana Belgiu, Zoe Falomir, Isabelle Mougenot & Laurent Durieux (2019): Ontologies to interpret remote sensing images: why do we need them?, GIScience & Remote Sensing To link to this article: https://doi.org/10.1080/15481603.2019.1587890 © 2019 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group.
    [Show full text]
  • Question Answering
    Question Answering What is Ques+on Answering? Dan Jurafsky Ques%on Answering One of the oldest NLP tasks (punched card systems in 1961) Simmons, Klein, McConlogue. 1964. Indexing and ( ( Dependency Logic for Answering English Quesons. !"#$%&' )&*#'%+,-.'$/#0$ American Documentaon 15:30, 196-204 What do worms eat? Worms eat grass Horses with worms eat grass worms horses worms eat with eat eat grass worms grass what Birds eat worms Grass is eaten by worms birds worms eat eat 2 worms grass Dan Jurafsky Ques%on Answering: IBM’s Watson • Won Jeopardy on February 16, 2011! WILLIAM WILKINSON’S “AN ACCOUNT OF THE PRINCIPALITIES OF WALLACHIA AND MOLDOVIA” Bram Stoker INSPIRED THIS AUTHOR’S MOST FAMOUS NOVEL 3 Dan Jurafsky Apple’s Siri 4 Dan Jurafsky Wolfram Alpha 5 Dan Jurafsky Types of Ques%ons in Modern Systems • Factoid ques+ons • Who wrote “The Universal Declara4on of Human Rights”? • How many calories are there in two slices of apple pie? • What is the average age of the onset of au4sm? • Where is Apple Computer based? • Complex (narrave) ques+ons: • In children with an acute febrile illness, what is the efficacy of acetaminophen in reducing fever? • What do scholars think about Jefferson’s posi4on on dealing with pirates? 6 Dan Jurafsky Commercial systems: mainly factoid quesons Where is the Louvre Museum located? In Paris, France What’s the abbreviaon for limited L.P. partnership? What are the names of Odin’s ravens? Huginn and Muninn What currency is used in China? The yuan What kind of nuts are used in marzipan? almonds What instrument does
    [Show full text]
  • Ques+On Answering
    Queson Answering Debapriyo Majumdar Information Retrieval – Spring 2015 Indian Statistical Institute Kolkata Adapted from slides by Dan Jurafsky (Stanford) and Tao Yang (UCSB) Queson Answering One of the oldest NLP tasks (punched card systems in 1961) Simmons, Klein, McConlogue. 1964. Indexing and Dependency Logic for Answering English Ques%on: Poten%al-Answers: Questions. American Documentation 15:30, 196-204 What do worms eat? Worms eat grass Horses with worms eat grass worms horses worms eat with eat eat grass worms grass what Birds eat worms Grass is eaten by worms birds worms eat eat worms grass 2 Ques%on Answering: IBM’s Watson § Won Jeopardy on February 16, 2011! WILLIAM WILKINSON’S “AN ACCOUNT OF THE PRINCIPALITIES OF WALLACHIA AND MOLDOVIA” Bram Stoker INSPIRED THIS AUTHOR’S MOST FAMOUS NOVEL 3 Apple’s Siri § A seemingly “limited” set of few possible questions § Answers based on contextual parameters 4 Wolfram Alpha, Google 5 Wolfram Alpha But in this case, Google returns a standard list of document links 6 Types of Ques%ons in Modern Systems § Factoid questions – Answers are short – The question can be rephrased as “fill in the blanks” question Examples: – Who directed the movie Titanic? – How many calories are there in two slices of apple pie? – Where is Louvre museum located? § Complex (narrative) questions: – What precautionary measures should we take to be safe from swine flu? – What do scholars think about Jefferson’s position on dealing with pirates? 7 Paradigms for QA § IR-based approaches – TREC QA Track (AskMSR, ISI,
    [Show full text]
  • The Negative Impact of A-Box Materialization on Rdf2vec Knowledge Graph Embeddings
    More is not Always Better: The Negative Impact of A-box Materialization on RDF2vec Knowledge Graph Embeddings Andreea Ianaa, Heiko Paulheima aData and Web Science Group, University of Mannheim, Germany Abstract RDF2vec is an embedding technique for representing knowledge graph entities in a continuous vector space. In this paper, we investigate the effect of materializing implicit A-box axioms induced by subproperties, as well as symmetric and transitive properties. While it might be a reasonable assumption that such a materialization before computing embeddings might lead to better embeddings, we conduct a set of experiments on DBpedia which demonstrate that the materialization actually has a negative effect on the performance of RDF2vec. In our analysis, we argue that despite the huge body of work devotedon completing missing information in knowledge graphs, such missing implicit information is actually a signal, not a defect, and we show examples illustrating that assumption. Keywords RDF2Vec, Embedding, Reasoning, Knowledge Graph Completion, A-box Materialization 1. Introduction A straightforward assumption is that completing miss- ing knowledge in a knowledge graph before computing RDFvec [1] was originally conceived for exploiting knowl- node representations will lead to better results. However, edge graphs in data mining. Since most popular data in this paper, we show that the opposite actually holds: mining tools require a feature vector representation of completing the knowledge graph before computing an records, various techniques have been proposed for cre- RDF2vec embedding actually leads to worse results in ating vector space representations from subgraphs, in- downstream tasks. cluding adding datatype properties as features or creat- ing binary features for types [2].
    [Show full text]
  • Question Answering
    Queson Answering Evangelos Kanoulas [email protected] Question answering EVI Siri Google (Amazon) (Apple) Question answering Question answering h7p://youtu.be/WFR3lOm_xhE?t=20s Connecons to Related Fields • Informaon retrieval • Natural language processing • Databases • Machine learning • Ar%ficial intelligence Queson Answering Types of Ques%ons in Modern Systems • Factoid quesons – Who wrote “The Universal Declaraon of Human Rights”? – How many calories are there in two slices of apple pie? – What is the average age of the onset of au%sm? – Where is Apple Computer based? • Complex (narrave) ques%ons: – In children with an acute febrile illness, what is the efficacy of acetaminophen in reducing fever? – What do scholars think about Jefferson’s posi%on on dealing with pirates? Commercial systems: mainly factoid quesons Where is the Louvre Museum In Paris, France located? What’s the abbreviaon for L.P. limited partnership? What are the names of Odin’s Huginn and Muninn ravens? What currency is used in China? The yuan What kind of nuts are used in almonds marzipan? What instrument does Max drums Roach play? What is the telephone number 650-723-2300 Paradigms for QA • IR-based approaches – TREC; Google • Knowledge-based approaches – Apple Siri; Wolfram Alpha; Amazon Evi • Hybrid approaches – IBM Watson Many ques%ons can already be answered by web search • a IR-based Ques%on Answering • a IR-based Factoid QA Document DocumentDocumentDocument Document Document Indexing Answer Passage Question Retrieval Processing Docume Document Query Document Document
    [Show full text]