Interoperation in Complex Information Ecosystems
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Panel: Large Knowledge Bases
From: AAAI Technical Report SS-02-06. Compilation copyright © 2002, AAAI (www.aaai.org). All rights reserved. Panel: Large KnowledgeBases AdamPease (Teknowledge, chair), Chris Welty (Vassar College), Pat Hayes (U. West Florida), Anthony G. Cohn (U. Leeds), Ken Murray (SRI) Fellbaum, C. (1998). WordNet, An Electronic Lexical Database. MITPress. Abstract Lenat, D., 1995, "Cyc: A Large-Scale Investment in It is estimated that 1-2 exabytes of data is now being KnowledgeInfrastructure". Communicationsof the ACM generated each year, almost all of it in purely digital form 38, no. 1 !, November.See also http://www.c¥c.com (Lymanet. ai. 2000). Properly structured, this information could form a global knowledge base. Currently however, Lyman,P., Varian, H., Dunn, J., Strygin, A., Swearingen, this information exists in manydifferent forms, manyof K., (2000). HowMuch Information?, University California, which are only suitable for humanconsumption, and which Berkeley are largely opaque to computerbased understanding. Majorefforts to build large formal ontologies or address http://www.sims.berkeley.edu/research/project.,ghow-much- issues in their construction have been undertaken funded by info the government in the US such as the DARPAKnowledge Niles, I., & Pease, A., (2001), Towarda Standard Upper Sharing Effort (Patil et al, 1992), High Performance Ontology, in Proceedings of the 2nd International KnowledgeBases (Cohen et. al., 1998), Rapid Knowledge Conference on Formal Ontology in Information Systems Formation (RKF, 2002) and in Europe including Advanced (FOIS-2001). See also http:llonlology.teknowledge.com KnowledgeTechnologies (Shadboit, 2001) and OntoWeb and http:l/suo.ieee.org (OntoWeb,2002), as international standards efforts such the IEEE Standard Upper Ontology (Niles & Pease, 2001) OntoWeb(2002). -
A Data-Driven Framework for Assisting Geo-Ontology Engineering Using a Discrepancy Index
University of California Santa Barbara A Data-Driven Framework for Assisting Geo-Ontology Engineering Using a Discrepancy Index A Thesis submitted in partial satisfaction of the requirements for the degree Master of Arts in Geography by Bo Yan Committee in charge: Professor Krzysztof Janowicz, Chair Professor Werner Kuhn Professor Emerita Helen Couclelis June 2016 The Thesis of Bo Yan is approved. Professor Werner Kuhn Professor Emerita Helen Couclelis Professor Krzysztof Janowicz, Committee Chair May 2016 A Data-Driven Framework for Assisting Geo-Ontology Engineering Using a Discrepancy Index Copyright c 2016 by Bo Yan iii Acknowledgements I would like to thank the members of my committee for their guidance and patience in the face of obstacles over the course of my research. I would like to thank my advisor, Krzysztof Janowicz, for his invaluable input on my work. Without his help and encour- agement, I would not have been able to find the light at the end of the tunnel during the last stage of the work. Because he provided insight that helped me think out of the box. There is no better advisor. I would like to thank Yingjie Hu who has offered me numer- ous feedback, suggestions and inspirations on my thesis topic. I would like to thank all my other intelligent colleagues in the STKO lab and the Geography Department { those who have moved on and started anew, those who are still in the quagmire, and those who have just begun { for their support and friendship. Last, but most importantly, I would like to thank my parents for their unconditional love. -
A Critique of Pure Reason’
151 A critique of pure reason’ DREWMCDERMOTT Yale University, New Haven, CT 06S20, U.S.A. Cornput. Intell. 3. 151-160 (1987) In 1978, Patrick Hayes promulgated the Naive Physics Man- the knowledge that programs must have before we write the ifesto. (It finally appeared as an “official” publication in programs themselves. We know what this knowledge is; it’s Hobbs and Moore 1985.) In this paper, he proposed that an all- what everybody knows, about physics, about time and space, out effort be mounted to formalize commonsense knowledge, about human relationships and behavior. If we attempt to write using first-order logic as a notation. This effort had its roots in the programs first, experience shows that the knowledge will earlier research, especially the work of John McCarthy, but the be shortchanged. The tendency will be to oversimplify what scope of Hayes’s proposal was new and ambitious. He sug- people actually know in order to get a program that works. On gested that the use of Tarskian seniantics could allow us to the other hand, if we free ourselves from the exigencies of study a large volume of knowledge-representation problems hacking, then we can focus on the actual knowledge in all its free from the confines of computer programs. The suggestion complexity. Once we have a rich theory of the commonsense inspired a small community of people to actually try to write world, we can try to embody it in programs. This theory will down all (or most) of commonsense knowledge in predictate become an indispensable aid to writing those programs. -
QUERY-DRIVEN TEXT ANALYTICS for KNOWLEDGE EXTRACTION, RESOLUTION, and INFERENCE by CHRISTAN EARL GRANT a DISSERTATION PRESENTED
QUERY-DRIVEN TEXT ANALYTICS FOR KNOWLEDGE EXTRACTION, RESOLUTION, AND INFERENCE By CHRISTAN EARL GRANT A DISSERTATION PRESENTED TO THE GRADUATE SCHOOL OF THE UNIVERSITY OF FLORIDA IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF DOCTOR OF PHILOSOPHY UNIVERSITY OF FLORIDA 2015 c 2015 Christan Earl Grant To Jesus my Savior, Vanisia my wife, my daughter Caliah, soon to be born son and my parents and siblings, whom I strive to impress. Also, to all my brothers and sisters battling injustice while I battled bugs and deadlines. ACKNOWLEDGMENTS I had an opportunity to see my dad, a software engineer from Jamaica work extremely hard to get a master's degree and work as a software engineer. I even had the privilege of sitting in some of his classes as he taught at a local university. Watching my dad work towards intellectual endeavors made me believe that anything is possible. I am extremely privileged to have someone I could look up to as an example of being a man, father, and scholar. I had my first taste of research when Dr. Joachim Hammer went out of his way to find a task for me on one of his research projects because I was interested in attending graduate school. After working with the team for a few weeks he was willing to give me increased responsibility | he let me attend the 2006 SIGMOD Conference in Chicago. It was at this that my eyes were opened to the world of database research. As an early graduate student Dr. Joseph Wilson exercised superhuman patience with me as I learned to grasp the fundamentals of paper writing. -
Usage-Dependent Maintenance of Structured Web Data Sets
Usage-dependent maintenance of structured Web data sets Dissertation zur Erlangung des akademischen Grades eines Doktors der Naturwissenschaften (Dr. rer. nat) am Institut f¨urInformatik des Fachbereichs Mathematik und Informatik der Freien Unviersit¨atBerlin vorgelegt von Dipl. Inform. Markus Luczak-R¨osch Berlin, August 2013 Referent: Prof. Dr.-Ing. Robert Tolksdorf (Freie Universit¨atBerlin) Erste Korreferentin: Natalya F. Noy, PhD (Stanford University) Zweite Korreferentin: Dr. rer. nat. Elena Simperl (University of Southampton) Tag der Disputation: 13.01.2014 To Johanna. To Levi, Yael and Mili. Vielen Dank, dass ich durch Euch eine Lebenseinstellung lernen durfte, \. die bereit ist, auf kritische Argumente zu h¨oren und von der Erfahrung zu lernen. Es ist im Grunde eine Einstellung, die zugibt, daß ich mich irren kann, daß Du Recht haben kannst und daß wir zusammen vielleicht der Wahrheit auf die Spur kommen." { Karl Popper Abstract The Web of Data is the current shape of the Semantic Web that gained momentum outside of the research community and becomes publicly visible. It is a matter of fact that the Web of Data does not fully exploit the primarily intended technology stack. Instead, the so called Linked Data design issues [BL06], which are the basis for the Web of Data, rely on the much more lightweight technologies. Openly avail- able structured Web data sets are at the beginning of being used in real-world applications. The Linked Data research community investigates the overall goal to approach the Web-scale data integration problem in a way that distributes efforts between three contributing stakeholders on the Web of Data { the data publishers, the data consumers, and third parties. -
Conceptualization and Visualization of Tagging and Folksonomies
Conceptualization and Visualization of Tagging and Folksonomies Von der Fakultät für Ingenieurwissenschaften, Abteilung Informatik und Angewandte Kognitionswissenschaft der Universität Duisburg-Essen zur Erlangung des akademischen Grades Doktor der Ingenieurwissenschaften (Dr.-Ing.) genehmigte Dissertation von Steffen Lohmann aus Hamburg 1. Gutachter: Prof. Dr. Maria Paloma Díaz Pérez 2. Gutachter: Prof. Dr.-Ing. Jürgen Ziegler Tag der mündlichen Prüfung: 27.11.2013 Hinweis: Diese Dissertation ist im Rahmen eines binationalen Promotionsverfahrens (Cotutelle) in Kooperation mit der Universidad Carlos III de Madrid entstanden. Abstract Tagging has become a popular indexing method for interactive systems in the past decade. It offers a simple yet effective way for users to organize an ever increasing amount of digital information for themselves and/or others. The linked user vocabulary resulting from tagging is known as folksonomy and provides a valuable source for the retrieval and exploration of digital resources. Although several models and representations of tagging have been proposed, there is no coherent conceptualization that provides a comprehensive and pre- cise description of the concepts and relationships in the domain. Furthermore, there is little systematic research in the area of folksonomy visualization, and so folksonomies are still mainly depicted as simple tag clouds. Both problems are related, as a well-defined conceptualization is an important prerequisite for the interoperable use and visualization of folksonomies. The thesis addresses these shortcomings by developing a coherent conceptualiza- tion of tagging and visualizations for the interactive exploration of folksonomies. It gives an overview and comparison of tagging models and defines key concepts of the domain. After a comprehensive review of existing tagging ontologies, a unified and coherent conceptualization is presented that incorporates the best parts of the reviewed ontologies. -
L Dataspaces Make Data Ntegration Obsolete?
DBKDA 2011, January 23-28, 2011 – St. Maarten, The Netherlands Antilles DBKDA 2011 Panel Discussion: Will Dataspaces Make Data Integration Obsolete? Moderator: Fritz Laux, Reutlingen Univ., Germany Panelists: Kazuko Takahashi, Kwansei Gakuin Univ., Japan Lena Strömbäck, Linköping Univ., Sweden Nipun Agarwal, Oracle Corp., USA Christopher Ireland, The Open Univ., UK Fritz Laux, Reutlingen Univ., Germany DBKDA 2011, January 23-28, 2011 – St. Maarten, The Netherlands Antilles The Dataspace Idea Space of Data Management Scalable Functionality and Costs far Web Search Functionality virtual Organization pay-as-you-go, Enterprise Dataspaces Admin. Portal Schema Proximity Federated first, DBMS DBMS scient. Desktop Repository Search DBMS schemaless, near unstructured high Semantic Integration low Time and Cost adopted from [Franklin, Halvey, Maier, 2005] DBKDA 2011, January 23-28, 2011 – St. Maarten, The Netherlands Antilles Dataspaces (DS) [Franklin, Halevy, Maier, 2005] is a new abstraction for Information Management ● DS are [paraphrasing and commenting Franklin, 2009] – Inclusive ● Deal with all the data of interest, in whatever form => but semantics matters ● We need access to the metadata! ● derive schema from instances? ● Discovering new data sources => The Münchhausen bootstrap problem? Theodor Hosemann (1807-1875) DBKDA 2011, January 23-28, 2011 – St. Maarten, The Netherlands Antilles Dataspaces (DS) [Franklin, Halevy, Maier, 2005] is a new abstraction for Information Management ● DS are [paraphrasing and commenting Franklin, 2009] – Co-existence -
Exploring Digital Preservation Strategies Using DLT in the Context Of
Forget-me-block Exploring digital preservation strategies using Distributed Ledger Technology in the context of personal information management By JAMES DAVID HACKMAN Department of Computer Science UNIVERSITY OF BRISTOL A dissertation submitted to the University of Bristol in accordance with the requirements of the degree of Master of Science by advanced study in Computer Science in the Faculty of Engineering. 15TH SEPTEMBER 2020 arXiv:2011.05759v1 [cs.CY] 2 Nov 2020 EXECUTIVE SUMMARY eceived wisdom portrays digital records as guaranteeing perpetuity; as the New York Times wrote a decade ago: “the web means the end of forgetting” [1]. The Rreality however is that digital records suffer similar risks of access loss as the analogue versions they replaced - but through the mechanisms of software, hardware and organisational change. The first two of these mechanisms are straightforward. Software change relates to how data is encoded - for instance later versions of Microsoft Word often cannot access documents written with earlier versions [2]. Likewise hardware formats obsolesce; even popular technologies such as the floppy disk reach a point where accessing data on these formats becomes increasingly difficult [3]. The third mechanism is however more abstract as it relates to societal structures, and ironically is often generated as a by-product of attempts to escape the first two risks. In our efforts to rid ourselves of hardware and software change these risks are often delegated to specialised external parties. Common use cases are those of conveying information to a future self, e.g. calendars, diaries, tasks, etc. These applications, categorised as Personal Information Management (PIM) [4, p. -
Inconsistency Robustness for Logic Programs Carl Hewitt
Inconsistency Robustness for Logic Programs Carl Hewitt To cite this version: Carl Hewitt. Inconsistency Robustness for Logic Programs. Inconsistency Robustness, 2015, 978-1- 84890-159. hal-01148496v6 HAL Id: hal-01148496 https://hal.archives-ouvertes.fr/hal-01148496v6 Submitted on 27 Apr 2016 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Copyright Inconsistency Robustness for Logic Programs Carl Hewitt This article is dedicated to Alonzo Church and Stanisław Jaśkowski Abstract This article explores the role of Inconsistency Robustness in the history and theory of Logic Programs. Inconsistency Robustness has been a continually recurring issue in Logic Programs from the beginning including Church's system developed in the early 1930s based on partial functions (defined in the lambda calculus) that he thought would allow development of a general logic without the kind of paradoxes that had plagued earlier efforts by Frege, etc.1 Planner [Hewitt 1969, 1971] was a kind of hybrid between the procedural and logical paradigms in that it featured a procedural embedding of logical sentences in that an implication of the form (p implies q) can be procedurally embedded in the following ways: Forward chaining When asserted p, Assert q When asserted q, Assert p Backward chaining When goal q, SetGoal p When goal p, SetGoal q Developments by different research groups in the fall of 1972 gave rise to a controversy over Logic Programs that persists to this day in the form of following alternatives: 1. -
IKRIS Scenarios Inter-Theory (ISIT)
IKRIS Scenarios Inter-Theory (ISIT) Jerry Hobbs with the IKRIS Scenarios Working Group Contributions from Danny Bobrow, Chris Deaton, Mike Gruninger, Pat Hayes, Arun Majumdar, David Martin, Drew McDermott, Sheila McIlraith, Karen Myers, David Morley, John Sowa, Marco Valtorta, and Chris Welty. IKRIS Scenarios Fundamentals Events Most ontologies dealing with processes or scenarios will have something corresponding to events. PSL uses the term "activity_occurrence" for events. Cyc uses the term "Event". Other possible terms have drawbacks. "Action" connotes an event that has an agent, so it is only a subclass of events. "Process" and "Procedure" connote complex events that have subevents, and we need a term that will cover primitive events as well. One can argue that "activity" and "activity occurrence" have the same problem. For these reasons, we have chosen the term "Events". Types and Tokens We can refer to both event types and event tokens. PSL uses the terms "activity" and "activity_occurrence" for these two. Cyc uses the terms "EventType" and "Event". Another possible pair of term is "EventClass" and "EventInstance", but this biases one toward interpretation of types as classes, which may not be desirable (see below). It may seem desirable to make the type-token distinction explicit in the terms, and thus use "EventType" and "EventToken". But as Pat Hayes has pointed out, an event token may be a token of many types of events. A specific event may be a token of a running event type, an exercising event type, a winning event type, and numerous other types. Essentially, "token" is a relation between specific events and event types. -
Data in Context: Aiding News Consumers While Taming Dataspaces
DBCrowd 2013: First VLDB Workshop on Databases and Crowdsourcing Data In Context: Aiding News Consumers while Taming Dataspaces Adam Marcus∗ , Eugene Wu, Sam Madden MIT CSAIL marcua, sirrice, madden @csail.mit.edu { } ...were it left to me to decide whether we should have a gov- reasons: 1) A lack of space or time, as is common in minute- ernment without newspapers, or newspapers without a gov- by-minute reporting 2) The article is a segment in a multi- ernment, I should not hesitate a moment to prefer the latter. part series, 3) The reader doesn’t have the assumed back- — Thomas Jefferson ground knowledge, 4) A newsroom is resources-limited and can not do additional analysis in-house, 5) The writer’s ABSTRACT agenda is better served through the lack of context, or 6) The context is not materialized in a convenient place (e.g., We present MuckRaker, a tool that provides news consumers there is no readily accessible table of historical earnings). with datasets and visualizations that contextualize facts and In some cases, the missing data is often accessible (e.g, on figures in the articles they read. MuckRaker takes advantage Wikipedia), and with enough effort, an enterprising reader of data integration techniques to identify matching datasets, can usually analyze or visualize it themselves. Ideally, all and makes use of data and schema extraction algorithms to news consumers would have tools to simplify this task. identify data points of interest in articles. It presents the Many database research results could aid readers, par- output of these algorithms to users requesting additional ticularly those related to dataspace management. -
From Web 2.0 to Web
Web 2.0 and Web 3.0 and its Applications in Library Services Dr. Jagdish Arora Advisor, NBA Email: [email protected] Evolution of library technology (adapted from Noh (2012) Library 2.0 and Library 3.0 • Web 1.0 (Tims Berner Lee) • Semantic Web - 2001 (Tims Berner Lee) • Web 2.0 (Tim O'Reilly, 2005) • Web 3.0 John Markoff, New York Times, 2006) • Web 4.0 (Distant Dream?) • Library 2.0 coined by Michael Casey in 2006 • Library 3.0 • Library 4.0 This talk is addressed to: • Librarian 2.0 • Librarian 3.0 1996 (1 L W; 50 L U) 2006 (10 Cr W; 50 L) 2016 (100 Cr W; 250 Cr U) Web 1.0 to Web 4.0: Process of Evolution Subtitle comes here Web 1.0: Read Web 2.0: Read & Write Web 3.0: Read, Write & Web 4.0: Read, Write, /Awareness / Static (2006) Execute (2016) Execute & Concur (1996) • Web Connecting • Web Connecting • Intelligent interaction • Unidirectional / People / Human Knowledge & between machines and Passive transmission Centric Participative Intelligence users • Limited Interaction Web • Semantic Web: Web of • Internet of Things (IoT) with Users • Dynamic, Interactive Data; Virtual World • Human are upgraded with • No Content and Collaborative • Social Computing technology extension Creation Creation of Information Environment (Always On) • Social Networking • Multi-directional Sites (SNS) • Semantic connections • Bi-directional • Data filtered • AI Source: .... 6 Web 2.0 Web 2.0 is About Cultivating Communities . Shared Picture Shared Video Shared News Shared Shared Bookmarks Everything • SmugMug • YouTube •Digg • Photobucket •Twitter •MySpace • Vimeo •Redddit • Google Photos • Dailymotion •Pinterest •Facebook •Delicious • Snapfish • Twitch •StumbleUpon •Twitter •Grow News • Flickr • LiveLeak •Dribble •Friendster •Newsvine • ImageShack • Break •Orkut • Metacafe •Slashdot • Fotolog •Pocket •Snapchat Its About being in the User’s Space .