Large-Scale Knowledge Representation Resources for Cognitive Science Research

Large-Scale Knowledge Representation Resources for Cognitive Science Research

Large-scale Knowledge Representation Resources for Cognitive Science Research George A. Miller Martha S. Palmer Department of Psychology, Princeton University Department of Computer and Information Science, 1-S-5 Green Hall, Princeton, NJ 08544 USA University of Pennsylvania 3330 Walnut Street, Philadelphia, PA 19104-6389 USA Charles J. Fillmore International Computer Science Institute, University of Doug Lenat California at Berkeley Cycorp, Inc. 1947 Center St., Suite 600, Berkeley, CA 94704-1198 Suite 100, 3721 Executive Center Drive, Austin, TX USA 78731 USA Pat Hayes Institute for Human and Machine Cognition 40 South Alcaniz Street, Pensacola, FL 32502 Motivation Treebank syntactic structures: The Proposition Bank, or PropBank. PropBank consists of argument labels for the One of the unique features of cognitive science has been semantic roles of individual verbs and similar predicating its emphasis on understanding how people represent and use expressions such as participial modifiers and knowledge. Unfortunately, knowledge representation can nominalizations. This talk will describe the PropBank verb be quite difficult if one is starting from scratch. Having “off semantic role annotation being done at Penn for both the shelf” representation systems that can be used for English and Chinese. The annotation process will be investigations can make new types of cognitive modeling discussed as well as the use of existing lexical resources efforts possible, at a scale that would otherwise be such as WordNet, Levin classes and VerbNet. Comparisons impossible. The participants in this symposium each with similar projects, including the FrameNet Project at represent a different approach to building such Berkeley and the Prague Tectogrammatics project, will be representational resources. made. George A. Miller: WordNet is a lexical database that Doug Lenat: Cognitive Science research could benefit from contains 146,900 English nouns, verbs, adjectives, and large-scale representational building blocks. One such adverbs that are now organized by semantic relations into building block is a broad ontology of, well, everything. 117,500 meanings, where a meaning is represented by a set Another one, resting on that, is a formal axiomatization of of synonyms (a synset) that can be used to express that most of the meaning of most of those concepts; i.e., millions meaning. An entry in WordNet consists of a synset, a of axioms about those hundreds of thousands of terms. definitional gloss, and (sometimes) one or more sentences These two pieces have been worked on for twenty years -- illustrating usage. The semantic relations used to organize and the better part of a person-century of effort -- as Cyc. words and entries are synonymy and antonymy, hyponymy, It's been highly proprietary so far, with a small tip of its troponymy and hypernymy, meronymy and holonymy. A ontology exposed as OpenCyc. Starting this summer, currently active project, the disambiguation of definitional though, Cyc is being made available in its entirety for R&D glosses, will be discussed. purposes for free, courtesy of Ron Brachman at DARPA's IPTO. In this presentation I will briefly describe this Charles J. Fillmore: The FrameNet project is morphing ResearchCyc ontology and KB, and some initial utilities from a lexicon-building project to a system capable of provided with it (inference engine, English-Cyc lexicon, providing a layer or two of semantic annotation for full interfaces), how this might fit in with the other sentences. My remarks will summarize the kinds of infrastructure elements being described in this panel, and information the FrameNet database can provide now, and how these might leverage your work. what it should be able to offer in the not too distant future, for researchers in language engineering and cognitive Pat Hayes science. FrameNet is moving toward greater coverage of Pat Hayes, the discussant for this symposium, has the the lexicon, adaptation to specialist vocabularies, more unique distinction of having inventing two of the most systematic treatment of multiword expressions, and widely-used representations for change, the situation provisions for incorporating a wide variety of (non-core) calculus (with John McCarthy) and histories. He is a grammatical constructions. Fellow of the American Association for Artificial Intelligence and the Cognitive Science Society. Martha S. Palmer: Recently, a consensus has been achieved as to a task-oriented level of semantic representation to be layered on top of the existing Penn 19.

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    1 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us