Main Conference Program

Main Conference Program Monday, June 5 9:00–9:10 Opening Session 9:10–10:10 Keynote Speaker I: Joshua Goodman Email and Spam and Spim and Spat 10:10–10:40 Break Machine Translation I 10:40–11:05 Capitalizing Machine Translation Wei Wang, Kevin Knight and Daniel Marcu 11:05–11:30 Do we need phrases? Challenging the conventional wisdom in Statistical Machine Translation Chris Quirk and Arul Menezes 11:30–11:55 Improved Statistical Machine Translation Using Paraphrases Chris Callison-Burch, Philipp Koehn and Miles Osborne 11:55–12:20 Segment Choice Models: Feature-Rich Models for Global Distortion in Statistical Machine Translation Roland Kuhn, Denis Yuen, Michel Simard, Patrick Paul, George Foster, Eric Joanis and Howard Johnson Inference and Entailment 10:40–11:05 Effectively Using Syntax for Recognizing False Entailment Rion Snow, Lucy Vanderwende and Arul Menezes 11:05–11:30 Learning to recognize features of valid textual entailments Bill MacCartney, Trond Grenager, Marie-Catherine de Marneffe, Daniel Cer and Christopher D. Manning 11:30–11:55 Acquisition of Verb Entailment from Text Viktor Pekar 11:55–12:20 Acquiring Inference Rules with Temporal Constraints by Using Japanese Coordi- nated Sentences and Noun-Verb Co-occurrences Kentaro Torisawa Monday, June 5 (continued) Named Entity Recognition 10:40–11:05 Role of Local Context in Automatic Deidentification of Ungrammatical, Fragmented Text Tawanda Sibanda, Ozlem Uzuner and Ozlem Uzuner 11:05–11:30 Exploiting Domain Structure for Named Entity Recognition Jing Jiang and ChengXiang Zhai 11:30–11:55 Named Entity Transliteration and Discovery from Multilingual Comparable Corpora Alexandre Klementiev and Dan Roth 11:55–12:20 Reducing Weight Undertraining in Structured Discriminative Learning Charles Sutton, Michael Sindelar and Andrew McCallum 12:20–1:50 Lunch Short Papers: Machine Translation, Multi-Lingual Speech 1:50–2:05 Spectral Clustering for Example Based Machine Translation Rashmi Gangadharaiah, Ralf Brown and Jaime Carbonell 2:05–2:20 Bridging the Inflection Morphology Gap for Arabic Statistical Machine Translation Andreas Zollmann, Venugopal Ashish and Vogel Stephan 2:20–2:35 Arabic Preprocessing Schemes for Statistical Machine Translation Nizar Habash and Fatiha Sadat 2:35–2:50 Thai Grapheme-Based Speech Recognition Paisarn Charoenpornsawat, Sanjika Hewavitharana and Tanja Schultz 2:50–3:05 Story Segmentation of Broadcast News in English, Mandarin and Arabic Andrew Rosenberg and Julia Hirschberg 3:05–3:20 Word Pronunciation Disambiguation using the Web Eiichiro Sumita and Fumiaki Sugaya Monday, June 5 (continued) Short Papers: Discourse/Dialogue 1:50–2:05 Agreement/Disagreement Classification: Exploiting Unlabeled Data using Contrast Clas- sifiers Sangyun Hahn, Richard Ladner and Mari Ostendorf 2:05–2:20 Using Phrasal Patterns to Identify Discourse Relations Manami Saito, Kazuhide Yamamoto and Satoshi Sekine 2:20–2:35 Evaluating Centering for Sentence Ordering in Two New Domains Nikiforos Karamanis 2:35–2:50 Computational Modelling of Structural Priming in Dialogue David Reitter, Frank Keller and Johanna D. Moore 2:50–3:05 Museli: A Multi-Source Evidence Integration Approach to Topic Segmentation of Sponta- neous Dialogue Jaime Arguello and Carolyn Rose 3:05–3:20 Automatic Recognition of Personality in Conversation Franois Mairesse and Marilyn Walker Short Papers: Retrieval, Language Models 1:50–2:05 Using the Web to Disambiguate Acronyms Eiichiro Sumita and Fumiaki Sugaya 2:05–2:20 Lycos Retriever: An Information Fusion Engine Brian Ulicny 2:20–2:35 BioEx: A Novel User-Interface that Accesses Images from Abstract Sentences Hong Yu and Minsuk Lee 2:35–2:50 Selecting relevant text subsets from web-data for building topic specific language models Abhinav Sethy, Panayiotis Georgiou and Shrikanth Narayanan 2:50–3:05 Factored Neural Language Models Andrei Alexandrescu and Katrin Kirchhoff 3:05–3:20 Quantitative Methods for Classifying Writing Systems Gerald Penn and Travis Choma 3:20–3:50 Break Monday, June 5 (continued) Word Alignment 3:50–4:15 A Maximum Entropy Approach to Combining Word Alignments Necip Fazil Ayan and Bonnie J. Dorr 4:15–4:40 Alignment by Agreement Percy Liang, Ben Taskar and Dan Klein 4:40–5:05 Word Alignment via Quadratic Assignment Simon Lacoste-Julien, Ben Taskar, Dan Klein and Michael I. Jordan Semantics I 3:50–4:15 An Empirical Study of the Behavior of Active Learning for Word Sense Disambiguation Jinying Chen, Andrew Schein, Lyle Ungar and Martha Palmer 4:15–4:40 Unknown word sense detection as outlier detection Katrin Erk 4:40–5:05 Understanding Temporal Expressions in Emails Benjamin Han, Donna Gates and Lori Levin Parsing I 3:50–4:15 Partial Training for a Lexicalized-Grammar Parser Stephen Clark and James Curran 4:15–4:40 Effective Self-Training for Parsing David McClosky, Eugene Charniak and Mark Johnson 4:40–5:05 Multilingual Dependency Parsing using Bayes Point Machines Simon Corston-Oliver, Anthony Aue, Kevin Duh and Eric Ringger Tuesday, June 6 Parsing II 9:00–9:25 Multilevel Coarse-to-Fine PCFG Parsing Eugene Charniak, Mark Johnson, Micha Elsner, Joseph Austerweil, David Ellis, Isaac Haxton, Catherine Hill, R. Shrivaths, Jeremy Moore, Michael Pozar and Theresa Vu 9:25–9:50 A Fully-Lexicalized Probabilistic Model for Japanese Syntactic and Case Structure Anal- ysis Daisuke Kawahara and Sadao Kurohashi 9:50–10:15 Fully Parsing the Penn Treebank Ryan Gabbard, Seth Kulick and Mitchell Marcus Discourse 9:00–9:25 Exploiting Semantic Role Labeling, WordNet and Wikipedia for Coreference Resolution Simone Paolo Ponzetto and Michael Strube 9:25–9:50 Identifying and Analyzing Judgment Opinions Soo-Min Kim and Eduard Hovy 9:50–10:15 Learning to Detect Conversation Focus of Threaded Discussions Donghui Feng, Erin Shaw, Jihie Kim and Eduard Hovy Spoken and Acoustic Aspects of Language 9:00–9:25 Towards Automatic Scoring of Non-Native Spontaneous Speech Klaus Zechner and Isaac Bejar 9:25–9:50 Unsupervised and Semi-supervised Learning of Tone and Pitch Accent Gina-Anne Levow 9:50–10:15 Learning Pronunciation Dictionaries: Language Complexity and Word Selection Strate- gies John Kominek and Alan W Black 10:15–10:45 Break Tuesday, June 6 (continued) Machine Translation II 10:45–11:10 Relabeling Syntax Trees to Improve Syntax-Based Machine Translation Quality Bryant Huang and Kevin Knight 11:10–11:35 Grammatical Machine Translation Stefan Riezler and John T. Maxwell III 11:35–12:00 Synchronous Binarization for Machine Translation Hao Zhang, Liang Huang, Daniel Gildea and Kevin Knight Dialogue 10:45–11:10 Modelling User Satisfaction and Student Learning in a Spoken Dialogue Tutoring System with Generic, Tutoring, and User Affect Parameters Kate Forbes-Riley and Diane Litman 11:10–11:35 Comparing the Utility of State Features in Spoken Dialogue Using Reinforcement Learn- ing Joel Tetreault and Diane Litman 11:35–12:00 Backoff Model Training using Partially Observed Data: Application to Dialog Act Tagging Gang Ji and Jeff Bilmes Relation Extraction 10:45–11:10 Exploring Syntactic Features for Relation Extraction using a Convolution Tree Kernel Min Zhang, Jie Zhang and Jian Su 11:10–11:35 Integrating Probabilistic Extraction Models and Data Mining to Discover Relations and Patterns in Text Aron Culotta, Andrew McCallum and Jonathan Betz 11:35–12:00 Preemptive Information Extraction using Unrestricted Relation Discovery Yusuke Shinyama and Satoshi Sekine 12:00–1:30 Lunch Tuesday, June 6 (continued) Best Paper And Plenary Demo Presentations 1:30–2:00 Probabilistic Context-Free Grammar Induction Based on Structural Zeros Mehryar Mohri and Brian Roark 2:00–2:30 Prototype-Driven Learning for Sequence Models Aria Haghighi and Dan Klein 2:30–3:00 Plenary demos InfoMagnets: Making Sense of Corpus Data Jamie Arguello and Carolyn Rose Question Answering with Web, Mobile and Speech Interfaces Edward Whittaker, Joanna Mrozinski, and Sadaoki Furui From Pipedreams to Products and Promise! Janet Baker and Patri Pugliese 3:00–3:15 Break Tuesday, June 6 (continued) Posters and Demos 3:15–5:15 Posters A Comparison of Tagging Strategies for Statistical Information Extraction Christian Siefkes A Finite-State Model of Georgian Verbal Morphology Olga Gurevich A Maximum Entropy Framework that Integrates Word Dependencies and Grammatical Relations for Reading Comprehension Kui Xu, Helen Meng and Fuliang Weng Answering the question you wish they had asked: The impact of paraphrasing for Question Answering Pablo Duboue and Jennifer Chu-Carroll Comparing the roles of textual, acoustic and spoken-language features on spontaneous- conversation summarization Xiaodan Zhu and Gerald Penn Evolving optimal inspectable strategies for spoken dialogue systems Dave Toney, Johanna Moore and Oliver Lemon Exploiting Variant Corpora for Machine Translation Michael Paul and Eiichiro Sumita Gesture Improves Coreference Resolution Jacob Eisenstein and Randall Davis Illuminating Trouble Tickets with Sublanguage Theory Svetlana Symonenko, Steven Rowe and Elizabeth D. Liddy Improved Affinity Graph Based Multi-Document Summarization Xiaojun Wan and Jianwu Yang Investigating Cross-Language Speech Retrieval for a Spontaneous Conversational Speech Collection Diana Inkpen, Muath Alzghool, Gareth Jones and Douglas Oard Tuesday, June 6 (continued) Measuring Semantic Relatedness Using People and WordNet Beata Beigman Klebanov MMR-based Active Machine Learning for Bio Named Entity Recognition Seokhwan Kim, Yu Song, Kyungduk Kim, Jeong-Won Cha and Gary Geunbae Lee NER Systems that Suit User’s Preferences: Adjusting

Main Conference Program

Proactive Learning for Building Machine Translation Systems for Minority Languages

Jaime Carbonell Massively Multilingual Language

Information Extraction Lecture 5 – Named Entity Recognition III

Probabilistic Approaches for Answer Selection in Multilingual Question Answering

Active Learning

Active Learning and Crowd-Sourcing for Machine Translation

Jaime G. Carbonell -- Curriculum Vita

Naacl Hlt 2010

The Natural Language Sourcebook

Jaime G. Carbonell -- Curriculum Vita

Manuela M. Veloso

Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics Short Papers