Book of Abstracts

xx Table of Contents Session O1 - Machine Translation & Evaluation . 1 Session O2 - Semantics & Lexicon (1) . 2 Session O3 - Corpus Annotation & Tagging . 3 Session O4 - Dialogue . 4 Session P1 - Anaphora, Coreference . 5 Session: Session P2 - Collaborative Resource Construction & Crowdsourcing . 7 Session P3 - Information Extraction, Information Retrieval, Text Analytics (1) . 9 Session P4 - Infrastructural Issues/Large Projects (1) . 11 Session P5 - Knowledge Discovery/Representation . 13 Session P6 - Opinion Mining / Sentiment Analysis (1) . 14 Session P7 - Social Media Processing (1) . 16 Session O5 - Language Resource Policies & Management . 17 Session O6 - Emotion & Sentiment (1) . 18 Session O7 - Knowledge Discovery & Evaluation (1) . 20 Session O8 - Corpus Creation, Use & Evaluation (1) . 21 Session P8 - Character Recognition and Annotation . 22 Session P9 - Conversational Systems/Dialogue/Chatbots/Human-Robot Interaction (1) . 23 Session P10 - Digital Humanities . 25 Session P11 - Lexicon (1) . 26 Session P12 - Machine Translation, SpeechToSpeech Translation (1) . 28 Session P13 - Semantics (1) . 30 Session P14 - Word Sense Disambiguation . 33 Session O9 - Bio-medical Corpora . 34 Session O10 - MultiWord Expressions . 35 Session O11 - Time & Space . 36 Session O12 - Computer Assisted Language Learning . 37 Session P15 - Annotation Methods and Tools . 38 Session P16 - Corpus Creation, Annotation, Use (1) . 41 Session P17 - Emotion Recognition/Generation . 43 Session P18 - Ethics and Legal Issues . 45 Session P19 - LR Infrastructures and Architectures . 46 xxi Session I-O1: Industry Track - Industrial systems . 48 Session O13 - Paraphrase & Semantics . 49 Session O14 - Emotion & Sentiment (2) . 50 Session O15 - Semantics & Lexicon (2) . 51 Session O16 - Bilingual Speech Corpora & Code-Switching . 52 Session P20 - Bibliometrics, Scientometrics, Infometrics . 54 Session P21 - Discourse Annotation, Representation and Processing (1) . 55 Session P22 - Evaluation Methodologies . 57 Session P23 - Information Extraction, Information Retrieval, Text Analytics (2) . 59 Session P24 - Multimodality . 61 Session P25 - Parsing, Syntax, Treebank (1) . 64 Session O17 - Evaluation Methodologies . 66 Session O18 - Semantics . 67 Session O19 - Information Extraction & Neural Networks . 68 Session O20 - Dialogue, Emotion, Multimodality . 69 Session: I-P1 - Industry Track - Industrial Systems . 70 Session P26 - Language Acquisition & CALL (1) . 71 Session P27 - Less-Resourced/Endangered Languages (1) . 73 Session P28 - Lexicon (2) . 74 Session P29 - Linked Data . 76 Session P30 - Infrastructural Issues/Large Projects (2) . 77 Session P31 - MultiWord Expressions & Collocations . 78 Session I-O2: Industry Track - Human computation in industry . 81 Session O21 - Discourse & Argumentation . 82 Session O22 - Less-Resourced & Ancient Languages . 83 Session O23 - Semantics & Evaluation . 84 Session O24 - Multimodal & Written Corpora . 85 Session P32 - Document Classification, Text Categorisation (1) . 86 Session P33 - Morphology (1) . 88 Session P34 - Opinion Mining / Sentiment Analysis (2) . 89 Session P35 - Session Phonetic Databases, Phonology . 92 Session P36 - Question Answering and Machine Reading . 92 Session P37 - Social Media Processing (2) . 95 Session P38 - Speech Resource/Database (1) . 97 Session O25 - Social Media & Evaluation . 100 Session O26 - Standards, Validation, Workflows . 101 Session O27 - Treebanks & Parsing . 102 Session O28 - Morphology & Lexicons . 103 Session P39 - Conversational Systems/Dialogue/Chatbots/Human-Robot Interaction (2) . 104 Session P40 - Language Modelling . 106 xxii Session P41 - Natural Language Generation . 107 Session P42 - Semantics (2) . 109 Session P43 - Speech Processing . 111 Session P44 - Summarisation . 114 Session P45 - Textual Entailment and Paraphrasing . 116 Session O29 - Language Resource Infrastructures . 116 Session O30 - Digital Humanities & Text Analytics . 118 Session O31 - Crowdsourcing & Collaborative Resource Construction . 119 Session O32 - Less-Resourced Languages Speech & Multimodal Corpora . 120 Session P46 - Dialects . 121 Session P47 - Document Classification, Text Categorisation (2) . 123 Session P48 - Information Extraction, Information Retrieval, Text Analytics (3) . 124 Session P49 - Machine Translation, SpeechToSpeech Translation (2) . 126 Session P50 - Morphology (2) . 129 Session P51 - Multilinguality . 130 Session P52 - Part-of-Speech Tagging . 131 Session O33 - Lexicon . 133 Session O34 - Knowledge Discovery . 134 Session O35 - Multilingual Corpora & Machine Translation . 135 Session O36 - Corpus Creation, Use & Evaluation (2) . 136 Session P53 - Conversational Systems/Dialogue/Chatbots/Human-Robot Interaction (3) . 137 Session.

Book of Abstracts

An Analysis of Sentence Segmentation Features For

Talk Bank: a Multimodal Database of Communicative Interaction

Machine-Translation Inspired Reordering As Preprocessing for Cross-Lingual Sentiment Analysis

Sentence Boundary Detection for Handwritten Text Recognition Matthias Zimmermann

A Bayesian Framework for Word Segmentation: Exploring the Effects of Context

Multiple Segmentations of Thai Sentences for Neural Machine Translation

The Main Features of the E-Glava Online Valency Dictionary

A Topic-Aligned Multilingual Corpus of Wikipedia Articles for Studying Information Asymmetry in Low Resource Languages

Towards a Korean Dbpedia and an Approach for Complementing the Korean Wikipedia Based on Dbpedia

The Culture of Wikipedia

Student Research Workshop Associated with RANLP 2011, Pages 1–8, Hissar, Bulgaria, 13 September 2011

Multilingvální Systémy Rozpoznávání Řeči a Jejich Efektivní Učení