Languages, Applications and Technologies

IV Symposium on Languages, Applications and Technologies June 18th and 19th, 2015 Universidad Complutense de Madrid, Spain Editors: José-Luis Sierra-Rodríguez José Paulo Leal Alberto Simões ISBN: 978-84-606-8762-7 II Schedule and Table of Contents Thursday, 18th June Time Page 8h45 Registration 9h15 Opening Session 9h30 Keynote I chair: José Paulo Leal The role of ontologies in machine-machine communication. 1 Asunción Gómez 10h40 Coffee Break Session I: Document Processing chair: José João Almeida 11h00 Tree String Path Subsequences Automaton and Its Use for Indexing XML Documents ........................................................... 3 Eliška Šestáková and Jan Janousek 11h25 A structural approach to assess graph-based exercises ..................... 13 José Paulo Leal and Rúben Sousa 11h50 Automatic generation of CVs from Online Social Networks. 23 Sérgio Maia Dias, Alda Lopes Gançarski and Pedro Rangel Henriques 12h10 Knowledge Extraction from Requirements Specification................... 29 Eduardo Barra and Jorge Morato Session II: Domain-Specific Languages chair: Marjan Mernik 12h30 Domain Specific Languages for Data Mining: A Case Study for Educational Data Mining.......................................................... 35 Alfonso de la Vega, Diego García-Saiz, Marta Zorrilla and Pablo Sanchez 12h55 WSDLUD: A Metric to Measure the Understanding Degree of WSDL Descriptions.......................................................... 45 M. Berón, H. Bernardis, E. Miranda, D. Riesco, M. J. Varanda Pereira and P. Henriques 13h20 Towards the generation of graphical modelling environments aided by patterns 55 Antonio Garmendia, Ana Pescador, Esther Guerra and Juan De Lara 13h40 Lunch III Time Page Session III: Tools for Natural Language Speech and Text processing chair: Hugo Gonçalo Oliveira 15h00 Speech Features for Discriminating Stress Using Branch and Bound Wrapper Search................................................................ 61 Mariana Julião, Jorge Silva, Ana Aguiar, Helena Moniz and Fernando Batista 15h25 Oriya Morphological Analyzer Using Lttoolbox .......................... 71 Itisree Jena, Himani Chaudhry and Dipti Misra Sharma 15h50 Yet another suite of multilingual NLP tools............................... 81 Marcos Garcia and Pablo Gamallo 16h10 Coffee Break Session IV: Web Technologies and Practical Cases chair: Miguel Areias 16h35 Odin: A Service for Gamification of Learning Activities ................... 91 José Paulo Leal, José Carlos Paiva and Ricardo Queirós 17h00 SplineAPI: Building a REST API for NLP services........................ 101 Nuno Vieira, Alberto Simões and Nuno Ramos Carvalho 17h25 Engaging researchers in data management with LabTablet, an electronic laboratory notebook................................................... 111 Ricardo Carvalho Amorim, João Aguiar Castro, João Rocha Da Silva and Cristina Ribeiro 19h30 Visit to Madrid Old Town “Madrid de los Austrias” 21h00 Dinner IV Schedule and Table of Contents Friday, 19th June Time Page Session V: Semantic and Text Classification chair: Pablo Gamallo 9h30 Exploiting Twitter for the Semantic Enrichment of Telecommunication Alarms............................................................... 117 Hugo Gonçalo Oliveira, João Marques and Luís Cortesão 9h55 Meaning Inference of Abbreviations Appearing in Clinical Studies . 127 E. Chondrogiannis, V. Andronikou, E. Karanastasis and T. Varvarigou 10h20 Experiments on Enlarging a Lexical Ontology............................ 137 Alberto Simões and José João Almeida 10h40 Using Unstructured Profile Information for Gender Classification of Portuguese and English Twitter users.................................... 143 Marco Vicente, João Carvalho and Fernando Batista 11h00 Coffee Break 11h20 Keynote II chair: José-Luís Sierra The Application of Grammar Inference to Software Language Engineering . 149 Marjan Mernik Session VI: Human-Computer Language Processing chair: Maria João Varanda Pereira 12h30 Combining Processing with Racket...................................... 151 Hugo Correia and António Menezes Leitão 12h55 On Extending a Full-Sharing Multithreaded Tabling Design with Batched Scheduling............................................................ 163 Miguel Areias and Ricardo Rocha 13h20 An AST-based Tool, Spector, for Plagiarism Detection: The Approach, Functionality, and Implementation...................................... 173 Vítor T. Martins, Pedro Rangel Henriques and Daniela da Cruz 13h40 Lunch V Time Page Session VII: Semantic Web and Ontologies chair: Pedro Rangel Henriques 15h00 Efficient representation of RDF datasets ................................. 179 Jakub Swacha and Szymon Grabowski 15h25 Reducing large semantic graphs to improve semantic relatedness . 189 José Paulo Leal and Teresa Costa 15h50 Enrichment and On-demand Population of Nutritional Ontologies from XMLDiets Repository................................................. 199 Vanesa Espín, Manuel Noguera and Maria V. Hurtado 16h15 Coffee Break Session VIII: Grammars chair: Jan Janousek 16h35Browsing the Parse Space .............................................. 209 Daniel Rodríguez-Cerezo and José-Luis Sierra 17h00 Assessing Attribute Grammars’ Quality: metrics and a tool. 219 João Cruz, Pedro Rangel Henriques and Daniela Da Cruz 17h20 A Syntax-Directed Model Transformation Framework based on Attribute Grammars................................................... 225 Antonio Sarasa-Cabezuelo and José-Luis Sierra 17h40 Lightning Talks and Demo Session chair: Alberto Simões 19h00 Closing Session VI Preface This volume contains the proceedings of the 4rd edition of SLATE, the 4th Symposium on Lan- guages, Applications and Technologies, held in the Complutense University of Madrid, Spain, during 18th–19th June, 2015. We often use languages. First, to communicate between ourselves. Later, to communicate with computers. And more recently, with the advent of networks, we found a way to make computers communicate between themselves. All these different forms of communication use languages, different languages, but that still share many similarities. In SLATE we are interested in discussing these languages. Being languages such a broad subject, SLATE is organized in three main tracks: – HHL Track — Processing Human-Human Languages HHL is a forum dedicated to the discussion of research projects and ideas involving natural language processing and their industrial application. In 2015 this track is chaired by Alberto Simões. – HCL Track — Processing Human-Computer Languages HCL is a forum where researchers, developers, educators exchange ideas and information on the latest academic or industrial work on language design, processing, assessment and applications. The SLATE’15 HCL chair is José-Luis Sierra-Rodríguez. – CCL Track — Processing Computer-Computer Languages CCL main goal is to provide a broad space for discusstion about the XML markup language: examples of usage and associated technologies. In 2015 this track is chaired by José Paulo Leal. In this 4th edition we received a total of 40 full paper submissions and 7 short paper submissions. After a thorough peer-review process, in which each paper was reviewed by three anonymous reviewers, 17 papers were accepted as full papers (about 42% of full paper acceptance rate). In addition, 10 full papers were invited to resubmit as short papers, and 9 short papers were finally accepted for publication and presentation at the symposium (about 52% of short paper acceptance rate). SLATE’15 presentations has been divided in the following eight sessions: Document Process- ing (CCL), Domain-Specific Languages (HCL), Tools for Natural Language Speech and Text Pro- cessing (HHL), Web Technologies and Practical Cases (CCL), Semantic and Text Classification (HHL), Human-Computer Language Processing (HCL), Semantic Web and Ontologies (CCL), and Grammars (HCL). In addition, a Lightning Talks and Demo Session has been also included for allowing late-breaking presentations and tool demos. Finally, SLATE 2015 program also in- cludes two keynotes: one on the application of grammar inference to software language engineering, by Marjan Mernik from University of Maribor, Slovenia, and another on the role of ontologies in machine-machine communication, by Asunción Gómez from Technical University of Madrid, Spain. VII The organizers of SLATE 2015 want to thank to many people without whom this event would never be posible. In particular, the UCM’s Faculty of Philology for serving as venue of the Sympo- sium; the UCM’s Computer Science School for sponsoring the keynotes through its PhD. Program Conference Cycle; the UCM’s General Foundation for being in charge of the finances of the event; the Madrid City Council by organizing the Symposium social activity; the Santander-UCM research grants by partially funding one of the keynotes; the members of the ILSA (Implementation of Language-Driven Software and Applications) Research Group by their collaboration in the lo- cal organization of the event; Springer by giving us the oportunity of editing a post-proceeding volumen as part of the CCIS series; COMLAN and COMSIS journals by accepting submissions of revised and extended versions of the best papers presented at the symposium; the Easychair conference management system; the Program Committee members for spending their time reviewing the papers and writing the reports; the authors of the submitted papers for their contribution and interest in the symposium and, finally, to all participants that came to

Languages, Applications and Technologies

The Design & Implementation of an Abstract Semantic Graph For

Application of Graph Databases for Static Code Analysis of Web-Applications

Dense Semantic Graph and Its Application in Single Document Summarisation

Autowig: Automatic Generation of Python Bindings for C++ Libraries

DART@AI*IA 2013 Proceedings

Graph-Based Source Code Analysis of Javascript Repositories

Domain Specific Languages and Their Type Systems

CODE REFACTORING UNDER CONSTRAINTS by YAN LIANG

Final Thesis Jugraj Singh Vfinal

Code Structure Visualization

Towards a Generic Framework for Trustworthy Program Refactoring∗

Transformational Generation of Language Plug-Ins in the Harmonia Framework