I Iberian Sltech 2009

Total Page:16

File Type:pdf, Size:1020Kb

I Iberian Sltech 2009 I Iberian SLTech 2009 Proceedings of the I Joint SIG-IL/Microsoft Workshop on Speech and Language Technologies for Iberian Languages Porto Salvo, Portugal, September 3-4, 2009 Edited by: António Teixeira, Miguel Sales Dias & Daniela Braga Published by: Designeed ISBN: 978-989-96278-1-9 Portuguese National Library Number: 298538/09 Preface The ISCA Special Interest Group on Iberian Languages (SIG-IL) board, pursuing the aims of organizing conferences, schools and workshops, promoting industry/university collaboration and offering a forum to discuss opportunities in research and industry applications in the field of Speech and Language Technology, decided to organize a new event, I Iberian SLTech - I Joint SIG-IL/Microsoft Workshop on Speech and Language Technologies for Iberian Languages. A practical way of making possible the event in 2009 was to join efforts with Microsoft Language Development Center (MLDC), in Portugal, associating the SIG-IL event to a new edition of their past Workshops (I Microsoft Workshop on Speech Technology - Building bridges between industry and academia, Porto Salvo, May 2, 2007 and Propor 2008 Special Session: Applications of Portuguese Speech and Language Technologies, September 10, 2008, Curia, Portugal). The event also had the support of Red Temática en Tecnologías del Habla (RTTH). The main objective for this new event is to create a forum for the exchange of ideas and promote collaboration in the fields of Speech and Language Technologies for all the institutions who work on Iberian Languages. It also our goal to continue this event in the following years and that it may be held in the different places of the SIG-IL geography. The Organization is honoured to receive Alex Acero as keynote speaker who will bring us the following topic: “Building accurate and user-friendly speech sys- tems”. Alex Acero is the Research Area Manager of the Speech Group in Microsoft Research, in Redmond (USA) and is one of the world leadings researchers in Speech Technology, directing an organization with 70 researchers in audio, speech, multi- media, communication, natural language, and information retrieval. He is also an affiliate Professor of Electrical Engineering at the University of Washington, Seat- tle. Dr. Acero is author of the books “Acoustical and Environmental Robustness in Automatic Speech Recognition” (Kluwer, 1993) and “Spoken Language Processing” (Prentice Hall, 2001). Our Scientific Committee selected the following contributions for presentation: 21 posters (3 from students), 4 recent PhDs, 2 demos and 8 groups and projects. The posters covered 6 areas: systems and applications; resources and tools; speech recog- nition; speech synthesis; gender, speaker and language recognition and language processing. These contributions were edited in online, CD and paper proceedings. We accepted 18 contributions with authors from Portugal, 15 from Spain, 3 from Ger- many, 1 from USA, 1 from Cuba and 1 from Brazil. We would like to send our special thanks to all the Chairs and Scientific Com- mittee members that helped us in the proposals’ revision and conference organization in such a tight schedule, to our keynote speaker for bringing us such an interesting topic and to all authors for submitting the most recent advances of their work on Iberian languages speech and language processing. We expect that this is the first edition of a growing annual event that aims to attract more researchers in all countries that work on the latest advances on speech and language processing for the Iberian Languages. The Iberian SL Tech 2009 Organizers, António Teixeira, SIG-IL Chair, IEETA Daniela Braga, Microsoft Committees General Chairs António Teixeira, SIG-IL Chair, Universidade de Aveiro/IEETA, Portugal Miguel Dias, Microsoft, Portugal Daniela Braga, Microsoft, Portugal Local Organization Committee Daniela Braga, Microsoft, Portugal Francisco Pires, Microsoft, Portugal Bruno Reis Bechtlufft, Microsoft, Portugal Demos Chair Rubén San-Segundo, SIG-IL ISCA Liaison, Universidad Politécnica de Madrid, Spain Program Chair Aldebaro Klautau, SIG-IL Secretary, Universidade Federal do Pará, Brazil Juan Arturo Nolazco Flores, SIG-IL Vice-Chair, Tecnológico de Monterrey, Mexico Carmen García Mateo, University of Vigo, Spain Presentations and Panels Chair António Teixeira, SIG-IL Chair, Universidade de Aveiro/IEETA, Portugal Daniela Braga, Microsoft, Portugal Scientific Committee Abel Herrera Camacho, FI-UNAM, México Alberto Abad, INESC-ID Lisboa, Portugal Alberto Simões, Universidade do Minho, Portugal Aldebaro Klautau, Universidade Federal do Pará, Brazil Alex Acero, Microsoft Research, USA Alexander Gelbuck, CIC - IPN, México Alfonso Ortega, Universidad de Zaragoza, Spain Alvaro Iriarte, Universidade do Minho, Portugal Amália Andrade, CLUL/Universidade de Lisboa, Portugal Andreia Rauber, Universidade do Minho, Portugal Antonia Marti Antonín, Universidad de Barcelona, Spain António Bonafonte, Universitat Politècnica de Catalunya, Spain António Branco, FCUL, Portugal António Serralheiro, L2F INESC-ID and Academia Militar, Portugal António Teixeira, IEETA/Universidade de Aveiro, Portugal Ascensión Gallardo, Universidad Carlos III de Madrid, Spain Belinda Maia, FLUL, Portugal Carlos Meneses, ISEL, Portugal Carlos Teixeira, FCUL, Portugal Céu Viana, FLUL, Portugal Ciro Martins, IEETA/Universidade de Aveiro, Portugal Daniela Braga, MLDC/Microsoft, Portugal Diana Santos, SINTEF, Norway Doroteo Torres, Universidad Autónoma de Madrid, Spain Encarna Segarra, Universidad Politécnica de Valencia, Spain Eva Navas, Universidad del País Vasco, Spain Fábio Violaro, Universidade Estadual de Campinas - UNICAMP, Brazil Fernando Gil Resende Jr., Universidade Federal do Rio de Janeiro, Brazil Fernando Perdigão, Universidade de Coimbra, Portugal Francisco Campillo, Universidade de Vigo, Spain Francisco Vaz, IEETA/Universidade de Aveiro, Portugal Frank Seide, Microsoft Research Asia - Speech Group Hugo Meinedo, INESC-ID Lisboa, Portugal Inmaculada Hernaez Rioja, Universidad del País Vasco, Spain Isabel Trancoso, INESC-ID/IST, Portugal João Veloso, Faculdade de Letras da Universidade do Porto, Portugal Jorge Baptista, Universidade do Algarve, Portugal José Manuel Pardo, Universidad Politécnica de Madrid, Spain José Ramón Calvo de Lara, CENATAV, Cuba José Teixeira, Universidade do Minho, Portugal Juan Manuel Montero, Universidad Politécnica de Madrid, Spain Juan Nolazco Flores, Tecnológico de Monterrey, Mexico Luís Caldas Oliveira, INESC-ID/IST, Portugal Luis Hernandez, Universidad Politécnica de Madrid, Spain Luis Villaseñor Pineda, INAOE, Mexico Manuel Montes y Gómez, INAOE, Mexico Maria Aldina Marques, Universidade do Minho, Portugal Maria Helena Mira Mateus, ILTEC, Portugal Mário Silva, FCUL, Portugal Nestor Yoma, Universidad de Chile, Chile Nuno Mamede, INESC-ID/IST, Portugal Paula Carvalho, FCUL, Portugal Paulo Quaresma, Universidade de Évora, Portugal Plínio Barbosa, Universidade Estadual de Campinas (UNICAMP), Brazil Ranniery Maia, NICT, Japan Ricardo de Córdoba, Universidad Politécnica de Madrid, Spain Rubén San Segundo, Universidad Politécnica de Madrid, Spain Sérgio Paulo, INESC-ID Lisboa, Portugal Thomas Pellegrini, INESC-ID Lisboa, Portugal Vera Strube de Lima, Pontifícia Universidade Católica do Rio Grande do Sul, Brazil Violeta Quental, Pontifícia Universidade Católica do Rio de Janeiro, Brazil Xavier Gómez Guinovart, Universidade de Vigo, Spain Xosé Ramón Freixeiro Mato, Universidade da Coruña, Spain Keynote Speaker Building accurate and user-friendly speech systems 3 Alex Acero Microsoft, USA Selected Posters Systems & Applications A task-independent stochastic dialog manager for the EDECAN project 9 Francisco Torres Goterris Departamento de Sistemas Informáticos y Computación Universidad Politécnica de Valencia, Spain Terminology extraction from English-Portuguese and English-Galician parallel cor- pora based on probabilistic translation dictionaries and bilingual syntactic patterns 13 Alberto Simões Department of Computer Science Universidade do Minho, Portugal Xavier Gómez Guinovart Department of Translation and Linguistics Universidade de Vigo, Spain A Hierarchical Architecture for Audio Segmentation in a Broadcast News Task 17 Mateu Aguilo, Taras Butko, Andrey Temko, Climent Nadeu Department of Signal Theory and Communications, TALP Research Center Universitat Politècnica de Catalunya, Spain Browsing Multilingual Making-Ofs 21 Carlos Teixeira LASIGE, University of Lisbon, Portugal Ana Respício OR Center, DI, University of Lisbon, Portugal Catarina Ribeiro LASIGE, University of Lisbon, Portugal Resources & Tools A Catalan Broadcast Conversational Speech Database 27 Henrik Schulz, José A. R. Fonollosa Department of Signal Theory and Communications Technical University of Catalunya (UPC), Spain An XML Resource Definition for Spoken Document Retrieval 31 Luis Javier Rodríguez-Fuentes, Germán Bordel, Arantza Casillas Mikel Penagarikano & Amparo Varona Grupo de Trabajo en Tecnologías Software (GTTS) Universidad del País Vasco, Spain CORPOR System: Corpora of the Portuguese Language as spoken in São Paulo 35 Zilda Zapparoli Universidade de São Paulo (USP), Brazil Machine Translation of the Penn Treebank to Spanish 39 Martha Alicia Rocha Departamento de Sistemas y Computación Instituto Tecnológico de León, México Joan Andreu Sánchez Instituto Tecnológico de Informática Universidad Politécnica de Valencia, Spain Adapting the Unisyn Lexicon to Portuguese: Preliminary issues in the development of LUPo 43 Simone Ashby, José Pedro Ferreira, Sílvia
Recommended publications
  • The Race of Sound: Listening, Timbre, and Vocality in African American Music
    UCLA Recent Work Title The Race of Sound: Listening, Timbre, and Vocality in African American Music Permalink https://escholarship.org/uc/item/9sn4k8dr ISBN 9780822372646 Author Eidsheim, Nina Sun Publication Date 2018-01-11 License https://creativecommons.org/licenses/by-nc-nd/4.0/ 4.0 Peer reviewed eScholarship.org Powered by the California Digital Library University of California The Race of Sound Refiguring American Music A series edited by Ronald Radano, Josh Kun, and Nina Sun Eidsheim Charles McGovern, contributing editor The Race of Sound Listening, Timbre, and Vocality in African American Music Nina Sun Eidsheim Duke University Press Durham and London 2019 © 2019 Nina Sun Eidsheim All rights reserved Printed in the United States of America on acid-free paper ∞ Designed by Courtney Leigh Baker and typeset in Garamond Premier Pro by Copperline Book Services Library of Congress Cataloging-in-Publication Data Title: The race of sound : listening, timbre, and vocality in African American music / Nina Sun Eidsheim. Description: Durham : Duke University Press, 2018. | Series: Refiguring American music | Includes bibliographical references and index. Identifiers:lccn 2018022952 (print) | lccn 2018035119 (ebook) | isbn 9780822372646 (ebook) | isbn 9780822368564 (hardcover : alk. paper) | isbn 9780822368687 (pbk. : alk. paper) Subjects: lcsh: African Americans—Music—Social aspects. | Music and race—United States. | Voice culture—Social aspects— United States. | Tone color (Music)—Social aspects—United States. | Music—Social aspects—United States. | Singing—Social aspects— United States. | Anderson, Marian, 1897–1993. | Holiday, Billie, 1915–1959. | Scott, Jimmy, 1925–2014. | Vocaloid (Computer file) Classification:lcc ml3917.u6 (ebook) | lcc ml3917.u6 e35 2018 (print) | ddc 781.2/308996073—dc23 lc record available at https://lccn.loc.gov/2018022952 Cover art: Nick Cave, Soundsuit, 2017.
    [Show full text]
  • The Race of Sound Refiguring American Music a Series Edited by Ronald Radano, Josh Kun, and Nina Sun Eidsheim Charles Mcgovern, Contributing Editor the Race of Sound
    The Race of Sound Refiguring American Music A series edited by Ronald Radano, Josh Kun, and Nina Sun Eidsheim Charles McGovern, contributing editor The Race of Sound Listening, Timbre, and Vocality in African American Music Nina Sun Eidsheim Duke University Press Durham and London 2019 © 2019 Nina Sun Eidsheim All rights reserved Printed in the United States of America on acid-free paper ∞ Designed by Courtney Leigh Baker and typeset in Garamond Premier Pro by Copperline Book Services Library of Congress Cataloging-in-Publication Data Title: The race of sound : listening, timbre, and vocality in African American music / Nina Sun Eidsheim. Description: Durham : Duke University Press, 2018. | Series: Refiguring American music | Includes bibliographical references and index. Identifiers:lccn 2018022952 (print) | lccn 2018035119 (ebook) | isbn 9780822372646 (ebook) | isbn 9780822368564 (hardcover : alk. paper) | isbn 9780822368687 (pbk. : alk. paper) Subjects: lcsh: African Americans—Music—Social aspects. | Music and race—United States. | Voice culture—Social aspects— United States. | Tone color (Music)—Social aspects—United States. | Music—Social aspects—United States. | Singing—Social aspects— United States. | Anderson, Marian, 1897–1993. | Holiday, Billie, 1915–1959. | Scott, Jimmy, 1925–2014. | Vocaloid (Computer file) Classification:lcc ml3917.u6 (ebook) | lcc ml3917.u6 e35 2018 (print) | ddc 781.2/308996073—dc23 lc record available at https://lccn.loc.gov/2018022952 Cover art: Nick Cave, Soundsuit, 2017. © Nick Cave. Photo by James Prinz Photography. Courtesy of the artist and Jack Shainman Gallery, New York. This title is freely available in an open access edition thanks to generous support from the ucla Library. This book is published under the Creative Commons Attribution-NonCommercial- NoDerivs 3.0 United States (cc by-nc-nd 3.0 us) License, available at https://creativecommons.org/licenses/by-nc-nd/3.0/us/.
    [Show full text]
  • Indian Language Screen Readers and Syllable Based Festival Text-To-Speech Synthesis System
    Indian Language Screen Readers and Syllable Based Festival Text-to-Speech Synthesis System Anila Susan Kurian, Badri Narayan, Nagarajan Madasamy, Ashwin Bellur, Raghava Krishnan, Kasthuri G., Vinodh M.V., Hema A. Murthy IIT-Madras, India {anila,badri,nagarajan,ashwin,raghav,kasthuri,vinodh}@lantana.tenet.res.in [email protected] Kishore Prahallad IIIT-Hyderabad, India [email protected] Abstract on others to access common information that oth- ers take for granted, such as newspapers, bank state- This paper describes the integration of com- ments, and scholastic transcripts. Assistive tech- monly used screen readers, namely, NVDA nologies (AT), enable physically challenged persons [[NVDA 2011]] and ORCA [[ORCA 2011]] with Text to Speech (TTS) systems for Indian lan- to become part of the mainstream in the society. guages. A participatory design approach was A screen reader is an assistive technology poten- followed in the development of the integrated tially useful to people who are visually challenged, system to ensure that the expectations of vi- visually impaired, illiterate or learning disabled, sually challenged people are met. Given that to use/access standard computer software, such as India is a multilingual country (22 official lan- Word Processors, Spreadsheets, Email and the Inter- guages), a uniform framework for an inte- net. grated text-to-speech synthesis systems with screen readers across six Indian languages are Over the last three years, Indian Institute of Tech- developed, which can be easily extended to nology, Madras (IIT Madras) [[Training for VC, other languages as well. Since Indian lan- IITM 2008 ]], has been conducting a training pro- guages are syllable centred, syllable-based gramme for visually challenged people, to enable concatenative speech synthesizers are built.
    [Show full text]