Jaramillo Zhingre, José Ubertino.Pdf

Total Page:16

File Type:pdf, Size:1020Kb

Jaramillo Zhingre, José Ubertino.Pdf UNIVERSIDAD NACIONAL DE LOJA Área de la Energía, las Industrias y los Recursos Naturales no Renovables Carrera de Ingeniería en Sistemas TEMA: “CONSTRUCCIÓN DE UN SISTEMA DE DIÁLOGO COMPLETO, EN LENGUA ESPAÑOLA, PARA LA BIBLIOTECA BRAILLE DEL HONORABLE CONSEJO PROVINCIAL DE LOJA, EL CUAL SE ENCARGARA DE PROPORCIONAR INFORMACIÓN BIBLIOGRÁFICA A LAS ESTUDIANTES NO VIDENTES” Tesis previa a la obtención del Título de Ingeniero en Sistemas Autor: José Ubertino Jaramillo Zhingre. Director: Ing. Wilson Vélez LOJA - ECUADOR 2011 UNIVERSIDAD NACIONAL DE LOJA Sistema Automático de Información Bibliográfica CERTIFICACIÓN Ing. Wilson Vélez DOCENTE DE LA CARRERA DE INGENIERÍA EN SISTEMAS DE LA UNIVERSIDAD NACIONAL DE LOJA, DIRECTOR DE TESIS C E R T I F I C A: Que el Señor José Ubertino Jaramillo Zhingre, realizo el trabajo de investigación titulado “CONSTRUCCIÓN DE UN SISTEMA DE DIÁLOGO COMPLETO, EN LENGUA ESPAÑOLA, PARA LA BIBLIOTECA BRAILLE DEL HONORABLE CONSEJO PROVINCIAL DE LOJA, EL CUAL SE ENCARGARA DE PROPORCIONAR INFORMACIÓN BIBLIOGRÁFICA A LAS ESTUDIANTES NO VIDENTES” bajo mi dirección y asesoramiento, mismo que fue revisado, enmendado y corregido minuciosamente. En virtud que la Tesis reúne, a satisfacción, las cualidades de fondo y forma exigidas para un trabajo de este nivel, autorizo su presentación, sustentación y defensa ante el tribunal respectivo. Loja, Septiembre del 2011 …………………………………………. Ing. Wilson Vélez DIRECTOR DE TESIS II UNIVERSIDAD NACIONAL DE LOJA Sistema Automático de Información Bibliográfica AUTORÍA Las ideas conceptos y opiniones que se exponen en el trabajo de investigación que se presenta, son de absoluta responsabilidad del autor, excepto las que se encuentran debidamente citadas. …………………………………. José Ubertino Jaramillo Zhingre III UNIVERSIDAD NACIONAL DE LOJA Sistema Automático de Información Bibliográfica AGRADECIMIENTO Agradezco primeramente a Dios por haberme permitido llegar hasta donde estoy ya que Él es la fortaleza y la luz que me guía en los senderos de la vida, ya que sin Él no soy nada. A la Universidad Nacional de Loja, Centro Educativo donde se me permitió consolidar mi formación académica, gratitud profunda a la Carrera de Ingeniería en Sistemas, por los conocimientos en ella recibidos. Agradecimientos especiales a la Lic. Sandra Carrera, por sus consejos y colaboración incondicional en la realización de las pruebas del Sistema en la Biblioteca Braille Parlante del Honorable Consejo Provincial de Loja A mis familiares que estuvieron ahí cuando más los necesite, amigos, compañeros y todas las personas que de una u otra manera ha contribuido en el desarrollo de este gran sueño. Gracias a todos Uds. IV UNIVERSIDAD NACIONAL DE LOJA Sistema Automático de Información Bibliográfica DEDICATORIA. A mis padres, que en todo momento de la vida me brindaron su apoyo incondicional y me enseñaron que en este mundo todo se puede alcanzar mientras seas constante y no te des por vencido jamás, a mis hermanas, amigos y todos los que estuvieron presentes cuando más los necesite. José V UNIVERSIDAD NACIONAL DE LOJA Sistema Automático de Información Bibliográfica CESIÓN DE DERECHOS José Ubertino Jaramillo Zhingre, autor de la tesis denominada “CONSTRUCCIÓN DE UN SISTEMA DE DIÁLOGO COMPLETO, EN LENGUA ESPAÑOLA, PARA LA BIBLIOTECA BRAILLE DEL HONORABLE CONSEJO PROVINCIAL DE LOJA, EL CUAL SE ENCARGARA DE PROPORCIONAR INFORMACIÓN BIBLIOGRÁFICA A LAS ESTUDIANTES NO VIDENTES”, cedo los derechos correspondientes a la mencionada tesis a favor de la Universidad Nacional de Loja, para que ésta institución haga uso en lo que creyere conveniente. Jose Ubertino Jaramillo Zhingre VI UNIVERSIDAD NACIONAL DE LOJA Sistema Automático de Información Bibliográfica RESUMEN Los sistemas de diálogo son programas informáticos que se diseñan con la finalidad de emular a un ser humano en un diálogo oral con otra persona. Actualmente, diversas empresas e instituciones usan estos sistemas para proporcionar información y otros servicios de forma automática. En la presente tesis, titulada “CONSTRUCCIÓN DE UN SISTEMA DE DIÁLOGO COMPLETO, EN LENGUA ESPAÑOLA, PARA LA BIBLIOTECA BRAILLE DEL HONORABLE CONSEJO PROVINCIAL DE LOJA, EL CUAL SE ENCARGARA DE PROPORCIONAR INFORMACIÓN BIBLIOGRÁFICA A LAS ESTUDIANTES NO VIDENTES”, se expone el estado del arte en el área de los sistemas de diálogo y se presenta el trabajo realizado en el diseño e implementación de los módulos de un sistema de diálogo determinado. La tesis aporta el desarrollo de un sistema de dialogo con entrada y salida en lengua española y para una tarea de Dominio semántico restringido. Dicho sistema está constituido por los módulos de comprensión del Lenguaje, gestor del dialogo y de generación de respuestas en lenguaje natural. Dado el objetivo central de la tesis, el desarrollo del módulo gestor de diálogo ha sido el principal trabajo ya que es la base fundamental de todo sistema de dialogo. La aplicación está orientada al sector de bibliotecas Braille para personas no videntes, la misma que permitirá principalmente poder consultar a un estudiante no vidente sobre algún tema en particular, dicho tema dependerá de si se encuentra o no el libro ingresado en la Base de datos, además permitirá la administración de libros permitiendo agregar, modificar o actualizar su contenido facilitando así la usabilidad del sistema ya que no se quedara limitado a un dominio inicial sino que este ira incrementando su dominio de acuerdo a lo existente en su base de datos, además cuenta con un módulo generador de estadísticas sobre qué temas son mas consultados, que materias son más frecuentes y el número de consultas que se generan permitiendo así la administración y adquisición correcta y adecuada de libros de acuerdo a lo que requieren los estudiantes. Como parte fundamental para el desarrollo de este proceso de investigación, se empezó con la determinación del tema, planeamiento de los objetivos y la selección de una metodología la cual permitió la planificación de procedimientos que se llevaron a cabo mediante métodos, técnicas, análisis, diseño, programación y validación del software. La metodología de desarrollo de software seleccionada para el presente proyecto fue ICONIX, la misma que hace un fuerte uso del UML lo que permitirá realizar un mejor diseño de la aplicación. VII UNIVERSIDAD NACIONAL DE LOJA Sistema Automático de Información Bibliográfica ABSTRACT Dialogue systems are computer programs that are designed in order to emulate a human being in a spoken dialogue with another person. Currently, several companies and institutions use these systems to provide information and other services automatically. In this thesis, entitled "CONSTRUCTION OF A COMPLETE SYSTEM OF DIALOGUE IN SPANISH LANGUAGE, BRAILLE LIBRARY FOR THE PROVINCIAL COUNCIL OF THE HONORABLE of Loja, which is responsible BIBLIOGRAPHIC INFORMATION PROVIDED TO STUDENTS Blind," outlines the state of art in the area of dialogue systems and presents the work done in the design and implementation of the modules of a given dialog. The thesis provides the development of a system with input and output dialogue in Spanish and for a task restricted semantic domain. This system consists of modules Language understanding, dialogue manager and response generation in natural language. Given the central aim of the thesis, the development of dialogue management module has been the main work and that is the foundation of any system of dialogue. The sector-oriented application is Braille library for the blind, the same that will mainly be able to consult a blind student on a particular issue, this issue will depend on whether or not the book is entered into the database, management will also allow books to add, modify or update the content thus facilitating the usability of the system and not be limited to an initial domain but this will increment its domain according to your existing database, also has with a statistics generator module which issues are top lists, which are more common materials and the number of queries that are generated allowing acquisition management and proper and adequate books according to what students need. As a key part in developing this research process began with the determination of the issue, planning objectives and the selection of a methodology which allowed for the planning of procedures performed using methods, techniques, analysis, design programming and software validation. The software development methodology selected for this project was ICONIX, it makes heavy use of UML, which allow for a better design of the application. VIII UNIVERSIDAD NACIONAL DE LOJA Sistema Automático de Información Bibliográfica INDICE GENERAL CERTIFICACIÓN ..................................................................................................... II AUTORÍA ............................................................................................................... III AGRADECIMIENTO .............................................................................................. IV DEDICATORIA. ...................................................................................................... V CESIÓN DE DERECHOS ...................................................................................... VI RESUMEN ............................................................................................................ VII ABSTRACT ......................................................................................................... VIII INDICE GENERAL ...............................................................................................
Recommended publications
  • THE DEVELOPMENT of ACCENTED ENGLISH SYNTHETIC VOICES By
    THE DEVELOPMENT OF ACCENTED ENGLISH SYNTHETIC VOICES by PROMISE TSHEPISO MALATJI DISSERTATION Submitted in fulfilment of the requirements for the degree of MASTER OF SCIENCE in COMPUTER SCIENCE in the FACULTY OF SCIENCE AND AGRICULTURE (School of Mathematical and Computer Sciences) at the UNIVERSITY OF LIMPOPO SUPERVISOR: Mr MJD Manamela CO-SUPERVISOR: Dr TI Modipa 2019 DEDICATION In memory of my grandparents, Cecilia Khumalo and Alfred Mashele, who always believed in me! ii DECLARATION I declare that THE DEVELOPMENT OF ACCENTED ENGLISH SYNTHETIC VOICES is my own work and that all the sources that I have used or quoted have been indicated and acknowledged by means of complete references and that this work has not been submitted before for any other degree at any other institution. ______________________ ___________ Signature Date iii ACKNOWLEDGEMENTS I want to recognise the following people for their individual contributions to this dissertation: • My brother, Mr B.I. Khumalo and the whole family for the unconditional love, support and understanding. • A distinct thank you to both my supervisors, Mr M.J.D. Manamela and Dr T.I. Modipa, for their guidance, motivation, and support. • The Telkom Centre of Excellence for Speech Technology for providing the resources and support to make this study a success. • My colleagues in Department of Computer Science, Messrs V.R. Baloyi and L.M. Kola, for always motivating me. • A special thank you to Mr T.J. Sefara for taking his time to participate in the study. • The six Computer Science undergraduate students who sacrificed their precious time to participate in data collection.
    [Show full text]
  • Estudios De I+D+I
    ESTUDIOS DE I+D+I Número 51 Proyecto SIRAU. Servicio de gestión de información remota para las actividades de la vida diaria adaptable a usuario Autor/es: Catalá Mallofré, Andreu Filiación: Universidad Politécnica de Cataluña Contacto: Fecha: 2006 Para citar este documento: CATALÁ MALLOFRÉ, Andreu (Convocatoria 2006). “Proyecto SIRAU. Servicio de gestión de información remota para las actividades de la vida diaria adaptable a usuario”. Madrid. Estudios de I+D+I, nº 51. [Fecha de publicación: 03/05/2010]. <http://www.imsersomayores.csic.es/documentos/documentos/imserso-estudiosidi-51.pdf> Una iniciativa del IMSERSO y del CSIC © 2003 Portal Mayores http://www.imsersomayores.csic.es Resumen Este proyecto se enmarca dentro de una de las líneas de investigación del Centro de Estudios Tecnológicos para Personas con Dependencia (CETDP – UPC) de la Universidad Politécnica de Cataluña que se dedica a desarrollar soluciones tecnológicas para mejorar la calidad de vida de las personas con discapacidad. Se pretende aprovechar el gran avance que representan las nuevas tecnologías de identificación con radiofrecuencia (RFID), para su aplicación como sistema de apoyo a personas con déficit de distinta índole. En principio estaba pensado para personas con discapacidad visual, pero su uso es fácilmente extensible a personas con problemas de comprensión y memoria, o cualquier tipo de déficit cognitivo. La idea consiste en ofrecer la posibilidad de reconocer electrónicamente los objetos de la vida diaria, y que un sistema pueda presentar la información asociada mediante un canal verbal. Consta de un terminal portátil equipado con un trasmisor de corto alcance. Cuando el usuario acerca este terminal a un objeto o viceversa, lo identifica y ofrece información complementaria mediante un mensaje oral.
    [Show full text]
  • Text to Speech Synthesis for Bangla Language
    I.J. Information Engineering and Electronic Business, 2019, 2, 1-9 Published Online March 2019 in MECS (http://www.mecs-press.org/) DOI: 10.5815/ijieeb.2019.02.01 Text to Speech Synthesis for Bangla Language Khandaker Mamun Ahmed Department of Computer Science and Engineering, BRAC University, Dhaka, Bangladesh Email: [email protected] Prianka Mandal Department of Software Engineering, Daffodil International University, Dhaka, Bangladesh Email: [email protected] B M Mainul Hossain Institute of Information Technology, University of Dhaka, Dhaka, Bangladesh Email: [email protected] Received: 13 September 2018; Accepted: 14 December 2018; Published: 08 March 2019 Abstract—Text-to-speech (TTS) synthesis is a rapidly A TTS system converts natural language text into growing field of research. Speech synthesis systems are speech and then, a computer system able to read text applicable to several areas such as robotics, education and aloud. A speech synthesizer converts written text to a embedded systems. The implementation of such TTS phonemic representation and then converts the phonemic system increases the correctness and efficiency of an representation to waveforms which can be output as application. Though Bangla is the seventh most spoken sound. language all over the world, uses of TTS system in There are several ways to create synthesized speech. applications are difficult to find for Bangla language Among them, concatenative synthesis and formant because of lacking simplicity and lightweightness in TTS synthesis are very popular. Concatenative synthesis is systems. Therefore, in this paper, we propose a simple based on concatenating pre-recorded speech of phonemes, and lightweight TTS system for Bangla language.
    [Show full text]
  • Design and Implementation of Text to Speech Conversion for Visually Impaired People
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Covenant University Repository International Journal of Applied Information Systems (IJAIS) – ISSN : 2249-0868 Foundation of Computer Science FCS, New York, USA Volume 7– No. 2, April 2014 – www.ijais.org Design and Implementation of Text To Speech Conversion for Visually Impaired People Itunuoluwa Isewon* Jelili Oyelade Olufunke Oladipupo Department of Computer and Department of Computer and Department of Computer and Information Sciences Information Sciences Information Sciences Covenant University Covenant University Covenant University PMB 1023, Ota, Nigeria PMB 1023, Ota, Nigeria PMB 1023, Ota, Nigeria * Corresponding Author ABSTRACT A Text-to-speech synthesizer is an application that converts text into spoken word, by analyzing and processing the text using Natural Language Processing (NLP) and then using Digital Signal Processing (DSP) technology to convert this processed text into synthesized speech representation of the text. Here, we developed a useful text-to-speech synthesizer in the form of a simple application that converts inputted text into synthesized speech and reads out to the user which can then be saved as an mp3.file. The development of a text to Figure 1: A simple but general functional diagram of a speech synthesizer will be of great help to people with visual TTS system. [2] impairment and make making through large volume of text easier. 2. OVERVIEW OF SPEECH SYNTHESIS Speech synthesis can be described as artificial production of Keywords human speech [3]. A computer system used for this purpose is Text-to-speech synthesis, Natural Language Processing, called a speech synthesizer, and can be implemented in Digital Signal Processing software or hardware.
    [Show full text]
  • Voice Synthesizer Application Android
    Voice synthesizer application android Continue The Download Now link sends you to the Windows Store, where you can continue the download process. You need to have an active Microsoft account to download the app. This download may not be available in some countries. Results 1 - 10 of 603 Prev 1 2 3 4 5 Next See also: Speech Synthesis Receming Device is an artificial production of human speech. The computer system used for this purpose is called a speech computer or speech synthesizer, and can be implemented in software or hardware. The text-to-speech system (TTS) converts the usual text of language into speech; other systems display symbolic linguistic representations, such as phonetic transcriptions in speech. Synthesized speech can be created by concatenating fragments of recorded speech that are stored in the database. Systems vary in size of stored speech blocks; The system that stores phones or diphones provides the greatest range of outputs, but may not have clarity. For specific domain use, storing whole words or suggestions allows for high-quality output. In addition, the synthesizer may include a vocal tract model and other characteristics of the human voice to create a fully synthetic voice output. The quality of the speech synthesizer is judged by its similarity to the human voice and its ability to be understood clearly. The clear text to speech program allows people with visual impairments or reading disabilities to listen to written words on their home computer. Many computer operating systems have included speech synthesizers since the early 1990s. A review of the typical TTS Automatic Announcement System synthetic voice announces the arriving train to Sweden.
    [Show full text]
  • Towards Expressive Speech Synthesis in English on a Robotic Platform
    PAGE 130 Towards Expressive Speech Synthesis in English on a Robotic Platform Sigrid Roehling, Bruce MacDonald, Catherine Watson Department of Electrical and Computer Engineering University of Auckland, New Zealand s.roehling, b.macdonald, [email protected] Abstract Affect influences speech, not only in the words we choose, but in the way we say them. This pa- per reviews the research on vocal correlates in the expression of affect and examines the ability of currently available major text-to-speech (TTS) systems to synthesize expressive speech for an emotional robot guide. Speech features discussed include pitch, duration, loudness, spectral structure, and voice quality. TTS systems are examined as to their ability to control the fea- tures needed for synthesizing expressive speech: pitch, duration, loudness, and voice quality. The OpenMARY system is recommended since it provides the highest amount of control over speech production as well as the ability to work with a sophisticated intonation model. Open- MARY is being actively developed, is supported on our current Linux platform, and provides timing information for talking heads such as our current robot face. 1. Introduction explicitly stated otherwise the research is concerned with the English language. Affect influences speech, not only in the words we choose, but in the way we say them. These vocal nonverbal cues are important in human speech as they communicate 2.1. Pitch information about the speaker’s state or attitude more effi- ciently than the verbal content (Eide, Aaron, Bakis, Hamza, Pitch contour seems to be one of the clearest indica- Picheny, and Pitrelli 2004).
    [Show full text]
  • Guía De Accesibilidad De Oracle Solaris 11 Desktop • Octubre De 2012 Contenido
    Guía de accesibilidad de Oracle® Solaris 11 Desktop Referencia: E36637–01 Octubre de 2012 Copyright © 2011, 2012, Oracle y/o sus filiales. Todos los derechos reservados. Este software y la documentación relacionada están sujetos a un contrato de licencia que incluye restricciones de uso y revelación, y se encuentran protegidos por la legislación sobre la propiedad intelectual. A menos que figure explícitamente en el contrato de licencia o esté permitido por la ley, no se podrá utilizar, copiar, reproducir, traducir, emitir, modificar, conceder licencias, transmitir, distribuir, exhibir, representar, publicar ni mostrar ninguna parte,de ninguna forma, por ningún medio. Queda prohibida la ingeniería inversa, desensamblaje o descompilación de este software, excepto en la medida en que sean necesarios para conseguir interoperabilidad según lo especificado por la legislación aplicable. La información contenida en este documento puede someterse a modificaciones sin previo aviso y no se garantiza que se encuentre exenta de errores. Sidetecta algún error, le agradeceremos que nos lo comunique por escrito. Si este software o la documentación relacionada se entrega al Gobierno de EE.UU. o a cualquier entidad que adquiera licencias en nombre del Gobierno de EE.UU. se aplicará la siguiente disposición: U.S. GOVERNMENT END USERS: Oracle programs, including any operating system, integrated software, any programs installed on the hardware, and/or documentation, delivered to U.S. Government end users are "commercial computer software" pursuant to the applicable Federal Acquisition Regulation and agency-specific supplemental regulations. As such, use, duplication, disclosure, modification, and adaptation of the programs, including any operating system, integrated software, any programs installed on the hardware, and/or documentation, shall be subject to license terms and license restrictions applicable to the programs.
    [Show full text]
  • Instituto De Computação Universidade Estadual De Campinas
    INSTITUTO DE COMPUTAÇÃO UNIVERSIDADE ESTADUAL DE CAMPINAS A Systematic Literature Review on Awareness of Others in Accessible Collaborative RIAs Leonelo D. A. Almeida M. Cecília C. Baranauskas Technical Report - IC-12-26 - Relatório Técnico December - 2012 - Dezembro The contents of this report are the sole responsibility of the authors. O conteúdo do presente relatório é de única responsabilidade dos autores. A Systematic Literature Review on Awareness of Others in Accessible Collaborative RIAs Leonelo D. A. Almeida, M. Cecília C. Baranauskas Institute of Computing, University of Campinas (UNICAMP) Albert Einstein Av., 1251, 13083-970, Campinas-SP, Brazil {leonelo.almeida, cecilia}@ic.unicamp.br Abstract. The more robust and dynamic aspects of Web 2.0 applications (also named Rich Internet Applications, RIAs) stimulate the participation and collaboration among people while interacting with such shared interaction spaces. An evident consequence (e.g. Facebook, Instagran, and Twitter) is the increasing influence of RIAs on other media channels as TV and newspapers. However, the current state-of-art of Web 2.0 does not provide equitative opportunities of interaction for people. Accessibility in RIAs is still a challenging objective. Also, for aspects as awareness of others on RIAs that provided collaboration features the development of accessible mechanisms is not restricted to semantic markup but it also involves data structures, politeness, load of data, and other characteristics. This technical report presents a Systematic Literature Review process designed for investigating the aspect of awareness of others in accessible collaborative RIAs; it also reports included and excluded studies and the data collected from the reviewed studies. Keywords: Web 2.0, Accessibility, Systematic Literature Review.
    [Show full text]
  • Virtual Storytelling: Emotions for the Narrator
    Virtual Storytelling: Emotions for the narrator Master's thesis, August 2007 H.A. Buurman Committee Faculty of Human Media Interaction, dr. M. Theune Department of Electrical Engineering, dr. ir. H.J.A. op den Akker Mathematics & Computer Science, dr. R.J.F. Ordelman University of Twente ii iii Preface During my time here as a student in the Computer Science department, the ¯eld of language appealed to me more than all the other ¯elds available. For my internship, I worked on an as- signment involving speech recognition, so when a graduation project concerning speech generation became available I applied for it. This seemed like a nice complementation of my experience with speech recognition. Though the project took me in previously unknown ¯elds of psychol- ogy, linguistics, and (unfortunately) statistics, I feel a lot more at home working with- and on Text-to-Speech applications. Along the way, lots of people supported me, motivated me and helped me by participating in experiments. I would like to thank these people, starting with MariÄetTheune, who kept providing me with constructive feedback and never stopped motivating me. Also I would like to thank the rest of my graduation committee: Rieks op den Akker and Roeland Ordelman, who despite their busy schedule found time now and then to provide alternative insights and support. My family also deserves my gratitude for their continued motivation and interest. Lastly I would like to thank all the people who helped me with the experiments. Without you, this would have been a lot more di±cult. Herbert Buurman iv Samenvatting De ontwikkeling van het virtueel vertellen van verhalen staat nooit stil.
    [Show full text]
  • Gnuspeech Tract Manual 0.9
    Gnuspeech TRAcT Manual 0.9 TRAcT: the Gnuspeech Tube Resonance Access Tool: a means of investigating and understanding the basic Gnuspeech vocal tract model David R. Hill, University of Calgary TRAcT and the \tube" model to which it allows access was originally known as \Synthesizer" and developed by Leonard Manzara on the NeXT computer. The name has been changed because \Synthesiser" might be taken to imply it synthesises speech, which it doesn't. It is a \low-level" sound synthesiser that models the human vocal tract, requiring the \high-level" synthesiser control inherent in Gnuspeech/Monet to speak. TRAcT provides direct interactive access to the human vocal tract model parameters and shapes as a means of exploring its sound generation capabilities, as needed for speech database creation. i (GnuSpeech TRAcT Manual Version 0.9) David R. Hill, PEng, FBCS ([email protected] or drh@firethorne.com) © 2004, 2015 David R. Hill. All rights reserved. This document is publicly available under the terms of a Free Software Foundation \Free Doc- umentation Licence" See see http://www.gnu.org/copyleft/fdl.html for the licence terms. This page and the acknowledgements section are the invariant sections. ii SUMMARY The \Tube Resonance Model" (TRM, or \tube", or \waveguide", or transmission-line) forms the acoustic basis of the Gnupeech articulatory text-to-speech system and provides a linguistics research tool. It emulates the human vocal apparatus and allows \postures" to be imposed on vocal tract model, and energy to be injected representing voicing, whispering, sibilant noises and \breathy" noise. The \Distinctive Region Model" (DRM) control system that allows simple specification of the postures, and accessed by TRAcT 1is based on research by CGM Fant and his colleagues at the Stockholm Royal Institute of Technology Speech Technology Laboratory (Fant & Pauli 1974), by Ren´eCarr´eand his colleagues at T´el´ecomParis (Carr´e,Chennoukh & Mrayati 1992), and further developed by our original research team at Trillium Sound Research and the U of Calgary).
    [Show full text]
  • Generating the Voice of the Interactive Virtual Assistant Adriana Stan and Beáta Lőrincz
    Chapter Generating the Voice of the Interactive Virtual Assistant Adriana Stan and Beáta Lőrincz Abstract This chapter introduces an overview of the current approaches for generating spoken content using text-to-speech synthesis (TTS) systems, and thus the voice of an Interactive Virtual Assistant (IVA). The overview builds upon the issues which make spoken content generation a non-trivial task, and introduces the two main components of a TTS system: text processing and acoustic modelling. It then focuses on providing the reader with the minimally required scientific details of the terminology and methods involved in speech synthesis, yet with sufficient knowl- edge so as to be able to make the initial decisions regarding the choice of technology for the vocal identity of the IVA. The speech synthesis methodologies’ description begins with the basic, easy to run, low-requirement rule-based synthesis, and ends up within the state-of-the-art deep learning landscape. To bring this extremely complex and extensive research field closer to commercial deployment, an exten- sive indexing of the readily and freely available resources and tools required to build a TTS system is provided. Quality evaluation methods and open research problems are, as well, highlighted at end of the chapter. Keywords: text-to-speech synthesis, text processing, deep learning, interactive virtual assistant 1. Introduction Generating the voice of an interactive virtual assistant (IVA) is performed by the so called text-to-speech synthesis (TTS) systems. A TTS system takes raw text as input and converts it into an acoustic signal or waveform, through a series of intermediate steps.
    [Show full text]
  • Appendix VI Samples of Final Year Projects with Marking Sheets
    University of St Andrews School of Computer Science Appendix VI Samples of Final Year Projects with Marking Sheets Automated Class Questionnaires – Acquire Author: Gareth Edwards University of St Andrews 24th April 2003 1 Abstract This document discusses the Acquire system designed to facilitate on-line submissions of module reviews by students enrolled at the University of St Andrews; specifically the members of the School of Computer Science. The current system requires students to fill in and submit paper based forms which are read by an optical system. This system has proved unreliable so the purpose of this project was to create a new computerised system prototype to investigate whether replacing the existing system with a web-based form submission system was viable and to discover any advantages and disadvantages of such a system. Declaration I declare that the material submitted for assessment is my own work except where credit is explicitly given to others by citation or acknowledgement. This work was performed during the current academic year except where otherwise stated. The main text of this project report is 14004 words long, including project specification and plan. In submitting this project report to the University of St Andrews, I give permission for it to be made available for use in accordance with the regulations of the University Library. I also give permission for the title and abstract to be published and for copies of the report to made and supplied at cost to any bona fide library or research worker, and to be made available on the World Wide Web.
    [Show full text]