A Speech Annotation and Transcription Tool
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
A Snack Implementation and Tcl/Tk Interface to the Fundamental Frequency Variation Spectrum Algorithm
A Snack Implementation and Tcl/Tk Interface to the Fundamental Frequency Variation Spectrum Algorithm Kornel Laskowski 1, Jens Edlund 2 1 Language Technologies Institute, Carnegie Mellon University, Pittsburgh PA, USA 2 KTH Speech, Music and Hearing, Stockholm, Sweden [email protected], [email protected] Abstract Intonation is an important aspect of vocal production, used for a variety of communicative needs. Its modeling is therefore crucial in many speech understanding systems, particularly those requiring inference of speaker intent in real-time. However, the estimation of pitch, traditionally the first step in intonation modeling, is computationally inconvenient in such scenarios. This is because it is often, and most optimally, achieved only after speech segmentation and recognition. A consequence is that earlier speech processing components, in today’s state-of-the-art systems, lack intonation awareness by fiat; it is not known to what extent this circumscribes their performance. In the current work, we present a freely available implementation of an alternative to pitch estimation, namely the computation of the fundamental frequency variation (FFV) spectrum, which can be easily employed at any level within a speech processing system. It is our hope that the implementation we describe aid in the understanding of this novel acoustic feature space, and that it facilitate its inclusion, as desired, in the front-end routines of speech recognition, dialog act recognition, and speaker recognition systems. 1. Introduction presenting a steep learning curve for researchers who may wish to use it in their work. Intonation is an important aspect of vocal production, used for a variety of communicative needs. -
2 Game Programming with Python, Lua, and Ruby by Tom Gutschmidt
Game Programming with Python, Lua, and Ruby By Tom Gutschmidt Publisher: Premier Press Pub Date: 2003 ISBN: 1-59200-079-7 Pages: 472 LRN Dedication Acknowledgments About the Author Letter from the Series Editor Introduction What's in This Book? Why Learn Another Language? What's on the CD-ROM? Part ONE: Introducing High-Level Languages Chapter 1. High-Level Language Overview High-Level Language Roots How Programming Languages Work Low-Level Languages Today's High-Level Languages The Pros of High-Level Languages Cons of High-Level Languages A Brief History of Structured Programming Introducing Python Introducing Lua Introducing Ruby Summary Questions and Answers Exercises Chapter 2. Python, Lua, and Ruby Language Features Syntactical Similarities of Python, Lua, and Ruby Hello World Samples Summary Questions and Answers Exercises Part TWO: Programming with Python Chapter 3. Getting Started with Python Python Executables Python Debuggers Python Language Structure Creating a Simple User Interface in Python A Simple GUI with Tkinter Memory, Performance, and Speed Summary Questions and Answers 2 Exercises Chapter 4. Getting Specific with Python Games The Pygame Library Python Graphics Sound in Python Networking in Python Putting It All Together Summary Questions and Answers Exercises Chapter 5. The Python Game Community Engines Graphics Commercial Games Beyond Python Summary Question and Answer Exercises Part THREE: Programming with Lua Chapter 6. Programming with Lua Lua Executables and Debuggers Language Structure Memory, Performance, and Speed Summary Questions and Answers Exercises Chapter 7. Getting Specific with Games in Lua LuaSDL Gravity: A Lua SDL Game The Lua C API Summary Questions and Answers Exercises Chapter 8. -
A Snack Implementation and Tcl/Tk Interface to the Fundamental Frequency Variation Spectrum Algorithm
A Snack Implementation and Tcl/Tk Interface to the Fundamental Frequency Variation Spectrum Algorithm Kornel Laskowski 1, Jens Edlund 2 1 Language Technologies Institute, Carnegie Mellon University, Pittsburgh PA, USA 2 KTH Speech, Music and Hearing, Stockholm, Sweden [email protected], [email protected] Abstract Intonation is an important aspect of vocal production, used for a variety of communicative needs. Its modeling is therefore crucial in many speech understanding systems, particularly those requiring inference of speaker intent in real-time. However, the estimation of pitch, traditionally the first step in intonation modeling, is computationally inconvenient in such scenarios. This is because it is often, and most optimally, achieved only after speech segmentation and recognition. A consequence is that earlier speech processing components, in today’s state-of-the-art systems, lack intonation awareness by fiat; it is not known to what extent this circumscribes their performance. In the current work, we present a freely available implementation of an alternative to pitch estimation, namely the computation of the fundamental frequency variation (FFV) spectrum, which can be easily employed at any level within a speech processing system. It is our hope that the implementation we describe aid in the understanding of this novel acoustic feature space, and that it facilitate its inclusion, as desired, in the front-end routines of speech recognition, dialog act recognition, and speaker recognition systems. 1. Introduction presenting a steep learning curve for researchers who may wish to use it in their work. Intonation is an important aspect of vocal production, used for a variety of communicative needs. -
Survey on Swedish Language Resources
Survey on Swedish Language Resources Kjell Elenius1 Eva Forsbom2 Beáta Megyesi2 1Speech, Music and Hearing School of Computer Science and Communication, KTH Sweden 2Department of Linguistics and Philology Uppsala University Sweden 2008-03-14 Survey on Swedish Language Resources ii Elenius, Forsbom and Megyesi Abstract Language resources, such as lexicons, databases, dictionaries, corpora, and tools to create and process these resources are necessary components in human language technology and natural language applications. In this survey, we describe the inventory process and the results of existing language resources for Swedish, and the need for Swedish language resources to be used in research and real-world applications in language technology as well as in linguistic research. The survey is based on an investigation sent to industry and academia, institutions and organizations, to experts involved in the development of Swedish language resources in Sweden, the Nordic countries and world-wide. This study is a result of the project called “An Infrastructure for Swedish language technology” supported by the Swedish Research Council´s Committee for Research Infrastructures 2007 - 2008. iii Survey on Swedish Language Resources Table of Contents 1. Introduction............................................................................................................................................1 2. Method ...................................................................................................................................................1