A Snack Implementation and Tcl/Tk Interface to the Fundamental Frequency Variation Spectrum Algorithm
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
A Snack Implementation and Tcl/Tk Interface to the Fundamental Frequency Variation Spectrum Algorithm
A Snack Implementation and Tcl/Tk Interface to the Fundamental Frequency Variation Spectrum Algorithm Kornel Laskowski 1, Jens Edlund 2 1 Language Technologies Institute, Carnegie Mellon University, Pittsburgh PA, USA 2 KTH Speech, Music and Hearing, Stockholm, Sweden [email protected], [email protected] Abstract Intonation is an important aspect of vocal production, used for a variety of communicative needs. Its modeling is therefore crucial in many speech understanding systems, particularly those requiring inference of speaker intent in real-time. However, the estimation of pitch, traditionally the first step in intonation modeling, is computationally inconvenient in such scenarios. This is because it is often, and most optimally, achieved only after speech segmentation and recognition. A consequence is that earlier speech processing components, in today’s state-of-the-art systems, lack intonation awareness by fiat; it is not known to what extent this circumscribes their performance. In the current work, we present a freely available implementation of an alternative to pitch estimation, namely the computation of the fundamental frequency variation (FFV) spectrum, which can be easily employed at any level within a speech processing system. It is our hope that the implementation we describe aid in the understanding of this novel acoustic feature space, and that it facilitate its inclusion, as desired, in the front-end routines of speech recognition, dialog act recognition, and speaker recognition systems. 1. Introduction presenting a steep learning curve for researchers who may wish to use it in their work. Intonation is an important aspect of vocal production, used for a variety of communicative needs. -
A Speech Annotation and Transcription Tool
XTrans: a speech annotation and transcription tool Meghan Lammie Glenn, Stephanie M. Strassel, Haejoong Lee Linguistic Data Consortium, University of Pennsylvania, Philadelphia, Pennsylvania {mlglenn,strassel,haejoong}@ldc.upenn.edu Abstract 2. Motivation We present XTrans, a multi-platform, multilingual, multi- Since its first transcription project in 1995 LDC has utilized a channel transcription application designed and developed by number of different annotation tools and approaches. Early on Linguistic Data Consortium. XTrans provides new and LDC developed customized user interfaces built around efficient solutions to many common challenges encountered existing commercial software like Entropic's ESPS/waves+ during the manual transcription process of a wide variety of and emacs. Over time user-friendly multi-featured tools like audio genres, such as supporting multiple audio channels in a Transcriber [1] became more widely available and were meeting recording or right-to-left text directionality for incorporated into LDC's annotation pipeline. To meet data and languages like Arabic. To facilitate accurate transcription, annotation needs for specific evaluation programs, LDC also XTrans incorporates a number of quality control functions, developed customized transcription tools within the and provides a user-friendly mechanism for transcribing Annotation Graph Toolkit (AGTK) framework [2], [3] overlapping speech. This paper will describe the motivation to including TableTrans, a spreadsheet-style audio annotation develop a new transcription tool, and will give an overview of tool, and MultiTrans, a multi-channel transcribing tool [4]. XTrans functionality. The EARS and GALE programs introduced a new Index Terms: linguistic resources, speech transcription, constellation of challenges: manual transcription of thousands transcription tool, speech annotation, linguistic annotation of hours of broadcast and telephone speech, in multiple languages, produced on a very tight schedule. -
2 Game Programming with Python, Lua, and Ruby by Tom Gutschmidt
Game Programming with Python, Lua, and Ruby By Tom Gutschmidt Publisher: Premier Press Pub Date: 2003 ISBN: 1-59200-079-7 Pages: 472 LRN Dedication Acknowledgments About the Author Letter from the Series Editor Introduction What's in This Book? Why Learn Another Language? What's on the CD-ROM? Part ONE: Introducing High-Level Languages Chapter 1. High-Level Language Overview High-Level Language Roots How Programming Languages Work Low-Level Languages Today's High-Level Languages The Pros of High-Level Languages Cons of High-Level Languages A Brief History of Structured Programming Introducing Python Introducing Lua Introducing Ruby Summary Questions and Answers Exercises Chapter 2. Python, Lua, and Ruby Language Features Syntactical Similarities of Python, Lua, and Ruby Hello World Samples Summary Questions and Answers Exercises Part TWO: Programming with Python Chapter 3. Getting Started with Python Python Executables Python Debuggers Python Language Structure Creating a Simple User Interface in Python A Simple GUI with Tkinter Memory, Performance, and Speed Summary Questions and Answers 2 Exercises Chapter 4. Getting Specific with Python Games The Pygame Library Python Graphics Sound in Python Networking in Python Putting It All Together Summary Questions and Answers Exercises Chapter 5. The Python Game Community Engines Graphics Commercial Games Beyond Python Summary Question and Answer Exercises Part THREE: Programming with Lua Chapter 6. Programming with Lua Lua Executables and Debuggers Language Structure Memory, Performance, and Speed Summary Questions and Answers Exercises Chapter 7. Getting Specific with Games in Lua LuaSDL Gravity: A Lua SDL Game The Lua C API Summary Questions and Answers Exercises Chapter 8. -
Survey on Swedish Language Resources
Survey on Swedish Language Resources Kjell Elenius1 Eva Forsbom2 Beáta Megyesi2 1Speech, Music and Hearing School of Computer Science and Communication, KTH Sweden 2Department of Linguistics and Philology Uppsala University Sweden 2008-03-14 Survey on Swedish Language Resources ii Elenius, Forsbom and Megyesi Abstract Language resources, such as lexicons, databases, dictionaries, corpora, and tools to create and process these resources are necessary components in human language technology and natural language applications. In this survey, we describe the inventory process and the results of existing language resources for Swedish, and the need for Swedish language resources to be used in research and real-world applications in language technology as well as in linguistic research. The survey is based on an investigation sent to industry and academia, institutions and organizations, to experts involved in the development of Swedish language resources in Sweden, the Nordic countries and world-wide. This study is a result of the project called “An Infrastructure for Swedish language technology” supported by the Swedish Research Council´s Committee for Research Infrastructures 2007 - 2008. iii Survey on Swedish Language Resources Table of Contents 1. Introduction............................................................................................................................................1 2. Method ...................................................................................................................................................1