Making a World of Difference
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Semi-Continuous Hidden Markov Models for Speech Recognition
SEMI-CONTINUOUS HIDDEN MARKOV MODELS FOR SPEECH RECOGNITION Xuedong Huang Thesis submitted for the degree of Doctor of Philosophy University of Edinburgh 1989 DECLARATION This thesis, unless explicitly noted, was composed entirely by myself, and reports original work of my own execution. Xuedong Huang', B.Sc.' M)$c. 2 B.Sc. in Computer Science, Hunan University, 1982 2 M.Sc. in Computer Science and Technology, Tsinghua University, 1984 -1- ABSTRACT Hidden Markov models, which can be based on either discrete output probability distributions or continuous mixture output density functions, have been demonstrated as one of the most powerful statistical tools available for automatic speech recognition. In this thesis, a semi-continuous hidden Markov model, which is a very general model including both discrete and continuous mixture hidden Markov models as its special forms, is proposed. It is a model in which vector quantisation, the discrete hidden Markov model, and the continuous mixture hidden Markov model are unified. Based on the assumption that each vector quantisation codeword can be represented by a continuous probability density function, the semi-continuous output probability is then a combination of discrete model-dependent weighting coefficients with these continuous codebook probability density functions. In comparison to the conventional continuous mixture hidden Markov model, the semi-continuous hidden Markov model can maintain the modelling ability of large-mixture probability density functions. In addition, the number of free parameters and the computational complexity can be reduced because all of the probability density functions are tied together in the codebook. The semi- continuous hidden Markov model thus provides a good solution to the conflict between detailed acoustic modelling and insufficient training data. -
The Barbara Johnson Reader a John Hope Franklin Center Book the Barbara Johnson Reader the Surprise of Otherness
The Barbara Johnson Reader A John Hope Franklin Center Book The Barbara Johnson Reader The Surprise of Otherness Barbara Johnson edited by melissa feuerstein bill johnson gonzález lili porten keja valens With an Introduction by judith butler and an Afterword by shoshana felman Duke University Press Durham and London 2014 © 2014 Duke University Press Afterword © 2014 Shoshana Felman All rights reserved Printed in the United States of America on acid- free paper ∞ Designed by April Leidig Typeset in Minion Pro by Westchester Publishing Services Library of Congress Cataloging-in-Publication Data The Barbara Johnson reader : the surprise of otherness / edited by Melissa Feuerstein, Bill Johnson Gonzalez, Lili Porten, and Keja Valens, with an introduction by Judith Butler and an afterword by Shoshana Felman. pages cm “A John Hope Franklin Center Book.” Includes bibliographical references and index. isbn 978-0-8223-5419-2 (pbk : alk. paper) isbn 978-0-8223-5403-1 (cloth : alk. paper) 1. Johnson, Barbara, 1947–2009. 2. Feminist literary criticism. I. Feuerstein, Melissa. II. Johnson Gonzalez, Bill, 1970– iii. Porten, Lili. IV. Valens, Keja, 1972– pn98.w64b37 2014 801.'95092—dc23 2013045003 Contents Ac know ledg ments vii Editors’ Preface xi Personhood and Other Objects: The Figural Dispute with Philosophy by Judith Butler xvii Barbara Johnson by Barbara Johnson xxvii part i | Reading Theory as Literature, Literature as Theory 1 The Critical Diff erence: BartheS/BalZac 3 2 Translator’s Introduction to Dissemination (abridged) 14 3 Poetry and Syntax: -
Weekender July-2021 Web.Pdf
JUL-AUG 2021 FULL SPEED AHEAD SUMMER SIPPERS TIME IS RUNNING OUT Takuya Haneda hopes to paddle his New and true Japanese liqueurs that Japan’s slapdash love affair with SDGs canoe to gold at Tokyo 2020 will hold the Tokyo heat at bay is anything but sustainable — 2 — Weekender_August_Master_002.indd 2 2018/07/24 11:41 CONTENTS RADAR IN-DEPTH TRAVEL THIS MONTH’S HEAD TURNERS COFFEE-BREAK READS WHERE TO GO 8 AREA GUIDE: TOKYO’S LUMBERYARD 20 BALLET IN THE TIME OF CORONA 46 SHONAI SHRINE Once Tokyo’s center for all things wood, How National Ballet of Japan’s new artistic Embark on small adventures on the grounds Shin-Kiba now brightens up at night as a director Miyako Yoshida stays on her toes. of a former Yamagata castle. popular entertainment hub. 23 KYOTO IN SYMPHONY 50 SPORTS PARADISE 10 STYLE: ODE TO THE BIRDS Composer Marios Joannou Elia orchestrates a Traditional and modern sports have deep Photographer and filmmaker Boa Campbell masterpiece of sound dedicated to Kyoto. roots in Kyushu’s Miyazaki Prefecture. takes us on a flight through Tokyo’s world of modern fashion. 26 A DAY IN THE LIFE OF A SHAMISEN 52 KYUSHU BY TRAIN MAKER Escape to these iconic locales with 16 SHOP JAPAN: BEST OF THE BEACH The designer of Tokyo 2020’s official shamisen assistance from the JR Kyushu Rail Pass. These Japan-made summer accessories spreads the word about the native instrument. help anyone stay cool and look stylish on their next seaside excursion. 30 ON IMPULSE GUIDE Special effects makeup artist extraordinnaire CULTURE ROUNDUP 18 TRENDS: 5 JAPANESE LIQUEURS Amazing Jiro takes TW inside his twisted world. -
Acoustics, Speech, and Signal Processing, 1995
MICROSOFT WINDOWS HIGHLY INTELLIGENT SPEECH RECOGNIZER: WHISPER Xuedong Huang, Alex Acero, Fil Alleva, Mei-Yuh Hwang, Li Jiang and Milind Mahajan Microsoft Corporation One Microsoft Way Redmond, WA 98052, USA ABSTRACT independent continuous speech recognition for typical Windows command and control applications. In this paper, we will Since January 1993, we have been working to refine and extend selectively describe several strategies we used in Whis,per to tackle Sphinx-I1 technologies in order to develop practical speech efficiency and usability problems for command and control recognition at Microsoft. The result of that work has been the applications. Whisper (Windows Highly Intelligent Speech Recognizer). Whisper represents significantly improved recognition efficiency, usability, and accuracy, when compared with the Sphinx-I1 2. EFFICIENCY ISSUES system. In addition Whisper offers speech input capabilities for Microsoft Windows@ and can be scaled to meet different PC We have dramatically improved Whisper’s computational and platform configurations. It provides features such as continuous memory requirements. In comparison with Sphinx-11 (under the speech recognition, speaker-independence, on-line adaptation, same accuracy constraints), the necessary RAM was ireduced by a noise robustness, dynamic vocabularies and grammars. For typical factor of 20, and the speed was improved by a factor of 5. Windows@ Command-and-Control applications (less than 1,000 Efficiency issues are largely related to the size of models and search architecture [3], which is closely related to data structures words), Whisper provides a software only solution on PCs and algorithm design as well as appropriate acoustic and language equipped with a 486DX, 4MB of memory, and a standard sound modeling technologies. -
Cyborg Art: an Explorative and Critical Inquiry Into Corporeal Human-Technology Convergence
http://waikato.researchgateway.ac.nz/ Research Commons at the University of Waikato Copyright Statement: The digital copy of this thesis is protected by the Copyright Act 1994 (New Zealand). The thesis may be consulted by you, provided you comply with the provisions of the Act and the following conditions of use: Any use you make of these documents or images must be for research or private study purposes only, and you may not make them available to any other person. Authors control the copyright of their thesis. You will recognise the author’s right to be identified as the author of the thesis, and due acknowledgement will be made to the author where appropriate. You will obtain the author’s permission before publishing any material from the thesis. Cyborg Art: An Explorative and Critical Inquiry into Corporeal Human-Technology Convergence A thesis submitted in partial fulfilment of the requirements for the degree of Doctor of Philosophy at the University of Waikato, by Elizabeth Margaretha Borst Faculty of Arts and Social Sciences © Elizabeth M. Borst, 2009 All rights reserved. This work may not be reproduced without permission of the author. University of Waikato, 2009 i ii Abstract This thesis introduces and examines the undervalued concept of corporeal human- technology interface art, or ‘cyborg art’, which describes literal, figural and metaphorical representations of increasing body and technology integration. The transforming (post)human being is therefore the focus; who we are today, and who or what we may become as humanity increasingly interfaces with technology. Theoretical analysis of cyborg imagery centres on the science fiction domain, in particular film and television, as opposed to art. -
EFFICIENT CEPSTRAL NORMALIZATION for ROBUST SPEECH RECOGNITION Fu-Hua Liu, Richard M
EFFICIENT CEPSTRAL NORMALIZATION FOR ROBUST SPEECH RECOGNITION Fu-Hua Liu, Richard M. Stern, Xuedong Huang, Alejandro Acero Department of Electrical and Computer Engineering School of Computer Science Carnegie Mellon University Pittsburgh, PA 15213 ABSTRACT algorithm, FCDCN, which provides an additive environ- mental compensation to cepstral vectors, but in an environ- In this paper we describe and compare the performance of a series ment-specific fashion [5]. MFCDCN is less of cepstrum-based procedures that enable the CMU SPHINX-II computationally complex than the earlier CDCN algorithm, speech recognition system to maintain a high level of recognition and more accurate than the related SDCN and BSDCN accuracy over a wide variety of acoustical environments. We algorithms [6], and it does not require domain-specific describe the MFCDCN algorithm, an environment-independent training to new acoustical environments. In this paper we extension of the efficient SDCN and FCDCN algorithms devel- describe the performance of MFCDCN and related algo- oped previously. We compare the performance of these algorithms rithms, and we compare it to the popular RASTA approach with the very simple RASTA and cepstral mean normalization to robustness. procedures, describing the performance of these algorithms in the context of the 1992 DARPA CSR evaluation using secondary microphones, and in the DARPA stress-test evaluation. 2. EFFICIENT CEPSTRUM-BASED COMPENSATION TECHNIQUES 1. INTRODUCTION In this section we describe several of the cepstral normal- ization techniques we have developed to compensate The need for speech recognition systems and spoken lan- simultaneously for additive noise and linear filtering. Most guage systems to be robust with respect to their acoustical of these algorithms are completely data-driven, as the com- environment has become more widely appreciated in recent pensation parameters are determined by comparisons years (e.g. -
Media Translations of Fantasy Worlds
A World of Difference: Media Translations of Fantasy Worlds Rachel Louise Mizsei Ward Doctor of Philosophy University of East Anglia School of Film, Television and Media Studies May 2013 This copy of the thesis has been supplied on condition that anyone who consults it is understood to recognise that its copyright rests with the author and that use of any information derived there-from must be in accordance with current UK Copyright Law. In addition, any quotation or extract must include full attribution. Rachel Mizsei Ward University of East Anglia Abstract The modern consumer has access to a massively complex entertainment world. Many of the products available reveal a visible movement of popular fantasy worlds between different media. This transmedia process creates a strong link between film, merchandising and games; with all of these mediums borrowing from each other. This borrowing takes various forms, from licensed adaptations to unofficial copying of ideas, settings and characters as well as exploiting the different aesthetics and techniques of different media. Much of the scholarship on transmedia concentrates on storytelling, where a single overarching narrative unfolds over several different media. This thesis will move away from storytelling to consider how culture producers borrow the aesthetics, narratives and fantasy worlds from other sources, including computer games. This borrowing happens because it enables them to use transmedia functionality to gain market share from an already established audience who have a vested interested in, and enthusiasm for, an established world. Most of this borrowing happens around specific genres – especially fantasy, science fiction and horror. These genres are particularly wide-ranging and emphasise the possibilities of worldbuilding, making then good sources for multi-media franchises. -
Merry Christmas from Our Family to Yours
The Westfield— Serving the Town Since 1890 — Leader USPS 680020 Published OUR 109th YEAR – ISSUE NO. 51-99 Periodical – Postage Paid at Westfield, N.J. Thursday, December 23, 1999 Every Thursday 232-4407 FIFTY CENTS Menorah Near Train Station Draws Citizen, Council Debate By PAUL J. PEYTON elected officials should have in place policy.” Specially Written for The Westfield Leader when considering the largesse of our Mayor Jardim read a prepared let- The recent placement of a menorah community.” ter which he had written but not sent on municipal property near the town’s He was critical of Mayor Jardim’s to Mr. Hertell in response to the Christmas tree, in front of the north decision to accept the menorah and former school board member’s com- side train station, has prompted con- have Department of Public Works ments. cerns from a former elected official staff install it “without a courtesy He said Mr. Hertell had a “pro- in town and an angered response consultation” with members of the found misunderstanding” of his role from some members of the Town council. has mayor. Mayor Jardim said he has Council. He said this action “sets a dubious the authority to respond to requests Keith S. Hertell, who served on the and dangerous precedent, which may by residents and to direct town staff Westfield Board of Education from or may not reflect the best interests of in this regard. 1995 to 1998, publicly read a letter the citizens of Westfield.” Mayor Jardim stated that the coun- he sent to Mayor Thomas C. -
Subphonetic Modeling for Speech Recognition Mei-Yuh Hwang Xuedong Huang
Subphonetic Modeling for Speech Recognition Mei-Yuh Hwang Xuedong Huang School of Computer Science Carnegie Mellon University Pittsburgh, Pennsylvania 15213 ABSTRACT Each clustered output distribution is denoted as a senone. In How to capture important acoustic clues and estimate essen- this way, senones can be shared across different models as tial parameters reliably is one of the central issues in speech illustrated in Figure 1. recognition, since we will never have sufficient training data to model various acoustic-phonetic phenomena. Success- ful examples include subword models with many smoothing techniques. In comparison with subword models, subphonetic modeling may provide a finer level of details. We propose to model subphonetic events with Markov states and treat the state in phonetic hidden Markov models as our basic sub- phonetic unit -- senone. A word model is a concatenation of state-dependent senones and senones can be shared across different word models. Senones not only allow parameter sharing, but also enable pronunciation optimization and new word learning, where the phonetic baseform is replaced by the senonic baseform. In this paper, we report preliminary subphonetic modeling results, which not only significantly re- duced the word error rate for speaker-independent continuous speech recognition but also demonstrated a novel application senone codebook for new word learning. Figure 1: Creation of the senone codebook. 1 INTRODUCTION For large-vocabulary speech recognition, we will never have The advantages of senones include better parameter sharing sufficient training data to model all the various acoustic- and improved pronunciation optimization. After clustering, phonetic phenomena. How to capture important acoustic different states in different models may share the same senone clues and estimate essential parameters reliably is one of if they exhibit acoustic similarity. -
Challenges in Adopting Speech Recognition
! BY Li Deng and Xuedong Huang Although progress has been impressive, there are still several hurdles that speech recognition technology must clear before ubiquitous adoption can be realized. R&D in spontaneous and free-flowing speech style is critical to its success. CHALLENGES IN ADOPTING Speech Recognition n recent years, significant progress has been made in advancing speech recognition technology, making speech an effective modality in both telephony and multimodal human-machine interaction. Speech recognition systems have been built and deployed for numerous applications. The technology is not only improving at a steady pace, but is also becoming increasingly usable and useful. However, Ispeech recognition technology has not been widely accepted in our society. The current use of speech recognition by enterprises and consumers reflects only a tip of the iceberg of the full power that the technology could potentially offer [3]. To realize such a potential, the industry has yet to bridge the gap between what people want from speech recognition, typically in a multimodal environment, and what the technology can deliver. To make the mainstream use of speech recognition a reality, the industry must deliver robust and high-recognition accuracy close to human-like COMMUNICATIONS OF THE ACM January 2004/Vol. 47, No. 1 69 performance. To this end, the industry and research communities must collectively address and overcome both technical and business challenges to uncovering the full potential of speech technology in multimodal and intelligent human-machine communication. Speech recognition technology has made dramatic progress over the past 30 years, triggered by advances in computing devices and architectures, success of algo- rithm development, and availability of large quantities of data. -
Japan and the Indigenous People of Karafuto
ON THE FRONTIERS OF HISTORY RETHINKING EAST ASIAN BORDERS ON THE FRONTIERS OF HISTORY RETHINKING EAST ASIAN BORDERS TESSA MORRIS-SUZUKI GLOBAL THINKERS SERIES Published by ANU Press The Australian National University Acton ACT 2601, Australia Email: [email protected] Available to download for free at press.anu.edu.au ISBN (print): 9781760463694 ISBN (online): 9781760463700 WorldCat (print): 1182556687 WorldCat (online): 1182556433 DOI: 10.22459/OFH.2020 This title is published under a Creative Commons Attribution-NonCommercial- NoDerivatives 4.0 International (CC BY-NC-ND 4.0). The full licence terms are available at creativecommons.org/licenses/by-nc-nd/4.0/legalcode Cover design and layout by ANU Press. Cover image from Mamiya Rinzo’s ‘Tōdatsu Kikō’. This edition © 2020 ANU Press CONTENTS List of Illustrations . vii Introduction . 1 1 . Anti-Area Studies Revisited . 7 2 . Mapping Time and Space . 25 3 . ‘Tartary’ in the Reshaping of Historical Thought . 47 4 . Unthinking Civilisation: An Imbricated History of the Okhotsk Region . 71 5 . The Telescope and the Tinderbox: Rediscovering La Pérouse in the North Pacific . 107 6 . Lines in the Snow: The Making of the Russo–Japanese Frontier . 139 7 . Indigeneity and Modernity in Colonial Karafuto . 165 8 . Japan and its Region: From Tartary to the Emergence of the New Area Studies . 195 Concluding Thoughts: On the Value of Small Histories . 233 LIST OF ILLUSTRATIONS Map 1.1. The Okhotsk region ...............................6 Figure 2.1. Japanese map and accompanying image of peoples of the world, 1671. 26 Figure 2.2. Fifteenth-century map of the Iberian Peninsula ........28 Figure 2.3. -
The Computer and the Brain, and the Potential of the Other 95%
The Computer and the Brain, and the Potential of the Other 95% Tai Sing Lee Carnegie Mellon University July 15, 2015. By some accounts, the formation of the Computer Science Department 50 years ago was in large part motivated by Herb Simon and colleagues’ agenda of using computers to simulate and understand human intelligence and decision-making. The founding fathers of the field of computer science in fact cared deeply about understanding the brain. Alan Turing and a number of mathematicians shared beers with biologists like Horace Barlow in their informal dining Ratio Club in Cambridge to discuss the principles of cybernetics and the neural basis of intelligence. Computer science and neuroscience had been more intimately coupled than most practitioners of either field might realize nowadays. The last paper John Von Neumann wrote, on his deathbed, was his contemplation on the connection between the Computer and the Brain. Computer science and neuroscience have made tremendous strides in the last 50 years. Thanks to the visionaries who founded and lead CSD and later the School of Computer Science, Carnegie Mellon has been at the forefront of many innovations in artificial intelligence. Allen Newell and Herb Simon’s Logic Theorist in 1955 was the first AI program. It proved 38 of the 52 well-known theorems and even discovered a more elegant new proof on its own! Hans Berliner’s BKG Backgammon defeated the reigning world champion at the time in 1979. Feng-Hsiung Hsu’s Deep Thoughts achieved grand-master level in 1988 when he was a third year Ph.D.