Speech Standards: Lessons Learnt Paolo Baggia
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Just Another Perl Hack Neil Bowers1 Canon Research Centre Europe
Weblint: Just Another Perl Hack Neil Bowers1 Canon Research Centre Europe Abstract Weblint is a utility for checking the syntax and style of HTML pages. It was inspired by lint [15], which performs a similar function for C and C++ programmers. Weblint does not aspire to be a strict SGML validator, but to provide helpful comments for humans. The importance of quality assurance for web sites is introduced, and one particular area, validation of HTML, is described in more detail. The bulk of the paper is devoted to weblint: what it is, how it is used, and the design and implementation of the current development version. 1. Introduction The conclusion opens with a summary of the information and opinions given in this paper. A Web sites are becoming an increasingly critical part of selection of the lessons learned over the last four years how many companies do business. For many companies is given, followed by plans for the future, and related web sites are their business. It is therefore critical that ideas. owners of web sites perform regular testing and analysis, to ensure quality of service. 2. Web Site Quality Assurance There are many different checks and analyses which The following are some of the questions you should be you can run on a site. For example, how usable is your asking yourself if you have a web presence. I have site when accessed via a modem? An incomplete list of limited the list to those points which are relevant to similar analyses are given at the start of Section 2. -
HTML5 and the Open Web Platform
HTML5 and the Open Web Platform Stuttgart 28 May 2013 Dave Raggett <[email protected]> The Open Web Platform What is the W3C? ● International community where Members, a full-time staff and the public collaborate to develop Web standards ● Led by Web inventor Tim Berners-Lee and CEO Jeff Jaffe ● Hosted by MIT, ERCIM, Keio and Beihang ● Community Groups open to all at no fee ● Business Groups get more staff support ● Technical Working Groups ● Develop specs into W3C Recommendations ● Participants from W3C Members and invited experts ● W3C Patent process for royalty free specifications 3 Who's involved ● W3C has 377 Members as of 11 May 2013 ● To name just a few ● ACCESS, Adobe, Akamai, Apple, Baidu, BBC, Blackberry (RIM), BT, Canon, Deutsche Telekom, eBay, Facebook, France Telecom, Fujitsu, Google, Hitachi, HP, Huawei, IBM, Intel, LG, Microsoft, Mozilla, NASA, NEC, NTT DoCoMo, Nuance, Opera Software, Oracle, Panasonic, Samsung, Siemens, Sony, Telefonica, Tencent, Vodafone, Yandex, … ● Full list at ● http://www.w3.org/Consortium/Member/List 4 The Open Web Platform 5 Open Web Platform ● Communicate with HTTP, Web Sockets, XML and JSON ● Markup with HTML5 ● Style sheets with CSS ● Rich graphics ● JPEG, PNG, GIF ● Canvas and SVG ● Audio and Video ● Scripting with JavaScript ● Expanding range of APIs ● Designed for the World's languages ● Accessibility with support for assistive technology 6 Hosted and Packaged Apps ● Hosted Web apps can be directly loaded from a website ● Packaged Web apps can be locally installed on a device and run without the need for access to a web server ● Zipped file containing all the necessary resources ● Manifest file with app meta-data – Old work on XML based manifests (Web Widgets) – New work on JSON based manifests ● http://w3c.github.io/manifest/ ● Pointer to app's cache manifest ● List of required features and permissions needed to run correctly ● Runtime and security model for web apps ● Privileged apps must be signed by installation origin's private key 7 HTML5 Markup ● Extensive range of features ● Structural, e.g. -
Downloaded (Just Like an EXE File) and Executed from Local File System
HTML - Wikipedia, the free encyclopedia http://en.wikipedia.org/wiki/HTML HTML From Wikipedia, the free encyclopedia HyperText Markup Language ( HTML ) is the main HTML markup language for displaying web pages and other (HyperText Markup Language) information that can be displayed in a web browser. Filename .html, .htm HTML is written in the form of HTML elements extension consisting of tags enclosed in angle brackets (like <html> ), within the web page content. HTML tags most Internet text/html commonly come in pairs like <h1> and </h1> , although media type some tags, known as empty elements , are unpaired, for Type code TEXT example <img> . The first tag in a pair is the start tag , the second tag is the end tag (they are also called opening Uniform Type public.html tags and closing tags ). In between these tags web Identifier designers can add text, tags, comments and other types of Developed by World Wide Web Consortium & text-based content. WHATWG The purpose of a web browser is to read HTML Type of Markup language documents and compose them into visible or audible web format pages. The browser does not display the HTML tags, but Extended SGML uses the tags to interpret the content of the page. from HTML elements form the building blocks of all websites. Extended to XHTML HTML allows images and objects to be embedded and Standard(s) can be used to create interactive forms. It provides a ISO/IEC 15445 means to create structured documents by denoting W3C HTML 4.01 structural semantics for text such as headings, paragraphs, (http://www.w3.org/TR/1999 lists, links, quotes and other items. -
Multilingual Aspects in Speech and Multimodal Interfaces
Multilingual Aspects in Speech and Multimodal Interfaces Paolo Baggia Director of International Standards 1 Outline Loquendo Today Do we need multilingual applications? Voice is different from text? Current Solutions – a Tour: Speech Interface Framework Today Voice Applications Speech Recognition Grammars Speech Prompts Pronunciation Lexicons Discussion Points 2 Company Profile . Privately held company (fully owned by Telecom Italia), founded in 2001 as spin-off from Telecom Italia Labs, capitalizing on 30yrs experience and expertise in voice processing. Global Company, leader in Europe and South America for award-winning, high quality voice technologies (synthesis, recognition, authentication and identification) available in 30 languages and 71 voices. Multilingual, proprietary technologies protected Munich over 100 patents worldwide London . Financially robust, break-even reached in 2004, revenues and earnings growing year on year Paris . Offices in New York. Headquarters in Torino, Madrid local representative sales offices in Rome, Torino New York Madrid, Paris, London, Munich Rome . Flexible: About 100 employees, plus a vibrant ecosystem of local freelancers. 3 International Awards Market leader-Best Speech Engine Speech Industry Award 2007, 2008, 2009, 2010 2010 Speech Technology Excellence Award CIS Magazine 2008 Frost & Sullivan European Telematics and Infotainment Emerging Company of the Year Award Loquendo MRCP Server: Winner of 2008 IP Contact Center Technology Pioneer Award Best Innovation in Automotive Speech Synthesis Prize AVIOS- SpeechTEK West 2007 Best Innovation in Expressive Speech Synthesis Prize AVIOS- SpeechTEK West 2006 Best Innovation in Multi-Lingual Speech Synthesis Prize AVIOS- SpeechTEK West 2005 4 Do We Need Multilingual Applications? Yes, because … . We live in a Multicultural World . Movement of students/professionals, migration, tourism . -
Rečové Interaktívne Komunikačné Systémy
Rečové interaktívne komunikačné systémy Matúš Pleva, Stanislav Ondáš, Jozef Juhár, Ján Staš, Daniel Hládek, Martin Lojka, Peter Viszlay Ing. Matúš Pleva, PhD. Katedra elektroniky a multimediálnych telekomunikácií Fakulta elektrotechniky a informatiky Technická univerzita v Košiciach Letná 9, 04200 Košice [email protected] Táto učebnica vznikla s podporou Ministerstvo školstva, vedy, výskumu a športu SR v rámci projektu KEGA 055TUKE-04/2016. c Košice 2017 Názov: Rečové interaktívne komunikačné systémy Autori: Ing. Matúš Pleva, PhD., Ing. Stanislav Ondáš, PhD., prof. Ing. Jozef Juhár, CSc., Ing. Ján Staš, PhD., Ing. Daniel Hládek, PhD., Ing. Martin Lojka, PhD., Ing. Peter Viszlay, PhD. Vydal: Technická univerzita v Košiciach Vydanie: prvé Všetky práva vyhradené. Rukopis neprešiel jazykovou úpravou. ISBN 978-80-553-2661-0 Obsah Zoznam obrázkov ix Zoznam tabuliek xii 1 Úvod 14 1.1 Rečové dialógové systémy . 16 1.2 Multimodálne interaktívne systémy . 19 1.3 Aplikácie rečových interaktívnych komunikačných systémov . 19 2 Multimodalita a mobilita v interaktívnych systémoch s rečo- vým rozhraním 27 2.1 Multimodalita . 27 2.2 Mobilita . 30 2.3 Rečový dialógový systém pre mobilné zariadenia s podporou multimodality . 31 2.3.1 Univerzálne riešenia pre mobilné terminály . 32 2.3.2 Projekt MOBILTEL . 35 3 Parametrizácia rečových a audio signálov 40 3.1 Predspracovanie . 40 3.1.1 Preemfáza . 40 3.1.2 Segmentácia . 41 3.1.3 Váhovanie oknovou funkciou . 41 3.2 Spracovanie rečového signálu v spektrálnej oblasti . 41 3.2.1 Lineárna predikčná analýza . 43 3.2.2 Percepčná Lineárna Predikčná analýza . 43 3.2.3 RASTA metóda . 43 3.2.4 MVDR analýza . -
Orchestration Server Developer's Guide
Orchestration Server Developer's Guide SCXML Language Reference 10/2/2021 Contents • 1 SCXML Language Reference • 1.1 Usage • 1.2 Syntax and Semantics • 1.3 Extensions and Deviations • 1.4 SCXML Elements • 1.5 Event Extensions • 2 Logging and Metrics • 2.1 Supported URI Schemes • 2.2 Supported Profiles • 2.3 Examples Orchestration Server Developer's Guide 2 SCXML Language Reference SCXML Language Reference Click here to view the organization and contents of the SCXML Language Reference. SCXML stands for State Chart XML: State Machine Notation for Control Abstraction. Orchestration Server utilizes an internally developed SCXML engine which is based on, and supports the specifications outlined in the W3C Working Draft 7 May 2009 [1]. There are, however, certain changes and/or notable differences (see Extensions and Deviations) between the W3C specification and the implementation in Orchestration Server which are described on this page. Only the ECMAScript profile is supported (the minimal and XPath profiles are not supported). Usage SCXML documents are a means of defining control behaviour through the design of state machines. The SCXML engine supports a variety of control flow elements as well as methods to manipulate and send/receive data, thus enabling users to create complex mechanisms. For example, Orchestration, which is a consumer of the SCXML engine, uses SCXML documents to execute strategies. A simple strategy may involve defining different call routing behaviours depending on the incoming caller, however, the SCXML engine facilitates a wide variety of applications beyond simple call routing. Authoring SCXML documents can be done using any text editor or by leveraging the Genesys Composer tool. -
A Framework for Intelligent Voice-Enabled E-Education Systems
A FRAMEWORK FOR INTELLIGENT VOICE-ENABLED E-EDUCATION SYSTEMS BY AZETA, Agbon Ambrose (CUGP050134) A THESIS SUBMITTED TO THE DEPARTMENT OF COMPUTER AND INFORMATION SCIENCES, SCHOOL OF NATURAL AND APPLIED SCIENCES, COLLEGE OF SCIENCE AND TECHNOLOGY, COVENANT UNIVERSITY, OTA, OGUN STATE NIGERIA, IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE AWARD OF DOCTOR OF PHILOSOPHY IN COMPUTER SCIENCE MARCH, 2012 CERTIFICATION This is to certify that this thesis is an original research work undertaken by Ambrose Agbon Azeta with matriculation number CUGP050134 under our supervision and approved by: Professor Charles Korede Ayo --------------------------- Supervisor Signature and Date Dr. Aderemi Aaron Anthony Atayero --------------------------- Co- Supervisor Signature and Date Professor Charles Korede Ayo --------------------------- Head of Department Signature and Date --------------------------- External Examiner Signature and Date ii DECLARATION It is hereby declared that this research was undertaken by Ambrose Agbon Azeta. The thesis is based on his original study in the department of Computer and Information Sciences, College of Science and Technology, Covenant University, Ota, under the supervision of Prof. C. K. Ayo and Dr. A. A. Atayero. Ideas and views of this research work are products of the original research undertaken by Ambrose Agbon Azeta and the views of other researchers have been duly expressed and acknowledged. Professor Charles Korede Ayo --------------------------- Supervisor Signature and Date Dr. Aderemi Aaron Anthony Atayero --------------------------- Co- Supervisor Signature and Date iii ACKNOWLEDGMENTS I wish to express my thanks to God almighty, the author of life and provider of wisdom, understanding and knowledge for seeing me through my Doctoral research program. My high appreciation goes to the Chancellor, Dr. David Oyedepo and members of the Board of Regent of Covenant University for catching the vision and mission of Covenant University. -
A Declarative Approach Based on Xforms
Helsinki University of Technology Publications in Telecommunications Software and Multimedia Teknillisen korkeakoulun tietoliikenneohjelmistojen ja multimedian julkaisuja Espoo 2006 TML-A16 WEB USER INTERACTION - A DECLARATIVE APPROACH BASED ON XFORMS Mikko Honkala Dissertation for the degree of Doctor of Science in Technology to be presented with due permission of the Department of Computer Science and Engineering, for pub- lic examination and debate in Auditorium T2 at Helsinki University of Technology (Espoo, Finland) on the 12th of January, 2007, at 12 noon. Helsinki University of Technology Department of Computer Science and Engineering Telecommunications Software and Multimedia Laboratory Teknillinen korkeakoulu Tietotekniikan osasto Tietoliikenneohjelmistojen ja multimedian laboratorio Distribution: Helsinki University of Technology Telecommunications Software and Multimedia Laboratory P.O.Box 5400 FIN-02015 HUT Tel. +358-9-451 2870 Fax. +358-9-451 5014 c Mikko Honkala ISBN-13 978-951-22-8565-5 ISBN-10 951-22-8565-7 ISSN 1456-7911 ISBN-13 978-951-22-8566-2 (PDF) ISBN-10 951-22-8566-5 (PDF) ISSN 1455 9722 (PDF) URL: http://lib.tkk.fi/Diss/ Otamedia Oy Espoo 2006 ABSTRACT Author Mikko Honkala Title Web User Interaction - a Declarative Approach Based on XForms Published Doctoral thesis, Helsinki University of Technology, 2006 Keywords XML, User Interfaces, User Interaction, XForms, UIDL, XHTML This thesis studies next-generation web user interaction definition languages, as well as browser software architectures. The motivation comes from new end-user requirements for web applications: demand for higher interaction, adaptation for mobile and multimodal usage, and rich multimedia content. At the same time, there is a requirement for non- programmers to be able to author, customize, and maintain web user interfaces. -
A State-Transfer-Based Open Framework for Internet of Things Service Composition
Doctoral Dissertation Academic Year 2018 A State-Transfer-based Open Framework for Internet of Things Service Composition Keio University Graduate School of Media Design Ruowei Xiao A Doctoral Dissertation submitted to Keio University Graduate School of Media Design in partial fulfillment of the requirements for the degree of Ph.D of Media Design Ruowei Xiao Thesis Advisor: Associate Professor Kazunori Sugiura (Principal Advisor) Professor Akira Kato (Co-advisor) Professor Keiko Okawa (Co-advisor) Thesis Committee: Professor Akira Kato (Principal Advisor) Professor Keiko Okawa (Member) Professor Kai Kunze (Member) Senior Assistant Professor Takeshi Sakurada (Member) Abstract of Doctoral Dissertation of Academic Year 2018 A State-Transfer-based Open Framework for Internet of Things Service Composition Category: Science / Engineering Summary Current Internet-of-Things (IoT) applications are built upon multiple architec- tures, standards and platforms, whose heterogeneity leads to domain specific tech- nology solutions that cannot interoperate with each other. It generates a growing need to develop and experiment with technology solutions that break and bridge the barriers. This research introduces an open IoT development framework that offers gen- eral, platform-agnostic development interfaces, and process. It allows IoT re- searchers and developers to (re-)use and integrate a wider range of IoT and Web services. A Finite State Machine (FSM) model was adopted to provide a uniform service representation as well as an entry point for swift and flexible service com- position under Distributed Service Architecture (DSA). Leveraging this open IoT service composition framework, value-added, cross-domain IoT applications and business logic can be developed, deployed, and managed in an on-the-fly manner. -
PDF Brochure
STAFF SPONSORS *John Toole Executive Director and CEO William B. Pickett, Co-Founder, Historian Dennis Paustenbach, President and CEO, ChemRisk CommerceNet The Computer History Museum Dr. Pickett is a senior professor of history at Rose-Hulman Kirkland and Ellis *Dennis Paustenbach Institute of Technology. He has been a Fulbright professor in Christian Taylor President and CEO, ChemRisk Japan and is the author of numerous books and articles on Rose-Hulman Ventures *Donald Kennedy American political and diplomatic history and the history of Editor in Chief, Science technology. He has taught courses about the historical WEB HISTORY CENTER FOUNDING MEMBERS President Emeritus, Stanford University impact of the Web, and in 2004 he initiated and was co-chair Institutions *David Kirsch of a conference called “The World Wide Web at Ten: The • Center for History and New Media, Director, Dotcom Digital Archive Dream and the Reality” commemorating the 10th George Mason University University of Maryland anniversary of the commercial Web. • CommerceNet • The Computer History Museum *Dave Raggett Marc Weber, Co-Founder, • Digibarn Web Pioneer—HTML Architect Collections and Communications Specialist • International World Wide Web Conference Committee *Christian Taylor Mr. Weber is an award-winning journalist and technology • The Internet Archive Legal and IP Expert writer, and co-founder of Arcady Press. He was the first person • Rose-Hulman Institute of Technology Kirkland and Ellis to research the Web's origins as a historical topic, starting in • Stanford University Libraries *Rob Kusel 1995. He co-founded the World Wide Web History Project in History of Science and Technology Collections Fundraising Advisor • University of Maryland Essex Drake 1996 with the assistance of Sir Tim Berners-Lee and many Dot-Com Archive and Business Plan Archive other Web pioneers. -
Download As A
Client-Side State-based Control for Multimodal User Interfaces David Junger <[email protected]> Supervised by Simon Dobnik and Torbjörn Lager of the University of Göteborg Abstract This thesis explores the appeal and (ease of) use of an executable State Chart language on the client-side to design and control multimodal Web applications. A client-side JavaScript implementation of said language, created by the author, is then discussed and explained. Features of the language and the implementation are illustrated by constructing, step-by-step, a small multimodal Web application. Table of Contents Acknowledgments Introduction 1. More and more modality components 1.1 Multimodal Output 1.2 Multimodal Input 2. Application Control 2.1 Multimodal Architecture and Interfaces 2.2 Internal application logic 2.3 Harel State Charts 2.4 SCXML 2.5 Benefits of decentralized control 3. JSSCxml 3.1 Web page and browser integration 3.2 Implementation details 3.3 Client-side extras 3.4 Performance 4. Demonstration 4.1 The event-stream 4.2 The user interface 4.3 Implementing an output modality component 4.4 Where is the “mute” button? 5. The big picture 5.1 Related work 5.2 Original contribution 6. Future Work References Acknowledgments I learned about SCXML (the aforementioned State Chart language) thanks to Torbjörn Lager, who taught the Speech and Dialogue course in my first year in the Master in Language Technology. Blame him for getting me into this. Implementing SCXML was quite a big project for me. I would never have considered starting it on my own, as I have a record of getting tired of long projects and I was afraid that would happen again. -
Speechtek 2008 Final Program
FINAL PROGRAM AUGUST 18–20, 2008 NEW YORK MARRIOTT MARQUIS NEW YORK CITY SPEECH IN THE MAINSTREAM WWW. SPEECHTEK. COM KEYNOTE SPEAKERS w Analytics w Mobile Devices Ray Kurzweil author of The Age of Spiritual Machines and w Natural Language w Multimodal The Age of Intelligent Machines w Security w Video and Speech w Voice User Interfaces w Testing and Tuning Lior Arussy President, Strativity Group author of Excellence Every Day Gold Sponsors: Kevin Mitnick author, world-famous (former) hacker, & Silver Sponsor: Media Sponsors: security consultant Organized and produced by Welcome to SpeechTEK 2008 Speech in the Mainstream Last year we recognized that the speech industry was at a tipping Conference Chairs point, or the point at which change becomes unstoppable. To James A. Larson illustrate this point, we featured Malcolm Gladwell, author of the VP, Larson Technical Services best-selling book, The Tipping Point, as our keynote speaker. Not surprisingly, a lot has happened since the industry tipped: We’re Susan L. Hura seeing speech technology in economy-class cars and advertised on television to millions of households, popular video games with VP, Product Support Solutions more robust voice commands, and retail shelves stocked with affordable, speech-enabled aftermarket navigation systems. It’s Director, SpeechTEK clear that speech is in the mainstream—the focus of this year’s Program Planning conference. David Myron, Editorial Director, Speech technology has become mainstream for organizations CRM and Speech Technology magazines seeking operational efficiencies through self-service. For enterprises, speech enables employees to reset passwords, sign up for benefits, and find information on company policies and procedures.