An Efficient, Streamable Text Format for Multimedia Captions and Subtitles Dick C.A

Total Page:16

File Type:pdf, Size:1020Kb

An Efficient, Streamable Text Format for Multimedia Captions and Subtitles Dick C.A An Efficient, Streamable Text Format for Multimedia Captions and Subtitles Dick C.A. Bulterman, Jack Jansen, Pablo Cesar, Samuel Cruz-Lara To cite this version: Dick C.A. Bulterman, Jack Jansen, Pablo Cesar, Samuel Cruz-Lara. An Efficient, Streamable Text Format for Multimedia Captions and Subtitles. ACM Symposium on Document Engineering - DocEng 2007, Aug 2007, Winnipeg, Canada. inria-00192467 HAL Id: inria-00192467 https://hal.inria.fr/inria-00192467 Submitted on 28 Nov 2007 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. An Efficient, Streamable Text Format for Multimedia Captions and Subtitles Dick C.A. Bulterman, A.J. Jansen, Pablo Cesar and Samuel Cruz-Lara CWI: Centrum voor Wiskunde en Informatica INRIA Lorraine Kruislaan 413 Laboratoire Loria BP 239 1098 SJ Amsterdam F-54506 Vandoeuvre Les Nancy cedex The Netherlands France {Dick.Bulterman, Jack.Jansen, P.S.Cesar}@cwi.nl [email protected] ABSTRACT 1. INTRODUCTION In spite of the high profile of media types such as video, XML multimedia languages integrate a collection of audio, audio and images, many multimedia presentations rely graphics, image, text, and video media items into a single extensively on text content. Text can be used for incidental presentation. Since most media items are by nature very labels, or as subtitles or captions that accompany other large — they consist of sampled data that can run into the media objects. In a multimedia document, text content is not gigabytes — it has been common wisdom in multimedia only constrained by the need to support presentation styles document design to develop container formats that refer to and layout, it is also constrained by the temporal context of media objects by reference. A good example of such a lan- the presentation. This involves intra-text and extra-text tim- guage is SMIL [4], which does not contain any actual media ing synchronization with other media objects. This paper data but consists of scheduling and layout directives that describes a new timed-text representation language that is control the presentation of a set of external (to the docu- intended to be embedded in a non-text host language. Our ment) media object URIs. format, which we call aText (for the Ambulant Text Format), balances the need for text styling with the requirement for While the clean separation of content from document struc- an efficient representation that can be easily parsed and ture is a valid abstract concept, it often brings with it an scheduled at runtime. aText, which can also be streamed, is increased authoring burden. Instead of maintaining (or gen- defined as an embeddable text format for use within declar- erating) a single document, several documents need to be ative XML languages. The paper presents a discussion of the defined: one for the integrating structure and one each for requirements for the format, a description of the format and the media objects. It also brings media access problems — a comparison with other existing and emerging text formats. especially in mobile environments — where multiple trans- We also provide examples for aText when embedded within fer sessions need to be initiated to fetch a few bytes of con- the SMIL and MLIF languages and discuss our implementa- tent. While this is defendable for large, typically binary tion experiences of aText with the Ambulant Player. media objects, it is less appropriate for text data. Categories and Subject Descriptors One approach for increasing authoring and processing effi- D.3.2 [Language Classifications]: Specialized application ciency is to integrate an existing text format into the XML languages; I.7.2 [Document and Text Processing]: Docu- container language. While many public and proprietary text ment Preparation—Languages and systems. formats exist, they cannot be simply imported into a multi- General Terms media container format because most are extremely feature rich (meaning that they need a complex text processing Performance, Design, Experimentation, Standardization, engine to manage text styling and layout) and because they Languages. were designed to be timing agnostic (meaning that there is Keywords no existing syntax for describing intra-text timing, nor extra- text synchronization with other objects). Even dedicated lan- Timed text, SMIL, DFXP, RealText, streaming text, Ambu- guages for describing timed text (such as RealText [14], lant. DFXP [1], HTML+TIME [10] or MPEG4-Part 17 [7]) often are based on temporal models that clash with the host timing model used in the media presentation. Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that This paper presents a new language for integrating text in an copies are not made or distributed for profit or commercial (processing) efficient and streamable manner into XML- advantage and that copies bear this notice and the full citation on based multimedia languages. We call this language the the first page. To copy otherwise, or republish, to post on servers or Ambulant Text Format, or aText. Rather than being designed to redistribute to lists, requires prior specific permission and/or a to be a stand-alone text container format, aText was designed fee. to be an embedded timed text description format that could Conference '00, Month 1-2, 2000, City, State. extend existing XML languages such as SMIL and MLIF [5]. Copyright 200x ACM 1-58113-000-0/00/0000…$5.00. DocEng ’07 Submission Page 1 of 10 The principal contribution of aText is that a balance has been presented in parallel with the text. It may also require achieved between a text format that meets the stylistic needs style cues for speaker identification. This class will of a set of common multimedia applications classes and one require light formatting (alignment, line breaks) and that is easy to parse, schedule and render on a wide range of tight temporal coupling with other objects in the presen- multimedia implementation platforms (including low-pow- tation. Historically, captions and subtitles have been ered mobile devices and low-end set-top boxes). The format geared for use by the hearing impaired. has been implemented and integrated into the Ambulant 4. Foreign-language subtitles: this class is similar to the cap- open media player [3]. tions/subtitles described in class #3, except that audio cues and speaker identification are typically not pre- This paper is structured as follows. Section 2 provides a sented; there is an assumption that the foreign-language summary of the principal uses of a temporal text format in reader can see and hear the speakers and cues. multimedia documents. Section 3 surveys existing support for timed text, both within document containers and as 5. Time-constrained moving text: this class requires support stand-alone timed text languages. Section 4 describes the for marquee-style crawling or credits-style rolling text. aText format in detail, including its design goals, temporal The content may or may not be directly related to other characteristics and the styling facilities available in the lan- content in the presentation (such as an independent guage. We also discuss the use of aText as an external con- news crawl under an unrelated video). Such moving text tainer format. Section 5 provides an overview of our current needs to have a rate associated with it that is not related implementation status and the performance issues encoun- to its internal content. It may also need to loop based on tered when implementing aText within the Ambulant player. external (to the text) factors. We conclude with our expectations for further development 6. Inter-object triggered text: this class requires the specifica- and standardization of aText. tion of text fragments that are either triggered by exter- nal media (such as a particular frame in a video) or that 2. APPLICATIONS CLASSES FOR TEXT trigger external objects (such as the sound of a gong IN MULTIMEDIA DOCUMENTS when a certain word appears in the text). The level of granularity may be at the word or phrase level, rather The functionality proposed for aText is based on a set of gen- than at the text object level. eral use cases that we have found to be common when authoring multimedia presentations. 7. Conditional timed text: this class requires support for the conditional specification of text into a rendering area. We preface this discussion with the observation that, unlike The conditions may depend on the state of variables in languages such as XHTML [12], text often plays an inciden- the integrating document, and may be dynamically eval- tal role in multimedia presentations. In text-centric presenta- uated during the presentation of the text. tions, the flow of text in a document will dominate layout 8. Static block text: this class requires support for relatively and styling concerns. In multimedia documents, the tempo- large amounts of formatted text. The text block may con- ral nature of continuous media object (and the presentation sist of informative information that is conditionally pre- sequence of objects such as graphics elements and images) sented during a presentation, such as help text or content define the presentation structure. When text is present, it descriptions. normally plays a subordinate role. As we will see, aText was designed to provide substantial support for the first seven classes and reasonable support The subordinate role of text does not mean that it is unim- for the eight class of text.
Recommended publications
  • Guidelines for the Preservation of Video Recordings IASA-TC 06
    Technical Committee Standards, Recommended Practices, and Strategies Guidelines for the Preservation of Video Recordings IASA-TC 06 Part B. Video Signal, Preservation Concepts, and Target Formats From IASA-TC 06, Edition 1 Revised version, 2019 B-1 Revised version, 2019 Guidelines for the Preservation of Video Recordings Table of Contents B.1 The Video Signal and Bitstreams: Format and Features B-6 B.1.1 Conventional video carriers and formatting B-6 B.1.1.1 Conventional video carriers and the video signal B-6 Sidebar: the noun video B-6 B.1.1.2 Conventional carriers compared to file-based video B-6 B.1.1.3 Broadcast standards and the formatting of video recordings B-7 B.1.2 Analogue video unpacked, part one: key features and variants B-8 B.1.2.1 Illusion of motion from a stream of still images B-9 B.1.2.2 Sound data is carried in parallel with picture data B-9 B.1.2.3 Picture data consists of sets of horizontal scan lines B-10 B.1.2.4 Horizontal lines of picture data may be interlaced B-11 B.1.2.5 Movies on film can be recorded as video B-11 B.1.2.6 Timing: video signal elements must be synchronized (RS-170) B-12 B.1.2.7 Range of picture brightnesses and blanking “brightness” B-14 B.1.3 Analogue video unpacked, part two: key features and variants continued B-16 B.1.3.1 Colour encoding for video on conventional carriers B-16 B.1.3.1.1 Composite video B-17 B.1.3.1.2 S-video B-18 B.1.3.1.3 Colour-difference component video B-18 Sidebar: colour and tonal specifications for digital video and related matters B-20 B.1.3.2 Ancillary data B-22
    [Show full text]
  • Bibliography of Erik Wilde
    dretbiblio dretbiblio Erik Wilde's Bibliography References [1] AFIPS Fall Joint Computer Conference, San Francisco, California, December 1968. [2] Seventeenth IEEE Conference on Computer Communication Networks, Washington, D.C., 1978. [3] ACM SIGACT-SIGMOD Symposium on Principles of Database Systems, Los Angeles, Cal- ifornia, March 1982. ACM Press. [4] First Conference on Computer-Supported Cooperative Work, 1986. [5] 1987 ACM Conference on Hypertext, Chapel Hill, North Carolina, November 1987. ACM Press. [6] 18th IEEE International Symposium on Fault-Tolerant Computing, Tokyo, Japan, 1988. IEEE Computer Society Press. [7] Conference on Computer-Supported Cooperative Work, Portland, Oregon, 1988. ACM Press. [8] Conference on Office Information Systems, Palo Alto, California, March 1988. [9] 1989 ACM Conference on Hypertext, Pittsburgh, Pennsylvania, November 1989. ACM Press. [10] UNIX | The Legend Evolves. Summer 1990 UKUUG Conference, Buntingford, UK, 1990. UKUUG. [11] Fourth ACM Symposium on User Interface Software and Technology, Hilton Head, South Carolina, November 1991. [12] GLOBECOM'91 Conference, Phoenix, Arizona, 1991. IEEE Computer Society Press. [13] IEEE INFOCOM '91 Conference on Computer Communications, Bal Harbour, Florida, 1991. IEEE Computer Society Press. [14] IEEE International Conference on Communications, Denver, Colorado, June 1991. [15] International Workshop on CSCW, Berlin, Germany, April 1991. [16] Third ACM Conference on Hypertext, San Antonio, Texas, December 1991. ACM Press. [17] 11th Symposium on Reliable Distributed Systems, Houston, Texas, 1992. IEEE Computer Society Press. [18] 3rd Joint European Networking Conference, Innsbruck, Austria, May 1992. [19] Fourth ACM Conference on Hypertext, Milano, Italy, November 1992. ACM Press. [20] GLOBECOM'92 Conference, Orlando, Florida, December 1992. IEEE Computer Society Press. http://github.com/dret/biblio (August 29, 2018) 1 dretbiblio [21] IEEE INFOCOM '92 Conference on Computer Communications, Florence, Italy, 1992.
    [Show full text]
  • Introduction to Closed Captions
    TECHNICAL PAPER Introduction to Closed Captions By Glenn Eguchi Senior Computer Scientist April 2015 © 2015 Adobe Systems Incorporated. All rights reserved. If this whitepaper is distributed with software that includes an end user agreement, this guide, as well as the software described in it, is furnished under license and may be used or copied only in accordance with the terms of such license. Except as permitted by any such license, no part of this guide may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, electronic, mechanical, recording, or otherwise, without the prior written permission of Adobe Systems Incorporated. Please note that the content in this guide is protected under copyright law even if it is not distributed with software that includes an end user license agreement. The content of this guide is furnished for informational use only, is subject to change without notice, and should not be construed as a commitment by Adobe Systems Incorporated. Adobe Systems Incorporated assumes no responsibility or liability for any errors or inaccuracies that may appear in the informational content contained in this guide. This article is intended for US audiences only. Any references to company names in sample templates are for demonstration purposes only and are not intended to refer to any actual organization. Adobe and the Adobe logo, and Adobe Primetime are either registered trademarks or trademarks of Adobe Systems Incorporated in the United States and/or other countries. Adobe Systems Incorporated, 345 Park Avenue, San Jose, California 95110, USA. Notice to U.S. Government End Users.
    [Show full text]
  • On Organisational Involvement and Collaboration in W3C Standards Through Editorship Jonas Gamalielsson* and Björn Lundell
    Gamalielsson and Lundell Journal of Internet Services Journal of Internet Services and Applications (2017) 8:5 DOI 10.1186/s13174-017-0056-1 and Applications RESEARCH Open Access On organisational involvement and collaboration in W3C standards through editorship Jonas Gamalielsson* and Björn Lundell Abstract Over time, a number of open standards have been developed and implemented in software for addressing a number of challenges, such as lock-in, interoperability and longevity of software systems and associated digital artefacts. An understanding of organisational involvement and collaboration in standardisation is important for informing any future policy and organisational decisions concerning involvement in standardisation. The overarching goal of the study is to establish how organisations contribute to open standards development through editorship. Specifically, the focus is on open standards development in the World Wide Web Consortium (W3C). Through an analysis of editorship for all W3C recommendations we contribute novel findings concerning organisational involvement and collaboration, and highlight contributions from different types of organisations and countries. We make five principal contributions. First, we establish an overall characterisation of organisational involvement in W3C standardisation. Second, we report on organisational involvement in W3C standardisation over time. Third, we establish how different organisations, organisation types, and countries are involved in W3C technologies. Fourth, we report on organisational
    [Show full text]
  • SMIL 2.0 — Interactive Multimedia on the Web Lloyd Rutledge
    SMIL 2.0 — Interactive Multimedia on the Web Lloyd Rutledge Multimedia and Human-Computer Interaction Group CWI, Amsterdam, The Netherlands W3C SYMM working group Lynda Hardman, Jacco van Ossenbruggen: CWI Dick Bulterman, Jack Jansen, Sjoerd Mullender: Oratrix W3C SYMM working group http://www.cwi.nl/~media/SMIL/Tutorial/{SMIL-4hr.html} Includes material from the upcoming book "SMIL 2.0 — Interactive Multimedia on the Web" Copyright © 2002, Lloyd Rutledge Synchronized Multimedia Integration Language (SMIL) Main Points Pronounced smile Multimedia for the Web — for multimedia what HTML is for hypertext Integration format for presentable mono-medium formats Structure SMIL 1.0 — W3C Recommendation on 15th June 1998 SMIL 2.0 "meta-language" W3C Recommendation on 7th August 2001 SMIL 2.0 family formats SMIL Profile and SMIL Basic released with SMIL 2.0 SMIL 2.0 family format XHTML+SMIL comes after SMIL 2.0 Main Themes Powerful timing and synchronization Adaptive to users and systems Models a flexible but consistent presentation and user interface SMIL Isn't Flash — Flash is mono-medium animation on steriods MPEG-{4 / 7 / 21} — MPEG looks at content and coding, and player architecture and a whole lot more, but is more media centric than web centric D-HTML — D-HTML uses scripted definitions of local behaviors, without a notion of the presentation's context SMIL 2.0 Profiles What is a Profile? A language for which a browser can be built A combination of modules from the SMIL 2.0 "meta-language" Possibly non-SMIL constructs with SMIL constructs
    [Show full text]
  • Rich Web Applications – Client Standards
    Colorado Software Summit: October 22 – 27, 2006 © Copyright 2006, Kevin E. Kelly Rich Web Applications – Client Standards Kevin E. Kelly [email protected] Part of the “Web 2.0” picture Kevin E. Kelly – Rich Web Applications – Client Standards Slide 1 Colorado Software Summit: October 22 – 27, 2006 © Copyright 2006, Kevin E. Kelly Client Standards in the Real World Kevin E. Kelly – Rich Web Applications – Client Standards Slide 2 Colorado Software Summit: October 22 – 27, 2006 © Copyright 2006, Kevin E. Kelly Agenda Introduction W3C Rich Web Activity Compound Documents WICD Issues Demo The Backplane Q&A Kevin E. Kelly – Rich Web Applications – Client Standards Slide 3 Colorado Software Summit: October 22 – 27, 2006 © Copyright 2006, Kevin E. Kelly Introduction Work ➢ Co-op'ed at IBM ➢ US Air Force ➢ Automotive Embedded Startup ➢ Rational Software ➢ Acquired by IBM Standards ➢ Chair Compound Document Format Working Group, was also in the XForms Working Group ➢ Hypertext Coordination Working Group member ➢ HL7 Advisory Council Rep to the W3C Kevin E. Kelly – Rich Web Applications – Client Standards Slide 4 Colorado Software Summit: October 22 – 27, 2006 © Copyright 2006, Kevin E. Kelly W3C Domains Architecture Domain ➢ DOM, XML, Internationalization, Web Services, URI Technology and Society Domain ➢ Patent Policy, Privacy, Semantic Web Ubiquitous Web Domain ➢ Device Independence, Mobile Web, Multimodal, Voice Browser Web Accessibility Initiative ➢ Web Accessibility, International Web Accessibility Interaction Domain ➢ Next Slide Kevin E. Kelly – Rich Web Applications – Client Standards Slide 5 Colorado Software Summit: October 22 – 27, 2006 © Copyright 2006, Kevin E. Kelly W3C Interaction Domain Graphics Activity Style Activity ➢ SVG ➢ CSS ➢ WebCGM Synchronized Multimedia Activity HTML Activity ➢ SyMM (SMIL) ➢ HTML/XHTML ➢ Timed Text ➢ Hypertext Coordination Group XForms Activity Math Activity ➢ XForms ➢ Math (MathML) Rich Web Client Activity ➢ Next Slide Kevin E.
    [Show full text]
  • Timed Text Flip for Vantage Automated Subtitle and Captioning Conversion Integrated Into the Vantage Media Processing Platform
    Vantage Timed Text Flip Product Sheet Timed Text Flip Timed Text Flip for Vantage Automated Subtitle and Captioning Conversion Integrated into the Vantage Media Processing Platform Timed Text Flip is a sophisticated Introduction subtitle and captioning service that As video professionals are faced with increased accessibility requirements, a runs on the Vantage Media complete transcoding and delivery solution should include caption and Processing Platform, providing a subtitle delivery capabilities. Timed Text Flip integrates seamlessly into unique interface to read, write, and Vantage and is easy to use, with an intuitive interface that brings professional adjust subtitle and caption data. subtitling and captioning tools into existing Vantage workflows. Unlike simple caption filters, The Timed Text Flip service allows users to manipulate the timecode of caption data to match media files that are being processed in the same workflow. API to submit files for processing As part of Vantage, the Timed Text Flip caption and subtitle engine can be driven via REST API. Users can submit caption and subtitle files, monitor job progress, select custom workflows, and receive metrics via the Vantage API. Vantage Cloud Port Subtitling and Captioning Done Right Capable Timed Text Flip allows Vantage users to add high quality subtitles and captions to their content to meet accessibility requirements. Timed Text Flip is The Power of Vantage. designed specifically to meet strict internet and TV broadcast mandates that The Simplicity of SaaS. require captioning and subtitle delivery. The service recognizes and automati- As your workflows evolve and cally processes subtitle and captioning input files from service companies and move into the Cloud, customers.
    [Show full text]
  • Providing Captions Fro Flash-Based Streaming Video
    Providing Captions for Flash-Based Streaming Video By Kevin Jones If you have ever wanted to watch captioned streaming videos on the Internet, then you probably know how difficult they are to find. Outside of using Google’s advanced video search capability to search for captioned videos on the google.com domain, there are very few methods for reliably locating captioned videos on the Internet. While the W3C Accessibility Guidelines (WCAG) maintain that all videos on the Internet should provide both a synchronized text alternative and audio description for accessibility purposes, the reality is that very few actually do. There is growing awareness, however, of the many benefits of providing captions for streaming videos. In particular, captioning: Improves clarity and comprehension of the sounds and dialogue, particularly when poor audio, heavy accents, background noises, and other such media elements are present. Makes your video (and advertising) accessible to over 28 million Americans who are deaf or hard of hearing. Creates a complete text transcript of the video, which can be indexed by search engines to provide far more accurate search results than keyword tagging alone. Enables translation into multiple languages. Helps to meet regulatory compliance measures for governmental and educational institutions (e.g., “Section 508”). Allows individuals to follow along with the video dialogue even when they do not have access to sound on their PC or mobile device (e.g., in a noisy area with no headphones or when using devices with faulty or missing sound cards or drivers). Promotes literacy for children and adults by strengthening reading speed, comprehension, spelling, and grammar skills.
    [Show full text]
  • Promedia™ Carbon Carbon Server, Carbon Agent VERSION 3.20
    ProMedia™ Carbon Carbon Server, Carbon Agent VERSION 3.20 Installation and User Guide Disclaimer Harmonic reserves the right to alter the equipment specifications and descriptions in this publication without prior notice. No part of this publication shall be deemed to be part of any contract or warranty unless specifically incorporated by reference into such contract or warranty. The information contained herein is merely descriptive in nature, and does not constitute a binding offer for sale of the product described herein. Harmonic assumes no responsibility or liability arising from the use of the products described herein, except as expressly agreed to in writing by Harmonic. The use and purchase of this product do not convey a license under any patent rights, copyrights, trademark rights, or any intellectual property rights of Harmonic. Nothing hereunder constitutes a representation or warranty that using any products in the manner described herein will not infringe any patents of third parties. Trademark Acknowledgments Harmonic and all Harmonic product names are trademarks of Harmonic All other trademarks are the property of their respective owners. The software described in this document is furnished under a license agreement or nondisclosure agreement. The software may be used or copied only in accordance with the terms of those agreements. © 20120122 Harmonic Inc. All rirightsghts reserved. Documentation Conventions This manual uses some special symbols and fonts to ccallall your attention to important informationinformation.. The following symbols appear throughouthroughoutt this manual: CAUTION: The Caution symbol calls your attention to information that, if ignored, can adversely affect the performance of your Harmonic product, or that ccanan make a procedure needlessly difficult.
    [Show full text]
  • Web Standards.Pdf
    BOOKS FOR PROFESSIONALS BY PROFESSIONALS® Sikos, Ph.D. RELATED Web Standards Web Standards: Mastering HTML5, CSS3, and XML gives you a deep understand- ing of how web standards can be applied to improve your website. You will also find solutions to some of the most common website problems. You will learn how to create fully standards-compliant websites and provide search engine-optimized Web documents with faster download times, accurate rendering, lower development costs, and easy maintenance. Web Standards: Mastering HTML5, CSS3, and XML describes how you can make the most of web standards, through technology discussions as well as practical sam- ple code. As a web developer, you’ll have seen problems with inconsistent appearance and behavior of the same site in different browsers. Web standards can and should be used to completely eliminate these problems. With Web Standards, you’ll learn how to: • Hand code valid markup, styles, and news feeds • Provide meaningful semantics and machine-readable metadata • Restrict markup to semantics and provide reliable layout • Achieve full standards compliance Web standardization is not a sacrifice! By using this book, we can create and maintain a better, well-formed Web for everyone. CSS3, and XML CSS3, Mastering HTML5, US $49.99 Shelve in Web Development/General User level: Intermediate–Advanced SOURCE CODE ONLINE www.apress.com www.it-ebooks.info For your convenience Apress has placed some of the front matter material after the index. Please use the Bookmarks and Contents at a Glance links to access them. www.it-ebooks.info Contents at a Glance About the Author................................................................................................
    [Show full text]
  • Analysis and Design of a Subtitling System for Ambient Intelligence Environments
    Analysis and Design of a Subtitling System for Ambient Intelligence Environments Aitor Rodriguez-Alsina, Guillermo Talavera, Pilar Orero, Jordi Carrabina Qc-2090D. Escola d’Enginyeries. Campus UAB 08193 Bellaterra, Spain {Aitor.Rodriguez, Guillermo.Talavera, Pilar.Orero, Jordi.Carrabina}@uab.cat Abstract — The development of ubiquitous applications for enabling adaptation to the user (or application) accessibility ambient intelligence environments needs to also take into requirements. Most subtitles distributed on the Internet are account some usability and accessibility issues in order to described in text files that follow the SubRip (.SRT) format, ensure a proper user experience and to overcome the existing considered "perhaps the most basic of all subtitle content access barriers. A proper access to video subtitles, for formats"[4]. Figure 1. shows an example of an .SRT file. instance, is not always available due to the technical limitations of traditional video packaging, transmission and presentation. Each subtitle entry consists of the subtitle number, the time New Web standards enable more featured applications with the subtitle should appear on the screen, the subtitle itself, better multi-platform definition, so they are suitable for and a blank line to indicate the end. building ubiquitous applications for ambient intelligence 1 environments. This work presents a video subtitling system 00:00:20,000 --> 00:00:24,400 that enables the customization and adaptation of subtitles. The Altocumulus clouds occur between six thousand benefits of Web applications compared with device-specific native applications for building the solution as well as its 2 current platform support are analyzed. Finally, three different 00:00:24,600 --> 00:00:27,800 application use cases are presented.
    [Show full text]
  • Advances in W3C Web Graphics Standards Max Froumentin - World Wide Web Consortium SIGGRAPH 2003 31 July 2003
    Advances in W3C Web graphics standards Max Froumentin - World Wide Web Consortium SIGGRAPH 2003 31 July 2003 ● Part I ● XHTML2 / XForms ● SVG ● CSS ● SVG 1.0/1.1 ● Multimodal ● SVG Mobile Profiles ● * Part II ● SVG 1.2 ● Component integration ● SVG Print ● Device integration ● Multimedia ● Device Independence ● Multimedia: SMIL ● Multimodal Interaction ● Multimedia: Timed-Text ● Fine ● Multimedia: Timed-Text Requirements Part I - Specification Roundup - Integration What's new. Advances in W3C Web graphics standards - 1 of 19 SVG Advances in W3C Web graphics standards - 2 of 19 SVG 1.0/1.1 Recommendations since Sep 2001 and Jan 2003 Adoption ● Renderers: Adobe plug-in, Batik, Corel Smart Graphics, etc. ● Editors: XStudio ● Exchange format: Maya 5, Invisio@@ ● SVG Conference Advances in W3C Web graphics standards - 3 of 19 SVG Mobile Profiles Recommendation since Jan 03 ● SVG Tiny ● SVG Basic ● Adoption Advances in W3C Web graphics standards - 4 of 19 SVG 1.2 Second working draft released Apr2003 ● adds: rendering XML ● adds: flowing text ● adds: audio/video Advances in W3C Web graphics standards - 5 of 19 SVG Print Authoring Guidelines, not a profile Requirements document published Feb 2003 ● No animation or scripting ● Color reproduction ● Page layout ● Multiple part (fonts, images) aggregation Advances in W3C Web graphics standards - 6 of 19 Multimedia Advances in W3C Web graphics standards - 7 of 19 Multimedia: SMIL SMIL 2 Recommendation since Aug 2001 ● Adoption: animation markup included in SVG ● 3GPP has defined a profile of SMIL for MMS messages ● Implementations: Qi mobile browser, X-Smiles, RealOne, IE, etc. Advances in W3C Web graphics standards - 8 of 19 Multimedia: Timed-Text New Working Group ● movie subtitling ● captioning for accessibility ● scrolling news items ● karaoke! ● Credits Currently, several incompatible formats exist, causing interoperability problems when used within SMIL.
    [Show full text]