Improving Retrieval Accuracy Main Content Extraction HTML Web

Total Page:16

File Type:pdf, Size:1020Kb

Improving Retrieval Accuracy Main Content Extraction HTML Web Improving Retrieval Accuracy in Main Content Extraction from HTML Web Documents Von der Fakult¨atf¨ur Mathematik und Informatik der Universit¨atLeipzig angenommene DISSERTATION zur Erlangung des akademischen Grades DOCTOR rerum naturalium (Dr. rer. nat.) im Fachgebiet Informatik vorgelegt von Master Applied Math. Hadi Mohammadzadeh geboren am 12.10.1969 in Mashhad, Iran Die Annahme der Dissertation wurde empfohlen von: 1. Professor Dr. Jinan Fiaidhi (Ontario, Kanada) 2. Professor Dr. Gerhard Heyer (Leipzig) Die Verleihung des akademischen Grades erfolgt mit Bestehen der Verteidigung am 27.11.2013 mit dem Gesamtpr¨adikat magna cum laude . Amtierender Dekan: Professor Dr. Hans-Bert Rademacher Gutachter: Professor Dr. Jinan Fiaidhi Professor Dr. Gerhard Heyer Tag der Promotion: 27. November 2013 I would like to dedicate this thesis to my loving parents ii Abstract The rapid growth of text based information on the World Wide Web and various applications making use of this data motivates the need for efficient and effective methods to identify and separate the \main content" from the additional con- tent items, such as navigation menus, advertisements, design elements or legal disclaimers. Firstly, in this thesis, we study, develop, and evaluate R2L, DANA, DANAg, and AdDANAg, a family of novel algorithms for extracting the main content of web documents. The main concept behind R2L, which also provided the initial idea and motivation for the other three algorithms, is to use well particularities of Right-to-Left languages for obtaining the main content of web pages. As the English character set and the Right-to-Left character set are encoded in different intervals of the Unicode character set, we can efficiently distinguish the Right-to- Left characters from the English ones in an HTML file. This enables the R2L approach to recognize areas of the HTML file with a high density of Right-to- Left characters and a low density of characters from the English character set. Having recognized these areas, R2L can successfully separate only the Right-to- Left characters. The first extension of the R2L, DANA, improves effectiveness of the baseline algorithm by employing an HTML parser in a post processing phase of R2L for extracting the main content from areas with a high density of Right-to-Left characters. DANAg is the second extension of the R2L and generalizes the idea of R2L to render it language independent. AdDANAg, the third extension of R2L, integrates a new preprocessing step to normalize the hyperlink tags. The presented approaches are analyzed under the aspects of efficiency and effectiveness. We compare them to several established main content extraction algorithms and show that we extend the state-of-the-art in terms of both, efficiency and effectiveness. Secondly, automatically extracting the headline of web articles has many ap- plications. We develop and evaluate a content-based and language-independent approach, TitleFinder, for unsupervised extraction of the headline of web articles. The proposed method achieves high performance in terms of effectiveness and ef- ficiency and outperforms approaches operating on structural and visual features. iv Zusammenfassung Das rasante Wachstum von textbasierten Informationen im World Wide Web und die Vielfalt der Anwendungen, die diese Daten nutzen, macht es notwendig, effiziente und effektive Methoden zu entwickeln, die den Hauptinhalt identifizieren und von den zus¨atzlichen Inhaltsobjekten wie z.B. Navigations-Men¨us,Anzeigen, Design-Elementen oder Haftungsausschl¨ussentrennen. Zun¨achst untersuchen, entwickeln und evaluieren wir in dieser Arbeit R2L, DANA, DANAg und AdDANAg, eine Familie von neuartigen Algorithmen zum Extrahieren des Inhalts von Web-Dokumenten. Das grundlegende Konzept hin- ter R2L, das auch zur Entwicklung der drei weiteren Algorithmen f¨uhrte,nutzt die Besonderheiten der Rechts-nach-links-Sprachen aus, um den Hauptinhalt von Webseiten zu extrahieren. Da der lateinische Zeichensatz und die Rechts-nach-links-Zeichens¨atzedurch ver- schiedene Abschnitte des Unicode-Zeichensatzes kodiert werden, lassen sich die Rechts-nach-links-Zeichen leicht von den lateinischen Zeichen in einer HTML- Datei unterscheiden. Das erlaubt dem R2L-Ansatz, Bereiche mit einer hohen Dichte von Rechts-nach-links-Zeichen und wenigen lateinischen Zeichen aus einer HTML-Datei zu erkennen. Aus diesen Bereichen kann dann R2L die Rechts-nach- links-Zeichen extrahieren. Die erste Erweiterung, DANA, verbessert die Wirk- samkeit des Baseline-Algorithmus durch die Verwendung eines HTML-Parsers in der Nachbearbeitungsphase des R2L-Algorithmus, um den Inhalt aus Bereichen mit einer hohen Dichte von Rechts-nach-links-Zeichen zu extrahieren. DANAg erweitert den Ansatz des R2L-Algorithmus, so dass eine Sprachunabh¨angigkeit erreicht wird. Die dritte Erweiterung, AdDANAg, integriert eine neue Vorver- arbeitungsschritte, um u.a. die Weblinks zu normalisieren. Die vorgestellten Ans¨atzewerden in Bezug auf Effizienz und Effektivit¨atanalysiert. Im Vergleich mit mehreren etablierten Hauptinhalt-Extraktions-Algorithmen zeigen wir, dass sie in diesen Punkten ¨uberlegen sind. Dar¨uber hinaus findet die Extraktion der Uberschriften¨ aus Web-Artikeln vielf¨altigeAnwendungen. Hierzu entwickeln wir mit TitleFinder einen sich nur auf den Textinhalt beziehenden und sprachabh¨angigenAnsatz. Das vorgestellte Verfahren ist in Bezug auf Effektivit¨atund Effizienz besser als bekannte Ans¨atze, die auf strukturellen und visuellen Eigenschaften der HTML-Datei beruhen. vi Acknowledgements Like all research, the work described in this dissertation was not conducted in isolation. It is impossible to thank everyone who has in some way contributed to it and equally impossible to thank individuals for all of their contributions but I would like to thank those people that gave me the opportunity to work and enjoy during my PhD journey. I am indebted for technical insights, encouragement and friendship to many members of the Institute of Applied Information Processing at the University of Ulm, as well as those I had the privilege of working with in the Department of Natural Language Processing at the University of Leibzig. This thesis would not have been possible without the support and the insights of Prof. Dr. Franz Schweiggert. I'm pleased for the given chance to reach this point of my career. I am most grateful to Prof. Dr. Gholamreza Nakhaeizadeh. It has been a big pleasure working with him. I have to thank him for all the support, the suggestions, the patience, and every kind of help that he gave to me, and for the person I became at this point. And of course, for the time spent out of work and his continues suggestions. Grateful acknowledgment is made to my supervisor Prof. Dr. Gerhard Heyer for all his help, guidance, and support. I sincerely would like to thank Prof. Dr Jinan Fiaidhi for being the reviewer of this thesis. She gave me useful comments. Thanks also go to my friends and colleagues at the University of Ulm, in particular Dr. Andreas Borchert, Armin Rutzen, Wolfgang Kaifler, Rene Just, Steffen Kram and Michaela Weiss. I would like to show my gratitude to Dr. Thomas Gottron for his advice, collaboration, and the opportunity to work and publish some papers together. I want to thank one of my best friends Dr. Norbert Heidenbluth for helping me to prepare figures and diagrams. A special thank is dedicated to Mrs Roswitha Jelinek and Mrs Gisela Richter for them support and efforts. At the end, I appreciate Dr. Ljubow Rutzen- Loesevitz for proof-reading this thesis. viii Publications The algorithms, for main content extraction form HTML web documnets, and experimental results introduced in this thesis have been published in six conference and workshop proceedings in addition to one book chapter, as follows: • Hadi Mohammadzadeh, Franz Schweiggert, and Gholamreza Nakhaeizadeh. Using UTF-8 to Extract Main Content of Right to Left Language Web Pages. In: ICSOFT 2011 - Proceedings of the 6th International Conference on Soft- ware and Data Technologies, Seville, Spain, 2011, volume 1, pp. 243 { 249, SciTePress. • Hadi Mohammadzadeh, Thomas Gottron, Franz Schweiggert, and Gholam- reza Nakhaeizadeh. A Fast and Accurate Approach for Main Content Ex- traction based on Character Encoding. In: TIR'11: Proceedings of the 8th International Workshop on Text-based Information Retrieval(DEXA'11), Toulouse, France, 2011, pp. 167 { 171, IEEE Computer Society. • Hadi Mohammadzadeh, Thomas Gottron, Franz Schweiggert, and Gholam- reza Nakhaeizadeh. Extracting the Main Content of Web Documents Based on a Naive Smoothing Method. In: KDIR'11: International Conference on Knowledge Discovery and Information Retrieval, Paris, France, 2011, pp. 470 { 475, SciTePress. • Hadi Mohammadzadeh, Thomas Gottron, Franz Schweiggert, and Gholam- reza Nakhaeizadeh. Extracting the Main Content of Web Documents based on Character Encoding and a Naive Smoothing Method. In: Software and Data Technologies, CCIS Series, Springer, 2013, pp. 217 { 236, Springer- Verlag Berlin Heidelberg. • Hadi Mohammadzadeh, Thomas Gottron, Franz Schweiggert, and Gholam- reza Nakhaeizadeh. The impact of source code normalization on main con- tent extraction. In: WEBIST'12: 8th International Conference on Web In- formation Systems and Technologies, Porto, Portugal, 2011, pp. 677 { 682, SciTePress. • Hadi Mohammadzadeh, Omid Paknia, Franz Schweiggert, and Thomas Got- tron. Revealing trends based on defined queries in biological publications using cosine similarity. In: BIOKDD'12: 23th International Workshop on
Recommended publications
  • Your Guide to Youview+ from BT
    Keep this guide somewhere safe You might need it from time to time. All Rights Reserved. Your guide to YouView+ from BT Quick start guide inside The Hunger Games: Catching Fire © 2013, Artwork Fire The Hunger Games: Catching & Supplementary Entertainment TM & © 2014 Lions Gate Materials Inc. The Hunger Games: Catching Fire Available now on BT Box Office Top 10 tips Welcome to YouView from BT Once programmed, you can use your YouView remote In Search, you’ll only see suggestions until you press . You’ll soon be able to sit back and enjoy the shows you love. to control your TV. See page 45. If you can’t see what you’re looking for, press to see everything that matches your search. But first things first. To get set up, just follow the few simple steps starting Try these shortcut buttons on your remote: over the page. It’s easy and shouldn’t take more than half an hour. With the YouView mobile app, you can see what’s on Find any programme available on YouView. and set recordings on the move. Then, you can learn all about YouView and how it’ll help you take control Takes you back to where you were or of your TV in ‘Using YouView’ starting on page 21. back a level in the menus. Go to youview.com/mobileapp to find out more. Takes you back to live TV or out of a High definition Freeview channels are separate from Need some help? No problem – give us a call on 0800 111 4567, go to player menu.
    [Show full text]
  • In This Edition of ENG 'NF
    No.8 - Spring 1982 Please circula te The Quarterly for BBC Engineering StaH In this edition of ENG 'NF ENHANCED CEEFAX ON AIR Page I EDITORIAL Page 2 TRANSMITIERS OPENING AND CLOSING Page 2 ELECTRONIC STILLS STORE Page 3 TV WAVEFORM ANALYSER Page 4 NEWSTUDIOS FOR BH Page 4 AUTOMATIC TESTING The BBC has made the first public Blue (RGB) samples, occupying some Page 5 broadcast of high quality still pictures 1.2 M bytes of storage space. This by means of the UK Telctext system. represents an active picture size of 702 EXTENDED PAL The broadcast high-quality pictures and by 576 pixels, or picture elements, to Page 5, 6, 7 . other enhancements were demonstrated fIll the television screen with the high- to a technical committee of the quality picture. For the experimental DIGITAL TRANSCODER European Broadcasting Union (EBU) on transmission, a montage of the startling Page 8 February lIth, and at a meeting of the pictures of Saturn taken from the Institution of Electrical Engineers (lEE) Voyager 2 spacecraft was used to NCl RE-ENTERS SERVICE on March 8th. The UK teletext system demonstrate the capability of the Page 9 has, for many years now, represented an system. The picture was fed from a efficient and rugged way of trans- conventional 35 mm slide scanner into 4000 VPR 2'S mitting, receiving and decoding data for the digital picture store. After sampling Page 10 display on a television receiver. None of at 13.5 MHz it was fed to a the efficiency or ruggedness is lost in microcomputer and data generator that RADIO MAINTENANCE the transmission of the enhancements.
    [Show full text]
  • Proceedings of the World Summit on Television for Children. Final Report.(2Nd, London, England, March 9-13, 1998)
    DOCUMENT RESUME ED 433 083 PS 027 309 AUTHOR Clarke, Genevieve, Ed. TITLE Proceedings of the World Summit on Television for Children. Final Report.(2nd, London, England, March 9-13, 1998). INSTITUTION Children's Film and Television Foundation, Herts (England). PUB DATE 1998-00-00 NOTE 127p. AVAILABLE FROM Children's Film and Television Foundation, Elstree Studios, Borehamwood, Herts WD6 1JG, United Kingdom; Tel: 44(0)181-953-0844; e-mail: [email protected] PUB TYPE Collected Works - Proceedings (021) EDRS PRICE MF01/PC06 Plus Postage. DESCRIPTORS Children; *Childrens Television; Computer Uses in Education; Foreign Countries; Mass Media Role; *Mass Media Use; *Programming (Broadcast); *Television; *Television Viewing ABSTRACT This report summarizes the presentations and events of the Second World Summit on Television for Children, to which over 180 speakers from 50 countries contributed, with additional delegates speaking in conference sessions and social events. The report includes the following sections:(1) production, including presentations on the child audience, family programs, the preschool audience, children's television role in human rights education, teen programs, and television by kids;(2) politics, including sessions on the v-chip in the United States, the political context for children's television, news, schools television, the use of research, boundaries of children's television, and minority-language television; (3) finance, focusing on children's television as a business;(4) new media, including presentations on computers, interactivity, the Internet, globalization, and multimedia bedrooms; and (5) the future, focusing on anticipation of events by the time of the next World Summit in 2001 and summarizing impressions from the current summit.
    [Show full text]
  • BBC 1 - Bright New World
    ---- -- ,,- -.-.-- - .- , - -,-.- -~ .... - --- , -- ., ., No.20- Spring1985 The Quarterly for BBC Engineering Staff Please Circulate BBC 1 - Bright New World Recently many viewers will have Descrip tion by a spotlight above tlte viewer. noticed a new look to BBC 1. For The symbol generated by the many years the channel logo has equipment is a rotating image of Design been the familiar 'rotating world', the world with a caption displayed Graphic Design at Television generated from a mechanical beneath. The image is larger than Centre undertook the artistic model. Originally this was pro- its predecessor, and there is no design of the symbol. Clearly this duced by a remotely controlled reflecting mirror, but the detail had to be done in very close con- camera, affectionately known as and accuracy are much greater. junction with the engineers in Nod dy, which was also used for the old mechanical clock and fault captions. Most of Noddy's functions have been replaced by electronic generators, but the World was still being produced by a caption scanner, followed by a colour synthesiser and PAL coder. This required regular maintenance, and alignment of the video pro- cessing to produce consistent resul ts. On February 18th a new Symbol was introduced. The generating equipment is all electronic, using the latest digital techniq ues. Some digitally generated pictures suffer from an effect known as 'aliassing'; this is most noticable on sloping lines and circles as small steps, which show the graphic being made up of discrete lines. The new equipment is fully 'anti-aliassed' on both the logo and the captions, which sub- stantially improves the overall The caption is customised for each Designs Department.
    [Show full text]
  • Lessons from Discarded Computer Architectures
    Lessons from Discarded Computer Architectures Andrew E. Fluck University of Tasmania Locked Bag 1307, Launceston, Tasmania, 7250, Australia [email protected] Abstract. The BBC microcomputer was one of several nationally produced which were superseded by the International Business Machines (IBM) Personal Computer (PC). This reflected the results of both an international market com- petition and rivalry between different US processor manufacturers. Along with the hardware, valuable software and supporting educational ideologies were discarded. As we make choices about technological innovation, to what degree are we selecting potential efficacy or responding to marketing hype? Keywords: BBC microcomputer, IBM Personal Computer, Apple II, computer hardware, operating systems. 1 Introduction The BBC microcomputer was an 8-bit machine based on the Motorola 6502 proces- sor. It made a huge impact in British schools, putting predecessors into the shade of its colour graphics. Its successor, the Archimedes was almost as successful, but ran into the juggernaut of the IBM PC – and the rest is history. Almost. 2 The BBC Microcomputer - A Withered Branch In 1979-80 the British Broadcasting Company (BBC) started the BBC Computer Lit- eracy Project and put out a tender for a microcomputer to accompany the television series The Computer Programme. Acorn was a firm started by two former Sinclair employees, marketing director Chris Curry and researcher Hermann Hauser. Their firm won the tender in April 1981 and released the BBC Microcomputer later that year. The large keyboard unit connected to a conventional television, which became the screen for the computer. Based on the 8-bit Motorola 6502 processor, the initial model had 16k bytes of RAM, and cost GBP 299.
    [Show full text]
  • The Case of the BBC and ETV Muluken Asegidew a Thesis
    Media Coverage of China-Africa Relations: the Case of the BBC and ETV Muluken Asegidew A Thesis Submitted to the Graduate School of Journalism and Communication Presented in Partial Fulfillment of the Requirements for the Degree of Master of Arts in Journalism and Communication Addis Ababa University Addis Ababa, Ethiopia June, 2013 Media Coverage of China- Africa Relations: the Case of the BBC and ETV Muluken Asegidew Advisor: Abdissa Zerai (PhD.) Signed by the Examining Committee: ____________________________ ___________ ______________ Advisor Signature Date ____________________________ ____________ ______________ Examiner Signature Date ____________________________ _____________ ______________ Examiner Signature Date _____________________________ _____________ _______________ Chair of Department or Graduate Program Coordinator Signature Date Media Coverage of China- Africa Relations: the Case of the BBC and ETV Muluken Asegidew Addis Ababa University, 2013 ABSTRACT The main aim of this study was examining the media coverage of China- Africa relations in light of Framing and Orientalism theories. The study focused on the 21st century China- Africa relations, when China has become one of the world’s powerful nations and Africa has started building a new name. It also pointed out emphasized and deemphasized issues and commonly utilized media sources. To this end, a qualitative method, (particularly) textual analytic method was used. The BBC and ETV were selected purposefully and their online version of the news stories published from Jan. 2011 to Feb. 30, 2013, were collected from the websites, www.com/news and www.erta.gov.et, respectively. The analysis shows that the BBC and ETV online news stories used South-South co-operation frame, non-interference frame, and mutual benefit frame in articulating China-Africa relations.
    [Show full text]
  • BT TV Z4 User Guide V4
    TV user guide Hello Use this guide to see what’s what with your new BT TV. Here’s what’s in this guide Need some help? Grab your remote and get going 2 Where to start 12 The main menu 4 Help with set-up 13 Watching live TV 6 Help with Freeview 13 Pausing live TV 8 Help with extra channels 14 Searching for programmes 9 How to control your TV using your BT TV remote 15 On Demand 10 Get to know your TV Box 16 BT TV app 11 Parental Controls and PINs 17 Accessibility tools 18 Hello 1 Grab your remote and get going Meet your new best friend – your BT TV remote control. Use it to wander around, see the sights, pick what you’d like to watch. Switch your box on or off To find out how to also control your TV using your remote, have a look at page 15. Want to control your TV using your BT TV remote? Sometimes it’s hard enough keeping track of one remote, let alone two. Why not just use one? To work your TV using your BT TV remote, see page 15. 2 Grab your remote and get going Explore the BT Player You’ll find a huge library of on-demand entertainment, sport, films, music and kids’ shows. Open the main menu It’s the home of BT TV and where you go to get on-demand programmes, players, apps and settings. See page 4 See more about a programme For when you want more information about the show.
    [Show full text]
  • The Discourse of Broadcast News
    The Discourse of Broadcast News In this timely and important study, Martin Montgomery unpicks the inside workings of what must still be considered the dominant news medium: broadcast news. Drawing principally on linguistics, but multidisciplinary in its scope, The Discourse of Broadcast News demonstrates that news pro- grammes are as much about showing as telling, as much about ordinary bystanders as about experts, and as much about personal testimony as calling politicians to account. Using close analysis of the discourse of television and radio news, the book reveals how important conventions for presenting news are changing, with significant consequences for the ways audiences understand its truthfulness. Fully illustrated with examples and including a detailed examination of the high-profile case of ex-BBC journalist Andrew Gilligan, The Discourse of Broadcast News provides a comprehensive study that will challenge our current assumptions about the news. The Discourse of Broadcast News will be a key resource for anyone researching the news, whether they be students of language and linguistics, media studies, or communication studies. Martin Montgomery is Director of the Scottish Centre for Journalism Studies and Reader in Literary Linguistics at the University of Strathclyde. The Discourse of Broadcast News A linguistic approach Martin Montgomery First published 2007 by Routledge 2 Park Square, Milton Park, Abingdon, Oxon OX14 4RN Simultaneously published in the USA and Canada by Routledge 270 Madison Ave, New York, NY10016 This edition published in the Taylor & Francis e-Library, 2007. “To purchase your own copy of this or any of Taylor & Francis or Routledge’s collection of thousands of eBooks please go to www.eBookstore.tandf.co.uk.” Routledge is an imprint of the Taylor & Francis Group, an informa business © 2007 Martin Montgomery All rights reserved.
    [Show full text]
  • BBC Television Service/BBC One[Edit] Pre-1969[Edit] Main Article: BBC One Pre-1969 Idents
    BBC Television Service/BBC One[edit] Pre-1969[edit] Main article: BBC One Pre-1969 Idents The first ident, "Television Symbol", "Angel's Wings, or the "Bat's Wings" (1953–1960) The first incarnation of the globe, introduced in 1963 (1963–1964) The original BBC Television Service was launched on 2 November 1936 and was taken off the air at the outbreak of war in September 1939, returning in June 1945. As the only public television service in Britain, and initially in the world, there was no need for a station ident in the early days. However, with the imminent arrival of commercial television in Britain, 2 December Commented [B1]: This is important because it gives the full date of the arrival of the first ever TV ident. 1953 saw the arrival of the first ident, nicknamed the "Bat's Wings". This was an elaborate mechanical contraption constructed by Abram Games, which featured a tiny spinning globe in the centre, surrounded by two spinning "eyes", with lightning flashes to either side. Unlike later idents, this was filmed, rather than live. The model was temperamental, and broke down shortly Commented [B2]: Shows how the filming of idents have changed over time. after it was filmed.[1] By the early 1960s the "Bat's Wings" had been superseded by the "BBC tv" logo within a circle, beneath which would appear a map of Britain split into the BBC's broadcast regions. The channel's most famous emblem, the globe, appeared in its first guise on 30 September 1963. The first such ident featured the continuity announcer speaking over a rotating globe while a "BBC tv" caption would appear with the announcement, "This is BBC Television" being made.
    [Show full text]
  • Youview Response to the BBC Trust November 2013
    YouView Response to the BBC Trust November 2013 Executive summary YouView welcomes the opportunity to respond to the BBC Trust review of YouView. YouView exists to enable the evolution of free to air television in the UK. It is an open platform for content providers, and consumers can connect a YouView device via any UK Internet service provider. Devices are available from retailers for a one off cost or from ISPs as part of a phone, broadband and TV package. YouView is different from most other TV platforms because it does not own or retail content, produce devices or enter into contracts with the end consumer. Instead, it enables these relationships on behalf of its partners, which include content owners, ISPs, device manufacturers and retailers. YouView is currently the UK’s fastest growing TV platform, largely as result of the range of content it offers, the fact that it is free at the point of use (unless a consumer decides to pay for additional services), its ease of use and set up and the fact that the two ISP shareholders are subsidising the cost of equipment. Retail sales, though currently small, are an important element of the YouView story and already less than 18 months after launch boxes are readily available for £229 and in some cases less. YouView intends to develop its retail presence further by encouraging manufacturers to bring a range of devices to the market. We believe that YouView has had a major beneficial impact on the UK digital TV market providing the core free to air DTT audience with an easy upgrade path from Freeview.
    [Show full text]
  • Success Story Extraordinary TV for Everyone: How Youview Used IP
    Success Story Extraordinary TV for Everyone: How YouView Used IP Intelligence to Enable ISP Set-top Box Features YouView, a joint venture between BBC, ITV, Channel 4, Five, BT, Talk Talk and Arqiva, is the UK’s on demand service delivering over 70 live free-to-air digital TV and radio channels as well as seven day catch up from the UK’s public service broadcast players. A range of quality content is also available via on demand players from Netflix, UKTV Play, NOW TV, Sky Store, Quest OD from Discovery, milkshake!, S4C and STV for viewers in Scotland. YouView has record, pause and rewind TV functionality as well as fully integrated search and discover features. A key challenge for YouView in setting up YouView required a highly accurate YouView set-top boxes are offered as its pay content distribution was identifying solution that is regularly and automatically part of broadband subscription bundles set-top boxes being installed for the first updated as the device base grows, ISPs from TalkTalk, BT and Plusnet. It is also time on BT, TalkTalk or Plusnet ISP lines; IP ranges change, and as homes migrate available to buy subscription-free from serving the relevant on-screen branding, between ISPs. all major retailers and many independent packaged services and features provided electrical stores including John Lewis, by those ISPs; and – on a daily basis – As Sunit Kotchea, YouView’s Operations Currys, Argos, Tesco, Amazon and ensuring each set-top box continued to Director, explains, “From the moment a Richer Sounds. accurately surface relevant ISP features.
    [Show full text]
  • Cmf-Yearbook-2014-Pdf.Pdf
    1 The Children’s Media Yearbook is a publication of The Children’s Media Foundation Director, Greg Childs Administrator, Jacqui Wells The Children’s Media Foundation P.O. Box 56614 London W13 0XS [email protected] First published 2014 © Lynn Whitaker for editorial material and selection © Individual authors and contributors for their contributions All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, without the prior permission in writing of The Children’s Media Foundation, or as expressly permitted by law, or under terms agreed with the appropriate reprographics rights organisation. You must not circulate this book in any other binding or cover. ISBN 978-0-9575518-2-4 Paperback ISBN 978-0-9575518-3-1 E-book Book design by Craig Taylor Cover illustration by Stuart Harrison Previous page illustration by CBeebies/Beakus The Children’s Media Yearbook 2014 Edited by Lynn Whitaker THE CHILDREN’S MEDIA FOUNDATION INTRODUCTION 1. Editorial Lynn Whitaker 8 2. The Children’s Media Foundation: Year Two Anna Home 12 3. A Founder Patron’s View Brian Jameson 15 INDUSTRY NEWS AND VIEWS 4 BBC Children’s Joe Godwin, Kay Benbow and Cheryl Taylor 20 5 Disney Channels UK Louise Bucknole 27 6 In at the Deep End: Multi-tasking at Beakus Animation Studio Steve Smith 31 7 Writing for Kids Danny Stack 36 8 LIMA UK: The Good News About Licensing Kelvyn Gardner 39 9 The British Toy and Hobby Association: Natasha Crookes 42 10 Gender Skewing in Children’s
    [Show full text]