Evaluation of the SVM Based Multi-Fonts Kanji Character Recognition Method for Early-Modern Japanese Printed Books

Total Page:16

File Type:pdf, Size:1020Kb

Evaluation of the SVM Based Multi-Fonts Kanji Character Recognition Method for Early-Modern Japanese Printed Books Evaluation of the SVM based Multi-Fonts Kanji Character Recognition Method for Early-Modern Japanese Printed Books Manami Fukuo1, Yurie Enomoto1†, Naoko Yoshii1, Masami Takata1, Tsukasa Kimesawa2, and Kazuki Joe 1, 1 Dept. of Advanced Information & Computer Sciences, Nara Women’s University, Nara, Japan 2 Digital Library Division, National Diet Library, Kyoto, Japan Abstract - The national diet library in Japan provides a web The information including titles and author names of the based digital archive for early-modern printed books by books in the digital library is given as text data while main image. To make better use of the digital archive, the book body is image data. There are no functions for generating text images should be converted to text data. In this paper, we data from image data. Thus full-text search is not supported evaluate the SVM based multi-fonts Kanji character yet. To make early-modern printed and valuable books data recognition method for early-modern Japanese printed books. more accessible, their main body should be given as text data, Using several sets of Kanji characters clipped from different too. As described above, the number of the target books is so publishers’ books, we obtain the recognition rate of more than large that auto conversion is required. 92% for 256 kinds of Kanji characters. It proves our recognition method, which uses the PDC (Peripheral If the conversion targets were general text images, they would Direction Contributivity) feature of given Kanji character have been converted into text data easily with some software images for learning and recognizing with an SVM, is effective of optical character recognition (OCR). However, most of for the recognition of multi-fonts Kanji character for early- early-modern printed books contain old Japanese characters modern Japanese printed books. that are not used now. Moreover, the fonts used in the early- modern printed books vary by publisher and year of Keywords: character recognition; SVM; digital archiving publication. Existing OCR software cannot recognize right characters under these conditions. 1 Introduction The national diet library (NDL) [1] in Japan keeps about To solve these problems, we proposed a multi-fonts Japanese 390,000 books dating from the Meiji and Taisyo era (1868- character recognition method for early-modern printed books 1926). The books cover a broad range including philosophies, [4]. However, we selected just ten kinds of (Japanese) Kanji literatures, histories, technologies, natural sciences, etc. Most characters, which are commonly and frequently used, for the of them are out of print and valuable materials in scholarly. recognition experiment. Thus, the effectivity of the proposed method was not clearly shown. In this paper, we extend the kinds of Kanji characters for several recognition experiments Generally, books in libraries read in hand have too many risks to validate our method. To perform several recognition of aging or wearing, and loss by man-made source to open to experiments, we need enormous character images with various the public. To solve the problem, the NDL started a project font sets from early-modern books by various publishers. called “The Digital Library from the Meiji Era” [2,3]. In the Therefore we need to perform automatic character clipping project, early-modern printed books are recorded on from images. Then, we extract the features of character microfiches page by page. The microfiches are converted into images using a feature extraction method for handwritten digital images and opened at the project Web site. Converting Kanji character recognition because of the wide variety of the books into digital images enabled the contents of the font-sets and hard noises. Finally, the Support Vector valuable books to be opened to the public while the original Machine (SVM), which is one of promising recognition books are preserved in good condition. The NDL provides methods, is used for the recognition of the feature vectors. about 148,000 volumes with 101,000 titles in the digital library from their collection. Users can see the digital images of books whenever or wherever with the Internet connection. The rest of this paper is organized as follows. In section II, we present the overview of our multi-fonts Kanji character recognition method. The evaluation method to validate the effectivity of our method is explained in section III. In section † Currently works for Fujitsu Limited. IV, we describe evaluation experiments and discuss the based method. However, there are 6,349 kinds of Kanji experimental results. characters, which Japanese Standards Association selects as the Japanese character code [6]. In addition, the structure of 2 The Recognition Method kanji characters is hierarchical: There are a lot of similar Kanji structures such as radical indices. In this section, we give the overview of our multi-fonts Kanji character recognition method presented in [4]. In this paper, we extend the kinds of Kanji character up to 256 For the character recognition of early-modern printed books, kinds for several recognition experiments and validate the we think we should not use the typical OCR processes effectivity of our method. In [4], all character images for the because each publisher might use different font sets in those recognition experiment were clipped from early-modern days. The flow of our recognition method is shown below. books by hand. Because of the large number of the kinds of Kanji characters for this experiment, we also implement a) An image containing a character is given as training data. automatic Kanji character clipping for early-modern books. b) Several pre-processes such as binarization, normalization 3 Automatic Kanji Character Clipping and noise reduction are applied to the given image. The flow of automatic Kanji character clipping to extract c) The PDC (Peripheral Direction Contributivity) feature each character image for the recognition experiments is shown [5] is extracted from the pre-processed image, and each below. feature vector for the given image is labeled with the category according to the kind of the character included 1) A page of the book images is given as an object image in the given image. and divided into right and left parts. d) A dictionary is generated by an SVM with the training 2) Several pre-processes such as binarization, noise data extracted in c). The dictionary is used for the reduction and affine transform are applied to each recognition of test data. divided half part of the object. Giving character images at a), several pre-processes are 3) Layout analysis is applied to each part. applied to each image. Any pixel in the images must be black or white when PDC features are extracted from the images. 4) Character strokes are clipped within the layout analysis Thus binarization is applied to the images. In normalization, results. margins are removed from the images so that each character size is equivalent to be scaled. Noise reduction should be also 5) Labeling is performed for each clipped character stroke applied since printing and archiving quality of early-modern so that each label is mapped to a character image. printed books is mostly poor. Without noise reduction, noises of images would be recognized as character strokes to be Several pre-processes are executed on each part image. extracted as unsuitable PDC features. Binalization is applied to the image since the distribution of the vertically projected pixel values is used for extracting PDC feature vectors are calculated with category labels at c), character domains. Noises of the image might be recognized and used for the learning phase of SVM at d). A training data as a false character domain. Then the noise reduction by a for SVM learning is a set of PDC feature vectors and its label median filter is performed. Furthermore, distortion correction while a test data for SVM classification is a set of PDC by an affine transformation is also applied since the distortion feature vectors without label. All character images are by capturing has a great influence to layout analysis. converted to training or test data by processes a) to c). Half of Therefore, layout analysis would be performed more correctly the whole data are randomly selected from each category as with noise reduction and distortion correction. The threshold training data, and the others are used for test data. An SVM is calculated with the distribution of the vertically projected learns separated hyper-planes in the PDC feature vector space pixel values, and layout analysis is executed. The auto- with the training data. The trained SVM can classify test data correlation function by Sondhi [7] is applied to the according to the separated hyper-planes. distribution of the vertically projected pixel values to calculate stroke spacing. Finally labeling is performed to each In our previous research [4], the SVM recognition character stroke, which is clipped based on the stroke spacing. experiments were performed for ten kinds of Kanji characters. Each labeled part of connected black dots is recognized as a Furthermore we compared the experimental result of SVM character domain to clip a character from the original book with a neural network (NN). The experimental result of SVM images. was the recognition rate of 97.8% while the NN was 77.6%. We confirmed that our SVM based method was more suitable In this paper, we use the character images clipped by the for learning and classification of PDC features than the NN automatic character clipping. Figure 1 shows some examples of clipped character images. Each character image is clipped Table 1 The list of nine early-modern works from nine early-modern books with various publication ages and publishers so that we get character images with various Book Title Author Publisher Publication font sets. It means we get nine kinds of PDC feature vector Number Year sets for each Kanji character.
Recommended publications
  • Commercial E-Books of Japanese Language : an Approach to E-Book Collection Development
    Deep Blue Deep Blue https://deepblue.lib.umich.edu/documents Research Collections Library (University of Michigan Library) 2020-03-16 Commercial E-books of Japanese language : an approach to E-book collection development Yokota-Carter, Keiko https://hdl.handle.net/2027.42/166309 Downloaded from Deep Blue, University of Michigan's institutional repository Commercial E-books of Japanese language an approach to E-book collection development March 16, 2020 (canceled) NCC Next Generation Japanese Studies Librarian Workshop Cambridge, MA, USA Keiko Yokota-Carter Japanese Studies Librarian, University of Michigan Graduate Library ● Why E-book format? ● Types of E-book providers In this presentation ● E-book platforms EBSCO Kinokuniya Maruzen ● Comparisons of two platforms The presentation aims to share ● Collection development strategy information and some examples only among librarians. ● Build professional relationship with It does not support any representatives commercial company product. Why E-book format? Increase Accesss, Diversity, Equity, and Inclusion for Japanese Studies E-resources can increase accessibility to Japanese language texts for visually impaired users. Screen Reader reads up the texts of E-books. Support digital scholarship Data science Types of E-book providers 1. Newspaper database including E-journals and E-books ● KIKUZO II bijuaru for libraries – Asahi shinbun database ○ AERA, Shukan Asahi ● Nikkei Telecom 21 All Contents version ○ Magazines published by Nikkei – Nikkei Business, etc ○ E-books published by Nikkei Types of E-book providers 2. Japan Knowledge database – E-books Statistics books E-dictionaries E-journals images, sounds, maps E-book platforms – Japanese language E-books available for libraries outside Japan EBSCO - contact EBSCO representative at your institution ● Japanese language books supplied by ● English translation of Japanese NetLibrary until December, 2017 books; ● 5,100 titles added since January, 2018 Literature ● 10,940 titles available as of Feb.
    [Show full text]
  • The Book of Abstract
    JADH 2018 “Leveraging Open Data” September 9-11, 2018 Hitotsubashi Hall, Tokyo https://conf2018.jadh.org Proceedings of the 8th Conference of Japanese Association for Digital Humanities Co-hosted by: Center for Open Data in the Humanities, Joint Support-Center for Data Science Research, Research Organization of Information and Systems Hosted by: JADH2018 Organizing Committee under the auspices of the Japanese Association for Digital Humanities TEI 2018 “TEI as a Global Language” September 9-13, 2018 Hitotsubashi Hall, Tokyo https://tei2018.dhii.asia Book of Abstracts The 18th Annual TEI Conference Hosted by: Center for Evolving Humanities, Graduate School of and Members’ Meeting Humanities and Sociology, The University of Tokyo Joint Keynote Session JADH and TEI Joint Keynote Session The NIJL Database of Pre-modern Japanese Works .................................................. iv Robert Campbell Amsterdam 4D: Navigating the History of Urban Creativity through Space and Time .......................................................................................................................................... v Julia Noordegraaf Creating Collections of Social Relevance ................................................................... vii Susan Schreibman iii Joint Keynote Session The NIJL Database of Pre-modern Japanese Works Robert Campbell1 Abstract NIJL (the National Insitute of Japanese Literature) is currently engaged in digitizing, tagging and developing new ways to search the uniquely rich heritage of pre-modern (prior to
    [Show full text]
  • System for Adaptive Learning of Japanese Based on Language Data
    Masaryk University Faculty of Informatics System for adaptive learning of Japanese based on language data Bachelor’s Thesis Alexander Macinský Brno, Spring 2020 Masaryk University Faculty of Informatics System for adaptive learning of Japanese based on language data Bachelor’s Thesis Alexander Macinský Brno, Spring 2020 This is where a copy of the official signed thesis assignment and a copy ofthe Statement of an Author is located in the printed version of the document. Declaration Hereby I declare that this paper is my original authorial work, which I have worked out on my own. All sources, references, and literature used or excerpted during elaboration of this work are properly cited and listed in complete reference to the due source. Alexander Macinský Advisor: doc. RNDr. Aleš Horák, Ph.D. i Acknowledgements This way I would like to thank my advisor for directing me in the process of creation of this thesis. Also, my thanks go to all the respon- dents who were willing to take part in the testing process and helped me evaluate the project. ii Abstract Japanese language learners need to deal with a substantial amount of repetitive mental work. One of the solutions is to create a web browser application to simplify the process. The inspiration comes from other systems with similar functionality, a summary of these is provided. The created application tries to take the best from the existing solutions, as well as implement some original ideas. The result is a dictionary viewer, Japanese text reading aiding tool, flashcards editor and a tool for learning with flashcards all in one application.
    [Show full text]
  • Agency for Cultural Affairs/Ebooks Project Subject Works to Be Distributed Online
    Attachment-2 Agency for Cultural Affairs/eBooks Project Subject Works to be Distributed Online Seven works to be distributed online in the first phase of project from Friday, 1 February 2013 (1) “Edo Murasaki Picture Book” written by Tokusoshi Naniwa in 1765 with illustrations by Toyonobu Ishikawa The picture book has a high user access rate among National Diet Library digital archives. The work composed in three volumes depicts the lifestyle and culture of women in the mid-Edo Period, including women’s fashion and the moral values of women during 18th century Japan. Valuable illustrations by Toyonobu Ishikawa are also included. (2) ”Heiji Monogatari” Picture Scroll (first scroll: Sanjoden yakiuchi no maki) copied by Naiki Sumiyoshi in 1798 A precious piece among classical works with a high user access rate among the National Diet Library digital archives. Under the project, the digitalized first picture scroll depicts a war story on details from the Heiji Rebellion (Heiji no ran). The electronic version makes full use of the characteristics of picture scrolls, enabling the user to continuously scroll through the entire piece as if he/she were looking at the actual work. (3) Kazutoshi Ueda’s translation of Grimm’s Fairy Tale “Okami” (werewolf), published in 1889 by Yoshikawa Hanshichi Publishers The first Grimm’s Fairy Tale translated into Japanese in the middle of the Meiji era. The translation is a precious piece showing Japan’s embrace of foreign culture. Picture books in the 19th century have a unique sense of humor, written in a blend of Japanese and Western style as seen in animals wearing kimonos.
    [Show full text]
  • HYBRID W-ZERO3 Maniac
    HYBRID W-ZERO3 Maniac Makoto Ichikawa 12th Edition 0 HYBRID W-ZERO3 とうまくつきあうコツ ■ 電子メールのトラブルの回避 (1) 待受け時はキーロックの励行 (終話/電源)キーはキーロックしていない状態でインターネットの通信 切断の機能も割り当てられているため、誤ってメールの受信を中断してト ラブルの原因となります。このため、待受け時はキーロックを励行します。 (2) メールサーバーの容量は 15MB に設定変更 ウィルコムのプロバイダサービス PRIN のメールサーバーの初期設定の 容量は 1MB のため、メールサーバーの容量を 15MB に設定変更します。 (3) バージョンアップとメールのリトライ回数の設定 10 月 8 日の再販前の HYBRID W-ZERO3 は本体ソフトウェアのアッ プデートをします。そして「E メール自動受信機能」の「失敗したらリト ライする」を 2 回に設定します。 ■ パワーマネージメントの詳細設定 画面が暗くなった直後、画面をタップすれば作業中のウィンドウへ復帰 するようにパワーマネージメントの節電の設定を 2 分にします。 ■ 節電設定 W-CDMA(3G)は OFF を基本とし、また、初期設定が ON となって いる赤外線通信(ビーム)を OFF にします。 ■ 取扱説明書のダウンロード 同梱される取扱説明書以外に本機を使いこなす上で重要な取扱説明書が シャープとウィルコムの Web サイトにあり、必ずダウンロードします。 ■ ダイヤルキーのスライドは(通話)キー 電話の着信時にダイヤルキーをスライドして出すのは(通話)キーをプ ッシュしたのと同じで、そのまま、話し始めることができます。 1 目次 はじめに 7 1. スマートフォン 8 1.1 スマートフォンとは 8 1.2 通信機器の遍歴 10 (1) 携帯電話、PHS 10 (2) Windows CE機 11 (3) W-ZERO3シリーズ 13 2. 電話 19 (1) 電話の設定 19 (2) 電話に関する注意点 19 3. HYBRID W-ZERO3 の設定 21 (1) 取扱説明書のダウンロード 21 (2) microSD™ カード、U-SIM の取付 22 (3) ソフトウェアのアップデート 23 (4) PC やクラウドサービスを用いたデータの同期 24 (5) 文字入力 25 (6) 英文字入力 [ケータイ Shoin の設定] 25 (7) 待ち受け画面 26 (8) カーソルキー(Xcrawl)の設定 27 (9) スタート画面の操作性向上 28 (10) タッチパネルの入力禁止(ロック) 28 (11) [Today]のテーマと着信音の変更 29 (12) 節電設定 30 (13) セキュリティ 32 (14) オプション類の整備 33 2 4. アプリケーション 37 4.1 アプリケーションの種類 37 4.2 PIM 機能 46 (1) Outlook(予定表、連絡先、仕事) 46 (2) 時間調整、アラーム 47 (3) カメラ機能の応用(PDF 化、記号認識、OCR 機能) 48 (4) ノート 51 (5) PIM データのバックアップ 52 4.3 電子メール 53 (1) ライトメール 53 (2) メール(Outlook) 54 (3) Windows Live/Messenger 64 (4) SNS (mixi、Facebook、Twitter) 66 4.4 ブラウジング環境 67 (1) Opera Mobile 10 67 (2) Opera Mobile
    [Show full text]
  • An Attempt at Crowd-Sourced Transcription in Japan Although At
    An Attempt at Crowd-sourced Transcription in Japan Although at least a decade has passed since emergence of the digital environment as an infrastructure, aside from a small handful of exceptional projects (such as the Aozora-Bunko,1 BCCWJ,2 the SAT project,3 and so on) Japanese textual materials have not yet been encoded, having only been digitized to the stage of electronic facsimile. Most digital facsimiles still await further treatment from someone who takes a particular interest in them, but even in this case, the method is often nothing more than a search for rough metadata. OCR is efficient in transcribing recent publications, but it doesn’t work well in the case of typesetting in the out-of-copyright publications, which, mainly being published before World War II, contain many traditional forms of Chinese and Japanese characters, and are composed in old typefaces. In addition, it is generally too difficult to distribute Japanese TEI files. Under this circumstance, it is necessary to transcribe such resources in order to search and analyze Japanese materials as texts, which have been accumulated in the modern era when Japanese publishers started printing using typography. Digital facsimiles of such texts, numbering more than 400,000, are already available on the web site of the National Diet Library (NDL) of Japan. The NDL has already been attempting to transcribe them experimentally using OCR because a partial revision of the copyright law has now allowed the NDL use OCR, even though OCR is prohibited except for private use in Japan. But these attempts have not been sufficiently effective due to the complexity of the Chinese ideographs contained in the type.
    [Show full text]
  • The Current State of E-Books and Digital Libraries of Japan
    https://doi.org/10.48009/4_iis_2015_48-57 Issues in Information Systems Volume 16, Issue IV, pp. 48-57, 2015 THE CURRENT STATE OF E-BOOKS AND DIGITAL LIBRARIES OF JAPAN Hisaki Goto, Kaichi International University, [email protected] Ben Kim, Seattle University, [email protected] Kohji Inagaki, Seattle University, [email protected] ABSTRACT Japan is one of the major countries in terms of the number of books and e-books published in the world. In this paper we will discuss the history of e-books and digital libraries, the advantages/disadvantages of e-books, and the current state of the publishing industry and digital libraries of Japan. Finally, we will discuss how to make e-books and digital libraries more accessible and available in Japan. Keywords: E-Books, E-Libraries, Publishers, Japan INTRODUCTION In the United States, e-books are rapidly replacing paper books and fundamentally changing the culture of reading and accessing information. Japan is one of the top countries in terms of the number of books and e-books published in the world. In Japan, even though electronic journals have been available for scholars and researchers for more than a decade, e-books only recently became widely available to the general public starting in 2012 An electronic book (aka e-book, eBook, e-Book, ebook, digital book, or even e-edition) is a book-length publication in a digital form, consisting of text and images, readable on computers or other electronic devices [6]. This definition also includes a scanned form of a physical book. A delivery of a scanned version without proper copyrights violates copyright laws.
    [Show full text]
  • How Junbungaku Affects the Akutagawa Prize and Japan's
    EDITORS’ INTENTIONS AND AUTHORS’ DESIRES: HOW JUNBUNGAKU AFFECTS THE AKUTAGAWA PRIZE AND JAPAN’S COMMERCIAL LITERARY WORLD by Masumi Abe El-Khoury B.A., The University of British Columbia, 2005 A THESIS SUBMITTED IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF MASTER OF ARTS in THE FACULTY OF GRADUATE STUDIES (Asian Studies) THE UNIVERSITY OF BRITISH COLUMBIA (Vancouver) December 2011 © Masumi Abe El-Khoury, 2011 Abstract In this thesis I explore the current literary culture of Japan by examining the commercialization and politicization of junbungaku, “pure” literature. In particular, I focus on the most prominent award for new authors, the Akutagawa Prize, which is widely acknowledged as authoritative. My intention is to shed some useful light on the role of publishing company editors as the masterminds of the publishing industry. Chapter One provides an overview of issues surrounding junbungaku and taish! bungaku (“mass-oriented literature”). At present, junbungaku is defined in opposition to taish! bungaku, but ambiguities and boundary issues remain. This survey will enable us to identify the situations where the notion of junbungaku is defended as authoritative and how its relationship with the Akutagawa Prize increases its legitimacy. Chapter Two examines the origin and history of junbungaku, and discusses how the notion has changed over time. I also address questions such as what junbungaku is and how it can be defined, and uncover how junbungaku came under question as the Akutagawa Prize became more successful and began to overshadow junbungaku itself. The ultimate purpose of the Prize is to sell books and magazines; this affects not only literature but to some extent Japanese society as a whole.
    [Show full text]
  • A Study on Implementing of Culture, Religion and Time-Awareness to Machine Ethics Algorithms
    DOCTORAL THESIS A Study on Implementing of Culture, Religion and Time-Awareness to Machine Ethics Algorithms by Jagna Nieuważny Advised by: Fumito Masui Michal Ptaszynski KITAMI INSTITUTE OF TECHNOLOGY GRADUATE SCHOOL OF ENGINEERING March 2021 ii Acknowledgements This work would be not be possible without the support that I received from many people. I would like to reserve this page to thank them. I want to thank my supervisors, Professor Fumito Masui and Professor Michal Ptaszynski, for their time, kindness, understanding, guidance and feedback through- out this project. The Kitami Institute of Technology gracefully agreed to exempt me from fees for my doctoral course, making it easier for me to concentrate on research. My patient husband Karol has always been ready to give me encouragement, advice and invaluable support in all things technical and my family kept nagging me to get a PhD until I finally did. I am grateful to each and everyone of you. Jagna Nieuważny March 2021 iii ABSTRACT Recent rapid developments in the field of Artificial Intelligence (AI), especially those regarding the development and implementation of artificial neural networks, surface the questions previously considered to arise much later in the future and were so far only addressed in philosophy, or its lighter derivatives, such as science fiction literature. One of such questions regards the limitations of Artificial Intelligence systems to fully grasp the importance and necessity in human lives of such subjective, spiritual and motivational phenomena as religious experience or a moral outlook. In this dissertation, I present the results of my research devoted to applying Artificial Intelligence to quantitative analysis of factors influencing the ethical outlook of Japanese people.
    [Show full text]
  • 9 Ebook for Everyone!!
    CONTENTS Rediscovering publishing in the age of the network 3 What will change with EPUB3? 9 Ebook for everyone!! Wish for reading of persons with reading disabilities 15 Bricklaying ten thousand public files under the blue sky 17 Original Edition (Japanese) first published on July 7, 2011 English Edition first published on October 20, 2011 Cover Design: Koga Hirano Editorial Design: Marui-Kobunsha Corporation The Terms and Conditions in this brochure may be changed without prior notice. * T-Time, .book, .press, Crochet are registered trademarks of Voyager Japan, Inc. * Other product, company and service names may be trademarks or registered trademarks of their respective owners. Voyager Japan, Inc. 5-41-14 Jingumae, Shibuya-ku, Tokyo, Japan http://www.voyager.co.jp tel. +81-3-5467-7070 fax. +81-3-5467-7080 Rediscovering publishing in the age of the network President, Voyager Japan, Inc. Masaaki HAGINO but it was nevertheless one demanded by those people ❖ Roots I refuse to forget firmly dedicated to bringing new books into the world. Even in those unformed early years, the faith that a Even now I sometimes stop and wonder what led me into flourishing print culture would one day come into being electronic publishing. Certainly I did not start out buoyed by was already undeniably there. any grand dreams, or great confidence. Rather, I had been searching for an escape from conventional media and its We humans value communication by language and ruthless realities when I came across digital technology, desire to manipulate it to the utmost of our skills waiting for me there in the uncharted wilds into which I had precisely because it is the means that comes most easily entered.
    [Show full text]
  • Cărţi Electronice
    Ghidul autorului de cărţi electronice Nicolae Sfetcu Publicat de Nicolae Sfetcu Copyright 2015 Nicolae Sfetcu Prima ediţie PREVIZUALIZARE CARTE Ai scris o carte. Foarte frumos. Dar nu ai scris-o pentru a o ţine pentru tine. Trebuie să o publici. Problema e că editurile percep preţuri foarte mari pentru buzunarele unora dintre autori, şi aceasta în mod obiectiv, datorită costurilor mari de tipărire. Dar ţi-ai permis acest lux, şi acum ai o carte tipărită, cu tine ca autor. Poţi să păstrezi sutele sau miile de volume în biblioteca personală şi să o faci cadou prietenilor la diverse ocazii, sau să încerci să o distribui în diverse librării (care librării, că nu prea mai sunt?!). Ai reuşit să o distribui, cartea e plasată frumos în rafturi. În timp, se aşează praful pe coperţi. După un timp, cartea este retrasă. Mai rămâne să o foloseşti ca decor în casă, sau suport pentru diverse obiecte casnice. Dar, spre norocul tău, există o soluţie alternativă, mult mai ieftină şi mai eficientă în vînzarea cărţii tale: cărţile electronice! Piaţa cărţilor electronice a evoluat rapid în ultimii ani, şi o mulţime de companii au intrat pe această piaţă, ca edituri, librării online, sau pentru promovarea cărţilor electronice. Cărţi O carte este un set de foi scrise, imprimate, ilustrate, sau goale, compuse din cerneală, hârtie, pergament, sau alte materiale, de obicei prinse împreuna pe una din laturi. O singură foaie dintr-o carte se numeşte foaie, şi fiecare parte a unei for se numeşte o pagină. Un set de pagini cu text sau ilustrate, produse în format electronic este cunoscut sub numele de carte electronică.
    [Show full text]
  • E-Books: Developments and Policy Considerations”, OECD Digital Economy Papers, No
    Please cite this paper as: OECD (2012-10-29), “E-books: Developments and Policy Considerations”, OECD Digital Economy Papers, No. 208, OECD Publishing, Paris. http://dx.doi.org/10.1787/5k912zxg5svh-en OECD Digital Economy Papers No. 208 E-books: Developments and Policy Considerations OECD Unclassified DSTI/ICCP/IE(2011)11/FINAL Organisation de Coopération et de Développement Économiques Organisation for Economic Co-operation and Development 29-Oct-2012 ___________________________________________________________________________________________ English - Or. English DIRECTORATE FOR SCIENCE, TECHNOLOGY AND INDUSTRY COMMITTEE FOR INFORMATION, COMPUTER AND COMMUNICATIONS POLICY Unclassified DSTI/ICCP/IE(2011)11/FINAL Working Party on the Information Economy EBOOKS: DEVELOPMENTS AND POLICY CONSIDERATIONS English - Or. English JT03329763 Complete document available on OLIS in its original format This document and any map included herein are without prejudice to the status of or sovereignty over any territory, to the delimitation of international frontiers and boundaries and to the name of any territory, city or area. DSTI/ICCP/IE(2011)11/FINAL FOREWORD This report was presented to the Working Party on the Information Economy (WPIE) in December 2011 and was declassified by the Committee for Information, Computer and Communications Policy (ICCP) in March 2012. The report was prepared by Louisa Gosling (Making Sense Ltd.) with input from Christopher Hemmerlein and Taylor Reynolds of the OECD's Directorate for Science, Technology and Industry. This report is part of the ICCP’s work on digital content. Other studies in the series include online news, public-sector information, film and video, user-created content, mobile content, online computer games, music and scientific publishing. For more information, please visit www.oecd.org/sti/information- economy or contact [email protected].
    [Show full text]