Automatisierte Layoutformatierung Für Printmedien Mit CSS (PDF)

Ruprecht-Karls-Universität Neuphilologische Fakultät Institut für Computerlinguistik Bachelorarbeit 23.03.2017 Automatisierte Layoutformatierung für Printmedien mit CSS „CSS paged media module level 3“ in der Anwendung im Vgl. zu XSL-FO Erstbetreuer: Prof. Dr. Andreas Witt Zweitbetreuer: Manuel Montero Pineda Zweitgutachterin: Prof. Dr. Katja Markert Svenja Lohse (7. Semester) HF: Computerlinguistik (75%) Römerstr. 131 NF: Bildungswissenschaft (25%) 69126 Heidelberg Tel: 0162 8620613 Email: [email protected] Abstract Books and other printed media are still a favorite way to save and exchange knowledge and thoughts. These contents have to get enriched with meta data to be logically, usefully and clearly stored and representated. Through syntactical and semantical connections linguistic meta data enables specific analysis, extraction and processing amounts of data. Also the layout of texts, informations etc. works with these meta data and is important for understanding the content. Cross media publishing is interested in designing layouts for different output media but with the same or few programming languages. This thesis explores empirically the work ability of CSS level 3 (CSS3), especially CSS paged media module level 3, for creation of print layouts in practice in contrast to XSL-FO. How is it working and what is CSS3 missing to get a more professional result? Data base is MARC-XML from the „Deutsche Nationalbibliothek” datashop, which is processed by XSLT to a HTML5 document. On this CSS3 properties, grouped by typographical concepts, are tried out rendered by Antenna House Formatter. The subject combination of computer science (XML technologies), publishing (layout, cross media publishing) and linguistics (enrichment of content with markup languages) makes further research and developement of automatical CSS layout processing for printed media interesting. The study comes to the conclusion that CSS3 has a lot of things to improve before managing professional print layouts like XSL-FO does. But simple documents and simple, mean formatting requests are executed easily and fastly. The person developing and controlling CSS stylesheets doesn’t have to be a professional layout designer to get decent output. More browser compatibility would be helpful so more user could test scenarios (like „digital first” process, from website to print) and give feedback. This could be used for evaluation and advancement of CSS3. In conclusion, CSS3 is improvement-needy but also upgradeable and suitable for future of professional print layout creation. 2 Inhaltsverzeichnis Abkürzungsverzeichnis ................................................................................................................. 5 Typografische Konventionen ........................................................................................................ 7 1 Einleitung .............................................................................................................................. 8 1.1 Ausgangssituation und Herausforderung ...................................................................... 8 1.2 Ziel und Vorgehen ....................................................................................................... 10 1.3 Motivation für die Computerlinguistik ........................................................................ 10 2 Daten und Vorprozess ......................................................................................................... 11 2.1 Metadaten in XML ...................................................................................................... 11 2.2 Ausgangsformat MARC-XML und andere linguistische Metadatenformate .............. 12 2.2.1 MARC21 und MARC-XML .................................................................................. 12 2.2.2 Dublin Core ......................................................................................................... 14 2.2.3 IMDI .................................................................................................................... 15 2.2.4 TEI ....................................................................................................................... 15 2.2.5 CMDI ................................................................................................................... 16 2.3 HTML5 als Zielformat ................................................................................................ 17 2.4 Überführung/Transformation der Ausgangsdaten mit XSLT ...................................... 18 3 Überblick XSL-FO und CSS ............................................................................................... 19 3.1 XSL-FO ....................................................................................................................... 19 3.2 CSS(3) ......................................................................................................................... 20 3.3 Vergleichbarkeit .......................................................................................................... 22 3.4 Formatiererwahl .......................................................................................................... 23 4 Umsetzung typografischer Konzepte .................................................................................. 24 4.1 Grundstruktur .............................................................................................................. 25 4.1.1 XSL-FO ................................................................................................................ 25 4.1.2 CSS [CSS3PAGE] ............................................................................................... 26 4.1.3 Fazit Abschnitt ..................................................................................................... 28 4.2 Seitenformatierung ...................................................................................................... 28 4.2.1 Mehrspaltensatz ................................................................................................... 28 4.2.2 Umbrucheinstellungen ......................................................................................... 30 4.2.3 Seitenvorlagen ..................................................................................................... 35 4.2.4 Selektoren ............................................................................................................ 38 4.3 Inhaltstext .................................................................................................................... 39 4.3.1 Absatz- und Zeilenformatierung – Blöcke, Inline und Inline-Blöcke ................. 40 4.3.2 Wort- und Zeichenformatierung .......................................................................... 45 3 4.4 Weitere Inhaltselemente .............................................................................................. 51 4.4.1 Listen ................................................................................................................... 51 4.4.2 Tabellen ............................................................................................................... 54 4.4.3 Abbildungen ........................................................................................................ 57 4.4.4 Zwischenfazit ...................................................................................................... 61 4.5 Automatisch generierte Elemente................................................................................ 62 4.5.1 Fußnoten .............................................................................................................. 62 4.5.2 Marginalien ......................................................................................................... 65 4.5.3 Verweise .............................................................................................................. 74 4.5.4 Exkurs: Lesezeichen ............................................................................................ 82 4.5.5 Vorbereitung Druck ............................................................................................. 83 5 Lösungsmöglichkeiten und Ausblicke ................................................................................ 85 5.1 Vorteile und Schranken von CSS ................................................................................ 85 5.2 Bestehende Erweiterungsmöglichkeiten ..................................................................... 86 5.3 Ausblick ...................................................................................................................... 86 6 Schlussfolgerung/Bilanz ...................................................................................................... 87 7 Quellen ................................................................................................................................ 89 7.1 Printwerke ................................................................................................................... 89 7.2 Internetquellen ............................................................................................................. 89 7.3 Abbildungsquellen....................................................................................................... 95 8 Anhang ................................................................................................................................ 96 8.1 Codes ..........................................................................................................................

Automatisierte Layoutformatierung Für Printmedien Mit CSS (PDF)

Publishing Pdfs from DITA

Xmlmind XSL Utility - Online Help

Markup UK 2021 Proceedings

XSL-FO by Dave Pawson Publisher

DITA Open Toolkit 2.0 This Document Describes the DITA Open Toolkit Project—What the Project Is, and How to Use the Site

Xmlmind DITA Converter Manual

Ntent Copyright © 2011 Anthony Self

Oxygen XML Author Eclipse Plugin 17.1

DITA Open Toolkit 1.8.4 This Document Describes the DITA Open Toolkit Project—What the Project Is, and How to Use the Site

XML Prague 2016

Antenna House Regression Testing System User Guide V1.5

Conference Proceedings