Kubla Khan

Kubla Khan

Partnering to Publish : XML: It’s Not Rocket Science (but ….. The more you know the more you know you don’t know) SSP/ARL Seminar November 2010 Washington DC Philippa J. Benson, Ph.D. Director Strategic & Business Development The Charlesworth Group Who, me? 1984 – 1998 • MA and PH.D related to professional writing and research in literacy in science • Head of Writing Center at Intercultural Communication Center at CMU • Invited “Foreign Expert” in Technical Communication at key university in China (Shangahi Jiao Tong U.) • Faculty at University of Pittsburgh overseeing WAC program, worked with librarians developing instructional materials 1998 - 2009 • Taught writing at Georgetown U., FAES, in mainland China (on publishing) • Started information resources departments and launched publications at CI and TWS, both 501c3 organizations 2009 – Charlesworth, Director of Strategic and Business Development The Charlesworth Group Family firm, founded in 1928. Global Publishing Services • Journal & book typesetting (XML) • Marketing, rights, licensing, liaison (China) • Publishing services, language polishing Offices in United Kingdom United States Mainland China Many thanks to….people who partner to solve problems • Steve Laverick, CWG • Gurvinder Batra, Kiwi Technologies • Dan O’Brien, American Chemical Society • Devorah Bloom, Data Conversion Lab • JATS-Con, NLM Journal Article Tag Suite Conference Why are we here? • Learn about existing partnerships • Better understand the role that you can play in providing publishing services for your organization • Improve the quality of your publications through selective acquisition and editorial development. • Find out how to leverage resources within your institution • Cost-effectively market and promote your published products Why back to basics? Publarian? Liblisher? DTD RELAX NG ….blah, blah, blah, blah… XML allows your content to be created in one workflow, at one cost, for all readers …in print We can format your materials to be read….. … online + …. on the go Ancient Electronic Publishing • PDF • HTML • Proprietary platforms HTML • There is a heading. • The heading is followed by two paragraphs. • The 2nd paragraph has three line breaks. <h1> Kubla Khan </h1> <p> or a Vision in a Dream. A Fragment </p> <p>In Xanadu did Kubla Khan a pleasure dome decree <br> Where Alf the sacred river ran <br> Through caverns measureless to man <br> Down to a sunless sea.</p> XML tags are about content • This data is a poem <poem> • The title of the poem is “Kubla Khan” <poemtitle><firstname cent=“13th id=“p001”>Kubla</ • Kubla is the first name of someone who lived firstname><honourarytitle rid=“p001”> Khan</ honourarytitle></poemtitle> in the 13th century <author><surname>Coleridge</surname></author> • Khan is an honourary title which applies <stanza> directly to Kubla (and not to any other person <line>In<placename type=“mispronounced” location=“North mentioned, such as Coleridge) China” chinese=“Shang-Tu”>Xanadu</ • The surname of the poem’s author is placename>did<firstname cent=“13th”> Kubla</ Coleridge firstname><honourarytitle> Khan</honourarytitle> a pleasure dome decree</line> • The poem has one stanza which consists of <line>Where<river type=“sacred” id=“r001” four lines location=“China”> Alf</river> the sacred river • Xanadu is the mispronounced name of a place ran</line> in North China which the Chinese themselves <line>Through caverns measureless to man</ call “Shang-Tu” line> • Alf is the name of a river in China <line><direction rid=“r001”>Down</direction> to a sunless sea.</line></stanza> • Alf is a sacred river • The direction that Alf ran was “down” XML tags are about content Poem <poem> <poemtitle><firstname cent=“13th” id=“p001”>Kubla</ firstname><honourarytitle “Kubla” = a first name Kubla Khan Alf flows down rid=“p001”> Khan</honourarytitle></ poemtitle> by Coleridge Alf is a sacred <author><surname>Coleridge</surname></ Title = “Kubla Khan” river in China author> In Xanadu did Kubla Khan a pleasure <stanza> Kubla = person in dome decree Xanadu is often th <line>In<placename type=“mispronounced” 13 century Where Alf the sacred river ran mispounounced as location=“North China” Through caverns measureless to man “Shang-Tu chinese=“Shang-Tu”>Xanadu</ Down to a sunless sea. placename>did<firstname “Khan” = Xanadu is in cent=“13th”> Kubla</ honorary title North China firstname><honourarytitle> Khan</ honourarytitle> a pleasure dome decree</line> <line>Where<river type=“sacred” id=“r001” Author surname Xanadu is location=“China”> Alf</river> the Poem has Is Coleridge a the name of sacred river ran</line> one stanza a place <line>Through caverns measureless to man</ • line> <line><direction rid=“r001”>Down</ direction> to a sunless sea.</line></ stanza> One possible workflow Document Word (.doc) file converted is styled (.rtf) Title Author Front matter Symbols References ….and other regularly formatted “Raw” elements Word Document .rA (styled) One possible workflow Document Document converted converted (.rtf) (XML) Styled (.rtf) file Is converted to <poem><poemtitle><firstname cent=“13th” id=“p001”>Kubla</firstname><honourarytitle XML rid=“p001”>Khan</honourarytitle></ poemtitle><author><surname>Coleridge</ surname></author><stanza><line>In<placename type=“mispronounced” location=“North China” chinese=“Shang-Tu”>Xanadu</placename>did <firstname cent=“13th”>Kubla</firstname> <honourarytitle> Khan</honourarytitle> a pleasure dome decree</line><line>Where<river type=“sacred” id=“r001” location=“China”> Alf</ river> the sacred river ran</line><line> Through caverns measureless to man</line> <line> <direction rid=“r001”>Down</direction> to a sunless sea.</line></stanza> One possible workflow Document Document Customized output converted converted Print & web PDF, HTML (.rtf) (XML) <poem><poemtitle><firstname cent=“13th” id=“p001”>Kubla</firstname><honourarytitle rid=“p001”>Khan</honourarytitle></ poemtitle><author><surname>Coleridge</ surname></author><stanza><line>In<placename type=“mispronounced” location=“North China” chinese=“Shang-Tu”>Xanadu</placename>did <firstname cent=“13th”>Kubla</firstname> <honourarytitle> Khan</honourarytitle> a pleasure dome decree</line><line>Where<river type=“sacred” id=“r001” location=“China”> Alf</ river> the sacred river ran</line><line> Through caverns measureless to man</line> <line> <direction rid=“r001”>Down</direction> to a sunless sea.</line></stanza> Another possible workflow Customized output Document Document converted converted from XML: HTML, (.rtf) (XML) ePubs, apps <poem><poemtitle><firstname cent=“13th” id=“p001”>Kubla</firstname><honourarytitle rid=“p001”>Khan</honourarytitle></ poemtitle><author><surname>Coleridge</ surname></author><stanza><line>In<placename type=“mispronounced” location=“North China” chinese=“Shang-Tu”>Xanadu</placename>did <firstname cent=“13th”>Kubla</firstname> <honourarytitle> Khan</honourarytitle> a pleasure dome decree</line><line>Where<river type=“sacred” id=“r001” location=“China”> Alf</ river> the sacred river ran</line><line> Through caverns measureless to man</line> <line> <direction rid=“r001”>Down</direction> to a sunless sea.</line></stanza> XML tags are linked to DTDs XML Acronyms of the • Is governed by industry standards and Day rules (e.g., NLM DTD), for archiving (e.g., NLM = National PMC), and tagging (e.g., metadata) Library • Allows automatic inclusion of standard of Medicine STM electronic features such as DOIs. DTD = Data Tag Definitions • Allows for full or selected automated PMC = Pub Med services including composition, design, Central and/or proofing DOI = Digital • Can be driven by templates or tailored to Object users to have full control of output Identifier BUT….. The National Library of Medicine (NLM) DTD Although the NLM DTD is the gold standard in STM publishing it still has multiple flavors Journal Article Tag Suite (JATS) makes up the NLM DTD Created to enable an archive to capture as many of the Archiving and structural and semantic components of existing printed and Interchange tagged journal material as conveniently as possible, with no Tag Set effort made to model any particular sequence or textual format Journal Optimized for the archives that wish to regularize and Publishing control their content, not to accept the sequence and Tag Set arrangement presented to them by any particular publisher Article Designed for authoring new journal articles, where Authoring regularization and control of content is important Tag Set NCBI Book Written specifically to describe volumes for the NCBI online Tag Set libraries The DTD spectrum Ar<cle Authoring DTD Convert Create Archive and Interchange DTD Journal Publishing DTD Archive intellectual The Journal Publishing set is too restrictive content or Article file?? for some, and not prescriptive enough to be a good Authoring model. ACS: A mental model Tagset Module Module Schema (DTD, XSD, etc.) Tag definition A Tag definition B Tag definition C Module Module Tag definition D Schema (DTD, XSD, etc.) Module Module Module Module Tag definition dependencies ACS: Tags, tags, and more tags As-Is Extended Reduced Customized Built From Informed By Public Superset of Subset of Combo of Substantial Only the design version is public tagset the public Extensions + changes: philosophy of used without is used schema is Reductions renamed tags, public tagset is changes or used altered used modification hierarchies, etc <xyz> <xyz> <xyz> <xyz> <abc> <abc> <a/> <a/> <a/> <a/> <a> <aa/> <b/> <b/> <b/> <b/> <b/> <bb/> </xyz> <c/> </xyz> <c/> </a> <cc/> </xyz>

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    34 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us