
ENG287 The Digital Text Prof. Adam Hammond The limitations of plain text Implicit vs. explicit markup Implicit Markup ↣ The genre and structure of a text is given by typographical cues. Titles, footnotes, quotations, marginal glosses, etc. are marked by type size, use of bold, italics, different typefaces, white space, etc. ↣ Implicit markup is governed by conventions. You need to already be familiar with typographic conventions for indicating genre and structure to be able to make sense of them. ↣ Since these conventions are fluid and ambiguous, they are not suitable for machine processing. Implicit vs. Explicit Markup Explicit Markup ↣ An explicit markup language describes features of texts so that these features can be understood by machines as well as humans. ↣ It facilitates and promotes a structural view of documents. Implicit vs. Explicit Markup GOWER To sing a song that old was sung From ashes ancient Gower is come, Assuming man’s infirmities To glad your ear and please your eyes. The First Four Lines of Shakespeare’s Pericles, Prince of Tyre, Implicit Markup <speech> <speaker>Gower</speaker> <stanza> <line>To sing a song that old was sung</line> <line>From ashes ancient Gower is come,</line> <line>Assuming man’s infirmities</line> <line>To glad your ear and please your eyes.</line> … </stanza> </speech> The First Four Lines of Shakespeare’s Pericles, Prince of Tyre, Explicit Markup <speech> <speaker>Gower</speaker> <stanza> <line>To sing a song that old was sung</line> <line>From ashes ancient Gower is come,</line> <line>Assuming man’s infirmities</line> <line>To glad your ear and please your eyes.</line> … </stanza> </speech> The First Four Lines of Shakespeare’s Pericles, Prince of Tyre, Explicit Markup <speech> <speaker>Gower</speaker> <stanza> <line>To sing a song that old was sung</line> <line>From ashes ancient Gower is come,</line> <line>Assuming man’s infirmities</line> <line>To glad your ear and please your eyes.</line> … </stanza> </speech> The First Four Lines of Shakespeare’s Pericles, Prince of Tyre, Explicit Markup <speech> <speaker>Gower</speaker> <stanza> <line>To sing a song that old was sung</line> <line>From ashes ancient Gower is come,</line> <line>Assuming man’s infirmities</line> <line>To glad your ear and please your eyes.</line> … </stanza> </speech> The First Four Lines of Shakespeare’s Pericles, Prince of Tyre, Explicit Markup character speech speaker’s name verse stanzas individual lines of verse Ordered Hierarchy of Content Objects (OHCO) Present in Explicit Markup <speech> <speaker>Gower</speaker> <stanza> <line>To sing a song that old was sung</line> <line>From ashes ancient Gower is come,</line> <line>Assuming man’s infirmities</line> <line>To glad your ear and please your eyes.</line> … </stanza> </speech> The First Four Lines of Shakespeare’s Pericles, Prince of Tyre, Imaginary XML Markup <sp> <speaker>Gower</speaker> <lg type="stanza"> <l>To sing a song that old was sung</l> <l>From ashes ancient Gower is come,</l> <l>Assuming man’s infirmities</l> <l>To glad your ear and please your eyes.</l> … </lg> </sp> The First Four Lines of Shakespeare’s Pericles, Prince of Tyre, TEI Explicit Markup <sp><speaker>Gower</speaker><lg type="stanza”><l>To sing a song that old was sung</l><l>From ashes ancient Gower is come,</l><l>Assuming man’s infirmities</l><l>To glad your ear and please your eyes.</l>…</lg></sp> This is Harder for Us to Understand, but It’s All the Same to a Computer Tag (n): A label attached to someone or something for the purpose of identification or to give other information. Tag (v): Attach a label to. Some Definitions Metadata (n. pl) A set of data that describes and gives information about other data. Tagging Adds “Metadata” Tag (the Game) Element content <tag>You’re it</tag> Start tag End tag An XML Element Data Element content <tag>You’re it</tag> Start tag End tag Metadata An XML Element Element content <em>You’re it</em> Start tag End tag An HTML Element Element content <title>Hamlet</title> Start tag End tag A TEI Element A TEI File (“Open” format) A Microsoft Word File, “Under the Hood” (“Closed”/Proprietary format) HTML tags cater to “typographical man”: they are mostly concerned with how to display given information. TEI tags classify and store information about texts, for the purpose of processing that data in some way. Yesterday, my roommate Ruby stormed in and declared, “I’ve had it with you. I’m moving out.” Apparently he was “frustrated” by the fact that I hadn’t done dishes in a week, and that I had lost his Arrested Development DVDs in a pile of filth. “C’est la vie,” I said. “Some people don’t like cleaning. It is your misfortune to live with one.” A Sample Text [. .] <p>Yesterday, my roommate Ruby stormed in and declared, “I’ve had it with you. I’m moving out.”</p> <p>Apparently he was “frustrated” by the fact that I hadn’t done dishes in a week, and that I had lost his <em>Arrested Development</em> DVDs in a pile of filth.</p> <p>“<em>C’est la vie</em>,” I said. “Some people don’t like cleaning. It is your misfortune to live with one.”</p> [. .] The Sample Text Tagged in HTML [. .] <p><date when="2018-02-28">Yesterday</date>, my roommate <persName>Ruby</persName> stormed in and declared, <said>I’ve had it with you. I’m moving out</said>.</p> <p>Apparently he was <soCalled>frustrated </soCalled> by the fact that I hadn’t done dishes in a week, and that I had lost his <bibl><title>Arrested Development</title></bibl> DVDs in a <place><placeName>pile of filth </placeName><location><geo>43.668264, -79.399889 </geo></location></place>.</p> <p><said><foreign xml:lang="fr">C’est la vie</foreign></said>, I said. <said>Some people don’t like cleaning. It is your misfortune to live with one.</said></p> [. .] The Sample Text Tagged in TEI “Tags” Are Often Invisible from the Surface TEI Tags Can Become Display-Oriented HTML By a Process of Conversion… Value Attribute <foreign xml:lang="fr">C’est la vie</foreign> Start tag Element content End tag A TEI Element with an Attribute and Attribute Value Data Value Attribute <foreign xml:lang="fr">C’est la vie</foreign> Start tag Element content End tag Metadata A TEI Element with an Attribute and Attribute Value Value Attribute Click <a href="http://bit.ly/hg77">here</a> Start tag End tag Element content An HTML Element with an Attribute and Attribute Value <p> <said> <foreign> </foreign> </said> <said> </said> </p> XML Nesting: Properly Nested Elements a TEI Document (“Well-Formed”) Russian Nesting Dolls, a.k.a Matryoshki <p> <said> <foreign> </said> </foreign> <said> </p> </said> XML Nesting: Improperly Nested Elements (Not “Well-Formed”) <p> <said> <foreign> </said> </foreign> <said> </p> </said> XML Nesting: Improperly Nested Elements (Not “Well-Formed”) <p> <said> <foreign> Machines</said> are unforgiving, </foreign> literal<said> readers. If you break <p> their</said> rules, even slightly, they understand nothing. XML Nesting: Improperly Nested Elements <sp> <speaker>Gower</speaker> <lg type="stanza"> <l>To sing a song that old was sung</l> <l>From ashes ancient Gower is come,</l> <l>Assuming man’s infirmities</l> <l>To glad your ear and please your eyes.</l> … </lg> </sp> If You Had Access to Every Shakespearean Play Marked Up Like This, What Would You Be Able to Do With It? [. .] <p><date when="2013-02-26">Yesterday</date>, my roommate <persName>Ruby</persName> stormed in and declared, <said>I’ve had it with you. I’m moving out</said>.</p> <p>Apparently he was <soCalled>frustrated </soCalled> by the fact that I hadn’t done dishes in a week, and that I had lost his <bibl><title>Arrested Development</title></bibl> DVDs in a <place><placeName>pile of filth</placeName><location><geo>43.668264, - 79.399889</geo></location></place>.</p> <p><said><foreign xml:lang="fr">C’est la vie</foreign></said>, I said. <said>Some people don’t like cleaning. It is your misfortune to live with one.</said></p> [. .] If Every Novel Every Written Was Marked Up In This Way, What Could You Do With It? Now for our class project, which is all about ”narrative levels” We are participating in the first “Shared Task in the Digital Humanities”: Systematic Analysis of Narrative Texts through Annotation (SANTA) sharedtasksinthedh.github.io The goal of this project is to come up with a way of TAGGING frame narratives, and then to use this tagging to teach machines how to recognize frame narratives. Why frame narratives? Because they should be reasonably easy to tag, and just about everything else to do with narrative is incredibly hard. Alas, though: even frame narratives are pretty challenging when you think about them. FRAME NARRATIVE. Any time a character in a story begins to tell a story of his or her own, creating a narrative within a narrative, or a tale within a tale, you have a “frame narrative.” NARRATIVE: The representation of a STORY (an event or series of events). In written fiction, narratives are told by NARRATORS. GENETTE’S DOODLE: CLASSIC EXAMPLE: HAMLET HAMLET THE MOUSE- TRAP CLASSIC EXAMPLE: HAMLET A B CLASSIC EXAMPLE: HAMLET A B TWO NARRATIVES TWO NARRATIVE LEVELS (A+B) THOUSAND AND ONE NIGHTS A B B B C THOUSAND AND ONE NIGHTS A B B B C FIVE NARRATIVES THREE NARRATIVE LEVELS (A+B+C) SHAKY TOTS (1st folio) CHRISTOPHER SLY TAMING OF THE SHREW SHAKY TOTS (1st folio) A B HANKY TOTS “I” DOUGLAS GOVERNESS HANKY TOTS A B C James’s TOTS is a great example of why explicitly tagging frame narratives is hard.
Details
-
File Typepdf
-
Upload Time-
-
Content LanguagesEnglish
-
Upload UserAnonymous/Not logged-in
-
File Pages110 Page
-
File Size-