<<

Between the Panels: A Metadata Schema for Webcomics

Erin Donohue

Melanie Feinberg INF 384C: Organizing Information Spring 2014 Webcomics: A Descriptive Schema

Purpose and Audience:

This schema is designed to facilitate access to the oftentimes chaotic world of webcomics in a systematic and organized way. I have been reading webcomics for over a decade, and the only way I could find new was through word of mouth or by following links on the sites of comics I already read. While there have been a few attempts at creating a centralized listing of webcomics, these collections consist only of comic titles and artist names, devoid of information about the comics’ actual content. There is no way for users to figure out if they might like a comic or not, except by visiting the site of every comic and exploring its archive of posts.

I wanted a more systematic, robust way to find comics I might enjoy, so I created a schema that could be used in a catalog of webcomics. This schema presents, at a glance, the most relevant information that webcomic fans might want to know when searching for new comics. In addition to basic information like the comic’s title and artist, this schema includes information about the comic’s content and style—to give readers an idea of what to expect from a comic without having to browse individual comic . The attributes are specifically designed to make browsing lots of comics quick and easy.

This schema could eventually be utilized in a centralized comics database and could be used to generate recommendations using mood, art style, common themes, and other attributes. This schema is primarily intended for users who have some experience with webcomics and who are seeking new comics to read. However, the terminology in the schema is deliberately non­technical and accessible, so these entity descriptions can be used even by comic novices looking to explore the medium for the first .

Scope and Definitions:

A webcomic is a comic published on a . The term is used here to refer to both individual comic strips and the overall website upon which the comics are hosted. For the purposes of this metadata schema, to be considered a “webcomic,” a comic must consist of at least comic posted on a website (or section of a website) specifically devoted to that comic. The comic should be created with the primary purpose of being put online (as opposed to in print media).

What counts as a webcomic: * A website consisting of digitally­created drawings and text * A website consisting of handmade drawings scanned and uploaded to the web * A website devoted to a comic that was updated once and never again * A comic hosted on a distinct section of a larger website not devoted to comics * A comic that was first published online and later appeared in a or in addition to continuous online postings What doesn’t count as a webcomic: * A comic originally created for a newspaper or magazine that has subsequently been put online (e.g., Calvin and Hobbes, , The Far Side) * that predominantly feature non­comic content that publish occasional comics, not under a separate formal title or website section distinct from the text (e.g., A recipe blog with a single comic post about cooking) * Reblogs or shares of other comics (not the poster’s own work) on Tumblr, , or other Attribute Descriptions To apply this metadata schema to a webcomic, assign values to the attributes described on the following pages. Title and Artist are mandatory attributes and must have a value, even if that value is “[unknown].” All other attributes in this schema are optional.

Title The proper name of the comic Mandatory: Yes Repeatable: Yes Refinements: Alternative Title: Any form of the title used as a substitute or alternative to the formal title of the comic Input Guidelines: ● The title can be found on the comic’s website, often prominently featured in the header image. ● Titles should be formatted as they are on the comic site (e.g., xkcd should remain all lowercase). Checking outside resources, such as Wikipedia or Google can be useful in clarifying the established capitalization convention. ● Do not omit leading articles (e.g., “A” or “The”) from the title. ● If the comic is popularly known by another title that differs from its formal title, use the Alternative Title attribute. This attribute should not be used for (e.g., for Saturday Morning Breakfast Cereal, do not put SMBC as an alternative title). ● If the comic has no title, put [untitled] in the title field.

Artist The creator(s) and/or current producers of the comic Mandatory: Yes Repeatable: Yes Refinements: Writer: The person responsible for the written content of the comic Illustrator: The person responsible for the visual art (drawings, photos) of the comic Input Guidelines: ● Artist names should be written in Last Name, First Name format. If the artist uses only one name, list that name. ● If the artist is an organization, company, or collective, list the organization’s name, omitting leading articles like “A” or “The.” ● If the artist uses an alias or pen name, list that name according to the format above. If the artist’s given name is known, put it in parentheses after the alias/pen name. ● If the comic is written and illustrated by different people and the roles are clearly delineated (e.g., specified on the comic’s “About” page), the two child attributes, Writer and Illustrator, should be used instead of the Artist attribute. ● For comics made by multiple people (without a clearly defined writer/illustrator role), separate the artists’ names with a semicolon or use multiple instances of the Artist attribute. ● If the artist is unknown or anonymous, put “[unknown]” in the Artist attribute.

Description An overview of the content of the comic Mandatory: No Repeatable: Yes Input Guidelines: ● This attribute is intended to give a brief, general overview of the comic’s subject matter, ideally in the artist’s own words. It can be most commonly found in the “About” page or the header of a comic’s website. ● Because the value for this attribute comes directly from the artist’s website, it may be quirky or not particularly illuminating about the comic’s actual content. Though superficially uninformative, this text can still give potential readers an idea of the comic’s tone or mood, as well as the artist’s attitude. ● Cite the source URL for all material in brackets after the description. ● If the “About” page is not at all informative (e.g., It just says “Hi, this is a comic.”), if the “About” page consists only of information about the artist and not about the comic specifically, or if no clear “About” page exists, check outside resources such as Google or Wikipedia for a description. If these resources do not provide an abstract on the comic’s subject matter, click through the comic’s archive and try to synthesize the comic’s common topics, tone, and visual style into a brief description.

URL The website address where the comic can be found Mandatory: No Repeatable: Yes Input guidelines: ● Format the URL in the following way: http://www.example.com ● If a comic has multiple URLs or is hosted in multiple places, list each site in a separate URL field. ● If the site upon which a comic was hosted is no longer available, put the last known URL and “[deprecated]” after it. ● Take pains to find the original source of the webcomic. For instance, if using material posted on social media (especially Tumblr), ensure that the site you list is the site hosted by the comic’s actual artist or organization, and the material is not merely being reposted/reblogged from an outside source. ● If a secondary source (e.g., a Tumblr page) is the only remaining source for the webcomic, and the original site has been taken down, note the original URL in the manner described above and use a second URL field to list the secondary source URL. Make a note that this is not the original source of the material. Date of creation The date the comic was first published online Mandatory: No Repeatable: No Input Guidelines: ● The date can be formatted as MM/DD/YYYY, MM/YYYY, or YYYY, depending upon the amount of information known ● This information can commonly be found in the “About” page or by navigating to the earliest post in the comic’s archive. Wikipedia is a good secondary source for more popular comics. ● If the creation date is unknown, indexers may put “[unknown]” as a value.

Date of conclusion The publication date of the last comic Mandatory: No Repeatable: No Refinements: Last updated: In the case of a comic that has not formally concluded, the date of the last published comic Input Guidelines: ● The date can be formatted as MM/DD/YYYY, MM/YYYY, or YYYY, depending upon the amount of information known ● This information can commonly be found in the “About” page or by navigating to the latest post in the comic’s archive. Wikipedia is a good secondary source for more popular comics. ● If the comic is still being published, indexers may either exclude this attribute or put “N/A” as a value. ● Often comics will stop being published without a formal conclusion or message from the author. If the comic has not been updated for over a year from the date of the last comic post, use the refined attribute “Last updated” instead of “Date of conclusion.” Put the date of the last posted comic in this field. Continue to monitor the comic for updates and delete or “N/A” this attribute as appropriate. ● If the conclusion date is unknown but the comic has definitely concluded, indexers may put “[unknown]” as a value. Update schedule The general frequency with which new comics are posted on the site Mandatory: No Repeatable: Yes Input Guidelines: ● This information can sometimes be obtained in the “About” or “FAQ” sections of a site. If not available, check through the comic’s more recent archive to get a sense of the typical frequency of updates. ● Acceptable values include: Daily; Weekly; Monday, Wednesday, Friday; Monthly; Irregular ● For comics without a set update schedule, use the “Irregular” value

Narrative The structure of the comic’s storyline Mandatory: No Repeatable: No Input Guidelines: ● Acceptable values: Standalone, , Variable, Other ● To complete this field, look at the comics both individually and in aggregate. If each comic’s narrative arc is self­contained, use “Standalone.” If the comic is serialized with plots that develop across many strips, use “Serial.” ● If the site frequently features comics that fall into both categories, use “Variable.” ● Occasional deviations (e.g., a standalone narrative comic in a typically serialized strip) do not require listing an additional Narrative attribute. ● If none of the previous classifications apply, use “Other.

Format The general structure or layout of the comic Mandatory: No Repeatable: Yes Input Guidelines: ● Acceptable values include: Single , 2 panels, 4 panels, Multipanel, Variable ● Occasional deviations (e.g., a single panel comic in a typically 4­panel strip) do not require listing an additional Format attribute. ● If there is no standard or typical format, use the “Variable” value. Mood The general feeling or atmosphere created in the comic Mandatory: No Repeatable: Yes Input Guidelines: ● The mood is a subjective judgment on the feeling the artist creates in the comic through setting, art style, characters, dialogue, etc. ● If the mood is not clearly articulated in the comic’s “About” page, skimming several of the comics from different time periods in its history can be useful in discerning it. ● Consider the comic as a whole, not merely individual posts. ● There is no official controlled vocabulary for this attribute, but strive for consistency with existing records. For example, if “Surreal” is already used in several existing records, use “Surreal” in new applicable records, instead of “Weird” or “Bizarre.” ● Acceptable values include: Light, Dark, Humorous, Dramatic, Cheerful, Ominous ● Assign as many values as the comic’s mood warrants. These can be separated using a comma between values or by using a separate Mood attribute for each value.

Art style The visual presentation of the comic’s drawings or photographs Mandatory: No Repeatable: Yes Input Guidelines: ● The art style is a subjective judgment on the qualities of the comic’s illustrations. It is best ascertained by skimming several posts from different time periods in the comic’s history ● Consider the art of comic as a whole, not merely individual posts. ● There is no official controlled vocabulary for this attribute. It can, but does not have to, use technical visual art terminology. Terms should be clear and familiar to a wide audience that may not have a visual art background. ● Art thesauri are a useful starting point for generating values. E.g., http://www.macmillandictionary.com/us/thesaurus­category/british/Words­used­to­descri be­works­of­art­and­pictures ● Strive for consistency with existing records. For example, if “Unrefined” is already used in several existing records, use “Unrefined” in new applicable records, instead of close synonyms like “Rough” or “Gritty.” ● Sample acceptable values include: Minimalist, Intricate, Surreal, Dark, Light, Futuristic, Street Art, Abstract ● Assign as many values as the comic’s visual style warrants. These can be separated using a comma between values or by using a separate Art style attribute for each value. Common themes Typical subjects addressed in the comic Mandatory: No Repeatable: Yes Input Guidelines: ● The values for this attribute can be ascertained by browsing the comic’s archive or, in the case of more popular comics, by searching Wikipedia and Google ● A theme is “common” if it appears in a significant number of the posts on the comic’s site. This attribute should not be used to catalog the subject matter of every single comic. ● Assign as many values as the comic’s subject matter warrants. These can be separated using a comma between values or by using a separate Common Themes attribute for each value.

Characters The types/species of characters prominently featured in the comic Mandatory: No Repeatable: Yes Input Guidelines: ● This attribute is particularly useful in comics that prominently feature non­human characters (e.g., animals, vampires, robots) ● “Humans” should not include zombies, superheroes, cyborgs, mutants, or other variants. These are distinct values. ● In comics featuring animal characters, include the types of animals in parentheses after the initial “Animals” value. E.g., Animals (Bear, Cow, Duck).

Profanity The level of swearing typically used in the comic Mandatory: No Repeatable: No Input Guidelines: ● Values for this attribute should be entered on a scale of None to High ○ None: There are no curse words in the comic. ○ Low: An occasional curse word; the overwhelming majority of the comic has no profanity and a casual visitor to the site’s archive would probably not find a comic with “adult language.” ○ Moderate: Curse words appear fairly often in the comic. In a set of ten comics, 3 or 4 might have a curse word. A casual visitor to the site’s archive would be likely to find a few comics with “adult language.” ○ High: Curse words appear regularly in the comic. In a set of ten comics, at least five of them feature curse words. A casual visitor to the site’s archive would be hard pressed to find a comic without “adult language.” Explicit sexual content The level of full or partial nudity, sexual references, or sexual acts typically featured in the comic Mandatory: No Repeatable: No Input Guidelines: ● Values for this attribute should be entered on a scale of None to High ○ None: There is no explicit sexual content in the comic. ○ Low: Occasional nudity or sexual references; the overwhelming majority of the comic has no explicit sexual content and a casual visitor to the site’s archive would probably not find a comic featuring it. ○ Moderate: Explicit sexual content appears fairly often in the comic. In a set of ten comics, 3 or 4 might have nudity or sexual references/acts. A casual visitor to the site’s archive would be likely to find a few comics with explicit sexual material. ○ High: Explicit sexual content appears regularly in the comic. In a set of ten comics, at least five of them feature nudity or sexual references/acts. A casual visitor to the site’s archive would be hard pressed to find a comic without explicit sexual material.

Education level The general intellectual level of the comic’s content. How much education a person would typically need to understand and enjoy the comic’s subject matter. Mandatory: No Repeatable: Yes Input Guidelines: ● This attribute should be populated with the following controlled vocabulary: ○ Elementary School: Readers with an elementary­school­level education can understand and enjoy the comic. Words are fairly simple and subjects are uncomplex. ○ Middle School: Readers with a middle­school­level education can understand and enjoy the comic. Words are slightly more complicated; subject matter may be slightly darker but still understandable by a younger audience. ○ High School: Readers with a high school education can understand and enjoy the comic. Language is on par with typical adult reading level and themes and subjects are suitable for young adults. ○ College: Readers with a college education can understand and enjoy the comic. Language and themes are understood by a highly educated general audience. ○ Graduate School: Readers with a graduate education can understand and enjoy the comic. Frequently includes highly specialized subject knowledge and terms. ○ Einstein: You need to be a genius to understand and enjoy this comic. Alternate media Any other formats (besides on an Internet website) in which large portions of the comic are available Mandatory: No Repeatable: Yes Input Guidelines: ● This attribute should be used only for media in which the comic as a whole (or substantial collections of it) appear. Individual comic strips appearing in other media should not be noted here. ● This attribute should not be applied to comic merchandise, such as t­shirts or posters. ● This information is often advertised on the comic’s site (e.g., book promotions) and can also be found via Google and Wikipedia. Wikipedia’s List of Webcomics in Print is a helpful guide: http://en.wikipedia.org/wiki/List_of_webcomics_in_print ● Acceptable values include: Book, PDF, Newspaper, iPad app Examples The following examples can provide guidance for how to utilize the schema’s attributes. There are three central cases, A Softer World, Hark! A Vagrant, and xkcd, that represent typical manifestations of webcomics. There are also two border cases, Hyperbole and a Half and Nawlz. The Hyperbole and a Half case is an example of how to handle a comic/blog hybrid. The Nawlz case exemplifies how to handle interactive webcomics.

Central Cases:

Title A Softer World

Writer Comeau, Joey

Illustrator Horne, Emily

Description A Softer World is a comic that was created by Emily Horne and Joey Comeau so that people would recognize them as important artistic geniuses. Sometimes the "comic" is sad or harsh. It should be noted that this is in the tradition of George Simenon's 'romans durs' (or 'hard novels') and not in the lesser traditions of comics like or anything else not French. [Source: http://www.asofterworld.com/about.php ]

URL http://www.asofterworld.com/

Date of creation 02/07/2003

Date of conclusion N/A

Update schedule Daily

Narrative Standalone

Format 3 panels

Mood Dark, Humorous, Philosophical, Surreal

Art style Dark, Obscure, Minimalist

Common themes Death, Philosophy, Love, Relationships

Characters Humans

Profanity Moderate

Explicit sexual Low content Education level High School

Alternate media Book, Newspaper, PDF

Title Hark! A Vagrant

Artist Beaton, Kate

Description Hark! A Vagrant is a webcomic created by Kate Beaton. Having previously earned a bachelor’s degree in history, much of the humor in Beaton’s comics comes from comedic depictions of historical figures and scenes, although she does occasionally post about more contemporary subjects, such as comics and Nancy Drew books. [Source: http://knowyourmeme.com/memes/sites/hark­a­vagrant ]

URL http://harkavagrant.com/index.php

Date of creation 2007

Date of conclusion N/A

Update schedule Irregular

Narrative Standalone

Format Multipanel

Mood Humorous, Fun, Intellectual

Art style Unrefined

Common themes History, Literature, Feminism

Characters Humans, Superheroes, Animals (Pony)

Profanity Low

Explicit sexual Low content

Education level College

Alternate media Book Title xkcd

Artist Munroe, Randall

Description A webcomic of romance, sarcasm, math, and language. [Source: http://xkcd.com ]

URL http://xkcd.com/

Date of creation 09/2005

Date of conclusion N/A

Update schedule Monday, Wednesday, Friday

Narrative Standalone

Format Irregular

Mood Humorous, Scientific, Intellectual, Philosophical

Art style Light, Unrefined, Minimalist

Common themes Science, Math, Computers, Philosophy, Relationships

Characters Humans

Profanity Low

Explicit sexual None content

Education level College, Graduate School

Alternate media Book Border Cases:

Title Hyperbole and a Half

Artist Brosh, Allie

Description Hyperbole and a Half is a combination of web comic and blog written by Allie Brosh. The comic is drawn in Paintbrush and is intentionally artistically crude. It is a retelling of her life, and includes stories from her childhood as well as the challenges she faces as an adult. Brosh has also expanded the comic into a series of web in a similar style, which have been popular [Source: http://en.wikipedia.org/wiki/Hyperbole_and_a_Half ]

URL http://hyperboleandahalf.blogspot.com/

Date of creation 07/2009

Date of conclusion N/A

Update schedule Irregular

Narrative Standalone

Format Panels interspersed with blog text

Mood Humorous, Absurdist, Childlike

Art style Unrefined, Childlike, Colorful

Common themes Slice of life, Childhood, Adulthood, ADHD, Dogs

Characters Humans, Animals (Dogs)

Profanity Low

Explicit sexual None content

Education level Middle School

Alternate media Book Title Nawlz

Artist Sutu (Campbell, Stu)

Description Nawlz is a 14 episode cyberpunk adventure series that combines , interactivity, music and text to create a never before seen digital panoramic comic format. The story follows Harley Chambers as he kicks through the futuristic City of Nawlz, engaging in overlaying virtual realities, mind­bending drugs and other strange techno­cultures. Nawlz is full of hidden dimensions that require a tap, tilt and swipe to explore. [Source: http://www.nawlz.com/hq/about/ ]

URL http://www.nawlz.com/

Date of creation 2008

Date of conclusion N/A

Update schedule Irregular

Narrative Serial

Format Variable

Mood Dark, Serious

Art style Dark, Surreal, Street Art, Interactive

Common themes Cyberpunk, Technology, Drugs, Dystopia

Characters Humans, Giants, Animals (Pufferfish, Octopus)

Profanity Moderate

Explicit sexual None content

Education level High School

Alternate media iPad app