Cultivating Capacity, Creating Change
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Distant Reading, Computational Stylistics, and Corpus Linguistics the Critical Theory of Digital Humanities for Literature Subject Librarians David D
CHAPTER FOUR Distant Reading, Computational Stylistics, and Corpus Linguistics The Critical Theory of Digital Humanities for Literature Subject Librarians David D. Oberhelman FOR LITERATURE librarians who frequently assist or even collaborate with faculty, researchers, IT professionals, and students on digital humanities (DH) projects, understanding some of the tacit or explicit literary theoretical assumptions involved in the practice of DH can help them better serve their constituencies and also equip them to serve as a bridge between the DH community and the more traditional practitioners of literary criticism who may not fully embrace, or may even oppose, the scientific leanings of their DH colleagues.* I will therefore provide a brief overview of the theory * Literary scholars’ opposition to the use of science and technology is not new, as Helle Porsdam has observed by looking at the current debates over DH in the academy in terms of the academic debate in Great Britain in the 1950s and 1960s between the chemist and novelist C. P. Snow and the literary critic F. R. Leavis over the “two cultures” of science and the humanities (Helle Porsdam, “Too Much ‘Digital,’ Too Little ‘Humanities’? An Attempt to Explain Why Humanities Scholars Are Reluctant Converts to Digital Humanities,” Arcadia project report, 2011, DSpace@Cambridge, www.dspace.cam.ac.uk/handle/1810/244642). 54 DISTANT READING behind the technique of DH in the case of literature—the use of “distant reading” as opposed to “close reading” of literary texts as well as the use of computational linguistics, stylistics, and corpora studies—to help literature subject librarians grasp some of the implications of DH for the literary critical tradition and learn how DH practitioners approach literary texts in ways that are fundamentally different from those employed by many other critics.† Armed with this knowledge, subject librarians may be able to play a role in integrating DH into the traditional study of literature. -
KB-Driven Information Extraction to Enable Distant Reading of Museum Collections Giovanni A
KB-Driven Information Extraction to Enable Distant Reading of Museum Collections Giovanni A. Cignoni, Progetto HMR, [email protected] Enrico Meloni, Progetto HMR, [email protected] Short abstract (same as the one submitted in ConfTool) Distant reading is based on the possibility to count data, to graph and to map them, to visualize the re- lations which are inside the data, to enable new reading perspectives by the use of technologies. By contrast, close reading accesses information by staying at a very fine detail level. Sometimes, we are bound to close reading because technologies, while available, are not applied. Collections preserved in the museums represent a relevant part of our cultural heritage. Cataloguing is the traditional way used to document a collection. Traditionally, catalogues are lists of records: for each piece in the collection there is a record which holds information on the object. Thus, a catalogue is just a table, a very simple and flat data structure. Each collection has its own catalogue and, even if standards exists, they are often not applied. The actual availability and usability of information in collections is impaired by the need of access- ing each catalogue individually and browse it as a list of records. This can be considered a situation of mandatory close reading. The paper discusses how it will be possible to enable distant reading of collections. Our proposed solution is based on a knowledge base and on KB-driven information extraction. As a case study, we refer to a particular domain of cultural heritage: history of information science and technology (HIST). -
To Download a PDF of Our April, 2019 Edition
H PUBLISHED IN NORTHEAST PORTLAND SINCE 1984 H STAR PUBLISHING INC. STAR THE HOLLYWOOD Great ideas for Mother's Day & Easter! PAGES 12-13 H SERVING NORTHEAST AND NORTH METROPOLITANNEWS PORTLAND NEIGHBORHOODS H APRIL 2019 H VOLUME 36, NUMBER 10 H StarH HSTAR COMMUNITY NEWS WALKING WHILE BLACK FEELS UNSAFE IN CITY Walking while Black in Portland can feel treacherous, according to data analysis and public outreach by transportation planners who are developing a city pedestrian plan. Residents in Southwest and outer eastside neighborhoods have the most gaps in sidewalks and they also have inadequate BY JANET GOETZE lighting compared to close- [email protected] in areas, the planners said. However, the planners wanted to learn more because they received a relatively low percentage of survey comments from low-income Expoloring leafy Kerns areas and neighborhoods with sizable minority residents. They teamed with community organizations to develop two focus groups. Among other things, they learned that Black residents say they must be extra careful when Neighborhood by foot crossing streets or using crosswalks because they may be targets of racial slurs by drivers or other pedestrians. One woman who wanted to walk for exercise said BLOCK BY BLOCK SAVORING FLAVORS OF KERNS, SERVED UP BY FRANKS-A-LOT'S BILLY GOUCHER. PAGE 15 she never ventures out after 5 p.m. because she doesn’t feel safe. Another said she pays her son’s car insurance “because it is safer for him to drive than to be exposed.” In addition to increasing lighting in underserved communities, planners said in a summary document, their “toolbox” will include partnering with other agencies and city bureaus “to advance the well-being and personal security of vulnerable communities and continuing to research racial bias and driving behavior.” The “toolbox” information and focus group summary are in the project website www.pedpdx.com. -
Padgett’ S Turn to Violence the Clues That This Seemingly Ordi- Olds High School
SMILE AND FIGHT Improv star Parker stares down cancer — SEE LIFELIFE,, BB11 PortlandTHURSDAY, JUNE 19, 2014 • TWICE CHOSEN THE NATION’S BESTT NNONDAILYONDAILY PAPERTribune • PORTLANDTRIBUNE.COMPORTLANDTRIBUNE.COM • PUBLISHEDPUBLISHED TUESDAYTUESD AND THURSDAY O, Death City takes swing at spare reviving golf game me till Offi cials look to Colwood acquisition to get sport out of hole we talk By J ENNIFER ANDERSON The Tribune Three months after the city’s aquisition of the Colwood National Golf Club about it in Northeast Portland, the city is looking “The game to use it to at- needs a ■ tract a young- Death Cafes er, more di- boost. verse popula- We need tion of golfers tackle the taboo topic to save the fu- a way to ture of the bring other game. people — to help folks make “Most golf- ers are like me more — the majority gender most of their lives of us are gray- haired and diversity, amie always thought she’d like to be male,” says more buried under an oak tree in her rural John Zoller, di- cultural Clackamas County backyard. Nearing rector of the Jdeath after a long bout with cancer Portland Pub- diversity, last fall, she had Eric, her partner, contact lic Golf pro- especially Elizabeth Fournier, who specializes in do-it- gram for more yourself funerals through Cornerstone than 25 years. the young Funeral Services. “The game kids.” Fournier came out for a visit. She told the needs a boost. — J ohn Z oller, couple that over the coming weeks they would We need a way Portland Public talk about what type of to bring other Golf program funeral Jamie wanted, people — more director STORY BY and that there were gender diversi- questions and details ty, more cul- PETER KORN almost nobody consid- tural diversity, ers ahead of time. -
IXA-Pipe) and Parsing (Alpino)
Details from a distance? CLIN26 Amsterdam A Dutch pipeline for event detection Chantal van Son, Marieke van Erp, Antske Fokkens, Paul Huygen, Ruben Izquierdo Bevia & Piek Vossen CLOSE READING DISTANT READING NewsReader & BiographyNet Apply the detailed analyses typically associated with close (or at least non-distant) reading to large amounts of textual data ✤ NewsReader: (financial) news data ✤ Day-by-day processing of news; reconstructing a coherent story ✤ Four languages: English, Spanish, Italian, Dutch ✤ BiographyNet: biographical data ✤ Answering historical questions with digitalized data from Biography Portal of the Netherlands (www.biografischportaal.nl) using computational methods http://www.newsreader-project.eu/ http://www.biographynet.nl/ Event extraction and representation 1. Intra-document processing: Dutch NLP pipeline generating NAF files ✤ pipeline includes tokenization, parsing, SRL, NERC, etc. ✤ NLP Annotation Format (NAF) is a multi-layered annotation format for representing linguistic annotations in complex NLP architectures 2. Cross-document processing: event coreference to convert NAF to RDF representation 3. Store the NAF and RDF in a KnowledgeStore Event extraction and representation 1. Intra-document processing: Dutch NLP pipeline generating NAF files ✤ pipeline includes tokenization, parsing, SRL, NERC, etc. ✤ NLP Annotation Format (NAF) is a multi-layered annotation format for representing linguistic annotations in complex NLP architectures 2. Cross-document processing: event coreference to convert NAF to RDF 3. -
Guide to Local Love 2020
2020 GUIDE TO LOCAL LOVE Spread the love to our generous sponsors and community partners! ROSE CITY SPIRITS PDX HOMEGROWN BEECH ST. PARLOR COLIBRI This bar built in an old home features antique furnishings, crafty Custom floral design studio and plant shop specializing in composed cocktails, light and late-night gourmet fare, and djs every night. arrangements for individuals, businesses, and occasions alongside tropical plants and pottery. CARDINAL CLUB ELECTRIC LETTUCE East Coast style cozy tavern serving seasonal cocktails, yummy food, and music on vinyl. Electric Lettuce is a recreational dispensary with multiple locations throughout Oregon, offering a righteous selection of smoking acces- sories, cannabis edibles and topicals, concentrates and extracts, and ELK COVE VINEYARD & PIKE ROAD WINES flower. Willamette Valley winery crushing it since 1974 to create estate grown Pinot Noir, Pinot Gris, and Pinot Blanc. SECRET AARDVARK TRADING COMPANY The best little hot sauce in Portland. Founded in 2004 in farmer’s NEW DEAL DISTILLERY markets and now featured in over 300 eateries and grocery stores from here to NYC. High-quality, craft distilled artisan vodkas, gins, liqueurs, whiskies, and rums – made locally in small batches. Featuring a Taproom and PORTLAND NURSERY Distillers Workshop in SE Portland. Two locations supplying a vast selection of garden accessories and PRODUCE ROW CAFE design services plus native plants, veggies, and flowers. Located in the heart of the SE industrial district, offering 24 taps, VOODOO DOUGHNUT craft cocktails, a spacious year round heated patio, private and You know it, you love it. Pink boxes full of good things. semi-private event space, Sunday brunch with live DJs, and monthly dance parties. -
2016 Portland Hotel Guide
Portland Hotel Guide HouseSpecial 2016 housespecial.com Airport 12 420 NE 9th Ave. 10 2 11 8 7 4 3 6 1 5 9 North WELCOME TO PORTLAND Here are some hotel suggestions for your stay. Hopefully, this will give you a little taste of the city and make your decision a bit easier. We know you’re going to love Portland — we sure do. 1 The Nines HOUSESPECIAL RATE HOTELS 2 Ace Hotel The Nines .....................................................................page 3 3 Hotel Lucia Ace Hotel .....................................................................page 4 4 Hotel deLuxe Hotel Lucia ...................................................................page 5 Hotel deLuxe ................................................................page 6 5 Hotel Monaco Hotel Monaco...............................................................page 7 Sentinel Hotel ..............................................................page 8 6 Sentinel Hotel Hotel Vintage ...............................................................page 9 7 Hotel Vintage Hotel Eastlund..............................................................page 10 8 Benson Hotel 9 The Heathman Hotel STANDARD RATE HOTELS Benson Hotel ...............................................................page 11 10 Jupiter Hotel The Heathman .............................................................page 12 11 The Westin Jupiter Hotel ................................................................page 13 The Westin ...................................................................page 14 12 Hotel -
ISSN: 1647-0818 Lingua
lingua Volume 12, Número 2 (2020) ISSN: 1647-0818 lingua Volume 12, N´umero 2 { 2020 Linguamatica´ ISSN: 1647{0818 Editores Alberto Sim~oes Jos´eJo~aoAlmeida Xavier G´omezGuinovart Conteúdo Artigos de Investiga¸c~ao Adapta¸c~aoLexical Autom´atica em Textos Informativos do Portugu^esBrasileiro para o Ensino Fundamental Nathan Siegle Hartmann & Sandra Maria Alu´ısio .................3 Avaliando entidades mencionadas na cole¸c~ao ELTeC-por Diana Santos, Eckhard Bick & Marcin Wlodek ................... 29 Avalia¸c~aode recursos computacionais para o portugu^es Matilde Gon¸calves, Lu´ısaCoheur, Jorge Baptista & Ana Mineiro ........ 51 Projetos, Apresentam-se! Aplicaci´onde WordNet e de word embeddings no desenvolvemento de proto- tipos para a xeraci´onautom´atica da lingua Mar´ıaJos´eDom´ınguezV´azquez ........................... 71 Editorial Ainda h´apouco public´avamosa primeira edi¸c~aoda Linguam´atica e, subitamente, eis-nos a comemorar uma d´uziade anos. A todos os que colaboraram durante todos estes anos, tenham sido leitores, autores ou revisores, a todos o nosso muito obrigado. N~ao´ef´acilmanter um projeto destes durante tantos anos, sem qualquer financi- amento. Todo este sucesso ´egra¸cas a trabalho volunt´ario de todos, o que nos permite ter artigos de qualidade publicados e acess´ıveisgratuitamente a toda a comunidade cient´ıfica. Nestes doze anos muitos foram os que nos acompanharam, nomeadamente na comiss~aocient´ıfica. Alguns dos primeiros membros, convidados em 2008, continuam ativamente a participar neste projeto, realizando revis~oesapuradas. Outros, por via do seu percurso acad´emico e pessoal, j´an~aocolaboram connosco. Como sinal de agradecimento temos mantido os seus nomes na publica¸c~ao. -
VIANA: Visual Interactive Annotation of Argumentation
VIANA: Visual Interactive Annotation of Argumentation Fabian Sperrle* Rita Sevastjanova∗ Rebecca Kehlbeck∗ Mennatallah El-Assady∗ Figure 1: VIANA is a system for interactive annotation of argumentation. It offers five different analysis layers, each represented by a different view and tailored to a specific task. The layers are connected with semantic transitions. With increasing progress of the analysis, users can abstract away from the text representation and seamlessly transition towards distant reading interfaces. ABSTRACT are actively developing techniques for the extraction of argumen- Argumentation Mining addresses the challenging tasks of identi- tative fragments of text and the relations between them [41]. To fying boundaries of argumentative text fragments and extracting develop and train these complex, tailored systems, experts rely on their relationships. Fully automated solutions do not reach satis- large corpora of annotated gold-standard training data. However, factory accuracy due to their insufficient incorporation of seman- these training corpora are difficult and expensive to produce as they tics and domain knowledge. Therefore, experts currently rely on extensively rely on the fine-grained manual annotation of argumen- time-consuming manual annotations. In this paper, we present a tative structures. An additional barrier to unifying and streamlining visual analytics system that augments the manual annotation pro- this annotation process and, in turn, the generation of gold-standard cess by automatically suggesting which text fragments to annotate corpora is the subjectivity of the task. A reported agreement with next. The accuracy of those suggestions is improved over time by Cohen’s k [11] of 0:610 [64] between human annotators is consid- incorporating linguistic knowledge and language modeling to learn ered “substantial” [38] and is the state-of-the-art in the field. -
Enhancing Domain-Specific Entity Linking in DH
heterogeneous spectrum of Digital Humanities re- search. • Black Box and reproducibility. As Hasibi et Enhancing Domain-Specific al. (2016) recently remarked, the TagMe Entity Linking in DH RESTful API remains a black box, as it is im- possible to check whether it corresponds to Federico Nanni the system described in the original paper. [email protected] Not knowing the reliability of the system University of Mannheim, Germany limits its use for distant reading analyses, i.e. quantitative studies that go beyond text ex- Yang Zhao ploration. [email protected] • Language Versions. Currently, TagMe is University of Mannheim, Germany only available in English, German and Italian but does not support other widespread lan- Simone Paolo Ponzetto guages such as Chinese, Arabic, Spanish, and [email protected] French, which are essential for enhancing its University of Mannheim, Germany use in the DH community. • Infrequent Updates. TagMe has been ini- Laura Dietz tially created on the English 2009 version of [email protected] Wikipedia and it has been updated only University of New Hampshire, United States of America twice (summer 2012, summer 2016). Imag- ine a setting where a scholar intends to ana- lyze a collection of mainstream news on the Middle East: before the most recent update For the purpose of information retrieval and text the system was not able to detect mentions exploration, digital humanities (DH) scholars have ex- of “Al-Nursa Front”, the former Syrian amined the potential of methods such as keyphrase ex- branch of al-Qaeda. traction (Hasan and Ng, 2014) and named entity • Wikipedia as Knowledge Base. -
Prolegomena to the Automated Analysis of a Bilingual Poetry Corpus, with Particular Reference to an Annotated Edition of “The Cantos” of Ezra Pound
University of Pennsylvania ScholarlyCommons Publicly Accessible Penn Dissertations 2016 Prolegomena To The Automated Analysis Of A Bilingual Poetry Corpus, With Particular Reference To An Annotated Edition Of “the Cantos” Of Ezra Pound Robin Seguy University of Pennsylvania, [email protected] Follow this and additional works at: https://repository.upenn.edu/edissertations Part of the English Language and Literature Commons Recommended Citation Seguy, Robin, "Prolegomena To The Automated Analysis Of A Bilingual Poetry Corpus, With Particular Reference To An Annotated Edition Of “the Cantos” Of Ezra Pound" (2016). Publicly Accessible Penn Dissertations. 2576. https://repository.upenn.edu/edissertations/2576 This paper is posted at ScholarlyCommons. https://repository.upenn.edu/edissertations/2576 For more information, please contact [email protected]. Prolegomena To The Automated Analysis Of A Bilingual Poetry Corpus, With Particular Reference To An Annotated Edition Of “the Cantos” Of Ezra Pound Abstract Standing at the intersection of a theoretical investigation into the possibilities of applying the tools and methods of automated analysis to a large plurilingual poetry corpus and of a set of observables gleaned along the creation of a digitally annotated edition of The Cantos of Ezra Pound — a robust test-case for the TEI — the present dissertation can be read under different guises. One of them, for instance, would be that of a comedy, divina commedia or com�dia de Deus, in which the computer plays — Leibnizian harmonics! — the part of supreme intellect. A: The selva oscura is that of newly born “Digital Humanities” — burgeoning yet obscured already by two dominant paradigms. On the one hand, the constructivism inherited from poststructuralist theory; on the other, a na�ve return to the most trivial kind of linguistic realism. -
Where Close and Distant Readings Meet
Where Close and Distant Readings stress on mixed methods, as we are combining linguistic and literary criteria of selecting style markers to dis- Meet: Text Clustering Methods in criminate between blog genres (Leech & Short 2007,57-58). Literary Analysis of Weblog Genres Current research in the field of computational literary genre stylistics focuses on Most Frequent Words (e.g. Schöch Maciej Maryl and Pielström 2014; Jannidis & Lauer 2014) or functional [email protected] linguistic categories (or both) (e.g. Allison et. al 2011). Institute of Literary Research of the Polish Academy of Yet, this study applies similar methods to emergent and Sciences, Poland uncategorised forms of writing. The quantitative methods are incorporated into the qualitative research workflow in Maciej Piasecki order to create a productive feedback loop. [email protected] Wrocław University of Technology Corpus Ksenia Młynarczyk The corpus of blogs was collected with the use of [email protected] BlogReader - an extension of a corpus gathering system Wrocław University of Technology developed in CLARIN-PL (Oleksy et al. 2014) on a basis of open components: jusText andOnion (Pomikálek, 2011). From the initial set analysed by Maryl et al., 250 blogs were Problem: towards a non-topical classification of selected for processing as being long enough and included weblog genres clean text (comments were omitted). We intentionally left out blogs with exceptionally large or small amount of text The existing typologies of weblog genres - both popular in order to balance the sample. The selected subcorpus and academic - are based on the blog topic, e.g.