Humanities Computing: Essential Problems, Experimental Practice1

Lit&Ling 17/1 103-125 FINAL 23/7/02 3:41 pm Page 103 Humanities Computing: Essential Problems, Experimental Practice1 Willard McCarty King’s College London, London, UK For John Burrows, experimenter Abstract The application of computing to the disciplines of the humanities has two principal outcomes: useful results for the field of application and failures com- pletely to demonstrate what is known. These failures, an inevitable feature of modelling, point to the key question for humanities computing, how we know what we know, and so to the beginning of its own scholarly enquiry. This, I argue, proceeds along three branches, the algorithmic, the metatextual, and the representational. Examining the first of these here I argue for research toward an open-ended, interoperable set of primitives based on previous work in the field and designed for the emerging digital library environment. To set the stage for their further development I argue that the field as a whole does not wait on a theoretical formulation of what humanists do, rather should look to the tradition of experimental knowledge-making as this has been illuminated in recent years by historians, philosophers, and sociologists of science. Correspondence: Willard McCarty, 1 Introduction Centre for Computing in the Although computing has been applied to research in the arts and letters Humanities, King’s College London, 2 Strand, London WC2R 2LS, UK for more than 50 years, systematic effort to understand the consequences E-mail: and implications from an interdisciplinary perspective is quite recent. [email protected] Especially within the last decade, the growth of academic centres for humanities computing in the UK, Scandinavia, continental Europe, 1 This paper was first given at North America, and Australia has stimulated and supported inter- the humanities computing national debate on the nature, function and institutional role of the new seminar of the Computer scholarly field.3 Science and English Initiative, The following is a contribution to that debate. I begin with the central University College Dublin question, what happens when we apply computing to the humanities? I (March 2000), then as a special lecture at Stanford respond by exploring a provisional research agenda for the field and University and the University locating it within the tradition of experimental knowledge-making. In of Georgia (April 2000). particular I argue that work in the history, philosophy, and sociology of My thanks go to Susan science provides a robust basis on which to conceptualize the epistem- Schreibman, Michael Keller, ology that frames this agenda. I advocate learning from the application of © Oxford University Press 2002 Literary and Linguistic Computing, Vol. 17, No. 1, 2002 103 Lit&Ling 17/1 103-125 FINAL 23/7/02 3:41 pm Page 104 Willard McCarty humanistic and sociological methods to the natural sciences in our William Kretzschmar, and continuing effort to understand the impact of a scientific device on the Nelson Hilton for the humanities. opportunities to work on it in public; to Charles Bigelow, John Bradley, John Burrows, 2 What Happens Harry Collins, Vanessa Davies, In operational terms, when humanities research is computerized the Greg Dening, Susan Hockey, source materials become data—that is, computable information—and John Lavagnino, Tito Orlandi, the research methods resolve into some combination of software and Wilhelm Ott, Harold Short, Patricia Wright, and others I markup.4 What happens intellectually is neither solely computational cannot (mea culpa) recall for nor autonomously human but a combination or interaction of both—a help along the way. thinking with, and against, the computer. The translation of source 2 See the bibliographies by materials into data involves two fundamental steps: a choice of what to Lancashire (1991) and Adamo consider to be the data and a perceptual shift required to see these (1994); for text analysis, see materials as data. also the bibliography in In making this choice one rules out not just what one regards as McCarty (1993). irrelevant—the illustration on the front cover? the text of the colo- 3 For an accounting and phon?—but also those features of the source that are too difficult or taxonomy of humanities wholly impossible now, or perhaps ever, to compute. Such exclusion computing centres, see raises interesting questions. The perceptual shift goes both ways: one can McCarty and Kirschenbaum (2000); for recent public compare the source before conversion—with all its implicit contexts and debate, see the essays listed on the perceiver’s tacit knowledge—with the denuded corpus of data. The my homepage at http: difference illuminates what has not survived translation; it raises the //ilex.cc.kcl.ac.uk/wlm/; question, why not? Selecting or designing software for the task in mind is Unsworth (2000a), the essays likewise based on the never wholly successful attempt to map the aims of referenced there and others by the study onto what software can do. Insertion of those unambiguous the participants; those by Tito declarative statements (i.e. markup) normally involves the same creative Orlandi, listed on his conflict between computation and cognition: that which can be formu- homepage at http: lated in the rigorously consistent and explicit terms of the metalanguage //rmcisadu.let.uniroma1.it/ is very different from thoughts about the object of study, whether these ~orlandi/; and the many are inchoate in the mind or worked out in a ‘natural language’ argument. discussions over the last few years on the Humanist It is the difference, the gap between human knowledge and mechanical Discussion Group (1997–). demonstration, that counts. Humanities computing lives and deals in 4 For the prevalent sense of data that gap. as specifically computable, see Another way of construing what happens is in terms of modelling, by the OED s.v. 1d. Here markup which I mean ‘the manipulation of symbol structures so as to bring them, is taken to mean unambiguous more or less closely, into parallel with [a] pre-established nonsymbolic declarative metalanguage system, as when we grasp how dams work by developing a theory of inserted into or otherwise hydraulics or constructing a flow chart’.5 In terms of humanities com- associated with primary puting, modelling is an iterative process of constructing and developing linguistic data to identify items something like a computational ‘knowledge representation’ as this is for processing that software defined in computer science.6 In fact we might say that a model is a cannot otherwise reliably treat. manipulable knowledge representation. I prefer to use the present par- 5 Geertz, whose words I have ticipial modelling because it emphasizes the provisional, contingent nature quoted, usefully distinguishes of a continuing activity. Although representations, of knowledge or of this sense of the word, a model anything else, can be equally provisional, the present participle usefully ‘of’ something, from a model prevents us from mistaking what we do as a past participial result, some ‘for’ it, ‘the manipulation of representation of what has been reached, understood, and so is known. the nonsymbolic systems in 104 Literary and Linguistic Computing, Vol. 17, No. 1, 2002 Lit&Ling 17/1 103-125 FINAL 23/7/02 3:41 pm Page 105 Humanities Computing: Essential Problems, Experimental Practice terms of the relationships Because machinery is so often directly involved, a ‘model’ in my sense expressed in the symbolic, as is chiefly a physical device, something one runs, tinkers with, obtains when we construct a dam results from, although the activity can of course take place in the head, as according to the specifications a thought-experiment. In either case, as it is a surrogate for or simu- implied in an hydraulic theory lacrum of the thing studied a model cannot be true. Rather it is a prag- or the conclusions drawn from matic ‘work of fiction’ (Cartwright, 1983, p. 153) designed to reach a a flow chart’ (Geertz, 1973, truth that is otherwise difficult or impossible to reach, or which one can p. 93). As I have noted intuit but not demonstrate. Often it will of necessity be quite crude— elsewhere, the literature on hence the term ‘tinkertoy modelling’ in physics7—but its verisimilitude is modelling is extensive but only a condition of its usefulness, not the point of the exercise. A com- bewildering in its lack of consensus; the usage most pletely successful model—one which performed exactly as expected— companionable with would only demonstrate that the knowledge it instantiated was trivial (in humanities computing is the mathematician’s sense); it would be of no interest as a model to found in experimental physics, research. It might be of interest as a process that generates useful results where it is applied to the for some other purpose, but properly speaking a model teaches us when construction of equipment or it fails to correspond to what we expect of it. The model can fail in two software thus to model ways, either by not working despite perfect implementation of what we otherwise unreachable know how to specify, or by working when we think it should not. Such things and events (McCarty, failures drive research forward iteratively by suggesting improvements or 1994, pp. 278–80). On pointing to the need for further investigation.8 modelling in the sciences, Examples from the sciences will illustrate my point. In one biorobotics see Groenewold (1961), Leatherdale (1974), Redhead project researchers have modelled the walking and climbing behaviour of (1980), and Cartwright the Blaberus discoidalis cockroach, with benefits both to biology and 9 (1983); in the humanities, robotics. The model, Robot III, is well known because of its relative see Frye (1991). success, but as one of the researchers has said, its failures (e.g. to climb an 6 See Unsworth (2001) for a obstacle of a kind the insect would easily scale) are much more inter- definition and detailed esting because they tell us what we do not know.10 In another project discussion of the term.

Humanities Computing: Essential Problems, Experimental Practice1

The Repah Report Without Appendices

Litterær Kompetanse

Defining Digital Humanities

Research Infrastructures in the Digital Humanities

Συστήνοντας Τα Digital Humanities (DH): Γενεαλογίες, Ιστορίες Και Προοπτικές

Repah: a User Requirements Analysis for Portals in the Arts and Humanities

Social Informatics As a Concept: Widening the Discourse

Bibliography

1 (14) Transposing the Humanities, Information Technology And

Jill Walker Rettberg

From “Informatica Umanistica” to Digital Humanities and Return

Library, Information and Cultural Heritage Management TEXTBOOK