<<

COGNITIVE PROCESSES OF AND DIGIT

IDENTIFICATION

by Teresa Marie Schubert

A dissertation submitted to Johns Hopkins University in conformity with the requirements for the degree of Doctor of Philosophy

Baltimore, Maryland January, 2015

© Teresa Marie Schubert 2015 All Rights Reserved

ABSTRACT

Letter identification, the process of determining letter identities and their positions from visual stimulus, has a long research history. Debate remains concerning the levels of representation involved, as well as the manner in which letter position is coded: letters must be specified at particular positions. Digit identification, a cognitive process with arguably parallel goals to letter identification, has been less studied. A central question concerns overlap between letter and digit identification processes: Given the similarities between digits and letters as visual symbols, do they share a single identification process or does processing differ based on stimulus category? This thesis combines two investigations, concerning the letter position code employed in identification, and the extent of overlap of digit and letter processing, including variables determining letter and digit identification speed, accuracy, and error patterns. To address the first topic, I elicit letter perseveration errors in a identification task, seeking evidence for the type of position code which can successfully predict both letter position specificity and flexibility in recognition. Consistent with prior results from other paradigms, conclusions from this study suggest a key role for representing letters based on their distance from both the beginning and end of the word. Addressing the topic, I explore the ability of character frequency, visual similarity, and visual complexity of letters and digits to predict a categorical distinction in identification performance. In behavioral experiments with unimpaired readers, I elicit identification errors for mixed letter and digit strings and same/different judgment response to pairs of characters, providing a rich empirical dataset on which to further test the influence of these variables.

The results have implications for our knowledge of letter and digit processing, and are

ii consistent with a shared identification system in which both character types are identified. Together these investigations inform cognitive processes of letter and digit identification, which is the first stage in the reading process. Reading is a skill which is remarkably automatic, yet not innate, and the study of reading introduces broad questions about the nature of cognitive and neural processes that evolved to solve other problems in visual cognition.

Thesis Committee

Michael McCloskey (primary advisor), Cognitive Science

Brenda Rapp, Cognitive Science

Colin Wilson, Cognitive Science

Howard Egeth, Psychological and Brain Sciences

Marina Bedny, Psychological and Brain Sciences

iii

ACKNOWLEDGMENTS

The completion of this dissertation would not have been possible without the support of numerous people; I mention only the most salient here.

I would like to thank Mike and Brenda, who never stopped asking hard questions and pointing out the ugly facts to ruin my beautiful theories. I could not have asked for better mentors. Thanks also to my other committee members, Colin, Marina, and

Howard, for insightful questions about my research and productive discussions of big picture issues.

Thanks to my parents, Margaret and Peter, and my sister, Christina, for always believing in me (and teaching me how to make chocolate cake along the way).

Thanks to Darko, Laura, and Megan for beautiful friendships near and far.

Thanks to my officemates, past and present (but especially past, but especially present), for keeping me sane with coffee, chocolate, and snowflake breaks. All remaining stupid questions will be sent via email. Thanks to Bob, Dan, and Chris for always being on-call to play board games, often with very little notice.

And lastly, thank you to my husband, Carter, for his unwavering faith in me through everything.

I would like to dedicate this dissertation to my brother Joe, whose life inspired me in countless ways.

iv

TABLE OF CONTENTS

ABSTRACT ...... ii

ACKNOWLEDGMENTS ...... iv

TABLE OF CONTENTS ......

LIST OF TABLES ...... x

LIST OF FIGURES ...... xi

CHAPTER 1: INTRODUCTION ...... 1

Overview of the dissertation ...... 3

CHAPTER 2: LETTER IDENTIFICATION ...... 5

Goals of letter identification ...... 7

Invariance over visual forms ...... 8

Representing position ...... 10

Invariance over position ...... 10

Theories of letter identification ...... 12

Bigram-Based Theories of Letter Identification ...... 13

Letter-Based Theories of Letter Identification ...... 17

Evidence for stored letter-form representations ...... 21

Evidence from unimpaired participants ...... 23

Evidence from neuropsychological studies ...... 26

Evidence from LHD ...... 27

Computational modeling ...... 33

Revised Caramazza & Hillis theory of letter identification ...... 35

v

Position representation in letter identification ...... 36

Letter string similarity ...... 37

Empirical evidence for flexible position coding ...... 39

Reconciling graded both-edges position coding with previous results ...... 50

Summary: Letter identification theory under consideration ...... 52

CHAPTER 3: DIGIT IDENTIFICATION AND RELATIONSHIP TO LETTER

IDENTIFICATION ...... 54

Digit identification ...... 54

Theories of digit identification ...... 54

Shared letter and digit identification ...... 57

Separate letter and digit identification ...... 59

Shared identification systems ...... 64

McCloskey and Schubert (2014) ...... 65

Review of evidence for shared system ...... 68

Reconciling differential digit and letter accuracy with shared identification ...... 77

Digits comprise a smaller set ...... 78

Differential role of and/or the right hemisphere ...... 79

Within-set visual similarity ...... 80

Character frequency differences ...... 82

Summary of hypotheses ...... 84

Overview of Studies 1-3 ...... 84

vi

CHAPTER 4: EXPLORING HYPOTHESES FOR LETTER AND DIGIT

ACCURACY DISCREPANCY (STUDY 1) ...... 86

Methods ...... 88

Frequency ...... 88

Visual similarity ...... 89

Visual complexity ...... 92

Discriminability ...... 92

Results...... 93

Frequency ...... 93

Visual similarity ...... 94

Discriminability ...... 96

Visual complexity ...... 97

Discussion ...... 100

Frequency ...... 100

Visual similarity ...... 100

Discriminability ...... 102

Visual complexity ...... 103

Moving beyond set-wise analyses ...... 104

CHAPTER 5: LETTER AND DIGIT IDENTIFICATION STUDIES ...... 106

Letter identification: Confusions and response ...... 106

Inter-letter similarity ...... 107

Visual complexity ...... 108

Letter frequency ...... 109

vii

Letter and Digit Identification Accuracy (Study 2) ...... 112

Methods ...... 113

Long Duration Experiment Results ...... 118

Main Experiment Results ...... 118

Discussion ...... 125

Response Times in Letter and Digit Same/Different Judgments (Study 3)...... 130

Methods ...... 131

Results ...... 134

Discussion ...... 143

General Discussion ...... 150

Differences between Study 2 and Study 3 ...... 151

Shared identification systems ...... 152

CHAPTER 6: POSITION IN LETTER IDENTIFICATION (STUDY 4) ...... 154

Methods ...... 156

Long Duration Experiment ...... 157

Accuracy and Perseveration Analyses ...... 158

Perseveration Position Analyses ...... 159

Long Duration Experiment Results ...... 163

Main Experiment Results ...... 164

Perseveration and position analyses ...... 165

Syllable-based schemes ...... 172

Discussion ...... 174

Conclusions about the level of position representation ...... 178

viii

CHAPTER 7: GENERAL DISCUSSION ...... 181

Summary of experiments ...... 181

Study 1 ...... 181

Study 2 ...... 182

Study 3 ...... 183

Study 4 ...... 184

Conclusions...... 186

Future directions ...... 187

APPENDIX A: CHARACTER FEATURE SET ...... 192

APPENDIX : CHARACTER CONFUSION ...... 198

APPENDIX : SAME/DIFFERENT RESPONSE TIMES ...... 201

BIBLIOGRAPHY ...... 205

VITA...... 218

ix

LIST OF TABLES

Table 2.1: LHD‘ performance on Same/Different tasks ...... 33

Table 4.1: Perimetric complexity comparisons...... 98

Table 4.2: Number of pixels comparisons ...... 99

Table 5.1: Predictor variables for by-character error rate and Same trials ...... 115

Table 5.2: Predictor variables for substitution errors and Different trial...... 117

Table 5.3: By-character dependent measures...... 120

Table 5.4: Significant predictor variables in regressions of substitution data ...... 124

Table 5.5: Significant predictor variables in regressions of Different pair RT data ...... 140

Table 6.1: Results of non-syllabic position schemes ...... 166

Table 6.2: Results of adding schemes to Narrowly-graded Both-edges scheme ...... 169

Table 6.3: Results of adding Narrowly-graded Both-Edges scheme to other schemes...171

Table 6.4: Results of syllabic scheme analyses...... 173

Table A1: Features for uppercase letters and digits...... 193

Table A2: Rules for feature overlap ...... 195

Table A3: Feature overlap matrix ...... 196

Table B1: Behavioral confusion matrix from Study 2 ...... 199

Table C1: SameAll and Different response times from Study 3 ...... 202

Table C2: SameFirst RT from Study 3 ...... 204

x

LIST OF FIGURES

Figure 2.1: Simplified schematic of letter identification and reading processes ...... 6

Figure 2.2: Abstract letter identities as a common code ...... 7

Figure 2.3: Revised theory of letter identification ...... 35

Figure 3.1: Shared theory of letter and digit identification ...... 58

Figure 3.2: Separate systems theory of letter and digit identification ...... 62

Figure 4.1: The four used in Study 1 ...... 90

Figure 5.1: RT dendrogram computed from Different trials ...... 136

xi

CHAPTER 1: INTRODUCTION

Though reading is a skill which has to be explicitly taught, literate adults are astonishingly fast, needing as little as 50 milliseconds of exposure to accurately read a word in a normal context (Rayner, Inhoff, Morrison, Slowiaczek, & Bertera,

1981). In addition to this extreme speed, readers can also recognize 26 letters

(most with a distinct upper- and lowercase form) across a wide variety of visual inputs, including countless typewritten fonts, styles, and variation in size, position, and color. Even novel forms of letters, such as those written in unusual fonts or a stranger‘s handwriting, are recognized with relatively little difficulty. The input to reading begins at the retina, and the first step of reading is letter identification. This process culminates with activation of a single letter identity. Letter identities are abstract entities, corresponding not to a physical stimulus but to a concept of identity which is shared across many stimuli: ‗A‘, ‗A‘, ‗A‘, ‗a‘, and ‗a‘ are all instances of the [A] identity.

Research on letter identification explores the representations and processes which allow identities to be selected. In particular, what are the intermediate representations which allow identification to occur? Multiple letters (forming a word or string) are often identified in parallel, adding a need to represent not only the identity but also the position of each element. The ability to accurately represent position allows the CAT and

ACT to be distinct lexical items, though they are distinguished only by the relative positions of their constituent letter identities. Therefore, in considering the computation of a letter identity one must also consider the manner in which position is coded to maintain the order of identities in the input.

1

Like letters, digits also occur as elements of combinatorial strings, and the digits must be identified before further processing of the strings can occur. Again, the position of digits is vital, distinguishing 84 from 48. Given these similarities between letters and digits—and others to be explored in later sections—an interesting question concerns the extent to which letter and digit identification processes are overlapping or identical.

Conceptual processing of digits differs from letters in that individual digits have meaning while letters1 do not; at present it is unclear whether this high-level difference influences the character identification processes. Much prior work has focused on letter identification, leaving open a number of parallel questions about the identification of digits.

In this dissertation I examine both letter and digit identification. I consider identification processes that begin with visual input of an alphanumeric stimulus, and conclude with specification of the identities of the letters and/or digits present in the stimulus along with their positions. In addition to exploring the representations and processes involved in these computations, I evaluate the extent to which they can be considered as a single shared identification system.

As an example of more general object identification processes, the study of letter and digit identification has the advantages of a relatively circumscribed set of two- dimensional stimuli and a small set of identities: 26 Latin letters and 10 Arabic numerals.

Though the processing of letter and digit stimuli is strongly tied to processing and is often thought to be distinct from identification of non-symbolic visual objects, principles uncovered in letter and digit identification may apply more broadly. The ultimate goal of the study of letter identification and reading is a theory that is capable of

1 With the possible exception of the one-letter words ―I‖ and ―a.‖

2 accounting for the success readers achieve across such a wide variety of inputs. Such a theory would suggest precise diagnoses and treatments for acquired dyslexias in letter identification (‗peripheral dyslexias‘), and combined with an account of how the identification system is acquired in children it would provide the same for developmental dyslexias. Additionally, a complete theory of the process could be formalized into a computer model which would be capable of human-like identification performance, and would have utility in industry and clinical contexts.

Overview of the dissertation

The work in this dissertation involves data from an individual with acquired dyslexia and unimpaired adult participants; the latter experiments serve as follow-up studies to extend and confirm the neuropsychological findings. First I review relevant research on letter and digit identification, respectively, and their overlap. Chapter 2 addresses letter identification, reviewing the literature of letter identification theories and computational models as well as work published in Schubert and McCloskey (2013) to a revised theory of letter identification which will be the basis for the remainder of work in the dissertation. In Chapter 3 I review the small literature addressing digit identification, and consider two proposals for letter and digit identification: separate identification systems, and a single shared system. In favor of the latter view I review recent evidence published by McCloskey and Schubert (2014) as well as the broader literature. Chapter 4 contains analyses of the properties (.., visual similarity) of letter and digit stimuli, considering the two categories as sets (Study 1).

Chapter 5 describes two studies conducted with unimpaired participants exploring the dynamics of the shared systems hypothesis of letter and digit identification (Studies 2 and

3

3) through a delayed typing character identification task and a same/different judgment task on pairs of characters. Chapter 6 returns to letter identification specifically to consider the nature of position representation with a study of perseveration errors elicited from unimpaired participants (Study 4). Chapter 7 concludes the dissertation with a general discussion, conclusions, and directions for future research.

4

CHAPTER 2: LETTER IDENTIFICATION

Letter identification operates over visual information and produces as an output one or more letter identities corresponding to the input. Though some disagreement exists on this issue, a majority of researchers agree that (for alphabetic systems) letters are a vital unit of abstract orthographic structure for word recognition and reading

(see Frost, 2012, and commentary; also Grainger, 2008, for review).When multiple letters are processed simultaneously, the position as well as the identity of incoming stimuli needs to be represented. Once identified, the representations of the letters and their positions serve as input to central reading processes which accomplish access to semantic and/or phonological information about the string. Figure 2.1 presents a simplified schematic of the components of the letter identification and reading systems; this schematic will be enriched and revised throughout the dissertation in with both past research and new findings.

5

Figure 2.1: Simplified schematic of letter identification and reading processes.

After letter identification, the central reading processes consist of lexical and sub- lexical processes, which deal with whole words and sub-word parts, respectively. These will not be considered here aside from the orthographic , which is the first processing stage on the lexical route (left side of Figure 2.1) and the long term memory store of orthographic words. The orthographic lexicon contains information about which letters, and in what order, comprise the words that an individual knows. The combined processes of letter identification and accessing the orthographic lexicon are sometimes referred to as ‗word recognition‘ rather than reading; the latter entails access to semantics and/or phonology.

Abstract letter identities may also be mapped to their corresponding letter name

(e.g., /bi/ for [B]), one of many possible written forms (e.g., lowercase handwritten b ), or another form of output such typing. These mappings are depicted in Figure 2.2. The

6 assumption is that abstract letter identities serve as a common code for converting between different types of information, including visual (as letter identification) and motor information (as writing or speaking letter names; Besner, Coltheart, & Davelaar,

1984; Brunsdon, Coltheart, & Nickels, 2006; Hillis & Caramazza, 1995; Margolin &

Goodman-Schulman, 1984; Rothlein & Rapp, 2014; Tainturier & Rapp, 2003). This view is not universal; some researchers assume that the abstract orthographic units which are the end product of letter identification are reading-specific (e.g., Behrmann, Plaut, &

Nelson, 1998; Dehaene, 2009). Though letter identification is the focus of this dissertation, the path to various modes of output is relevant for interpreting results of varying orthographic tasks (letter naming, copying, spelling, etc.).

Figure 2.2: Depiction of abstract letter identities as a common code for multiple types of representations for output and further processing.

Goals of letter identification

There are a number of goals that letter identification needs to accomplish. Three major ones will be discussed here: identity invariance over visual forms, encoding of

7 letter identity position, and identity invariance over positions. These describe the qualities of a sufficient letter identification process.

Invariance over visual forms

Identity invariance over visual forms refers to the need to recognize stimuli referring to the same letter identity (e.g., [A]) when they appears in different visual forms. A letter can appear in a number of different ways by varying the case (a, A), style

(a, a), and even material (e.g., A in pebbles on a beach). Despite the varying visual appearance of these stimuli, they all correspond to a single identity; the identification process must be able to handle this variation in input and still activate an invariant identity.

Unusual visual forms

Different proposals exist for how very unusual letter forms are recognized. One approach, as suggested by Hofstadter, is that the representations of letter shapes used in identification may be quite abstract (Hofstadter & McGraw, 1995). For example, describing lowercase q as a vertical line and a truncated ellipse could obscure the fact that q and q also correspond to forms of q. Rather, a more abstract description such as a

‘ (term intended to give more leeway: straight or curved, vertical or slanted) on a

‗loop‘ (i.e., closed region) would capture these more unusual examples.

Though it is true that the perceptual system will identify these unusual letter exemplars, it is not clear that the same identification computations occur for unusual forms as for the more standard forms with which we have extensive daily experience. A number of results have indicated that letter identification is slowed by novel and/or mixed

8 fonts and speeded by experience with a previously-unfamiliar and the use of consistent fonts (Gauthier, Wong, Hayward, & Cheung, 2006; Sanocki, 1987, 1988;

Sanocki & Dyson, 2012; Walker, 2008). Sanocki suggested that assistance in identifying the letters of an unusual font comes from seeing multiple letters in the font, allowing the perceptual system to ‗tune‘ to the systematic alterations present in the font (1987). This suggestion is well taken if you consider this unusual letter in isolation: q versus in a same-font context: queen, where its identity is obvious (font: Ergonome ghijklmnopqrst). By hypothesis, a basic letter form and the transformations applied to it to derive the surface forms of the letters are both used by the identification system to recover the basic shape of each letter. Further, this procedure takes advantage of the systematicities of shape within a font, and the same procedure is used for each letter within a given font. This approach assumes that basic letter shapes are represented, perhaps as a collection of

(somewhat-abstract) letter features, and provides an account for how variability in visual appearance is handled to extract the basic shape, from which the identity is then derived.

Though the question of how unusual letter stimuli are identified is fascinating, the bulk of reading and word recognition research has focused on letter identification of fairly standard characters with a few basic shapes (i.e., script/print forms, and upper/lowercase). These are the forms which provide the bulk of our daily experience with letters. The suggestion by Sanocki that font-tuning adds on to standard character recognition processes highlights the need to first understand and detail these underlying standard processes, and then consider what additions or alterations allow them to contend with the full range of letters experience in the wild.

9

Representing position

Encoding of letter position becomes relevant when more than one letter is present in the stimulus, such as in a letter string or word. Various schemes have been proposed for representing letter positions. These include a left- or beginning-based scheme—often the default position scheme intended by authors when a specific one is not noted—in which letter positions are considered based on their position from the leftmost/ letter. Under this scheme, CAT would contain a [C] in position 1, [A] in position 2, and

[T] in position 3. Conversely, a right- or end-based scheme counts positions from the rightmost/final letter, with [C] in position 3, [A] in position 2, and [T] in position 1.

Midpoint schemes consider positions from the center of the letter string (e.g., [C] in position -1). By contrast to these schemes, which have extrinsic anchor points, other position schemes assign letter positions based on the context of letter identities in the stimulus. These include bigram and trigram schemes, which will be described in some detail when considering existing proposals for letter identification systems. The method of representing position will determine how positions align in different strings. The position assigned to a particular letter may vary based on string , syllable structure

(for words, pseudowords), and the context of surrounding letters.

Invariance over position

Identity invariance over positions must also be accomplished by the letter identification system. This fact is particularly transparent given the fact that letters in the

Latin (c.., ) appear in the same visual form regardless of their position in the stimulus. (The exceptions are of the initial letter in proper

10 names and in sentence-initial position. However, the majority of our experience with words is with all letters in lowercase.). A g at position 1 refers to the same abstract identity as a g at position 3; both should map onto the same identity and have the same letter name. Likewise, a G at position 1 refers to the same identity as a g at position 3

(position invariance combined with visual form invariance).

Davis and Bowers (2006) refer to two problems in letter identification which are both subsumed by this concept of identity invariance over position. The first is the

‗alignment problem,‘ which is demonstrated with compound words (Davis & Bowers,

2006, p. 549). For example, to know that CAT is contained in TOMCAT the letter sequence CAT must be represented in a similar fashion despite the differing position of the letters within the word (though whether and how the positions differ depends on the manner of representing letter position). The second problem is the ‗dispersion problem,‘ referring to the fact that the reading system seems to extract generalizations about single letters (e.g., their corresponding (s)); Davis and Bowers argue that some of this information does not depend on the position of the letter within a string (2006, p. 550). A clear example of this type of information seems to be the letter name, which is clearly invariant across letter positions. Correspondences between letters and are another example, though these are often position- or context-specific (e.g., C  /s/ before

I, but C  /k/ before A).

These three basic goals define the desired end-state of letter identification: a string of abstract letter identities and corresponding positions. This outlines the computational problem of letter identification: to compute from visual input an abstract letter identity,

11 along with its position.2 Any sufficient theory of letter identification must include mechanisms that produce this output, given the known variation in letter stimuli. For the purposes of this dissertation, I will use the phrase ―letter identification‖ to include these three goals. That is, identification must additionally compute position in order to be a successful and complete identification process.

Theories of letter identification

Various competing accounts attempt to describe the mental process of letter identification (e.g., Brunsdon, Coltheart, & Nickels, 2006; Caramazza & Hillis, 1990a;

Davis & Bowers, 2006; Davis, 2010; Dehaene, Cohen, Sigman, & Vinckier, 2005;

Gomez, Ratcliff, & Perea, 2008; Grainger & Heuven, 2003; Norris & Kinoshita,

2012; Whitney, Bertrand, & Grainger, 2011). Theories3 of letter identification generally consider as input the small line segments and curves (i.e., features) which form visual letter stimuli, and describe the computations which produce orthographic representations for input to word and nonword reading processes (i.e., orthographic lexicon). Many theories are built from the foundation of the McClelland & Rumelhart (1981) Interaction

Activation Model (IAM), which accomplishes letter identification in the narrow sense of

2 In addition to letter identities and positions, in certain contexts letter case must be represented, for example to distinguish between the ‗PET‘ and the ‗pet.‘ The case of the letters needs to be represented up to the level of orthographic entries, so that these word forms can map onto distinct semantic entries. Therefore, a truly complete theory of letter identification must include representations of letter case. Further discussion on this and a proposals for letter case representation can be found in Peressotti, Cubelli, and Job (2003) and Schubert & McCloskey (2013). 3 Some authors use the terms ‗theory‘ and ‗model‘ interchangeably, though a distinction can be drawn between a theory, a set of verbal assumptions about representations and processes, and a model, a computer program which attempts to implement this processing. I will use the term used by a given author to refer to his/her own work, regardless of whether the theory is instantiated as a computational model. Implemented computational models will be explicitly noted as such or as a ‗simulation‘. In practice, predictions are drawn both from formalized simulation work and from the stated commitments of a non- or less-formal theory. However, in the case of simulations, some aspects of the simulation may not be intended as theoretical commitments but rather were required implementational details.

12 activating letter identities, but does not deal with the issue of position representation.

Two broad classes of theory can be distinguished: bigram-based, and letter-based. This classification rests on the most abstract orthographic representation of letters described by the theory, which is generally the unit which accesses the orthographic lexicon for word recognition.

Bigram-Based Theories of Letter Identification

A bigram representation combines information about letter identity and letter position (Dehaene et al., 2005; Grainger & Van Heuven, 2003b; Whitney, 2001a).

Bigrams represent two letters: For example, the bigram [CA] indicates the presence of a

[C] identity, [A] identity, and their relative position: [C] precedes [A]. So-called closed bigrams represent only adjacent letter positions while open bigrams represent non- adjacent positions, preserving relative ordering. For example, the closed bigram [CA] is found in the stimulus CAT, and the open bigram [C*A] (and closed bigram [CR]) is found in the stimulus CRATE. Theorists who posit open-bigram representations generally indicate a maximum number of intervening characters (or character spaces) which can separate the two elements of an open bigram. This limits the number of representations necessary to specify a stimulus, and open-bigram representations also include closed bigrams which are effectively open bigrams with zero intervening elements (e.g., when limited to two intervening elements CRATE is represented by [CR]

[C*A] and [C*T] but not [C*E]).

There are three influential bigram-based theories of letter identification: the Bi- modal Interactive Activation Model (BIAM, Grainger, Granier, Farioli, Van Assche, & van Heuven, 2006; Grainger & Holcomb, 2009a; Grainger & Van Heuven, 2003; Madec,

13

Rey, Dufau, Klein, & Grainger, 2011), Local Combination Detector model (LCD,

Dehaene et al., 2005; Dehaene, 2009; Vinckier et al., 2007) and SERIalization Of Letters model (SERIOL2, Whitney & Marton, 2013). Of these, the first and third have been implemented in whole or part as computational models.

The BIAM consists of two routes of orthographic coding, one of which contains bigram representations (Grainger & Holcomb, 2009; Grainger & Van Heuven, 2003b;

Grainger, Granier, et al., 2006; Holcomb & Grainger, 2006; Madec et al., 2011;

Schoonbaert & Grainger, 2004). The bigram route has received the most work and scrutiny within the theory. It includes three levels of representation: features, letters, and open bigrams. Features and letters are represented in retinotopic space. Letter detectors in the ―alphabetic array‖ are duplicated horizontally across near-fixation positions and identify abstract letters across a small range of sizes, all fonts and both cases (Grainger,

2008; Grainger & Van Heuven, 2003b). A store of open bigrams receives input from the alphabetic array, representing letter identities and their relative positions. Independently, the alphabetic array also provides input to a sequential representation of , for activation of their corresponding phonemes (on the other processing route). Grainger and

Van Heuven (2003b) implemented the alphabetic array, open bigrams, and orthographic word units in a computational model; they successfully simulated the results of published priming studies to support the feasibility of open-bigram representations in word recognition.

The bulk of work on this model has focused on the level of open bigram representations. The earliest descriptions of the theory did not include any principled limit on the number of letters allowed to intervene between the two letters of a bigram

14

(though it was limited to 2 for the purposes of simulation); this was later modified to specify that two or fewer letters can intervene (Schoonbaert & Grainger, 2004). This specification allows the model to generate predictions about similarity of letter strings, which will vary depending on the length of the string.

In recent papers Grainger and colleagues refer to simply the ‗open bigram‘ model or ‗overlap open bigram‘ model (Grainger, 2008; Grainger, Granier, et al., 2006). The latter refers specifically to a version of the model in which the alphabetic array is redefined from a strict slot-based mechanism that receives input from only one position in the visual field to instead receive input from and represent letters present across a small number of nearby positions. The strength of a letter in each slot of the array depends on its distance from the center of that slot‘s receptive field. With multiple letters active in each slot, the bigram units at the next level will end up representing not only closed bigrams but also open bigrams with a small number of intervening letters. For example, if C and A are both active in slot1 and C, A, and T active in slot2, the resulting activate bigrams will be CC, CA, AC, AA, AT, (ordered combinations of slot1 and slot2 letters); note that this includes some bigrams not actually present in the stimulus, allowing this theory to predict transposition priming (to be discussed later). The graded slot-based alphabetic array results in a slightly more principled way to limit open bigrams to two intervening letters: the cut-off results from the assumed width of the receptive fields of the alphabetic array.

The Local Component Detector (LCD) theory of visual word recognition, though similar in some respects to the BIAM, is characterized by strong ties to the neurophysiology of the ventral visual stream (Dehaene, 2009; Dehaene et al., 2005;

15

Vinckier et al., 2007). This theory proposes a hierarchy of neuronal responses building up to the abstract representation of words in the Visual Word Form Area (VWFA) in the left fusiform gyrus (Dehaene, 2009). The representations at each level, or the ‗coded units,‘ correspond to the size and properties of receptive fields, a construct that carries through the model even to the level of small words and morphemes (Dehaene et al.,

2005). The earliest stages detect simple visual features, such as oriented line segments.

The next stage of the model represents the shapes of letters as combinations of line segments, and contains separate sets of neurons coding for the lower and uppercase forms of letters. To achieve case invariance, units at the following level receive input from the neurons coding for the lower and uppercase form of a single letter, thereby computing and representing abstract letter identity. The abstract letter detectors are repeated across the visual field; though these units achieve identity invariance across visual forms they do not achieve invariance across position (Dehaene et al., 2005). This bank of detectors is very similar to the alphabetic array of the BIAM in that retinal position is encoded and letters are identified in parallel at multiple positions. The next level of representation is open bigrams, which are also duplicated across the visual field. Bigrams represent abstract (case- and style-free) letter identities, along with information of which letter of the bigram is to the left of (preceding) the other, within a given region of the visual field.

By contrast to the open bigrams in the BIAM, these representations are not location invariant.

The third bigram model: SERIalization Of Letters (SERIOL2), is an alteration of a previous model, Sequential Encoding Regulated by Inputs to Oscillations within Letter units, or SERIOL (Whitney, 2001a, 2001b; Whitney & Berndt, 1999; Whitney & Marton,

16

2013). Both models rely heavily on principles of neural computation, modeling letter recognition in a neurally-plausible manner. The lowest level of representation in

SERIOL2 consists of visual features. These provide input into abstract letter detectors which correspond to particular retinal positions. The subsequent level represents letters in a location invariant manner, with input from the position-specific abstract letter detectors. Finally, open bigrams represent letters and their relative positions, including representation of word-edges in open bigrams bordering a word boundary. These representations are quite similar to those posited by Grainger and colleagues, making largely overlapping though non-identical predictions of orthographic similarity (Lupker,

Zhang, Perry, & Davis, 2014; Whitney et al., 2012).

Letter-Based Theories of Letter Identification

Letter-based theories of letter identification include the Spatial Coding Model

(SCM, Davis & Bowers, 2006; Davis, 2010), Overlap model (Gomez et al., 2008), Noisy

Channel model (Norris & Kinoshita, 2012; Norris, Kinoshita, & van Casteren, 2010), and two unnamed theories developed by Brunsdon and colleagues (2006), and Caramazza and

Hillis (1990a, 1990b).

The first three of these are computational models of letter identification, and differ in the manner in which they solve the problem of position representation in identification. The latter two contain a more enriched description of the levels of representations involved in letter identification, but do not consistently specify the types of position representation used.

The Overlap model (Gomez et al., 2008) deals exclusively with position coding and includes only a single level of letter representations. The model includes uncertainty

17 about letter identity as well as letter position: The elements of a stimulus initially have imprecise coding along both dimensions but the precision improves with the exposure duration of the stimulus. In the limit, with unlimited viewing time, the Overlap model posits that letters have fully specified positions, with no overlap into adjacent positions.

In order to deal with strings of different , each string is aligned by its two ends and scaled to a set width, resulting in a type of position coding which encodes distance from the ends as well as the midpoint of the string (Gomez et al., 2008).

The Spatial Coding Model (Davis, 2010; Davis & Bowers, 2006) encodes letter identities and positions, as well as implementing an orthographic lexicon and the process of word recognition. Processing begins with letter features, which map onto letter units and then onto orthographic word units. The spatial coding aspect is the position representation: Each letter identity has a corresponding position gradient which defines a distribution over ordinal positions. The distributed nature of the representation implies a noisy or uncertain coding of letter position; the relative pattern of position across the letters encodes their relative ordering. More uncertainty (a wider distribution of position values) is used for longer words, and additional units mark the beginning and end letters of the string.

The Noisy Channel model of Norris and Kinoshita (2012; 2010) builds from the premise that the same computations are used in letter identification as in visual object perception more generally. The basic units of computation are features, letters, and then words. Norris and Kinoshita conceive of visual perception as a process of recovering distal stimulus information passed through the ―noisy channel‖ of the visual system. To this end, letter identification processes attempt to account for the types of degradation the

18 visual system can produce, including perturbations of evidence for the presence of letters, as well as their identities and positions. Over time the model collects additional evidence from the stimulus, refining the representations for the letters present. Like the Overlap

Model, the Noisy Channel model achieves a precise coding of letter identity and position given sufficient presentation time/evidence.

The two remaining theories of letter identification include commitments to three levels of representation, including one level intervening between visual features and letter identities. Brunsdon, Coltheart, & Nickels (2006) detail a theory of reading and spelling as suggested and supported by extensive study of a child with developmental dyslexia.

They posit that letter features map onto a font-free letter representation. The font-free representations are stored representations, and do not encode the visual features that differentiate different forms of a letter of the same case (e.g., e and e), but do encode case differences (i.e., e is represented separately from E). This is followed by a case-free letter representation, or abstract letter identity. One caveat of the font-free representations posited is that support for these representations came directly from the observed patterns of impaired and spared performance in a developmental dyslexic individual, without independent support from cases of acquired dyslexia or normal readers. As study of a developmental dyslexic individual may not reveal a damaged adult reading system but rather an abnormally-developed system, this claim would benefit from additional converging evidence.

In several papers, Caramazza and Hillis (1990a, 1990b) describe a three-stage reading theory: visual features, letter shapes, and abstract letter identities. The intermediate level represents the shapes of the letters in the stimulus. The term ‗character

19 shape‘ adopted by Schubert & McCloskey (2013), is a more neutral term for this level which captures the fact that any visual stimulus consisting of letter-like features will have a representation at this level, including pseudoletters. The features of the previous level have been combined into units corresponding to the letters (presumably on the basis of visual characteristics of the stimulus such as proximity). The character shape is a constructed representation: character shapes are stored as a collection of activated abstract orthographic features for a given stimulus. These features are more abstract than those represented at the visual feature level, with some increased tolerance for changes in size and position of the feature. This level can be considered to contain a case-specific and font-specific representation of the letters in the stimulus, without knowledge of letter identities. Abstract letter identities are represented at the third and highest level of the theory.

Caramazza, Hillis, and colleagues (Caramazza & Hillis, 1990a, 1990b; Hillis &

Caramazza, 1995; Hillis, Rapp, Benzing, & Caramazza, 1998; Rapp & Caramazza, 1991) specify that at each level of their theory, the position of elements is represented according to a particular frame of reference. The levels progress from stimulus-driven to abstract, with each progressive level sensitive to fewer differences in the visual stimulus. Visual features are represented relative to the observer in a viewer-based frame of reference. As a consequence, the contents of this level are updated often, with each change in fixation or head position. Position at the character shape level is represented by reference to the stimulus itself, which is considered to have a right and left side and a top and bottom side. These spatial properties of the stimulus are preserved in the position representation.

However, representations at this level are invariant to the stimulus‘ location relative to

20 the viewer. At the abstract letter level, the position of letter identities is represented according to their canonical ordering, rather than any relation derived from the visual stimulus itself. Caramazza and Hillis (1990a, 1990b) further specified that letters occupy positions with respect to the midpoint of the word. At this level all aspects of the form of the stimulus have been discarded: The same word presented in different locations, case, font, or orientation has identical representations.

This theory has been extended and found extensive support in neuropsychological case studies (e.g., Hillis & Caramazza, 1995; Hillis, Rapp, Benzing, & Caramazza, 1998;

Rapp & Caramazza, 1991; Subbiah & Caramazza, 2000). Patients have been reported with deficits to each of the three hypothesized levels, with viewer-based, stimulus-based, and word-based deficits respectively. Schubert and McCloskey (2013) proposed a number of additions and modifications to the Hillis and Caramazza theory, adding an intermediate allograph level between the character shape and identity levels, an explicit representation of letter case, and exploring the properties of allograph representations. In the following section I review evidence for stored letter representations preceding abstract letter identities in the literature, as well as other evidence leading to the revised theoretical account.

Evidence for stored letter-form representations

As discussed above, the letter identification process solves the difficult problem of identifying a small set of letters from a wide variety of inputs. A literate adult can recognize letters across numerous printed fonts and across changes in size, color, orientation, and more. At its core, this process is one of categorization: Inputs are sorted into one of 26 categories of letter. In addition to varying in visual properties such as size

21 and orientation, which are transformations of a basic letter shape, letters also vary in case and style. These variations are often arbitrarily related to each other. For example, g and

G are not highly visually similar, yet they represent the same letter identity. Other letter shapes which are visually related, such as C and G, do not correspond to the same identity. Mapping directly from the various letter inputs (more accurately, from representations of their character shapes as visual features) to letter identities is a hard computational problem due to these considerations (e.g., Finkbeiner & Coltheart, 2009).

This difficult categorization process can be simplified through the use of sub- categories: allographs. An allograph is a stored representation of a letter: a particular letter shape defined by a case (i.e., upper/lower) and a style (i.e., print/script), such as a, a, and A. Allographs represent the limited set of basic shapes of letters, and abstract over other visual properties of the stimulus. Thus, the visually distinct forms g and G map onto separate allographs; however, stimuli varying only in font but depicting the same letter shape/style (e.g., g and g), will activate the same allograph. Font differences are those which are systematically related to the letter shape (e.g., to create the font Courier New, serifs are added to the letters in predictable locations; to create bold type the line weights are increased), while style differences such as a and a reflect a difference in visual shape that is to some degree arbitrary.

In the following sections I present empirical evidence in support of allograph representations in letter identification. As noted previously, most explicit theories/models of letter identity reviewed here do not include a level of letter representation intervening between features and letter representations. However, many researchers—sometimes implicitly—consider letter identification to include a structural

22 description of the stimulus as a precursor to activation of letter identities, analogous to structural descriptions in general object recognition. However, the term and concept of

‗structural description‘ does not specify whether the representations are stored or computed for the stimulus, so it is unclear to what extent these representations are intended to have the same properties as allographs: stored representations of letter shape.

Evidence from unimpaired participants

Few studies have explicitly explored stored font-free/style-specific case-specific letter representations. Behavioral priming experiments have been one important source of evidence. In their 1995 paper, Arguin and Bub found that in a letter decision task on single character targets, only identical primes (e.g., - target: A - A) and not cross- case primes (e.g., a – A) had a facilitatory effect on response time. By contrast, when the task was to name a single letter target, they found robust cross-case priming effects.

They took these results to imply the presence of two types of stored letter representations: case-specific letter codes, which support letter decision responses; and abstract codes divorced from letter shape, which support letter naming. This distinction parallels the distinction between allographs and abstract letter identities, though it does not specify style-specific allograph forms (only case-specific). Petit and Grainger (2002) reported very similar results: In a masked priming letter decision task they found that same-case primes provide more facilitation than cross-case primes, while no such difference was found in a masked priming letter naming task. Similar conclusions were drawn by Norris and Kinoshita (2008) from a cross-case same-different matching task with single letters.

On each trial a probe was presented (e.g., a), followed by a masked prime (e.g., A), and a target letter (e.g., A). The task was to determine whether the target and probe were the

23 same or different. In Experiment 3 of their paper, they manipulated the task so that the decision was based either on identity matching (e.g., a is the same as a and A) or case- specific form matching (e.g., a is the same as a but different from A), as well as manipulating the visual similarity between prime and target (e.g., a and A are dissimilar, c and C are similar). Results indicated that visual similarity only affected response times in the case-specific matching condition; this task was sensitive to visual similarity at a case-specific letter level, suggesting that it depended on case-specific letter representations.

Case-specific letter frequency effects

Further behavioral evidence from unimpaired subjects comes from a letter- decision study by New and Grainger (2011). Participants distinguished between letters and pseudoletters (re-combinations of letter parts). New and Grainger correlated reaction time in the letter decision task on a by-letter basis with the written frequency of that letter

(in French). Frequency correlated with RT both in a single-character letter decision task and a character-in-string (e.g., XXCXX for C) letter decision task. Highest correlations were found using frequency counts for case-specific forms in initial word position, and token frequency (letter frequencies weighted by the frequency of the containing words) rather than type frequency. These frequency-dependent results suggest that the letter identification system stores case-specific letter representations and tracks their frequency.

The authors noted that prior studies of letter frequency effects have had variable outcomes; New and Grainger‘s use of a letter-decision task (which can easily be performed on the basis of activation at the level of stored visual forms, see discussion in

24

‗Evidence from neuropsychological studies‘ section) may have boosted their ability to find case-specific letter frequency effects.

Jones and Mewhort (2004) compiled case-specific letter frequency counts (token frequency) for English letters, digits, and typographical symbols from a large corpus (~14 million words) of New York Times articles. In a speeded single letter naming experiment with uppercase stimuli they found adjusted naming time to correlate with uppercase letter frequency but not lowercase frequency. The adjustment was subtracting out the time taken in cued naming of each letter (after a long visual presentation); this presumably results in values which reflect the visual letter identification processes rather than letter name production (output) processes, and it is therefore sensible that this measure is sensitive to case-specific letter frequency.

These studies provide evidence for the use of case-specific letter representations, though they do not speak to the existence of stored representations of forms within a case

(e.g., a and a). Walker and Hinkley (2003) reported behavioral evidence from normal subjects in a memory study indicating that font-free, case-specific visual letter representations can be used to bind color to shape. They found that accuracy did not diminish between study and test for within-case but cross-font variations in letter shape (e.g., b vs. b, [not actual stimuli]). However, when there was a case change between study and test (within font, e.g., b vs. B), accuracy diminished. They took these results to indicate that the color is bound to a font-free letter shape representation which is case-specific. Whether these representations are also style-specific (rather than font) within a case was unexamined (13 of the 14 fonts they used were print fonts, and results were not analyzed by font).

25

Turning to neuroimaging and neurorecording methods, Petit, Midgley, Holcomb, and Grainger (2006) found event-related potentials for letter stimuli consistent with the computation of case-specific representations; however they did not manipulate font in their study. Another ERP study from the same group found an early time window in which signals for the same word written in different fonts diverged; these results accord with a style-specific representation early in processing (Chauncey, Holcomb, & Grainger,

2008). With fMRI, Gauthier and colleagues (2000) found evidence for sensitivity to font differences among single letters in a right fusiform region. These results are consistent with representations of case- and style-specific letter forms in letter identification, with some indication (from ERP time courses) that these are computed early in processing.

Evidence from neuropsychological studies

Some acquired dyslexic individuals exhibit a preserved ability to distinguish upright letters from among rotated or reflected letters or to distinguish letters of the from letters of other and pseudoletters, despite an impaired ability to access abstract letter identities (e.g., Brunsdon et al., 2006; Chanoine, Teixeira Ferreira,

Demonet, Nespoulous, & Poncet, 1998; Dalmás & Dansilio, 2000; Miozzo & Caramazza,

1998; Mycroft, Behrmann, & Kay, 2009; Rapp & Caramazza, 1989; Schubert &

McCloskey, 2013; Volpato, Bencini, Meneghello, Piron, & Semenza, 2012). In the intact reading system, without assuming allograph representations, the ability to recognize letters from non-letter or transformed-letter distractors could be accomplished by accessing abstract letter identities: Letters will activate their associated identity, while non-letters will not activate any single identity. In acquired dyslexia this may not be possible if abstract letter identity representations have been lost or are not fully

26 accessible. Patients who are unable to access abstract letter identities but show a preserved ability to recognize letters from non-letter distractors could be activating stored visual forms of letters (i.e., allographs) in order to distinguish real letters (have stored forms) from non-letters (no stored form). Following this logic, the existence of individuals who have an impaired ability to activate abstract letter identities but an intact ability to recognize letter forms as known or unknown provide support for a level of stored allograph representations used in identification. A specific case of this pattern of impaired and spared performance was found in individual LHD, with acquired dyslexia.

Evidence from LHD

Despite the preceding empirical observations, to my knowledge no researchers have explicitly integrated both a constructed level of character shape and a stored level allograph representations into a theory of letter identification. We tested an individual with acquired dyslexia, LHD, on a number of tasks to provide further evidence for allographs and investigate their properties (these results and others are reported in

Schubert & McCloskey, 2013). LHD displayed the characteristic described above: limited ability to activate abstract letter identities, but intact ability to distinguish between letters and non-letter forms. Through this study we expanded the theory of Caramazza and Hillis, making explicit the delineation between constructed and stored letter representations by including both character shape and allograph representations.

LHD, a right-handed woman, was 69 years old when the study began in 2010.

She holds a Master‘s degree and was an avid reader prior to her neurological damage: a ruptured posterior cerebral artery aneurysm in February 2007. Hemorrhage and surgery resulted in a large medial and ventral left hemisphere lesion. Brain imaging revealed

27 extensive damage stretching from the occipital pole to the temporal pole, following the medial portion of the hemisphere, and affecting striate and ventral extrastriate cortex, with involvement of the hippocampal region, parahippocampal gyrus, lingual gyrus, and fusiform gyrus.

Letter Identification Deficit

Evaluating LHD‘s performance with respect to the Caramazza and Hillis letter identification theory revealed that she is able to activate character shapes but unable to accurately activate abstract letter identities. Testing with naming from oral spelling revealed that central reading processes are wholly intact (Schubert & McCloskey, submitted). Once abstract letter identities are activated LHD processes them normally, including the abilities to produce their corresponding letter names and written forms.

LHD is impaired on tasks which require access to abstract letter identities from visual input, including letter naming, copy transcoding, cross-case same/different decisions, and auditory word generation from written letter cues (see Figure 2.2).

Pseudoletter copying confirmed that she correctly represents character shapes (and can reproduce them accurately). Given the location of this deficit in the cognitive architecture of letter identification, it provided an opportunity to investigate letter representations preceding letter identities. A series of letter decision experiments provided evidence in favor of allographs mediating letter identification.

Letter Decision Experiments

Two sets of pseudoletters were utilized across three letter decision experiments.

The pseudoletters were constructed from letters, with letter parts altered through deletion,

28 addition, and/or movement, and with the constraint that a pseudoletter could not be a simple translation, reflection, or rotation of a real letter. The first set contains 13 pseudoletters based on the uppercase letters of Arial font. The second set contains 26 pseudoletters based on the lowercase letters of the Segoe UI font. Both base fonts and therefore all pseudoletters are sans . The experiments were conducted with

4-character strings and required a letter/non-letter decision, either to each character or to the entire string. The results were extremely consistent: LHD was excellent at distinguishing letter from pseudoletter stimuli. Across the three experiments, she made only one error in letter decision, misclassifying a single pseudoletter character as a letter.

The stimuli from one experiment were presented for LHD to name the letters and she was characteristically impaired on this task, with a statistically significant difference between her performance on letter naming and letter decision.

The result of the letter decision tasks is quite clear: LHD is perfectly able to distinguish letters from pseudoletters despite her impairment at naming letters. This result cannot be explained by the existing theoretical framework of character shape representations and abstract letter identities: Character shape representations cannot underlie letter decision because they represent both letters and pseudoletters without distinction. In the unimpaired system, activation at the level of abstract letter identities could support letter decision but LHD has a demonstrated inability to activate abstract letter identities normally. If LHD‘s letter decision were based on the activation at the abstract letter identity level, we would have expected some errors due to pseudoletters causing erroneous activation of a letter identity, just as letters cause erroneous activation of other letter identities (resulting in misidentifications).

29

Consequently, LHD‘s intact performance on letter decision provides additional evidence for a level of allographs which follow character shape representations and precede abstract letter identities. Allographs are learned representations of the visual forms of letters; pseudoletters do not have a representation at this level. Given her demonstrated ability to activate allographs normally in the service of letter decision, we concluded that LHD‘s deficit is in activating letter identities from this level. This updated localization of her deficit allowed us to investigate the properties of allograph representations, as described in the following section.

Allograph Properties

In a series of experiments with LHD we investigated the properties of allographs.

The locus of LHD‘s deficit provided a unique opportunity to determine what is represented at the allograph level; the deficit spares this level and affects only the mapping to the subsequent level. We utilized same/different tasks which required a decision as to whether two letter strings contained the same letters (see Rapp &

Caramazza, 1989 for similar logic and tasks). The logic in these experiments was as follows: When the decision for a given set of stimuli only required accessing allographs,

LHD should be accurate. However, if the decision required accessing abstract letter identities, LHD should be impaired. We determined the properties of allographs by submitting various stimuli to LHD in this way. Four same/different tasks were conducted, investigating case, style, size, and font. The basic procedure used for all tasks was the presentation of a pair of three-letter strings to LHD; the pair could be of two strings that differed in the property under investigation (e.g., Cross-Case) or two strings that had the same property value (e.g., Same-Case).

30

Same-Case/Cross-Case Same/Different Task. This task utilized the following

Courier New letters: b, , f, g, h, , , m, n, q, r, t, and y, in upper and lower case. LHD performed with 100% accuracy on Same-Case pairs (e.g., mfh-mfh) but showed a statistically significant impairment on Cross-Case pairs (e.g., mfh-MFH). Correct performance on the Cross-Case trials depends on accessing an abstract (case-free) representation of letter identity; one must determine that the distinct shapes f and F are both instances of the same letter identity. The accuracy difference across conditions thus represents a decrement in performance when judgments must be made on the basis of abstract letter identity.

Same-Style/Cross-Style Same/Different Task. Courier New was used as the print style and Vladimir Script as the script (i.e., ) style; the letters a, d, e, f, g, h, i, j, k, l, p, q, r, s, t, x, y, z were used. LHD performed nearly perfectly (106/107 correct) on

Same-Style trials, but was impaired on Cross-Style trials (96/108 correct, 89%). This result suggests that making a decision about the Cross-Style pairs required access to abstract letter identities and that at least some of the letter forms used are represented distinctly at the allograph level.

Same-Size/Cross-Size Same/Different Task. All stimuli in this task were in

Courier New font, size 36 or 52. The letters used were: a, b, d, f, h, k, n, p, q, r, s, u, w, x, y, and z. LHD‘s accuracy was extremely high in both conditions: Cross-Size accuracy was 98% and Same-Size accuracy was 99%. This result suggests that the differences between letters across size did not activate distinct allographs, allowing for LHD to complete the task without need for activating abstract letter identities.

31

Same-Font/Cross-Font Same/Different Task. The fonts used in this task were

Courier New and Consolas; unlike the fonts selected for the Cross-Style task these were chosen because they have similar letter forms and differ primarily in serif presence/absence. The same sixteen letter identities from the size task were used, in lowercase. LHD‘s performance was perfect in both conditions of the task. As in the

Cross-Size task, this result implies that the two font variations activate the same allographs rather than distinct ones. This finding is particularly striking when considering the serifs and visual complexity of Courier New: Fluent readers know that these are irrelevant visual details yet they are comparable in size to features which differentiate letter identities (e.g., the horizontal serif at the bottom of r is the same size as the crossbar distinguishing c from e).

To quantify the observation that the serifs and other visual differences between the two fonts are quite large, I conducted a post-hoc analysis of the visual stimuli, computing the pixel overlap between the two fonts. Examining first the Consolas letters, for 5 of the 16 stimuli the most visually similar stimulus (i.e., with the highest pixel overlap) among the Courier New stimuli was not the same-identity letter. Rather, the

Consolas a is most similar to Courier s, Consolas b to Courier h, and Consolas p, q, and r to Courier n. Considering Courier New, there are three stimuli which are not most similar to their same-identity counterpart in Consolas: Courier s is most similar to

Consolas a, and Courier k and r are most similar to Consolas x. This simple analysis demonstrates that reliance on low-level visual features representing the stimulus precisely as it appears on the retina would not support perfect performance on the same/difference task, suggesting that some visual information has been discarded in the identification

32 process. This result supports the conclusion that allograph representations are somewhat abstract, as well as the intuition that high similarity among letter stimuli is not always consistent with matching letter identity.

The results of these four tasks, as summarized in Table 2.1, provided evidence for the properties of allograph representations. We concluded specifically that allographs represent the distinct shapes of letters: print and script forms, as well as upper and lower case. Additionally, the Cross-Font and Cross-Size tasks confirmed that allographs abstract away from irrelevant visual details and size of the stimulus.

Table 2.1: LHD‘s performance on Same/Different tasks Case Style Size Font

Within property Intact Intact Intact Intact

Across property Impaired Impaired Intact Intact

Computational modeling

Future studies should seek additional evidence for the role of allograph representations in letter identification. One source of evidence for the existence of this level could be computational modeling, providing a formalized version of the categorization problem of letter identification. With such a model one could seek evidence for intermediate representations which abstract over certain visual differences in the input. However, it may be difficult to fully simulate even the somewhat limited variability of letter stimuli readers commonly encounter, a vital aspect of the computational problem. Though previous computational models of letter identification

33 have been successful without an allograph level, such models rarely provide letter inputs which vary in case or style (Chang, Furber, & Welbourne, 2012; Finkbeiner & Coltheart,

2009). One step towards modeling the full range of case and font differences was taken by Chang et al., (2012): the inputs to their simulation are in both cases, multiple fonts, and multiple sizes. However, the fonts they present (e.g., Arial, Bookman, Courier New,

Times New Roman, , 5x7 dot matrix), while common, hardly address the visual differences present in letter stimuli (e.g., none of the fonts employ the single-story lowercase a, none are italicized or script/cursive). Furthermore, the generalization performance of their model to new fonts (90%, Simulation 2, p. 2777) is hardly human- like.

Turning from the Latin alphabet to other scripts, a recent paper by Lake,

Salakhutdinov, & Tenenbaum, (2013) describes a hierarchical Bayesian model of letter- shape in terms of motor strokes. A basic inventory of motor strokes was learned from a large dataset of movies of letter productions. After the inventory has been learned, the model can correctly determine from single exemplar of a letter whether a second stimulus is an exemplar of the same letter or not. This performance suggests that the model is using visual information which is relevant to letter identity differences and discarding irrelevant differences between exemplars of the same letter. However, this study did not attempt to contend with different allographs of the same identity; each letter had only a single form (with variation on this form introduced by the humans producing in the letter) to be matched. In summary, no existing simulation can be considered to fully contend with the computational problem of letter identity invariance across visual forms.

34

Revised Caramazza & Hillis theory of letter identification

The results from letter decision tasks with LHD, combined with evidence from other neuropsychological studies and unimpaired subjects, strongly suggest the presence of an allograph level of representation in letter identification. Therefore, in Schubert and

McCloskey (2013), we proposed a revised version of the Caramazza and Hillis theory which involves an allograph level subsequent to character shape representations. This revised theory is depicted in Figure 2.3. As described above, allographs represent the distinct visual forms of letters. Activation at this level can support a decision about whether a stimulus is a letter or non-letter character, but it does not provide any information about letter identity.

Figure 2.3: Depiction of the revised theory of letter identification proposed by Schubert &

35

McCloskey (2013).

These assumptions about the representations used in letter identification answer how letter identification accomplishes goal 1: Identity invariance over visual forms. To accomplish goals 2 and 3 requires an account of how positions are represented in letter identification, which is the topic of the following section.

Position representation in letter identification

Just as different levels of representation in identification employ different units

(e.g., allographs, abstract letter identities), these levels may utilize distinct position codes.

The simplest manner of representing letter position at any level is slot-based coding (also called channel-based), in which each letter occupies one of a number of discrete positions. For example, the Interactive Activation Model (McClelland & Rumelhart,

1981) contained just four positions (slots), and was only capable of identifying the letters in four-letter words. This trivially solves the problem of alignment because each letter is unambiguously positioned into a single slot, and slot-based coding is sufficient for a toy dataset. Obviously, however, this is not a complete model of the letter identification system as we are able to identify letters in strings of varying lengths, and the system for representing letter position must therefore be considerably richer.

In addition to those reviewed above in the context of letter identification theories, a number of coding schemes have been used for representing the positions of letters during identification. A complete list includes position schemes based on the letter identities present in a string, ‗letter context‘ schemes, and schemes which do not refer to the identities of other letters in the string (‗extrinsic anchor‘ schemes, terminology from

36

Fischer-Baum, McCloskey, & Rapp, 2010). Letter context schemes include open-bigram schemes (Dehaene et al., 2005; Grainger & Van Heuven, 2003a; Whitney, 2001a;

Whitney & Marton, 2013) and trigram schemes (Plaut, McClelland, Seidenberg, &

Patterson, 1996; Seidenberg & McClelland, 1989). Extrinsic anchor schemes include those representing position with reference to the midpoint of the letter string (e.g.,

Caramazza & Hillis, 1990a, 1990b; Gomez, Ratcliff, & Perea, 2008), the first/leftmost letter (Coltheart, Rastle, Perry, Langdon, & Ziegler, 2001; Davis, 2010; Plaut, 1999), the last/rightmost letter (Friedmann, Kerbel, & Shvimer, 2010), the closer edge of the string

(first or last, Jacobs, Ziegler, Rey, & Grainger, 1998), and both edges of the string

(Fischer-Baum, Charny, & McCloskey, 2011; McCloskey, Fischer-Baum, & Schubert,

2013). Position schemes based on orthographic syllables (which have some dependence on the letter identities present) have been used by Zorzi, Houghton, & Butterworth,

(1998), Harm et al., (1999) and Perry, Ziegler, & Zorzi, (2010). These authors have employed them as the structuring principle of inputs to computational models of reading: the letters/graphemes are assigned to a specific role (i.e., onset, nucleus/, coda) within a syllable. These schemes differ in why they have been proposed (e.g., coding inputs in a neural network, as a suggestion of the code used in human cognition); what they have in common is that all have been used or proposed for the purpose of representing letter position.

Letter string similarity

In addition to providing a full description of the representations in the identification process, the position codes in letter identification also determine the similarity between letter strings. The slot-based coding of IAM creates a situation in

37 which strings are similar to each other if they activate the same letters in the same slots

(e.g., SNOT and SPOT are similar because they contain the same letters in 3 of 4 slots).

Metrics of orthographic similarity attempt to describe how letter strings are related and provide a basis for exploring effects of orthographic similarity. One such metric is Coltheart‘s N (Coltheart, Davelaar, Jonasson, & Besner, 1977), which is an index of the number of orthographic neighbors (i.e., similar lexical entries) of a word.

This metric only considers two strings as neighbors if they are related by a single letter substitution; all neighbors are the same length. Furthermore, Coltheart‘s N is binary in that two words either are or are not neighbors; the N-value for a given word is the number of other words which are its neighbors. While this metric has had a great deal of success in accounting for experimental findings thought to be tied to orthographic similarity, like the IAM it limits similarity to items of the same length. A more recent metric of orthographic similarity, OLD20, begins to capture both the intuition that the two strings CAT and CART are highly similar, and empirical data suggesting that these strings are confusable (Yarkoni, Balota, & Yap, 2008). This metric considers all strings to have some similarity to all other strings, with similarity determined by the Levenshtein distance from one string to another, allowing for letter deletions and additions as well as substitutions (based on the edit-distance metric originally proposed in Levenshtein,

1966). OLD20 can be used to determine orthographic similarity for experimental purposes, and out-performs Coltheart‘s N in accounting for empirical effects thought to be due to orthographic similarity (Yarkoni et al., 2008). The gold standard of an orthographic similarity metric will be derived directly from knowledge of letter position coding in word recognition, describing the similarity of strings based on their cognitive

38 similarity. That OLD20 provides an improvement over Coltheart‘s N in accounting for behavioral word recognition data suggests that the true similarity between letter strings is non-zero for strings of different length, further suggesting that strict slot-based coding is not a sufficient model of letter position coding.

Empirical evidence for flexible position coding

Peressotti and Grainger (1995) investigated letter position coding with consonant string stimuli. Use of consonant strings rather than words was an attempt to parse out bottom-up position coding, involved in identifying any letter stimulus regardless of its lexicality, from the combined effects of bottom-up and top-down position coding which may occur with word stimuli (top-down effects in the form of feedback from lexical processing). Using a primed letter decision task with 3-letter strings (task: determine whether the target contains either all letters or letters and a typographical symbol), they found evidence for both position-specific and position-independent letter priming, with earlier effects of position-specific priming. Given the short length in letters of their stimuli and the corresponding fact that the positional displacement of letter identities was small, they suggest that the findings can be accounted for by position-specific letter detectors with some ‗cross-talk‘ between nearby positions, with greater cross-talk between adjacent than non-adjacent positions (Peressotti & Grainger, 1995). They do not specify how position should be considered (e.g., from the left, from the midpoint of the string); it is irrelevant to their particular results as all of the strings were the same length.

They further suggest that leaky position-specific letter detectors feed activation to position-independent letter detectors. This approach would later be abandoned by

Grainger and colleagues in favor of an open-bigram position scheme, but it provides a

39 sufficient model of the position effects obtained with 3-letter strings. In addition to the conclusions of this study, a growing literature of masked priming studies has provided evidence that letter identification cannot employ strict slot-based position coding.

Masked priming with lexical decision

A large number of priming studies has addressed the issue of position representation in letter identification. The core paradigm involves a masked prime stimulus followed by a target stimulus requiring a lexical decision. Researchers compare reaction time to the target when it is preceded by an experimental prime, with some hypothesized orthographic similarity to the target, to when it is preceded by a prime, with less or no similarity to the target. Generally, primes and targets are presented in different cases to avoid low-level visual overlap (pixels/features) between primes and targets as a possible source of any facilitation. Three main prime-types are commonly used: transposed letter (TL), subset, and superset. The latter two are also known as relative-position priming, as letters of the target are present in the primes in different absolute positions but the same relative positions4.

Transposition priming has been the most popular paradigm for investigating position representations in letter identification. TL priming refers to a facilitatory effect on lexical decision times when primes consist of the target word with two central letters transposed, (e.g., prime: talbe target: TABLE) (e.g., Forster, Davis, Schoknecht, & Carter,

1987; Perea & Lupker, 2003, 2004; Schoonbaert & Grainger, 2004). There is disagreement in the literature whether these effects should be compared to the effect of

4 Naturally, the definition of ‗same position‘ depends on the positional scheme that is considered. When unspecified, researchers may be referring to a left/beginning-based scheme.

40 baseline primes consisting of unrelated letters (e.g., prime: hofhi target: TABLE) or to primes that contain replaced letters in the TL positions (e.g., prime: tafhe target: TABLE).

The use of the latter, so-called orthographic control primes, is a more stringent and perhaps appropriate control, as it accounts for the effect of other overlapping letters in the prime, leaving only any facilitation caused by the two transposed letters themselves

(Lupker, Perea, & Davis, 2008; Perea & Lupker, 2003). Regardless, consensus has been reached that primes involving transpositions of interior letters of the target lead to facilitation relative to both completely unrelated and replaced-letter primes. The effect of transpositions involving the initial and final letters of the target rather than interior letters are equivocal, and may depend on target word length (Guerrera & Forster, 2008; Perea &

Lupker, 2003; Schoonbaert & Grainger, 2004).

In subset priming, again with masked lexical decision, primes contain a subset of the letters of the target. For example, the target APRICOT may be preceded by the prime apct; this experimental condition is compared to an unrelated condition in which

APRICOT is preceded by a prime such as egrl, which does not have any letters in common with the target. Results with this prime-type reveal that it facilitates reaction times to the target word relative to unrelated primes (e.g., Grainger, Granier, et al., 2006;

Humphreys, Evett, & Quinlan, 1990; Peressotti & Grainger, 1999; Schoonbaert &

Grainger, 2004; Stinchcombe, Lupker, & Davis, 2012; Van Assche & Grainger, 2006).

The presence of that shift the subset letters into the same ‗absolute position‘

(e.g., ap--c-t) do not alter the priming effect (Grainger, Granier, et al., 2006; Peressotti &

Grainger, 1999).

41

Superset priming refers to facilitation of processing of a target word with a prime which consists of all letters of the target plus additions. For example, jusmtice primes the target JUSTICE, relative to a completely unrelated prime, and the size of the priming effect decreases with additional letters added (Van Assche & Grainger, 2006; Welvaert,

Farioli, & Grainger, 2008). Recent results have suggested that though superset priming is more effective than unrelated baseline primes, it is less effective than TL priming, which is not predicted by open bigram theories (Lupker et al., 2014).

The major conclusion from these masked priming studies can be summarized as follows: Letter position coding is not strictly slot-based, rather some level of flexibility is present. Strict slot-based position coding does not predict any transposition priming, as the transposed letter prime is no more similar to the target than a replaced-letter prime; any change from the target will produce a non-match in that position. Consider the example of tafhe: In a slot-based representation the t, a, and e overlap with the representation of TABLE, and this is the same amount (and locations) of overlap for talbe.

Many researchers conclude in favor of open bigram-based models of letter identification on the basis of these priming data. Transposed and relative-position priming argue against closed-bigram schemes, which would code a subset/superset prime and the target with fairly dissimilar representations (e.g., grdn [GR], [RD], [DN] and GARDEN [GA]

[AR] [RD] [DE] [] share only one bigram), and therefore do not predict a large facilitatory priming effect. However, the transposition and relative priming effects do not solely support open-bigram representations; the evidence is also consistent with other types of non-strict-slot position coding. One example is the graded both-edges coding scheme, which will be described in detail in a subsequent section.

42

These findings inform position representations but do not specify the level at which these position representations apply. Given the evidence presented for an allograph level in addition to character shape and abstract letter identity levels, and the likelihood that these levels could represent the position of their elements in different ways, it becomes obvious that determining the position representation(s) in letter identification needs to be specific for a particular level(s) of representation. The question of what levels give rise to the masked priming effects will be considered next.

Level of priming effects. These masked priming studies—which form the bulk of the evidence on letter position—have been conducted with unimpaired subjects, so it is more difficult to know at what particular level of representation the effects arise.

Processing of the stimuli is occurring at all levels of the identification system, though the experimental conditions may target or manipulate a particular level or levels. The implications of priming results on letter string similarity depend on the level at which the effects arise.

The TL and relative priming effects are often thought to arise at a level which encodes the position of abstract letter identities; prime and target stimuli are generally presented in different cases to rule out featural or letter shape priming. Transposition and relative position priming thereby constrain theories about position coding of abstract letter identities: Position must be represented in a manner consistent with similar representations for transposed and subset primes compared to intact target words. Lupker and colleagues (2008) found that letter frequency affects transposition priming, which suggests that the effect arises at least in part at a stored letter level of representation, such as the allograph or abstract letter identity levels. However, while the use of cross-case

43 stimuli rules out a locus of effect prior to abstract letter identities, additional influence may be derived from later levels such as lexical entries.

Many researchers assume that masked priming with a lexical decision task reflects similarity between the prime string and the word target, involving pre-activation or partial activation of the lexical entry for the target. This is consistent with a general failure to find a significant priming effect on nonword targets, as nonwords cannot benefit from pre-activation in the lexicon. Despite parallel manipulations on the nonword targets as the word targets and an equal number of trials (due to the lexical decision task), very few studies have reported priming for nonwords. Some results of nonword target priming are found in Perea and Carreiras (2008), Perea and Lupker (2003), and

Schoonbaert and Grainger (2004), however the majority of masked priming studies report no significant effects for nonword targets. Kinoshita and Norris (2009) suggested that rather than arising at a single level of representation, the priming manipulations may reflect processing of multiple levels. In this case, some priming at a prelexical level (e.g., abstract letter identities) would result in slight facilitation of word and nonword targets.

Priming in lexical access would result in additional priming for word targets, resulting in stronger and more robust facilitation for words than nonwords.

Several authors have addressed the question of whether these orthographic priming effects could be instead explained by phonological similarity between primes and targets. Researchers have determined that transposition and relative position priming effects do not seem to mirror phonological similarity (Grainger, Kiyonaga, & Holcomb,

2006; Perea & Carreiras, 2006, 2008), suggesting that they arise at a purely orthographic

44 level. However, this does not differentiate between a locus in letter identification and one in orthographic lexical access.

Same/different matching

Garcia-Orza and colleagues (2010) also reported transposition priming in a same- different task (same case) for pseudowords and consonant strings, suggesting that flexible coding is at play for letters in non-word contexts. Norris and colleagues have recently applied a similar same-different paradigm with cross-case pairs (Kinoshita & Norris,

2009; Norris & Kinoshita, 2008; Norris et al., 2010). In this task participants are first presented with a probe item (e.g., TABLE) followed by a masked prime (e.g., talbe) and a target item (e.g., table). The task is to determine whether the probe and target are the same word; these two stimuli are always in different cases. In this example, the prime has a transposed letter manipulation relative to the target. In their 2009 paper, Kinoshita and Norris report manipulations pointing to the level of abstract letter identities as responsible for the priming effects in the cross-case same-different matching task. This conclusion is supported by equivalent priming for word and nonword targets, a lack of word frequency effect, a lack of letter visual similarity effect, and a lack of effect of pseudohomophone primes: these results point to a prelexical, abstract orthographic locus

(Kinoshita & Norris, 2009). These are null effects (lack of difference between two conditions), which weakens the claim somewhat. Research by Fischer-Baum and colleagues has provided some converging evidence to back up the conclusion that the position representation of abstract letter identities treats adjacent letter positions as similar.

45

Illusory word paradigm

Fischer-Baum, Charny, and McCloskey (2011) conducted a study with unimpaired individuals providing further evidence about position representations in reading. They used the illusory word paradigm (Davis & Bowers, 2004), in which two items (words or pseudowords) are simultaneously presented and the participant is subsequently cued to recall one of the items. Participants often err by reporting some combination of the cued item and its distractor, such as reporting ―TUNE‖ when cued to recall TUBE in the presence of TANK. This is a migration error of the N in TANK into

TUBE. Previous research indicated that letters migrate into the same or adjacent position in the cued item as they occupied in the distractor (Davis & Bowers, 2004). In order to distinguish between various position representations, which make differing predictions about what positions are considered the same and adjacent, Fischer-Baum and colleagues manipulated the distractor item, varying the position of a letter to-be-migrated. For example, in TANGLE the N is in the third position from the beginning of the word, while in THRONE the N is in the second position from the end of the word. These were both be used as distractors for TUBE. They recorded how often a critical migration error (e.g., intruding the N from the distractor) occurred in the presence of beginning-based distractors, end-based distractors, and midpoint-based distractors (e.g., TRANCE). Note that the distractor items were of a different length than the cued items, allowing for different position schemes to make differing predictions.

The results of two experiments—one with typed responses and one with spoken responses—converged to reveal that migration errors most often occurred in the presence of beginning-based and end-based distractors. In Fischer-Baum et al.‘s results,

46 migrations occurred not only from exact positions but also adjacent positions, suggesting a ‗graded both-edges‘ coding of letter position. Some terminology will aid in discussing the results of this and future studies: beginning-based positions are denoted by B+X (e.g.,

B+1 ‗beginning plus one‘), while end-based positions are denoted by E-X (e.g., E-2 ‗end minus two‘). For example, the response TUNE occurred in the presence of TANGLE (an exact B+3 match) and also, less often, TRANCE (B+4 is adjacent to B+3). In other words, immediately-adjacent positions are more similar to each other than positions at a greater distance, therefore producing some migration errors of letters into adjacent positions. The authors took these results as evidence for graded both-edges coding of letter position in reading. However, as with the priming studies described above, the paradigm and use of unimpaired participants does not allow a particular level of representation to be delineated. Their evidence for the existence of a both-edges graded representation of letter position must apply to some representations in word recognition, yet the particular level or levels are not evident from these results.

Dyslexia

Friedmann, Kerbel, and Shvimer (2010) reported data from developmental dyslexic individuals who made transposition errors between words when reading word pairs. They found that in cases when the two words of the pair have different lengths

(and therefore different position schemes make differing predictions), their errors maintain relative position counting from the final (in Hebrew, i.e., leftmost) letter. They suggest that these errors arise at the level of abstract letter identities but do not rule out prior levels (e.g., allographs) as sources.

47

McCloskey, Fischer-Baum, and Schubert (2013) analyzed a large corpus of reading data from the acquired dyslexic individual LHD (described previously) to draw conclusions about position representation in letter identification. Because LHD has a deficit localized to a particular level of letter identification—the mapping between allographs and abstract letter identities—her errors reflect the position representation of these levels. Therefore, the conclusions McCloskey and colleagues drew about position representations on the basis of these errors are specific to the allograph and ALI levels

(for further discussion of this point refer to McCloskey et al., 2013).

LHD‘s primary error type in reading is letter substitution. Of these errors, perseveration errors—a letter from a previous response occurring as an error in the current response—form a high proportion. Analyses determined that such errors occur more often than expected by chance, and are therefore thought to be causally related to their presence in a previous response (the ‗source‘). Given this relationship, the authors examined the position of the intrusion and source letter, asking whether the source and intrusion are in the same position in their respective responses. This requires defining position, which was done according to a large number of position schemes in order to contrast their predictions. The schemes tested include those based on a letter‘s distance from the beginning, end, and/or midpoint of the word; orthographic syllable structure; and open bigrams within the response. Overwhelmingly, the position scheme considering distance from the beginning and end of the word was most successful in accounting for the position of the source and intrusion; when a letter perseverated from one response to another, it maintained its both-edges position within the response.

Alternative position representations, including open-bigram, syllable-based, midpoint-

48 based, and beginning- or end-based representations alone were less successful in accounting for the positions of the perseverations errors. Adding the immediately adjacent positions (‗narrowly-graded‘ both-edges scheme) improved the ability of the scheme to account for the perseverations, and including second-adjacent positions

(‗broadly-graded‘) improved it further.

McCloskey et al. (2013) concluded that the positions of allographs and ALIs are represented with respect to the beginning and end of the string, and that the position representation is graded such that adjacent positions are more similar than distant ones.

Specifically, it was determined that immediately-adjacent positions are highly similar, and second-adjacent positions are less similar. This similarity is the result of letters being represented not only at the correct, precise position within the string (e.g., T in CATCH in the B+3/E-3 positions), but also represented less strongly at immediately-adjacent positions (e.g., T weakly in B+2/E-4 and B+4/E-2) and to an even lesser degree at second-adjacent positions (e.g., T more weakly still in B+1/E-5 and B+5/E-1). This type of position coding can be considered a leaky or ‗sloppy‘ slot-coding scheme: Letters are represented most strongly at a single position but there is some spillover of activation into nearby positions.

Another aspect of some letter position theories is the assumption that letter position is initially coded coarsely (i.e., graded across positions), and longer duration stimulus presentations improve and fine-tune the representation into a precise position code. The Overlap model and Noisy Channel Model both include such assumptions. The data from LHD involves unlimited stimulus duration; under these conditions by hypothesis a normal reader would come to represent the stimulus letter positions

49 precisely. It is unclear at this point how Gomez, Norris, and Kinoshita would account for

LHD‘s data: Is the result of a deficit in activating abstract letter identities a position code which cannot be fine-tuned, thereby explaining the success of the both-edges graded scheme under conditions of unlimited stimulus duration?

The conclusions of McCloskey and colleagues are similar to those found by

Fischer-Baum et al. (2011), but additionally specify the levels of representation at which graded both-edges coding is used: the allograph and abstract letter identity levels.

Position of allographs and abstract letter identities is coded by approximate distance in letters from the beginning and end of the word (as canonically-represented, see

Caramazza & Hillis, 1990a, 1990b; Schubert & McCloskey, 2013). Furthermore,

McCloskey et al. reported evidence in favor of broad grading of both-edges position, while Fischer-Baum and colleagues only tested a narrowly graded version.

Reconciling graded both-edges position coding with previous results

How do the results of McCloskey and colleagues (2013) compare to results from other studies suggesting that letter position coding is flexible? Studies which found evidence against strict slot-based coding and instead in favor of similar representations of adjacent letter positions are consistent with the graded quality of the graded both-edges position representation. The core finding of many studies—similar representations for adjacent letter positions—is consistent with any slot-based scheme which includes some grading or sloppiness in the position representation. However, the broadly-graded both edges scheme has some particular advantages in accounting for existing data beyond this basic finding.

50

Here I consider the three main results of masked priming lexical decision studies: transposition priming, subset priming, and superset priming; and the results of cross-case same-different matching: transposition priming for words and nonwords. All of these effects are cross-case (prime in one case, target in the other), suggesting that they must take place at the ALI level or beyond. Assuming for now that the priming effects arise at the abstract letter identity level, graded both-edges coding could account for the results.

Transposition priming with adjacent transpositions is easily accounted for by the narrowly-graded scheme, which states that adjacent positions are highly similar to each other. Non-adjacent priming is less often found and produces weaker results, consistent with the suggestion in the broadly-graded scheme that second-adjacent positions, though similar, are less similar than immediately-adjacent positions.

Subset primes which involve omission of one intervening letter (e.g., grdn for

GARDEN) can likewise be accounted for with the broadly-graded scheme. Subsets with two omitted letters between letters present in the prime could potentially be explained by an even more broadly-graded scheme. Superset priming can also be accounted for by the graded both-edges position scheme: supersets formed by adding one or two letters between adjacent letters are consistent with the broadly-graded scheme. The finding by

Welvaert and colleagues (2008, see also Van Assche & Grainger, 2006) that superset priming effects are graded based on the number of added letters accords, as previously, with the suggestion that immediately-adjacent positions are highly similar, and second- adjacent positions are less similar. The similarity of the string ABCXD to its base string

ABCD is greater than ABCXXD to the base string. In beginning-based position, the

ABC elements are in the same position in all three strings, while in end-based position

51 the C in ABCXD is one position away from the C in the base string, and the C in

ABCXXD is two positions away from the C in ABCD, with the result that ABCXXD is less similar than ABCXD.

Another empirical finding which is easily accounted for by the graded both-edges position scheme is that longer word targets generally show greater transposition priming than short word targets (Forster et al., 1987; Perea & Carreiras, 2008; Perea & Lupker,

2003). This result has often been interpreted as resulting from stricter position coding for short words than long, with varying numbers of post hoc assumptions to derive that prediction. In McCloskey et al. (2013) the authors concluded that the graded both-edges scheme codes letter positions with differential strength according to their position relative to the anchor point (beginning/end of the word). Specifically, the beginning-based coding was found to be stronger for letters in the first half of the word, and the end-based coding stronger for letters in the latter half of the word, with a gradient of strength from the anchor towards the midpoint. Therefore longer words will have weaker position coding for the central letters (which are the letters generally manipulated in transposition priming) than short words, by virtue of the increased distance of central letters from the word-ends. This entails that short letter strings differing by a single central letter will be more similar to each other than long letter strings differing by a single central letter, as the representation of the central letter is weaker and contributes less to the similarity in the long strings.

Summary: Letter identification theory under consideration

The experiments in this dissertation build from the revised Caramazza and Hillis letter identification theory (Figure 2.3). This theory posits four levels of processing in

52 letter identification: visual features, character shapes, allographs, and abstract letter identities. Visual features are represented retinotopically, and consist of oriented edges and other similar elements of the stimulus. From the active features, character shapes are computed to represent each letter in the stimulus as a group of more abstract features, represented with respect to their relative position in the stimulus. Subsequently these representations activate allographs, which are stored representations corresponding to the learned basic shapes of letters. Evidence has suggested that allographs are represented according to the canonical beginning and end of the presented word/string (see Schubert

& McCloskey, 2013) using graded both-edges position coding. Allographs activate corresponding abstract letter identities at the next level, which are representations of letter identities divorced from any visual properties. Abstract letter identities are also represented with respect to the beginning and end of the word/string in a graded both- edges manner. These theoretical commitments will be utilized throughout the remainder of the dissertation in considering digit identification as well as forming the basis for the experiments conducted.

53

CHAPTER 3: DIGIT IDENTIFICATION AND RELATIONSHIP TO

LETTER IDENTIFICATION

Digit identification

The goals of digit identification largely parallel those of letter identification. Identity invariance over visual forms, though it may be less pervasive in digits than in letters (e.g., digits do not have upper and lowercase forms), is still present in a small number of distinct shapes (e.g., 4, 4 ) for some digits, in addition to ubiquitous visual differences across font and handwriting styles (e.g., 4, 4). Encoding of digit position is vital to digit reading. Unlike in word-reading contexts, when an ordering error may be recognized if it leads to a nonword, an ordering error in digit reading may not be immediately obvious, and is therefore potentially more costly. Digit identity invariance over position is equivalent to that in letter identification. Though the semantic interpretation of a digit depends on its position (e.g., compare the meaning of the 5 in 529 and 15), the digit identities (0, 1, 2, 3, 4, 5, 6, 7, 8, 9) are represented identically regardless of position in a string. In summary, the goals and challenges of digit identification are very similar to those of letter identification, though differences arise in conceptual/semantic processing of numbers (numerical processing). Analogously to letter identification, digit identification is the process of determining the digit identities and positions in a stimulus.

Theories of digit identification

By contrast to the large field of work in letter identification, little work has focused on digit identification. A few authors have proposed specific theories of digit identification, while a handful of others have simply noted some key considerations for the process. First I consider whether the letter identification theories described in

54 previous sections have relevance to digit identification. Of these theories, Gomez‘s overlap Model, the Spatial Coding Model, and the Local Combination Detectors model are all specific to letter processing. By contrast, SERIOL2, the theory of Brunsdon et al., and the Hillis and Caramazza theory include lower levels (i.e., feature representations, also font-free representations for Brunsdon) which represent both letter and digit shapes

(Brunsdon et al., 2006; Caramazza & Hillis, 1990a; Whitney & Marton, 2013). On the other end of the spectrum from the letter-only theories, the Noisy Channel model posits identification processes for both letters and digits (as instances of general object recognition; Norris & Kinoshita, 2012).

In a recent adaptation of the open bigram model, Grainger and Hannagan (2014) revised the conception of the alphabetic detectors to ‗character detectors‘ which index the presence of a particular character identity (letter or digit) at a particular location in the visual field. However, they continue to posit that only letter identities participate in bigram representations at the following level, while digit identities are coded in a more strict manner without positional uncertainty or flexibility (Grainger & Hannagan, 2014).

This places the open bigram model in the same class as SERIOL2 and the other models which posit some low-level overlap of letter and digit representations but later divergence of processing.

In their 1998 paper describing a case of acquired dyslexia, Miozzo and

Caramazza propose a theory of letter and digit identification. Their identification theory is somewhat similar to that of Caramazza & Hillis (1990a), though without character shape representations. They posit that digit stimuli activate stored structural descriptions

(i.e., allographs), in parallel with letters. Subsequently, digits directly access semantics,

55 bypassing abstract identity representations and therefore diverging from letter identification processes.

Cohen and Dehaene (1991) describe a theory of digit processing, focusing on the later semantic aspects. This theory involves representation of a constructed visual number form, by analogy to the stimulus-based character shape level of Caramazza and

Hillis. Cohen and Dehaene posit that this representation provides access to semantics for calculation and comparison tasks and to digit names, but do not address any overlap with letter identification processes (Cohen & Dehaene, 1991).

In a number of papers, Polk and Farah (1994, 1995, 1998) suggest that due to differences in experience with letter and digit stimuli, letter and digit identification processes are likely to exist in distinct neural substrates. They suggest that letters and digits rarely co-occur in experience while members of the same category often co-occur, and these two factors combine to pressure the neural system to develop distinct neural representations. This conclusion was supported by simulation work: Distinct units developed for digits and letters when the inputs consisted of pure digit and pure letter stimuli (Polk & Farah, 1995). However, a cognitive account of letter and digit processing has not been described by these authors.

One important consideration for a digit identification system is the extent to which it overlaps with (shares levels of representation and computations) or is separate from, the letter identification system. Of the systems already described, some include fully separate letter and digit processing (e.g., Overlap model, LCD model), some posit early overlap of letter and digit processing and later divergence into separate systems

(e.g., Caramazza & Hillis, open bigram model, SERIOL2), while Kinoshita and Norris

56 posit a completely shared system. In a system with purely separate processes for letter and digit identification, activation and representations in one system would not affect activation or representations in the other system. Here I describe the assumptions and commitments of separate and shared letter and digit identification systems. To make the exercise more concrete I use the revised Caramazza and Hillis letter identification system described in Chapter 2 (and Schubert & McCloskey, 2013), considering what separate and shared versions of this system would entail.

Shared letter and digit identification

The primary characteristic of a fully shared (or ‗overlapping‘) identification system is that there is no distinction between the processing of letter and digit stimuli.

Information about digit/letter category would have no impact on the identification process, and representations of both types of characters would undergo the same computations5. The four levels of representation discussed for letters would also be used for digits. At the first two levels, visual feature and character shape representations for letters and digits would share overlapping representations. Allograph and abstract letter identity representations are specific to a given character form or identity, entailing that letters and digits would activate distinct representations at these levels, but connections from the character shape level would be present to both letter and digit allographs based on visual similarity. For example, activation of the feature ―curve opened left‖ (e.g., as part of the representation for a 3) at the character shape level would activate both the 3

5 To account for the fact that readers can report the category (letter or digit) of a given stimulus, we could posit a representation of stimulus category which receives input from one of the identification levels, similar to the suggestion that one output from the allograph level is an explicit representation of letter case.

57 allograph and also the 5, B, P, and R allographs. Explicit or implicit knowledge about the category of a given character would not alter processing within the system. A schematic of this system is shown in Figure 3.1. This type of system is akin to the Noisy Channel model: an identification system which does not distinguish at any stage between the two character types.

Figure 3.1: Shared theory of letter and digit identification. Adapted from McCloskey and Schubert (2014).

The ramifications of a shared identification system are fairly straightforward: The dynamics of the system would not differ for letters and digits and therefore the other active representations (and any errors) would not be restricted by category. Errors and slowed response times in identification would result from strong competitors from either

58 character set. To the extent that visual similarity, frequency, and other variables have been shown to influence the letter identification system they would also influence digit identification. Initial support for a shared system of the type just described would be evidence for digit allographs and abstract digit identities, in parallel with evidence for these representations for letters. Results that identification errors were independent of the category of the target stimulus would provide further support; effects of variables such as visual overlap would be found independently of categorical differences.

Separate letter and digit identification

By contrast to the shared systems hypothesis—predicated on character identification without regard to character category—the separate systems hypothesis is built on the premise that letter and digit identification processes differ. Multiple versions of separate systems hypotheses can be developed which follow this principle. For example, one might posit different levels of representation involved in letter and digit identification (e.g., abstract letter identities but no abstract digit identities, as suggested by Miozzo & Caramazza, 1998, or no digit allographs due to the lack of upper- and lower-case forms). Separate systems hypotheses of this type would be support by evidence for the various posited levels of representation specific to each character set.

Another possible meaning of separate letter and digit identification systems could be that while the computations used for both character types could be the same, they could take place in different neural substrates. Some authors suggest that distinct neural substrates would naturally arise due to differing demands and contexts of letter and digit identification (e.g., Polk & Farah, 1998). The question of neural underpinnings of the cognitive identification system(s) is not the topic of this particular investigation.

59

A more subtle version of the separate systems hypotheses is one in which the same levels of representation are employed for both characters but processing differs by character type (e.g., processing within each level). To explore this type of hypothesis I will again use the letter identification theory discussed by Schubert and McCloskey

(2013) with the assumption that letters and digits are involved in the same representational levels.

This theory states that the visual features used for letters also represent other similar visual objects and therefore can be assumed to represent visual features of digits.

To the extent that letters and digits both contain abstract visual features of the type represented at the character shape level, processing for letters and digits would continue to be shared at the character shape level. However, visual differences between the stimuli

(e.g., if a particular visual feature only occurs in digits, never in letters) could lead to non- overlapping featural representations6. In this instance, stimuli with digit-features would activate these within the character shape level, while stimuli with letter-features would activate these within the character shape level, resulting in distinct representations of the two character types at this level. Regardless of the separation of character shape representations, representations of letter shapes would map onto letter allographs, followed by abstract letter identities. Similarly, digit shapes would activate digit allographs and abstract digit identities, but these representations (recall that both are stored in long term memory) would be separate from the letter representations.

Activation of an allograph (letter or digit) would inform the identification system of the

6 The feature set used in letter identification is a topic of current research and it is unclear whether letters and digits consist of distinct or overlapping sets of features. To a reasonable approximation they appear to involve the same basic inventory of abstract feature (e.g., slanted and orthogonal lines, curves), but this remains an open question.

60 category of the stimulus (and may also underlie the reader‘s ability to report the same).

Furthermore, and integral to the separate systems, processing at this point diverges dependent on the stimulus category.

In one version of such a system, an inhibitory mechanism might be in place once an allograph is fully active which would reduce activation of any allographs of the other category. (Similar effects would result if any mechanism were in place to restrict the output to the selected category, or if the cross-category inhibition described here began before any allograph was fully active.) Consequently, an identification error arising after the category determination has been made would result in activation of a same-category competitor. A separate systems account of this type is presented in Figure 3.2.

61

Figure 3.2. Separate systems theory of letter and digit identification. Both types of characters undergo the same computational steps but once the category of a stimulus has been determined (at the allograph level) representations corresponding to the other category are inhibited, limiting processing to a single category.

Without a mechanism restricting output to a single category, potential competitors (and therefore error responses) could come from either category. A subset of digit and letter identities would have some activation (particular identities dependent on the stimulus as well as other factors such as residual activation from the past and resting activation levels), and in the event that the target is not selected, the strongest competitor from either category might be selected. However, a system that operates in this way does

62 not follow the separate systems principle of processing determined by the character category: No differences in the identification processes result from presentation of a letter versus digit stimulus. This system is not an alternative to the shared system, it is rather a version of a shared system, following the principle that character category does not impact the identification process.

The hypothesis described previously—with only one identification system active at a time—involves processing dependent upon character category and entails separate systems. Assuming that the correct system is activated, an error (due to the target identity not being the most active identity) would take the form of another identity of the same category, and cross-category errors (e.g., miss-identifying 8 as B) would never occur. The resulting identity will reflect high similarity (e.g., visual feature overlap) or a robust competitor (e.g., higher frequency) to the target. It is possible that representations in the other system had activation prior to being inhibited, but these would not be selected. For example, if abstract letter identities had some residual activation from previous trials but the current stimulus activated a digit allograph, the letter system would be inhibited and the residual letter activation would not impact processing. If an error were to arise earlier in processing such that the incorrect identification system were activated and the target identification system inhibited, the resulting identity would be from the incorrect category. By hypothesis a cross-category error would reflect similarity

(e.g., visual feature overlap) between the target and a representation in the other system which resulted in the wrong system becoming active, and the resulting error would therefore be similar to the target in this way. Due to the presence of within-category errors arising from when the correct system (but not the correct target identity) is

63 selected, similarity computed without regard to category would not perfectly predict the errors produced by these separate systems. Similarity within a category, (e.g., digits with other digits) will closely parallel the within-category errors, while similarity across categories will be less successful. Furthermore, perseveration errors (in which a representation with residual activation from a previous identification is erroneously selected again) arising due to residual activation at a level after the systems separate would not be cross-category if the correct category is selected on the current trial; if a letter identity maintains activation but the letter system is inhibited this activation will not be revealed as a perseveration error.

These observations suggest sources of evidence which would distinguish separate systems from a shared system. Another consequence of separate processing systems is that acquired disorders of letter and digit identification would not be expected to associate. Deficits in letter and digit identification would occur independently and only overlap when the deficit is localized at the shared visual feature level.

The two hypotheses of shared and separate identification systems represent endpoints on a continuum of possibilities for interactions between letter and digit identification within the letter theory under consideration. Research reviewed in the following sections is generally consistent with the shared identification system described here.

Shared identification systems

In their 2014 paper, McCloskey and Schubert proposed the theory of shared digit and letter identification described above. The evidence leading to the theory‘s

64 development will be described in some detail in this section, followed by a review of evidence from other authors, largely consistent with the shared system architecture.

McCloskey and Schubert (2014)

Evidence was collected from LHD (see case history details in Chapter 2:

Evidence for stored letter-form representations) which favors the shared systems hypothesis. In addition to a letter identification deficit, LHD also has a digit identification deficit which results in digit substitution errors (e.g., 148459 → 148479).

Three main results supported the proposal of a shared identification system. First, we found evidence for digit allographs, similar to letter allographs. Second, error patterns by position (i.e., serial position functions) in letter and digit strings were very similar, substitutions were the predominant error type, and perseveration errors were present for both character types. Third, in mixed letter/digit strings, identification errors were not constrained by stimulus type: Letters and digits freely substituted for each other.

Digit allographs

By analogy to the letter decision tasks described in Chapter 2, we also administered digit decision tasks to LHD. Ten pseudodigit characters were created from the digits 0-9 (Segoe UI font, sans-serif). Six-character strings consisting of a mix of digits and these pseudodigits were presented to LHD and she made a digit/non-digit decision on each character. Her accuracy was 100% in this task; these results led us to conclude that her ability to distinguish digit from non-digit characters is intact.

By analogy to the letter decision task summarized above, the conclusion from

LHD‘s excellent performance on this task is that digit allographs underlie her ability to

65 make pseudodigit/digit decisions. Further, we concluded that LHD‘s digit identification impairment arises in the mapping from digit allographs to abstract (digit) identities, thereby accounting for her ability to make digit decisions but her impairment in identifying digits. As these conclusions closely mirror our conclusions about both the letter identification system (presence of allographs) and LHD‘s letter identification deficit

(mapping from allographs to ALIs), the most parsimonious conclusion is that letters and digits are identified in a single shared system affected by a single deficit.

Similarity of error patterns for letters and digits

In digit-string identification tasks LHD‘s errors closely resembled those she made in letter-string identification tasks. Three aspects of her errors contribute: error types, particular error-prone positions, and presence of perseverations. LHD‘s errors in digit string identification were almost exclusively substitutions, just as her errors in letter string identification were almost exclusively substitutions. The serial position functions for digit and letter strings both showed a pattern of highest accuracy at the initial position and lowest accuracy at the final position, with intermediate accuracy for intermediate positions in both types of strings. LHD also displayed an overall effect of character type

(digit accuracy higher than letter accuracy); this result is commonly reported and will be discussed in the following section. The final aspect of LHD‘s digit identification performance which paralleled her letter identification performance was the presence of perseveration errors in both. At rates exceeding chance levels, errors in digit identification contained digits which were also present in prior responses at levels higher than chance. This was true also in letter identification, providing further evidence that the identification errors for both types of stimuli arise from a single deficit.

66

Cross-category substitutions

The third major source of evidence suggesting shared digit and letter identification systems comes from character naming with mixed letter and digit strings

(e.g., G4MH32). LHD was to name each character in the strings and she often misnamed digits as letters and vice-versa. In fact, the category of her errors was independent of the category of the stimuli and therefore these errors suggest that her responses come from a system where letters and digits are both represented7. This is perhaps the strongest positive evidence that LHD‘s errors arise from a common process in which both letters and digits are represented without distinction. Interestingly, LHD was able to accurately categorize characters as letters or digits, even in the same contexts in which she made cross-category errors. This result is in line with the suggestion made earlier that knowledge of letter/digit category must be made available in a shared identification system. Based on the locus of LHD‘s identification deficit, we suggest that explicit category information is made available at the allograph level, but that this knowledge does not affect processing of character identity: Letter and digit stimuli are identified without regard to their category.

Conclusions

On the basis of these four results, McCloskey and Schubert (2014) concluded that letters and digits are identified by a single process, without regard to character category.

Versions of separate identification systems which do not posit digit allographs and

7 We are quite confident that the errors were introduced in the identification process rather than in later stages of processing (e.g., retrieving the phonological forms of letter/digit names) based on evidence from oral spelling (production of letter names) and a dice-face naming task (production of digit names). See McCloskey & Schubert (2014) for more details.

67 abstract digit identities would be unable to explain the results presented. Versions of the separate system account which posit digit allographs and digit identities would perform better, but would need to account for the total lack of category-preservation in LHD‘s errors.

Accordingly, McCloskey and Schubert (2014) expanded the letter identification theory of Schubert and McCloskey (2013) to include digit representations. The first two levels of letter identification, visual feature and character shape processing, were already assumed to represent digit stimuli in addition to letters, just as these levels represent pseudoletters and other related shapes. We concluded that there are digit allographs and that selection of letter and digit identities are not separate processes; therefore we proposed a combined level of digit and letter allographs followed by a combined level of digit and letter abstract identities. In the following sections I review evidence from unimpaired individuals as well as further cognitive neuropsychological evidence, largely consistent with shared letter and digit identification processes.

Review of evidence for shared system

Behavioral evidence from unimpaired participants

Numerous authors have demonstrated parallels between letter and digit identification, often by contrast to identification processes of non-alphanumeric shapes

(e.g., pseudoletters, symbols). Serial position functions for response times to letter and digit strings parallel each other and pattern differently from functions for symbols not generally seen in text contexts (e.g., ,; Hammond & Green, 1982). More recently,

García-Orza, Perea, and Muñoz (2010) investigated masked transposition priming in a same/different task with pseudowords, consonant strings, digit strings, typographical

68 symbol strings (e.g., &%$!), and pseudoletter strings. They found evidence for transposed character priming in the pseudoword, consonant string, digit string, and symbol string conditions, but not the novel pseudoletters. They concluded that the position of known characters is encoded in a relative rather than strict manner, and that this similarity suggests use of the same mechanism to represent position, consistent with a shared identification system. Strings of pseudoletters seemed to be represented in a stricter manner which is distinct from that used for known characters. Using an unmasked same/different task on consonant, digit, and typographical symbol (e.g., %, £) strings,

Duñabeitia and colleagues (2012) revealed a facilitation of reaction times when one string had two characters transposed (e.g., NDTF – NTDF), relative to when the string had two characters replaced (NDTF – NSBF), regardless of string type, though some early differences in concurrent ERP recordings suggested differences between letter strings and other string types.

In some contexts letters and digits may be processed distinctly from both familiar symbols and unfamiliar characters. Tydgat and Grainger (2009) compared the serial position effects in the identification of strings of letters, digits, and typographical symbols. They found a W-shaped accuracy function for letter and digit strings, but an inverted U for symbols. Results for symbol strings differed from letter and digit strings in both a two-alternative forced-choice task and a partial report task, suggesting that the result is robust to task and may reflect the underlying identification processes for these stimuli. With lateralized string presentation, Chanceaux and Grainger (2012) found an accuracy advantage when presenting letters and digits in the leftmost position in the left visual field, but no such advantage for symbols or Greek letters. One exception to the

69 common finding of analogous patterns for letter and digit processing is a study by Mason

(1982) which found different serial position functions for letter and digit RT in a character detection task with 5-character strings. It is unclear why these results differ from those revealed by Hammond and Green (1982) in a similar task, but the majority of research of this type suggests that letter and digit strings are processed in a similar manner.

Results which suggest that known stimuli (digits, letters, typographical symbols) are identified distinctly from novel stimuli (Greek letters) can be accounted for within the shared identification theory. Novel stimuli would activate visual features common to known stimuli, but processing would diverge at or prior to the allograph level (by definition, novel stimuli will not have stored forms), with the potential to lead to different serial position functions. Moreover, the consistency of letter and digit serial position functions is consistent with a unified identification system for these characters.

A few recent studies have demonstrated that replacing letters with visually-similar digits (e.g., 4 for A) is nearly as effective as an identity prime in facilitating the identification of word and pseudoword targets, but that the reverse substitution (i.e., letters for digits, S for 5) creates an ineffective prime for digit strings (Perea, Dun, &

Carreiras, 2008; Perea, Duñabeitia, Pollatsek, & Carreiras, 2009). Though Perea and colleagues interpreted these results as due to a letter-specific feedback mechanisms which regularize the shape of the digit stimuli, recent evidence from Kinoshita and Lagoutaris

(2010) suggests that feedback may be not involved. Rather, Kinoshita and Lagoutaris‘ experiments with pseudowords, digit strings, and letter strings favor an interpretation in which letter and digit stimuli activate visually similar items both within and across

70 categories. They demonstrated that the difference between priming in digit string and pseudoword contexts depends on working memory capacity (i.e., pseudowords are easier to maintain than digit strings), and found comparable priming results when using more closely-matched target stimuli: digit strings and letter strings. These results are highly consistent with a unified letter and digit processing stream, in which letter and digit stimuli are able to activate both letter and digit identities, and do so according to visual similarity.

Kinoshita and colleagues (2013) conducted a further study of masked priming with letters of the prime replaced by letters and digits. They manipulated the visual similarity of the substituting characters, with visually similar letters and digits and visually dissimilar letters and digits (e.g., substituting for A, similar: H, dissimilar: D, similar: 4, dissimilar: 6; similarity determined by experimenters; parity of similarity levels across category verified by 2AFC identification accuracy). They found strong priming when letters were replaced by a visually similar digit or visually similar letter, though the letter priming effect was weaker and did not differ from the priming found with visually dissimilar letters. The ability of digits to prime letters which they resemble suggests that letters and digits are processed in a shared identification system. The authors suggest that differences between the facilitation produced by the letter and digit prime conditions may be due to the use of the lexical decision task, which biases processing towards letter interpretations of the stimuli. Digit stimuli which are visually- similar to letters are assumed to activate those letters, while letter stimuli are assumed to activate (or equivalently, provide evidence for) only their corresponding letter identities, not those for other letters; resulting in no difference between visually-similar and

71 visually-dissimilar letter primes because neither activate the target letter. Given their explanation of this finding as resulting from the task used, it would be informative to consider a similar manipulation with a different task, such as the masked cross-case same/different task, which does not require lexical activation. This would serve to test whether there is an asymmetry in the effects of visually similarity on letter and digit processing in a shared system, as Kinoshita et al.‘s results suggest.

The preceding results all refer to character strings; single character identification has been studied more rarely. Single letter and digit identification was studied by

Starrfelt and Behrmann (2011) in a single character identification task. Participants were asked to name a single briefly-presented masked character drawn from the digits 0-9 and letters A-J (uppercase only). At all durations which did not result in performance at ceiling, average digit accuracy was superior to average letter accuracy (Starrfelt &

Behrmann, 2011). While these results suggest that digits are identified more quickly, it does not bear on whether the identification processes are shared with letters. The implications of the reported difference in identification accuracy will be considered in more detail in subsequent sections.

Neural evidence from unimpaired participants

Evidence for or against the shared systems hypothesis is lacking in the neuroimaging and neurorecording literatures. Relatively few studies have presented both letter and digit stimuli, and evidence for preferential digit or letter responses is extremely mixed. fMRI studies have often found no neural preference for digit over letter stimuli in ventral regions at the group level, consistent with a lack of neural specificity for digit versus letter recognition (James, James, Jobard, Wong, & Gauthier, 2005; Polk & Farah,

72

1998; Polk et al., 2002; Price & Ansari, 2011; Reinke, Fernandes, Schwindt, O‘Craven,

& Grady, 2008). However, as Shum et al. (2013) noted, ventral temporal cortex is prone to susceptibility artifacts in fMRI; it is unclear whether null fMRI effects in these regions reflect a lack of power due to low signal or a lack of preference for digit stimuli. Despite this, some researchers have reported regions of the right ventral visual stream which show a preferential response to digit strings relative to letter strings (Carreiras, Quiñones,

Hernández-Cabrera, & Duñabeitia, 2014; Park, Chiang, Brannon, & Woldorff, 2014;

Park, Hebrank, Polk, & Park, 2011; Polk et al., 2002; relative to words: Reinke et al.,

2008), and the complementary pattern of preference for letters over digits has been found in the left hemisphere (single characters: Cantlon, Pinel, Dehaene, & Pelphrey, 2011; single characters and strings: James et al., 2005; strings: Carreiras et al., 2014; Park, Park,

& Polk, 2012; Polk et al., 2002). These results suggest that digit processing may be predominate or stronger in the right hemisphere, relative to letter processing in the left hemisphere.

Evidence from electrophysiology fails to clarify the status of digit and letter neurotopography. Studies have reported preferential digit responses at both left and right inferior temporal electrode sites; nearby sites have been found to respond similarly to digits and letters (Allison et al., 2004; Roux et al., 2008). Recently, Shum and colleagues

(2013) reported detailed results with single digit as well as letter, pseudo-letter, pseudo- digit, and word stimuli. They discovered seven electrode sites (in 4 subjects, of 7 total studied) which showed a preference for digits over all other stimuli. These sites were in the inferior temporal gyrus or fusiform gyrus; five of the seven were in the right hemisphere (3 subjects), the final two (1 subject) were in the left. Additional sites which

73 responded to digits greater than other characters (letters and pseudo-characters) or digits greater than words (including number words) were found in the inferior temporal/fusiform region of both hemispheres. Though suggestive of preferential digit processing at some sites, these results also imply variability in location and preferentiality of putative digit-selective regions across subjects, which may also contribute to the difficulty of identifying such areas in fMRI group studies. Recently, using ERP,

Duñabeitia et al. (2012) found differential effects of transposed characters in letter strings as compared to symbol strings, with some suggestion of a letter-like pattern for digit strings. Park and colleagues (2014) reported early (~175ms) differences in waveforms for letter and digit strings. Over left hemisphere sites, letters showed higher signal than digits and the pattern was reversed in the right hemisphere, and data from single letters and single digits revealed the same pattern. These results are consistent with the fMRI findings of a reversed effect for letters relative to digits in the right and left hemispheres.

In summary, studies have reported preferential digit processing in both the right and left hemispheres (and sometimes bilaterally in the same participant), as well as null results. Importantly, the neural evidence reviewed demonstrates voxel or electrode-site preferences for particular stimuli rather than exclusivity of processing in these regions.

Univariate analyses indicate a relative increase in neural activity in one experimental condition, but generally the conditions activate largely overlapping areas. Additionally, the evidence does not specify the cognitive processing stage(s) which underlie the neural responses. In skilled readers such as the adult participants in these studies, digit and letter recognition as well as subsequent processing of magnitude/semantics is likely automatic. In the one experiment which might have probed pre-recognition processing

74 by presenting digit and letter stimuli extremely briefly (50ms, forward and backward masked by #) only null effects were reported for the comparison between the character categories (Price & Ansari, 2011). Thus the extant neural evidence does not adjudicate between shared and separate letter and digit identification processes, though other types of neural evidence or more sophisticated analysis techniques (e.g., representational similarity analysis) may do so in the future.

Evidence from cognitive neuropsychology

Further evidence for the overlap of letter and digit identification processes comes from the neuropsychological literature. In an identification task (similar to that employed by Tydgat and Grainger, 2009), Ziegler and colleagues (2010) found that a group of developmental dyslexic children displayed parallel serial position functions for letter and digit strings, distinct from the function for symbol strings. Additionally, the developmental dyslexia case reported by Brunsdon and colleagues (2006) demonstrated parallel deficits in letter and digit identification. These results suggest that in at least some cases of developmental dyslexia, letter and digit processing may be jointly impaired, consistent with shared identification systems.

In cases of acquired dyslexia, letters are generally affected equally or more severely than digits (e.g., Cohen & Dehaene, 1995; Greenblatt, 1973; Grossi, Fragassi,

Orsini, De Falco, & Sepe, 1984; Ingles & Eskes, 2008; Larsen, Baynes, & Swick, 2004;

McCloskey & Schubert, 2014; Perri, Bartolomeo, & Silveri, 1996). This was the conclusion of Starrfelt and Behrmann (2011) in a large review of pure alexia (90 cases):

They reported that these individuals generally have an impairment in both single letter and single digit processing, though letters tend to be more impaired. Holender and

75

Peereman (1987) reviewed a selection of cases of individuals with left hemisphere damage (not limited to pure alexia) and found that many had deficits in single digit identification in addition to their language deficits. These reviews considered only single character identification, but it can generally be assumed that a deficit in single character identification will also impair identification of characters in strings.

Ingles and Eskes (Ingles & Eskes, 2008) compared letter and digit identification performance of one individual with acquired dyslexia to five control participants with brain damage not affecting reading. All of these participants completed an attentional blink task requiring identification of two target letters or digits at varying stimulus onset asynchronies (SOAs). The control subjects were at ceiling for identifying both letter and digit targets, except for the second target at the briefest SOAs for which they showed a digit advantage. The acquired dyslexic individual was at ceiling for identifying digits as the first target, but had lower accuracy than controls on letters as the first target and both letters and digits as target two. This study provides additional evidence for impairment of letter identification over digit identification in acquired disorders as well as further control data suggesting that digits are identified more rapidly and accurately than letters.

In addition to the general observation that both letter and digit identification are affected in many individuals, there have been no reported neuropsychological cases of an identification impairment for digit stimuli that did not also affect letters (McCloskey &

Schubert, 2014; Polk et al., 2002; Starrfelt & Behrmann, 2011). One puzzling exception is the case reported by Cipolotti, Butterworth, & Denes, (1991) of an individual who, subsequent to a stroke, was unable to identify any letters, or to perform letter decision tasks, but was able to identify and perform digit decision exclusively for the digits 1, 2, 3,

76 and 4. Numerical processing and calculation skills were extremely impaired, though some relative sparing was found for the numbers 1-4. The interpretation of this pattern is unclear; in particular the selective sparing of some digits with complete obliteration of others may suggest a different mode of processing than the usual digit identification system. It is quite common for impairments of numerical processing (i.e., of number semantics, calculation, comparison etc.) to be reported without concomitant difficulty with letters but these impairments do not reflect a deficit in digit identification.

Three individuals with acquired deficits have displayed an extremely similar pattern of impairment for letter and digit stimuli (Katz & Sevush, 1989; McCloskey &

Schubert, 2014; Patterson & Wilson, 1990). In all cases the impairment for letters and digits has been position-specific, affecting the same string positions in letter strings/words and digit strings. These cases provide strong evidence in favor of overlapping digit and letter identification processes, with specific and parallel patterns of impairment for both stimulus classes, as discussed above with respect to LHD.

In summary, though evidence from neuroimaging and neurorecording is equivocal, the vast majority of behavioral studies with unimpaired and impaired readers are consistent with the proposal of shared letter and digit identification processes. I will maintain the theoretical position that these processes are shared and explore the implications of this assumption in the remainder of this dissertation.

Reconciling differential digit and letter accuracy with shared identification

Though evidence suggests that digit and letter identification processes are shared, the fact remains that digit identification performance generally exceeds letter identification. As reviewed above, both unimpaired individuals and individuals with

77 acquired deficits tend to show better performance for digit than letter identification. This pattern might suggest that letters and digits are identified in separate systems which can be independently damaged (with the letter system more prone to damage than the digit system). However, the pattern is not inconsistent with a shared identification system if an explanation can be provided for how the pattern arises within the system. In this section I consider the general pattern of superior average accuracy (or lower RT) on the set of digits compared to the set of letters: This is the basic pattern which has been reported, though accuracy of individual letters may exceed that of some digits (e.g., McCloskey &

Schubert, 2014). Four hypotheses have been proposed in the literature to explain this core empirical phenomenon.

Digits comprise a smaller set

One explanation for the superiority of digit identification is that there is a smaller set of digit identities, thereby facilitating the process of activating the target digit. A few authors have suggested this possibility, though the exact mechanism for the facilitation has not been described (Cohen & Dehaene, 1995; Ingles & Eskes, 2008; Polk & Farah,

1998; Polk et al., 2002). Starrfelt and Behrmann (2011) made this explanation more explicit, suggesting that it results simply from the higher chance rate of correctly guessing one of ten possible digits rather than one of twenty-six possible letters.

However, this hypothesis presupposes that the reader (or her/his identification system) has knowledge that the target is a digit and accordingly is not considering any letter identities. This hypothesis is not compatible with evidence suggesting that character identification is not constrained by knowledge of the category of the stimulus. It may be true in some particular contexts that readers are only considering letter or digit targets;

78 however this situation is not the only situation in which the accuracy discrepancy between letters and digits is found.8

Differential role of semantics and/or the right hemisphere

Digits and letters differ in their semantic content: Single digits are logographic symbols which convey a meaning, while letters convey meaning only by virtue of their combination into larger units9. Some authors have suggested that this difference might involve a more direct or stronger connection between digit identification and conceptual processing than between letter identification and conceptual processing (Cohen &

Dehaene, 1995; Ingles & Eskes, 2008; Starrfelt & Behrmann, 2011). This could entail stronger feedback connections supporting digit identification, leading to more accurate or faster identification performance. Some authors specifically suggest that digits have stronger representations in the right hemisphere than letters do and cross-hemispheric connections therefore preferentially support digit identification (Cohen & Dehaene,

1995). In a review of the neuropsychological literature on letter and digit naming,

Holender and Peereman (1987) suggest that the right hemisphere stores some semantic representations for digits but not linguistic stimuli such as letters and words. They posit that the right hemisphere may provide support for digit naming when the left-hemisphere is damaged. This hypothesis is difficult to test empirically with behavioral data; the role and strength of feedback from semantics and/or the right hemisphere are currently unknown.

8 In particular, Starrfelt and Behrmann reported higher accuracy for digits than lowercase letters from unimpaired readers in a task with digit and letter trials randomly intermixed (Starrfelt & Behrmann, 2011). In this context it is non-obvious that participants would be considering only same-category competitors. 9 Exceptions are the letters A and I, which can be interpreted as one-letter words depending on context.

79

Within-set visual similarity

Differential degrees of visual similarity among the sets of digits and letters has been proposed as another explanation for why digit identification is superior to letter identification. Cohen and Dehaene (1995) first suggested this possibility: If letters are more visually similar to other letters than digits are to other digits it could be easier to determine the identity of a digit from its competitors than a letter from its competitors

(see also Polk et al., 2002).

Starrfelt and Behrmann (2011) expanded upon Cohen and Dehaene‘s original explanation for the digit-letter discrepancy and sought to test it empirically. They first computed the mean pairwise pixel overlap for lowercase letters and digits in two common fonts. Using this measure, they computed the number of competitors for letter and digit stimuli independently, by considering the pairs of same-category stimuli with an overlap value higher than a set threshold. At all thresholds they considered, digits had the same number or fewer competitors than letters, therefore digits have higher discriminability (i.e., lower confusability) than letters. (At the lowest threshold this is necessarily true because there are fewer digits than letters). This result suggests that digits are easier to pick out from the set of digits than letters from the set of letters.

However, the explanation also requires the assumption that digit and letter identities are only being selected from among the set of same-category identities: digits among digits and letters among letters, which does not accord with the shared systems hypothesis under consideration. Considered in the shared systems framework, a high degree of overlap of letters with any other characters (in terms of visual features at the character shape level) would entail a large number of active character shapes and subsequently

80 allographs and abstract letter identities, which could slow the selection of the single target letter. If there is less overlap between digit features and features of other characters, fewer other representations will be activated concurrently and competition will be less; these conclusions would require comparison of visual overlap with the entire set of characters as competitors.

An open question concerns visual similarity across the combined set of letters and digits. Under the assumptions of the shared character identification system, letters and digits compete with all other characters, not just those of the same category.

Additionally, pixel overlap measures of visual similarity have the disadvantage of being dependent on the font of the stimuli. Pixel overlap computed for different fonts, though they may show some consistency, also reveals different patterns of visual similarity

(recall the post-hoc analyses of Consolas and Courier New described in Chapter 2). The sensitivity of this measure to low-level visual attributes (e.g., serifs) may not accurately reflect the level(s) of representation at which letters and digits are perceptually similar.

Certainly, the digit-letter accuracy difference seems to hold across the (unknown) range of fonts used in all studies reporting it, leading to the presumption that it may arise from level of representation(s) which are not sensitive to cross-font visual variations (or that the variation is font appearances is small). Visual similarity measures which deal instead with somewhat abstract letter features (e.g., horizontal bar, curve opened left), thought to be represented at the character shape level, may provide a more motivated basis on which to visually compare the set of letters and digits.

81

Character frequency differences

Ingles and Eskes (Ingles & Eskes, 2008) suggested that differences in written frequency of occurrence for digits and letters might explain the performance discrepancy.

However, no data was brought to bear on this hypothesis. To provide an explanation for the superior performance on digits as a set, the average written frequency for digits would need to differ from that for letters. Presumably identification performance would be facilitated by a higher frequency of occurrence: if digits were more frequent than letters.

Though letters are overall more frequent than digits in written material, it is possible that a frequency analysis would reveal higher frequency for some digits than some letters, or that a comparison of upper case letters and digits would obtain in the required direction.

Assuming that the empirical data to be explained concerns both cases, case-specific frequency measures should be used. It is unknown whether the neuropsychological literature surveyed by Starrfelt and Behrmann (2011) and Holender and Peereman (1987) employed uppercase or lowercase letters; a mix of both is likely.

Effects of character frequency would suggest that the activation of allograph and/or abstract character identity representations are modulated by character frequency.

Effects of case-specific frequency suggest localization to the case-specific allograph representations while case-collapsed frequency effects suggest localization to character identity representations (note that case-specific letter frequency effects have been reported, see Chapter 2: Evidence for stored letter-form representations). Such an effect could be produced if frequency were encoded in the resting activation of the representations, such that more frequent items would have a higher resting activation and therefore be faster to activate. Another similar alternative would be that more frequent

82 items have a lower threshold to be fully activated. This type of explanation is the same as that used to explain word frequency effects in word recognition and reading, with differential resting activation or thresholds of lexical entries based on word frequency.

A small number of studies have suggested that digit and letter frequency may be encoded in the identification system. It has been observed in studies of synesthesia that the colors reported for letters and digits correlate with character frequency. Specifically,

Beeli, Esslen, and Jäncke (2007) reported significant correlations between the saturation of synesthetic colors for each character with their respective frequency, and Smilek,

Carriere, Dixon and Merikle (2007) reported significant correlations between luminance and character frequency. However, both of these studies considered the letter and digit sets separately; it is not known whether the relationship holds across the combined set of characters.

Evidence for character-frequency effects across the collapsed letter and digit character sets would provide evidence for frequency tracking for both types of characters.

To my knowledge no study has considered the effect of frequency on a by-character level across combined letter and digit stimuli in any context. Such an effect would be consistent with the shared systems hypothesis as well as providing at least a partial explanation for accuracy and RT discrepancy between the two character categories: rather than a categorical effect of letter or digit status, character accuracy/RT would correlate with character frequency. Two tests of the hypothesis are wanting: an assessment of set-wise character frequencies (i.e., Are digits more frequent than letters?) as well as a by-character analysis comparing character identification performance to character frequency.

83

Summary of hypotheses

Four major hypotheses have been proposed to explain the observed discrepancy between digit and letter identification. However, none of the hypotheses has thus far provided a satisfactory answer. Some hypotheses require unsupported assumptions, such as that the category of a character is used to constrain the identification processes. Others are simply not backed up with sufficient data to explain the phenomenon. In Study 1 I explore the validity of the visual similarity and character frequency accounts. As discussed above, the impact of the set of digit stimuli being smaller and possible differential influence of feedback from semantics are difficult to test and I do not undertake to do so here.

Critically, these hypotheses were proposed to explain a difference in identification performance across the entire set of digit and letter characters, what I refer to as categorical or set-wise versions of the hypotheses. However, considering set-wise accuracy or RT may obscure differences in accuracy across individual characters. The phenomenon of higher digit than letter accuracy may only be true for the mean accuracy, not for all digits compared to all letters; this level of performance detail has not been reported. In the case of LHD‘s identification performance her average accuracy for digits exceeded her average accuracy for digits, but there was overlap in the accuracy for individual digit and letter characters (McCloskey & Schubert, 2014).

Overview of Studies 1-3

I will consider whether the pre-conditions exist for frequency and visual similarity hypotheses to explain the categorical distinction reported between letters and digits, the

84 set wise versions of these hypotheses. These analyses do not require collecting identification data: The hypotheses require that digits are more frequent than letters and that digits have lower visual confusability/complexity than letters, which can be tested on the basis of character frequency metrics and visual properties alone. Of these variables, frequency and complexity are a measure of a single character‘s properties, while similarity is defined as a relationship between two or more characters. Any of these variables could help explain the accuracy/RT discrepancy between the two sets of stimuli if it is correlated/confounded with the letter-digit categorical distinction. Even if the effects of the variable arise within a shared systems architecture, an apparent categorical distinction between the two character types could arise in this way. Revealing such an effect(s) would reduce tension between the reported performance discrepancy and an underlying shared identification system. The comparisons of letters and digits on these three variables will comprise Study 1.

Examining the ability of specific variables to produce differences in the set wise processing speed/accuracy of characters also introduces a broader question about character identification. This question concerns the variables which determine accuracy and speed of character identification more generally. For example, if it is found that A is identified more rapidly than Z, can this be explained by the higher relative frequency of

A than Z? A second related question concerns the confusions among characters: When a stimulus character is not correctly identified, why do particular responses occur as errors?

These two questions will be addressed through parallel analyses of Study 2 (accuracy) and Study 3 (response time) data. I also explore the predictions of shared and separate identification systems for the results of these studies.

85

CHAPTER 4: EXPLORING HYPOTHESES FOR LETTER AND

DIGIT ACCURACY DISCREPANCY (STUDY 1)

Analyses in this study consider whether character frequency and/or visual similarity are viable hypotheses for explaining the observed categorical difference in identification performance for letters and digits. The hypotheses involve assumptions that letters and digits differ in particular ways (e.g., in their frequency), and further that the differences underlie differences in letter and digit identification performance. The analyses of this study consider the precondition for the hypotheses: Are there systematic differences between letters and digits in frequency or visual properties? Support for the hypothesis that a particular variable (e.g., frequency) is responsible for the accuracy discrepancy requires not only the precondition to be true (e.g., letters are less frequent than digits) but also evidence that this difference affects identification performance. The analyses in this study serve as preliminary analyses to the investigations of by-character effects in Studies 2 and 3 (in Chapter 5).

Metrics for each character were used to compare the characteristics of the letter and digit sets, including frequency, visual overlap, discriminability, and visual complexity. Of these, the first and final are properties of a particular character, which are computed independently of the remainder of the character set. Visual overlap and discriminability necessarily refer to the relationship between a given character and a larger set of characters.

Visual overlap and discriminability differ depending on the set of characters considered as the comparison set. In this study these measures were computed over two comparison sets: the entire set of letters and digits (all characters), as well as separately

86 within letters and digits. For letters, this means the measures were computed across all characters and across the set of letters; for digits they were computed across all characters and across the set of digits.

In the case of feature overlap, use of these two comparison classes has different theoretical implications. As indicated in Chapter 3, the shared systems theory of identification posits that during identification of letters and digits no distinction is drawn between the two character sets; characters of both types can gain activation and compete with each other. By contrast, separate systems hypotheses posit that letters and digits are identified in somewhat independent and distinct processes after the visual feature level

(NB: Not to be confused with the character shape level, where abstract letter/digit features, such as those in the feature set developed here, are represented) in which they are only competing with representations of same-set characters. Previous work comparing visual properties of letters and digits (discriminability based on pixel overlap) was conducted comparing a given character only to its same-set competitors, an implicit adoption of an extreme separate systems hypothesis, which assumes that even at the level of representation of pixels, representations of letter and digits do not compete with each other. Analyses of both all-character and same-category visual overlap were conducted.

At the level of feature overlap, the distinction between all-character and same-category competitors follow the assumptions of the two architectures discussed. At the level of pixel overlap the distinction is much less meaningful, as it is difficult to imagine any system which could differentiate letter stimuli from digit stimuli based on pixels alone.

87

Methods

Throughout this study, comparisons were conducted using nonparametric statistics (Spearman‘s rank-order correlation and Mann-Whitney U-test) and two-tailed significance testing. These methods are suited to the unequal variances and small/unequal sample sizes present in the data. The measures computed here are retained for use in by- character analyses in Studies 2 and 3 (Chapter 5).

Frequency

The first analysis compared published metrics of letter and digit frequency, determining whether the set of digits differ in frequency from the set of letters. Of existing character frequency metrics (e.g., Benford, 1938; Jones & Mewhort, 2004;

Mayzner & Tresselt, 1965; Solso & King, 1976), Jones and Mewhort‘s is the only to contain uppercase and lowercase letters as well as digits. These authors computed digit and case-specific letter frequencies from five corpora: New York Times archives, the Brown corpus (source of the Kučera & Francis word frequency counts; 1967), a set of randomly-selected webpages, an discussion forum, and an online encyclopedia.

They concluded that the edited corpora—New York Times and Brown—were most consistent with commonly-encountered text and recommended use of the former.

The average frequency for letters (each case separately) and digits were compared using the published New York Times counts. To be consistent with the categorical version of the character frequency hypothesis, digits would need to be more frequent than one or both cases of letters.

88

Visual similarity

The second analysis compared metrics of visual similarity for the sets of upper case10 letters and digits. Two measures of visual similarity were used: pixel overlap of images of letter and digits, and feature overlap. The average visual overlap among the set of digits was compared to the set of letters, for both visual similarity measures.

Pixel Overlap

The pixel overlap metric was computed in MATLAB2013a (Mathworks) by computing the overlap of black pixels in images of single letter and digit stimuli.

Characters were in 36pt font and centered (within 300x300pixels of white background) for maximal overlap. This was conducted for four fonts: Arial, Consolas, Courier New, and (sans serif, sans serif, serifed, serifed, respectively). Consolas and Courier New are fixed width fonts, while Times New Roman and Arial are proportional fonts. Times New Roman is the font used for body text in this dissertation.

Arial is a common fonts for text displayed on a computer screen, including in popular applications such as Gmail. All four fonts are displayed in Figure 4.1.

10 I considered exclusively uppercase letters in this and subsequent analyses. The reasons are to limit the characters to present as stimuli in the experiments of Studies 2 and 3, and to give a larger possibility of finding visual overlap and frequency effects for the letter and digit sets. Upper case letters occupy the same region of space on a line as digits—unlike lowercase letters which may occupy a smaller region (e.g., a, x), or extend below the baseline (e.g., j, g)—increasing the measured overlap between letters and digits centered within the standard character frame.

89

Figure 4.1: The four fonts used in this study: Consolas (sans serif, fixed-width), Arial (sans serif, proportional), Courier New (serifed, fixed-width), Times New Roman (serifed, proportional), respectively.

Feature Overlap

The feature set developed here was meant to approximate the type of abstract feature representations hypothesized to be present at the character shape level. These features include lines and curves, which are assumed to be invariant to character size and position in the visual field but specific to relative size (i.e., one feature can be larger than another) and position within a character (e.g., feature ‗horizontal line of length 1 at top of character‘ is distinguished from ‗horizontal line of length 1 at bottom of character‘). The feature overlap metric considered an inventory of letter and digit parts, along with their positions within a letter-based frame. Pelli and colleagues (2006) concluded that features, smaller than entire letter-shapes, are used in letter identification and further that features are independently activated and utilized by the identification system.

Specifically, they concluded that the identification of a letter depends on the activation of each component feature. This result suggests that representation of letter features is implicated in letter identification. Guidance for the feature set developed here came from previous proposals (Boles & Clifford, 1989; Briggs & Hocevar, 1975; Changizi &

Shimojo, 2005; Fiset et al., 2008; Pelli et al., 2006; Petit & Grainger, 2002). Features of

90 the same type but of different lengths and positions are considered to have some overlap.

This is meant to capture the assumption that there is similarity in the representation of features that differ only in these parameters.

The variation of letter and digit visual forms complicates describing the forms in a straightforward manner; a veridical description of one character stimulus may not be general enough to capture that same character in other fonts or forms. With this visual feature set I sought to capture the features of the standard fonts of uppercase print letters, without accounting for the full range of variability. The font used in the behavioral experiments of Studies 2 and 3 (Consolas) served as a model for the letter and digit shapes11.

Generally letters have been split into features at points of discontinuity in the visual stimulus (e.g., Changizi & Shimojo, 2005). In the interest of using a small set of atomic feature units to describe many letters, some letters were split into smaller segments in the absence of a discontinuity in the line. For example, U was split into two straight segments and one curve, whereas Changizi treats it as single stroke. This division of the character captures that U contains vertical segments which are quite similar to those in H, D, etc., which would be obscured if U were treated as a single feature. This decision is also supported by Pelli et al., (2006) who failed to find evidence for whole-letter templates/features in letter identification. A full listing of the features, overlap metric, and resulting overlap values for all characters are in Appendix A.

11 The only exception is the zero character, which in Consolas is a crossed-zero (0). This is a particularly unusual form of this character and therefore the standard form (0) was described in the feature set.

91

Visual complexity

Visual complexity was calculated three ways: First as the number of visual features and second as the number of black pixels in a given font. In addition, following the suggestion of Pelli et al., (2006) and Ziegler, Pech-Georgel, Dufau, & Grainger,

(2010), perimetric complexity (defined as the square of the perimeter of the ink area, divided by total ink area) was computed for each character. Perimetric complexity was computed over the same image files used for pixel overlap calculations. The method described in Pelli et al. (2006) was implemented in MATLAB. Complexity as computed by all three methods was compared for letter and digit sets.

Discriminability

Starrfelt and Behrmann (2011) developed a measure of discriminability for letters and digits which considers the highest competitors for a given character (i.e., the other characters with the highest pixel overlap with that character). Here I considered a simplified notion of discriminability, defined by the overlap value of the highest competitor for each character. The set of competitors was considered separately as either all other characters, consistent with the shared identification system, or the other characters of the same set (as computed by Starrfelt & Behrmann). The pixel and feature overlap measures were both used. For example, the highest pixel competitor for A

(Consolas) considered out of all the characters is 4, while A‘s highest competitor among the Consolas letters is R. Considering features, the highest competitor for A is A across all characters and also within just the letters.

92

Results

Frequency

Comparing lowercase letters to digits confirmed that on the whole, lowercase letters are more frequent than digits (Mann-Whitney U = 41.0, N1 = 10, N2 = 26, p < .01).

The average frequency of lowercase letters (2.37 million) exceeds that of digits (283,418) by nearly an order of magnitude. The most frequent digit (0) is significantly less frequent than the most frequent lowercase letter (e; .546 million vs. 7.74 million). These two comparisons are inconsistent with the hypothesis that better performance for digits than lowercase letters could arise from digits‘ higher frequency. The very large range of frequencies across the lowercase letters (54,221 to 7.74 million, q to e) suggests that it is important to consider the frequency of the particular subset of letter stimuli when the full alphabet is not used. Considering the relative ranking of lowercase letters and digits, all of the digits are ranked below 21 of the letters; there is some interleaving of the letter and digit frequency values at the lower end of the frequency ranks.

The discrepancy between frequencies of uppercase letters and digits is much smaller and in the opposite direction: The average frequency of uppercase letters

(135,677) is lower than that of digits (283,418). Comparing these sets indicates that digits are more frequent than uppercase letters (U = 44.0, N1 = 10, N2 = 26, p < .01).

Unlike with lowercase letters, the four most frequent characters among the combined set of uppercase letters and digits are digits (0, 1, 5, 2); half of the letters are less frequent than all of the digits. As with lowercase letters, when a subset of letters is used (e.g.,

93

Starrfelt & Behrmann, 2011)12 the properties of those particular characters determine whether sets of letters and digits differ.

Cross-case frequency (i.e., the average of lower- and upper-case frequencies) differs from both lowercase (U = 230.0, N1 = 26, N2 = 26, p < .05) and uppercase frequency (U = 41.0, N1 = 26, N2 = 26, p < .001). Given the numerically higher frequencies of lowercase letters, their average with uppercase letters results in cross-case frequency higher than digit frequency (U = 49.0, N1 = 10, N2 = 26, p < .01).

In summary, counts of uppercase, lowercase and cross-case frequency differ significantly from digit frequency, but the latter two in the reverse direction of the frequency hypothesis. The comparison between uppercase letters and digits revealed that digits are of higher frequency than letters. However, with all three comparisons there was considerable intermixing of the frequencies of the two character types.

Visual similarity

Pixel overlap

As it is directly computed from the visual appearance of the letters, pixel overlap varies by font. For example, of the four fonts used here (Arial, Consolas, Courier New,

Times New Roman), cross-correlation ranged from .19 (Consolas with Times New

Roman) to .99 (Courier New with Times New Roman).

Pixel overlap was calculated for every pairwise combination of characters for each font separately. Two measures were then computed: average overlap between a

12 The letter stimuli used by Starrfelt & Behrmann (A-J) include some high frequency letters (A, B, C) as well as a low frequency letter (J), with the effect that the average for this letter set is very similar to that found for the entire alphabet, though the minimum is much higher in the subset. Like comparisons to the full alphabet, the frequencies of the digits are higher than the letters (U = 20.0, N1 = 10, N2 = 10, p < .01).

94 given character and all other characters, and average overlap between a given character and only the other characters of that same type (letters with letters, digits with digits).

Considering first the relationships among the full set of characters, there are significant differences between letters and digits for only one of the fonts: Courier New (letter average: .35; digit average: .29; U = 66.0, N1 = 10, N2 = 26, p < .05). In Arial, Consolas, and Times New Roman there is no difference (Arial: p > .4; Consolas: p > .4, Times New

Roman: p >.1). Comparing pixel overlap computed only within category reveals differences for 2 of 4 fonts; in Arial and Consolas (the sans-serif fonts) the letters and digits differ (ps < .05) while in Times New Roman and Courier New they do not (ps >

.5). Both of these differences are in the reverse direction required for a pixel overlap explanation of higher digit accuracy: Digits in Arial and Consolas have higher average overlap than letters.

Feature overlap

Unlike pixel overlap, feature overlap involves more abstract visual properties which do not vary by font (within a style; ‗uppercase print‘ is considered here). As above, overlap of letters and digits was compared in two instances: including all characters, and including only within-set characters. All analyses comparing letters and digits on feature overlap failed to reach significance, regardless of whether the overlap included all characters or just within-set characters (ps > .1). Feature overlap for letters is

.13 across all characters and .14 within letters only; for digits the average feature overlap is .11 across all characters and .12 within digits only. The lower overlap values for feature than pixel overlap reflect the fact the any feature can have pixel overlap with any

95 other feature in the same location, while only features of the same class have any overlap with each other.

Discriminability

Pixel overlap

I calculated the competitor with the highest overlap value for each character, from the full set of characters and separately from the set of within-category characters, for each of the four fonts. The set of maximum values for the letters was then compared to that for the digits. Considering pixel overlap across all characters, letters and digits do not differ in the value of their highest competitor in three of the fonts considered (ps >

.1). However, they do differ in Times New Roman: Average maximum overlap for letters is .68 and for digits is .61 (U = 71.5, N1 = 10, N2 = 26, p < .05). Note that this discriminability measure reflects the overlap of the most visually similar competitor, which is independent of the number of competitors. Considering pixel overlap within set only, letters with letters and digits with digits, three of four fonts show an advantage for digits such that digits have lower highest-competitors than letters (ps < .05). The exception is in the font Arial, in which letters and digits do not differ (p > .5).

Feature Overlap

The same discriminability measure (i.e., overlap value of highest competitor) was also calculated from feature overlap. Highest competitor was computed among all the characters as well as from within-set characters. Letters and digits do not differ in discriminability computed across all characters (U = 116.5, N1 = 10, N2 = 26, p > .6).

However, letters and digits do differ in discriminability computed within set (U = 74.0,

96

N1 = 10, N2 = 26, p < .05). On average, the highest competitor for digits has an overlap value of .29, while for letters it is .37; letters have stronger within-set competitors than digits.

Visual complexity

Number of features

The number of features for each character was counted from the feature set described above. The average number of features for letters is 2.65; the average number of features for digits is 2.3. The number of features for letters and digits was compared using a Mann-Whitney test; no significant difference was found (U = 97.0, N1 = 10, N2 =

26, p > .2). The feature set was designed to capture abstract features of letters (e.g., does not included serifs) and therefore does not differ by font (e.g., H in all four fonts is described by two vertical lines and one horizontal line).

Perimetric complexity and number of pixels

Use of different fonts results in different values for number of pixels and perimetric complexity and therefore the outcome of comparisons of these variables depend on the font used. Tables 4.1 and 4.2 detail the comparisons for each font.

In Consolas, the perimetric complexity and number of pixels of letter and digit stimuli do not differ (ps > .1). In Arial, perimetric complexity does not differ between letters and digits (p > .3) while number of pixels does differ (p < .05). Courier New and

Times New Roman display significant differences in both perimetric complexity and number of pixels between letters and digits (all ps < .05). In both fonts, digits have fewer

97 pixels and lower perimetric complexity than letters; this is numerically true also for the sans-serif fonts.

Table 4.1: Perimetric complexity comparisons Font Font type Digit average Letter average Mann-Whitney U p-value All Digits v. All Letters†

CONSOLAS Sans serif 63.4 68.4 109.0 .475 0123456789

ARIAL Sans serif 67.0 74.7 101.0 .320 0123456789

COURIER NEW Serifed 116.8 148.6 23.0 < .001* 0123456789

TIMES NEW ROMAN Serifed 76.2 107.2 22.0 < .001* 0123456789

†N1 = 10, N2 = 26; *Significant at p < .05 level, 2-tailed

98

Table 4.2: Number of pixels comparisons Font Font type Digit average Letter average Mann-Whitney U p-value All Digits v. All Letters†

CONSOLAS Sans serif 1173.6 1224.8 111.5 .520 0123456789

ARIAL Sans serif 1330.5 1608.6 70.5 .034* 0123456789

COURIER NEW Serifed 622.9 738.0 70.5 .034* 0123456789

TIMES NEW ROMAN Serifed 895.5 1251.9 44.0 .002* 0123456789

†N1 = 10, N2 = 26; *Significant at p < .05 level, 2-tailed

These three measures of complexity are intercorrelated. Number of pixels correlates with perimetric complexity (Spearman‘s ρs range from .76 to .88, all ps <

.001). Across three of the fonts tested, number of features correlates positively with perimetric complexity (Spearman‘s ρs from .25 (Arial, ns) to .45, all significant ps < .05).

On the whole, number of pixels does not correlate with number of features (ps > .1) except with the font Courier New (Spearman‘s ρ = .35, p < .05). The positive correlation between number of features and number of pixels in Courier New may be due to the large serifs in this font; the number of serifs (and therefore the number of pixels) generally increases as the number of features increases.

99

Discussion

The analyses conducted in this study do not indicate systematic differences between letters and digits. Broadly, the four variables considered were frequency of occurrence, visual overlap, visual discriminability, and visual complexity. The latter three were computed for both pixel- and feature-based metrics, which are assumed to index different levels of processing in the identification system. Of these variables, visual overlap and discriminability depends on the nature of visual relationships among the characters, while frequency and complexity are properties of a single character. I briefly review the results of each analysis, discussing the conclusions that can be drawn from each.

Frequency

The analyses of character frequency revealed opposite patterns depending on letter case. Considered as a set, lowercase letters are more frequent than digits while uppercase letters are less frequent than digits. However, in both comparisons there is a large amount of intermixing of character frequencies such that some digits are more frequent than some letters and some digits are less frequent than some letters. Overall, the lack of systematic difference in character frequencies is inconsistent with the postulation of frequency as the cause of the performance discrepancy between digits and letters.

Visual similarity

Visual similarity and discriminability comparisons were conducted with all characters and within-set characters as separate comparison sets. Results from these two

100 analyses differ in their ability to account for differences arising within a shared or separate identification system: all characters are potential competitors in a shared system while only within-set characters compete in separate systems.

Pixel and Feature Overlap

The pixel overlap comparisons are consistent with the observation that pixel overlap measures vary widely across font. Comparisons between pixel overlap of letters and digits to all other characters were nonsignificant in three of four fonts, but significant in Courier New. On the other hand, when pixel overlap was computed relative only to within-set characters, only Arial and Consolas letters and digits differed from each other, but in the opposite direction. These results present a very inconsistent picture of any letter-digit differences and do not suggest a role for this type of visual similarity in explaining letter/digit identification performance.

The results of the feature-based visual similarity analyses are straightforward: No differences were found between letters and digits in their feature overlap. The feature set was designed to include features which are abstract, i.e., invariant to properties such as size, absolute position, and line thickness, and reflect features at the character shape level. To the extent that this attempt was successful, a lack of difference between letters and digits on overlap at this level suggests that at the character shape level the set of digits and the set of letters differ only in terms of their connections to higher representations (i.e., allographs, abstract identities), not featural representations. The conclusions from pixel and feature overlap measures are inconsistent with any ability of these variables to explain letter/digit accuracy discrepancies.

101

Discriminability

Discriminability is a measure of the strength of the highest competitor for a given character. As with visual similarity, it was computed using two different overlap measures: pixel overlap and feature overlap, and for each measure one analysis considered all other characters as possible competitors, while a second analysis only considered characters from within the same set (letters for letters and digits for digits) as competitors.

When considering competitors across the entire set of characters, differences between letters and digits were only found for Times New Roman. On the other hand, three of four fonts tested showed a difference in within-set discriminability for letters and digits: Letters have higher competitors within the set of letters than digits do within the set of digits (exception: Arial font). In their 2011 paper, Starrfelt and Behrmann conducted a discriminability analysis using pixel overlap of Times New Roman and

Arial, finding that digits have more high-valued within-set competitors than letters do.

Their result is consistent with the result presented here of differences between digit and letter discriminability among within-set competitors.

Discriminability based on feature overlap revealed a similar outcome: No differences when considering competitors across all characters, but digits have lower- valued highest-competitors within-set than letters do. The consistency of this result with the general pattern obtained with three fonts in pixel-overlap suggests that it may be a general property of the visual forms of letters and digits.

As discussed in the introduction, positing a variable such as discriminability as the cause of the digit/letter performance discrepancy also involves a commitment to a

102 particular architecture. The finding that discriminability largely only differs when considering within-category rather than all-character would be consistent with separate identification systems but not shared systems. However, given that the result of higher digit than letter within-category discriminability only obtained at the pixel overlap level, it is not obvious what relevance this result may have for processing. It is unclear how a system might differentiate letters and digits based on pixel representations, before any contact has been made to more abstract features (it is conceivable that distinct sets of features could represent letters and digits, see Footnote 6, p. 59) or to stored representations.

Visual complexity

Visual complexity was calculated in three ways: as number of features in each character, number of pixels in each character, and perimetric complexity. Considering features, the result is very straightforward: Letters and digits do not differ in the number of features that comprise them. For pixels and perimetric complexity, results varied by font; significant differences were found with two of four fonts. For both Courier New and Times New Roman, letters have higher average perimetric complexity and number of pixels than digits. These differences parallel the font types: Sans-serif fonts show no difference while serifed fonts do (see Tables 4.1 and 4.2 for examples of letters and digits in each font). In serifed fonts, the serifs on the letters are more prominent and frequent than on digits (e.g., 6, 9 have no serifs in either font), which would reduce the number of pixels and perimetric complexity relative to the serifed letter stimuli. In addition, Arial showed a significant difference between letters and digits in number of pixels.

103

Pelli and colleagues (2006) reported that the log of average perimetric complexity for an entire alphabet (e.g., Latin letters, Chinese characters, Hebrew letters) correlated highly with the log of identification efficiency (a measure of accuracy across varying levels of visual noise): More complex alphabets were harder to identify. This observation is in the same direction as the one observed here with the serifed fonts: Digits, which show an identification advantage, were shown to be less complex than letters. However, while Pelli‘s result was robust to a large number of alphabets and font styles—they included both serifed and sans-serif fonts—the present results were only obtained with serifed fonts. The relative frequency of observing letters and digits in serifed and sans- serifed fonts is not known, but to a reasonable approximation most text read on a screen uses sans-serif fonts (with the exemption of digital versions of newspapers/magazines) while printed material tends to use serifed fonts. Furthermore, the studies reporting better digit than letter performance likely used a mix of fonts. The lack of difference found with sans-serifed fonts does not contribute to a uniform explanation for the finding that digits are identified more rapidly and accurately than letters.

Moving beyond set-wise analyses

A number of the results in this study provided partial or suggestive evidence for a role of character frequency or visual properties in facilitating digit over letter identification, but consistent results were not found across analyses and some results would be consistent only with separate identification systems. Stronger conclusions could be drawn by comparing by-character performance to by-character measures of frequency, visual similarity, and complexity (which were considered only collapsed across sets in the current study). The ability of these variables to predict identification

104 accuracy/RT would provide a more robust explanation of the digit and letter performance discrepancy, and allow for examination of whether the setwise discrepancy is an accurate portrayal of the relative difficulty of characters. This is the topic of Studies 2 and 3, in

Chapter 5.

105

CHAPTER 5: LETTER AND DIGIT IDENTIFICATION STUDIES

In this chapter I consider the ability of the variables discussed in Study 2 to account for empirical data: character identification errors and response times. Through numerous studies have published confusion matrices and accuracy/RT measures for individual letters, digit confusions have rarely been reported. The tasks used in the current two studies—identification with a delayed typing response, and a same/different judgment with character pairs—are both assumed to require representing the character(s) through the character shape level, and either require (delayed typing) or may automatically involve activation of abstract character identities (same/different).

Analyses of the results of these tasks will explore the nature of identification processing for letters and digits and the degree to which the variables considered above are relevant predictors of identification accuracy and speed. In addition, the relevance of the findings to the existence of shared versus separate identification systems will be explored.

Letter identification: Confusions and response time

The vast majority of previous research considering performance on individual characters has addressed letters and letter identification, without considering digits. Prior suggestions for performance differences among letters have been in terms of three variables: visual similarity to other letters, visual perceivability or complexity, and written frequency of occurrence.

106

Inter-letter similarity

Inter-letter visual similarity has often been investigated through confusability matrices: errors produced in letter naming under conditions that induce low accuracy

(e.g., brief presentation, masking, peripheral presentation, low contrast, for recent reviews see: Mueller & Weidemann, 2012; Simpson, Mousikou, Montoya, & Defior,

2012). Of studies presenting theses error matrices, many do not go beyond presenting the matrix and comparing it to those collected by previous researchers, though it is common practice to perform a clustering or similar analysis to reveal the structure of the confusion data (Fisher & Monty, 1969; Gilmore, Hersh, Caramazza, & Griffin, 1979; Loomis,

1982; Phillips, Johnson, & Browne, 1983; Simpson, Mousikou, Montoya, & Defior,

2012; Townsend, 1971). Confusions are assumed to reveal similarity: if any two letters

X and Y have highly similar representations this will result in identification errors of X for Y and vice-versa. Some authors have used a confusability matrix to derive a feature set which could account for the observed confusions, or have attempted to predict confusions using a feature set (e.g., Gervais, Harvey, & Roberts, 1984; Geyer & DeWald,

1973). Such feature sets often include such features as horizontal lines, vertical lines, angles, and curves (e.g., Briggs & Hocevar, 1975; Gervais et al., 1984; Geyer & DeWald,

1973; Wiley, Wilson & Rapp, in prep.) These analyses have also been carried out on response time data collected from same/different tasks on pairs of letters (Courrieu,

Farioli, & Grainger, 2004; Podgorny & Garner, 1979, WIley, Wilson & Rapp, in prep.).

107

Visual complexity

The majority of attempts to characterize letter identification errors have focused on inter-letter similarity. However, additional insight comes from analyzing performance for each letter independently, in addition to confusions between letters. A number of studies have attempted to explain the ―perceivability‖ or ―‖ of particular letters, particular fonts, or particular alphabets (e.g., Mueller & Weidemann, 2012; Pelli et al.,

2006; Roethlein, 1912; van der Heijden, Malhas, & van der Roovaart, 1984).

Perceivability is meant to index the difficulty in identifying a particular letter, independently of inter-letter similarity (as a visual metric this is necessarily specific to a given letter form, e.g., a and a may differ in perceivability). In a recent paper, Mueller and Weidemann (2012) presented evidence suggesting that there is a role in identification for perceivability after accounting for inter-letter similarity. They additionally found that perceivability is influenced by a masking character in the identification task. Other studies have used a measure of complexity—operationalized by the number of pixels in the letter strokes, or perimeter normalized by pixel area—to account for perceivability, and found negative correlations between letter complexity and letter identification accuracy (Appelman & Mayzner, 1982; Pelli et al., 2006). However, it is unclear whether complexity is independent of inter-letter similarity. These studies suggest that the visual properties of a single letter contribute to identification performance, with some evidence that it accounts for an independent source of variance from inter-letter similarity.

108

Letter frequency

Beyond visual variables, multiple authors have reported frequency effects on identification performance. Frequency of occurrence is a property of a single letter, though relative or summed frequency of two or more letters may also be relevant to identification performance. Studies employing case-specific letter frequency metrics were reviewed in a previous section (Chapter 2: ‗Evidence for stored letter forms‘).

Some researchers additionally investigated effects of case-collapsed letter frequency on letter identification. In a review of 58 studies of letter naming and same/different judgments, Appelman and Mayzner (1981) found evidence for a weak correlation between by-letter accuracy for lowercase letters and frequency, but no correlation between by-letter accuracy for uppercase letters and frequency. In studies collecting response time to uppercase letters, these authors found a significant correlation between by-letter reaction time and letter frequency. In a letter search task involving the detection of a given letter target in a string of five letters, Pitchford, Ledgeway, and Masterson

(2008) found that by-letter correct response time correlated with letter frequency. These results, and those reviewed previously, provide broad support for the notion that letter frequency is a relevant variable in letter identification.

The studies reviewed here address inter-letter similarity and confusions, as well as metrics of a given letter. They suggest that frequency and visual properties (similarity, complexity) are relevant to the letter identification process, and have provided proposals for the visual features used in identification. The shared system theory would be strengthened by an understanding of the variables which are relevant to identification performance for both types of characters. No study to date has considered these variables

109 for the full set of characters: letters and digits. Only one study to my knowledge has collected identification data for letters and digits (excluding some early experiments, which have peculiarities of stimuli such as dot matrix fonts or fonts designed for radar screens), but this study was limited by separately analyzing the data for letters and digits, and not including all of the letters (Starrfelt & Behrmann, 2011). Additionally, they did not report by-character accuracy or response time for post-hoc analysis. By collecting identification accuracy and response time for all letters and digits I will gain a rich data set to analyze for influences of character frequency, visual complexity, and visual similarity, and inform the nature of character identification processing.

Predictions of shared and separate identification systems

Separate systems

In the version of separate digit and letter identification systems considered here, processing diverges at the allograph level, at which point information is available about the letter/digit category of the stimulus. Within such a system, described in detail in

Chapter 3, two broad predictions can be derived for the results of Studies 2 and 3. With respect to identification errors (Study 1), separate systems predict that any errors which arise after the allograph level will be within-category errors. This is due to the inhibitory

(or similar) mechanism which reduces activation of other-category representations in the other system, only allowing representations of the stimulus category to compete for selection. Errors arising earlier in the processing system could result in cross-category errors due to the incorrect system being selected (the inhibitory mechanism would in that case be erroneously inhibiting same-category representations). In addition to predicting

110 the category of the errors with respect to the category of the stimulus, the separation of processing also impacts the type of similarity that is expected to obtain among the characters. In particular, character accuracy (Study 2) as well as response times (Study 3) will tend to be predicted by visual similarity which is computed from same-category representations (digits‘ similarity to other digits, letters‘ similarity to other letter) rather than similarity computed from all representations (all characters‘ similarity to all other characters). In both Studies two visual similarity independent variables are tested, one that indexes within-category similarity (consistent with errors arising within a single system) and one that indexes across-category/all-character similarity.

Shared system

Predictions can also be drawn from the shared identification system. They are often opposing to those of the separate system, though one should note that processing in both systems (shared and separate) is shared up through the allograph level. The shared identification system places no constraints on the category of an error made; any stimulus will activate representations from both categories and no restriction is made during processing to limit representations to the category of the stimulus. Additionally, by contrast to the separate systems, similarity across all characters is relevant to predicting errors and response times. This entails that the variables which index similarity across all characters (not restricted by category) should be more successful than those which index only within-category similarity. The particular variables which make this distinction will be highlighted in each study.

111

Letter and Digit Identification Accuracy (Study 2)

In this study, I created a behavioral confusability matrix for letter and digit stimuli, in mixed-category strings in a delayed typing paradigm. Intermixing the two stimulus types allows for examination of letter and digit identification without top-down knowledge that the stimuli belong to a single category, which could influence responding.

Strings were presented for brief durations to induce identification errors (which are rare or nonexistent for single characters of standard size and contrast, even with limited exposure). The choice to limit exposure duration rather than mask or distort the characters was to not alter the visual forms of the letters, particularly since the mask or distortion used may not uniformly affect all characters (Fiset et al., 2009; Mueller &

Weidemann, 2012).

The confusability method provides multiple subsets of data: identification accuracy for each character (from the correct identification responses), and the set (and rates) of characters reported in error for each target character (from the identification errors). These correspond to the diagonal and off-diagonal cells of a target character-by- response character confusion matrix. The first set of data was used to determine the extent to which character frequency, visual complexity, and visual similarity predict character accuracy. The error responses were analyzed to determine whether the same variables (frequency, complexity, and visual similarity) affect the character given in error. Regression was employed to determine the relative importance of these variables in explaining the data.

There are numerous published behavioral confusability matrices for letter stimuli, but no study to my knowledge has examined cross-category confusability for letters and

112 digits. With increasing interest in digit identification and overlap between letter and digit processing, such a matrix not only provides useful data for direct analysis but also a tool for other researchers in designing and matching letter and digit stimulus sets.

Methods

Mixed letter and digit strings (e.g., R9XM) were presented briefly and subjects asked to report the stimulus after offset. As indicated by pilot testing, 4-character strings provide a balance between collecting as much data as possible per trial (long strings) with constraints on working memory (short strings). All letters except O and all digits except

0 were used (these characters are not reliably distinguishable in mixed contexts). In the experiment 442 strings were presented: each of the 34 characters appearing 13 times in each position. In 34 practice trials (split into 3 short blocks) participants saw strings with the same characteristics, one character appearing once in each position.

The fixed-width font Consolas was used. One feature of this font, as indicated by

Study 1 results, is that it does not introduce any categorical visual differences between letter and digit characters. In visual similarity (average pixel overlap), discriminability

(maximum pixel overlap), and visual complexity (perimetric complexity and number of pixels), the sets of Consolas letters and digits do not differ from each other. This property of the font is beneficial for investigating effects of visual variables on by- character dependent measures because it means that any effects of these variables are not confounded with the categorical distinction.

Stimulus duration was brief to induce errors; pilot testing revealed that central presentation with a duration of 30-50ms is sufficient to induce errors on strings of this length. Duration was calibrated individually for each participant through practice trials to

113 obtain an approximately 60% accuracy rate. The trial structure was as follows: self- paced display to begin trial (‗. . .‘ terminated by Spacebar), fixation for 200ms, blank screen for 100ms, stimulus for set duration, untimed response period (terminated by Enter key), and 500ms of blank screen. Twenty-seven individuals participated in this experiment, which lasted about forty minutes including a brief mandatory break at the halfway point.

A long duration version of this experiment was run with four participants to investigate possible effects of working memory on the errors. The long duration experiment included the same stimuli as the main experiment but stimulus duration was

1500ms to insure that participants had plenty of time to accurately identify all of the characters presented. All other experimental parameters were the same. Errors due to the memory demand of remembering the strings from the stimulus offset until the completion of the typing response would be expected to also affect the long duration experiment; a high accuracy rate would indicate that the working memory load is minimal and does not contribute errors during the main (brief duration) experiment.

Analyses

Analyses explored error rate by character, proportion of times each character was given as an error, modal error response to each character, and substitution error matrices.

Simple correlations and multiple linear regressions were used to assess the relationship between independent variables and the dependent measures. Simple correlations were run in MATLAB using corrcoef. Regressions were run with the stepwisefit function, which evaluates the current model (beginning with zero predictors entered) with respect to all variables and sequentially adds/removes predictors to improve model fit. All

114 models included a constant term. Before model fitting began, all variables (dependent and independents) were individually normalized by Z-scoring; the resulting model coefficients are β-values which can be compared across models. A p-value of .05 was considered significant.

By-character analyses. The modal error response given for each character was compared to the target character to determine whether errors and responses differ in frequency, number of pixels, number of features, perimetric complexity, visual overlap

(pixel and feature, over all characters or just within-set), or visual discriminability (pixel and feature, over all characters or just within-set).

The error rate for each character and the proportion of times each character was given as an error were correlated and regressed with the predictor variables listed in

Table 5.1. Many of these predictors are derived from measures described in Study 1: frequency, perimetric complexity, visual similarity (average overlap), and visual discriminability (maximum overlap). For the latter two variables, two versions of each were computed which reflect the visual relationship between characters either across all other characters, or within a given character category (letter or digit). The all-character versions are most consistent with a shared identification system, while the within- category versions are consistent with separate identification systems.

Table 5.1: Predictor variables for by-character error rate.

Predictor Variable Description Binary category 0 for digits; 1 for letters Number of features* Number of features in character Number of pixels* Number of pixels in character Perimetric complexity* Perimetric complexity of character Character frequency* Character frequency of character

115

Average pixel overlap: All Char.* Average pixel overlap of character with all other characters Average pixel overlap: Within Categ.* Average pixel overlap of character with other same-category characters (letters with letters, digits with digits) Maximum pixel overlap: All Char.* Maximum pixel overlap between character and most-similar other character. Maximum pixel overlap: Within Categ.* Maximum pixel overlap between character and most-similar same- category character. Average feature overlap: All Char.* Average feature overlap of character with all other characters Average feature overlap: Within Categ.* Average feature overlap of character with other same-category characters (letters with letters, digits with digits) Maximum feature overlap: All Char.* Maximum feature overlap between character and most-similar other character. Maximum feature overlap: Within Categ.* Maximum feature overlap between character and most-similar same- category character.

*Derived from measures described in Study 1

Substitution matrix analyses. Each cell of the substitution matrix (character target by response character, 34x34) contains the raw number of times the two characters indexing that cell were confused, in either direction; this matrix is symmetrical and only one triangle was submitted to analysis. Fourteen predictor variables were created (listed in Table 5.2). These variables included ones derived from the measures of frequency, visual similarity, and complexity used in Study 1. Additional variables were designed to represent possible sources of knowledge the participants may have used (either intentionally or not) in responding, including character names, letter/digit category status,

116 and distance between letters and digits in alphabetic/numeric order. In addition to analyses of all the substitutions, separate analyses considered subsets: Digit-for-digit errors, Letter-for-letter errors, and Mixed (Letter-for-digit and Digit-for-letter) errors.

Table 5.2: Predictor variables for substitution errors.

Predictor Variable Description Binary category 1 if characters of the pair belong to same category; 0 if characters of the pair belong to different categories Pixel overlap* Pixel overlap of characters of the pair Total number of pixels* Summed number of pixels in pair Feature overlap Feature overlap of characters of the pair Total number of features* Summed number of features in pair Total perimetric complexity* Summed perimetric complexity of pair Perimetric complexity Difference between perimetric complexities of the difference* pair Frequency difference* Difference between character frequencies of the pair Minimum frequency Smaller of the two character frequencies Maximum frequency Larger of the two character frequencies Total frequency Summed character frequencies of the pair Name similarity Overlap of phonetic features of constituent phonemes in character names of pair Normalized distance Alphabetic distance between two letters in pair; numeric distance between two digits in pair, normalized within category s.t. max distance = 1. Set to 2 for all cross-category pairs.

*Derived from measures described in Study 1

117

Long Duration Experiment Results

Accuracy in the long duration (1500ms) experiment was extremely high (mean:

96%, range: 93-99%), suggesting that the use of working memory does not cause errors in this task. The near-ceiling performance in this experiment is consistent with conclusions that visual working memory has a four-object capacity (Vogel, Woodman, &

Luck, 2001). Additionally, the main experiment had a shorter lag between stimulus onset and responding due to the shorter (≤50ms) stimulus duration which would further reduce the demand on working memory.

Main Experiment Results

Data from three participants were excluded from analysis due to very high or very low accuracy (accuracies of 95%, 94%, and 16%). Extremely high accuracy will contribute very few errors to the analysis, while the quality of the low accuracy data is questionable. Data from twenty-four participants (10,608 total trials) were retained.

Average accuracy was 74%, with a range of 37% to 89%.

Character accuracy differs across positions (χ2 (1, N=42432) = 73.0, p < .001).

Highest accuracy was found at the first position from the left (98%), followed by the second position, fourth, and finally third position (96.4%). This pattern of high accuracy on leftmost and rightmost positions is characteristic of character string identification and may be a due to a combination of factors including visual acuity, crowding, and the left- to-right reading direction (e.g., Hammond & Green, 1982; Tydgat & Grainger, 2009).

118

By-character error rates

Letters were more accurate than digits: 5.5% error rate for letters and 6.8% error rate for digits (χ2 (1, N=42432) = 27.1, p < .001). The five most accurate characters were all letters (L, P, H, S, C), while the five least accurate characters were a mix of both types

(I, 1, T, 8, Y), see Table 5.3. Comparisons of the category of response by category of target indicated that response category was independent of target category. Of error responses to letters, 69% (1179/1704) were other letters and 31% (547/1704) were digits.

Of error responses to digits, 72% (547/764) were letters and 28% (215/764) were digits.

These rates do not differ based on target category (χ2 (1, N=2468) = 1.69, p = .19). The rates of letter and digit responses approximate the relative prevalence of digit and letter targets in the study (26.5% of target characters were digits), but differ from the expected proportion if the response category was based only on category frequency in the stimulus set (χ2 (1, N=2468) = 12.2, p < .01).

The proportion of times each character was given as an error and the error rate on each character (Table 5.3) were submitted to correlation and multiple regression analyses with the predictor variables described in Table 5.1. No significant results were found with either dependent measure. These measures had very low variability on a by- character level, which likely contributed to the difficulty in predicting them.

119

Table 5.3: By-character dependent measures. Each character was presented as a stimulus 1248 times.

Character Error rate Proportion Modal error response Modal response prop. (Number) of errors to target (col 1) of errors 1 0.06 (75) 0.06 I 0.87 2 0.02 (30) 0.02 3 0.12 3 0.02 (29) 0.02 2 0.15 4 0.03 (32) 0.04 A 0.16 5 0.02 (30) 0.04 S 0.39 6 0.03 (32) 0.03 G 0.36 7 0.01 (16) 0.04 Y 0.13 8 0.04 (47) 0.04 B 0.19 9 0.03 (38) 0.01 P 0.19 A 0.01 (17) 0.02 4 0.31 B 0.03 (42) 0.02 8 0.38 C 0.01 (14) 0.02 G 0.12 D 0.02 (26) 0.02 B 0.14 E 0.04 (46) 0.03 F 0.36 F 0.01 (18) 0.05 E 0.24 G 0.04 (45) 0.04 6 0.19 H 0.01 (12) 0.01 † † I 0.08 (104) 0.06 1 0.74 J 0.03 (34) 0.03 7 0.20 K 0.02 (25) 0.03 X 0.22 L 0.01 (9) 0.01 I 0.20 M 0.02 (23) 0.02 N 0.32 N 0.02 (21) 0.02 M 0.34 O -- 0.04 6 0.14 P 0.01 (10) 0.03 G 0.24 Q 0.02 (24) 0.04 P 0.30 R 0.02 (22) 0.03 5 0.43 S 0.01 (13) 0.02 I 0.22 T 0.04 (48) 0.03 V 0.21 U 0.03 (40) 0.02 Y 0.16 V 0.02 (23) 0.03 M 0.29 W 0.01 (17) 0.03 W 0.20 X 0.02 (29) 0.02 V 0.11 Y 0.04 (46) 0.02 X 0.16 Z 0.03 (34) 0.04 I 0.87

†H was excluded because three characters tied for modal error response: N, V, Z

120

By-character modal error responses

For each target character, the modal error response character was extracted (see

Table 5.3). (The target H was excluded from this analysis because three characters tied for its modal error response: N, V, and Z.) The modal error response given to both letter and digit targets was generally a letter (letter responses to letters: 12/24, 71%; letter responses to digits: 7/9, 78%), consistent with the pattern in the full set of substitution errors.

The modal error responses were compared to their target characters on 12 variables, all listed in Table 5.1 (excluded: binary category). For each variable, the value for the target was subtracted from the value for the error. The number of positive and negative results was tallied for each variable and submitted to a sign test. None of these variables differed reliably between the target and modal error response: None met the critical value of 23 positive or negative outcomes out of the 33 observations (2-tailed sign test). The maximum number of positive or negative results for any variable was 19 (also nonsignificant with a 1-tailed test). The outcome of this analysis suggests that though for some characters the modal error response forms a high proportion of errors to that target, the influence of variables such as visual overlap, complexity, and frequency is better investigated across the full range of error responses.

Substitution matrix

Five substitution errors were not included in the matrix: four responses of the letter O (to J, Q, 1, and 9), and one response of the digit 0 (to 9); neither was ever presented as a target. A total of 2466 substitution errors were included, of 2792 total errors in the experiment. The entire matrix is reproduced in Appendix B.

121

First, the raw counts of the substitutions for each character pair were correlated with the variables listed in Table 5.2. Only two significant correlations resulted: a positive correlation (r = .16) with feature overlap, and a positive correlation (r = .28) with pixel overlap. Both of these correlations indicate an increase in errors on pairs which have high visual overlap. Multiple regression run with all thirteen predictor variables13 produced a significant model including pixel overlap (β = 3.62), total pixels (β = -1.38), and name similarity (β = -.95; model F(557,3) = 19.88, p < .05). These variables were entered into the stepwise regression in the order listed, indicating that each subsequent variable accounted for additional sources of variance; 8% of the variance could be accounted for by only the first predictor, 9% with the first two, and 10% with the full model. See Table 5.4 for a summary of these results.

By confusion type. The substitution data were also split into three subsets: Letter- for-letter errors, Digit-for-digit errors, and Mixed errors (Letter-for-digit and Digit-for- letter). Of the total errors, 1179 were Letter-for-letter errors, 215 were Digit-for-digit, and 1072 were Mixed errors. The substitution counts for these three subsets were each correlated with twelve predictor variables from Table 5.2 (excluding binary category match, also excluding normalized distance for the Mixed set).

Letter-for-letter errors correlated with feature overlap (r = .23), pixel overlap (r =

.33), normalized distance (r = -.13), difference in perimetric complexity (r = -.12), difference in letter frequencies (r = -.19), maximum frequency (r = -.26) and summed frequency (r = -.23). The visual overlap (pixel and feature) variables correlated

13 A stepwise regression with only feature overlap and pixel overlap as possible predictor variables (the two variables significantly correlated with the dependent measure) also produced a significant model including both variables (F(558,2) = 26.59, p < .05, R2 = 9%). This model was created by first adding feature overlap (accounting for 8% of the variance, final model β = 2.97) and then pixel overlap (final model β = 1.17).

122 negatively, indicating more errors between pairs with higher visual overlap. The negative correlation with normalized distance suggests that letters which are closer together in the alphabet have higher confusion rates. A high difference in perimetric complexity indicates that letters which differ greatly in complexity were less often confused. Letter pairs with a larger frequency difference were also less often confused. The higher the frequency of the more frequent letter, the less often confusions occurred. And finally, the higher the combined frequencies of the two letters, the less often confusions occurred.

Stepwise multiple linear regression was run to reveal which of these variables could jointly predict the confusability pattern; only three variables were entered into the final model: pixel overlap, maximum frequency, and feature overlap (F(296,3)=21.90, p <

.05). These variables were added to the model in the order listed, with beta values of

1.25, -1.13, and .75 in the final model, together explaining 18% of the variance in the

Letter-for-letter data. (Models leading to the final model accounted for 11% and 16% of the variance, respectively.)

The Digit-for-digit confusions were subjected to the same correlation and regression analyses. Significant correlations were found with pixel overlap (r = .36), normalized distance (r = -.50), and total number of pixels (r = .43). The positive correlation with pixel overlap indexes more confusions between letter pairs which have high pixel overlap. As with the Letter-for-letter data, increased distance, this time numeric distance, resulted in lower rates of confusion. And finally, higher total number of pixels of two digits, a measure of visual complexity, resulted in more confusions.

Stepwise multiple linear regression resulted in a model including normalized distance (β

= -2.78) and total number of pixels (β = 2.44; model F(33,2) = 13.43, p < .05). This

123 model accounts for 45% of the variance in the Digit-for-digit data, and was built from a model including just normalized distance (R2 = .27).

The Mixed-category confusion data were correlated with feature overlap (r = .20), pixel overlap (r = .33), name similarity (r = -.14), minimum frequency (r = .14), and summed frequency (r = .12). The positive correlations with feature and pixel overlap indicate higher error rates on character pairs with high visual overlap. High character name similarity was associated with lower error rates. The rarer the frequency of the more-rare character, the lower the confusability of the pair. And finally, as total frequency of the two characters increased, so did the number of confusions. Stepwise multiple linear regression resulted in a model including only pixel overlap and character name similarity (F(222,2) = 16.40, p < .05), accounting for 13% of the variance. Pixel overlap was added first, and accounted for 11% of the variance alone.

Table 5.4: Significant predictor variables in regressions of substitution data (Beta values) Predictor All Letter-letter Digit-digit Mixed Variable Substitutions Substitutions Substitutions Substitutions

Pixel overlap 3.62 1.25 5.78

Total number -1.38 2.44 of pixels

Feature overlap .75

Maximum -1.13 frequency Name -.95 -2.65 similarity Normalized -2.78 distance

124

Total model R2 .10 .18 .27 .13

Discussion

Analyses of by-character error rate, proportion of times each character was given as an error, and modal error responses to each character did not reveal any significant influences of the variables considered. In addition to a variable representing the category of the character (letter or digit), the independent variables tested were measures of visual complexity: number of features, number of pixels, and perimetric complexity; character frequency; visual overlap: feature and pixel, computed across all character and within same-character pairs; and visual discriminability: feature and pixel, computed across all character and within same-character pairs. The failure of these variables to correlate with or predict the three dependent measures suggests either that none of these variables are relevant to the identification process, or more likely, that there was not sufficient variability in the dependent measures used here. Analyses in Study 3 will consider these variables‘ abilities to predict response times in same/different judgments to character pairs; higher variability in RT across characters may result in reliable relationships.

The two primary positive results of this study concern the independence of response and target category, and variables predictive of the substitution matrix. The failure of substitution errors to maintain the category (letter/digit) of the target character was a dependable result. -squared analysis revealed that target and response category were independent in the overall substitution data, and a consistent pattern was also found considering only the modal error response to each character. Rather than matching the

125 category of the response, the substitution error categories were approximately consistent with the overall relative frequencies of letters and digits in the experiment, though not perfectly aligned with them.

Independence of response category with respect to target category in character identification is consistent with a shared character identification system. In such a system, letter and digit stimuli are processed without regard to their category designation and errors take the form of other active character identities. By contrast, separate identification systems would involve separation of letter and digit processing, with the result that some identification errors (those arising after a determination has been made about the category of the stimulus) would be constrained to be the same category as the target. The pattern of independence of target and error category in identification was also reported by McCloskey and Schubert (2014) in their study of acquired dyslexic individual LHD, and formed a part of our conclusion in favor of shared system processing. The lack of restriction of the error category by stimulus category implies the presence of shared identification system, or alternatively, separate identification systems combined with all errors having arisen prior to the category determination for the stimulus was made.

Associations were found between character confusions and independent variables which measure visual similarity, visual complexity, character name similarity, alphabetic/numeric distance, and character frequency. The first analyses considered the full set of substitutions: letter and digit error responses to both letter and digit targets.

This set was predicted by a combination of pixel overlap, total number of pixels, and character name similarity. A combination of increased pixel overlap between two

126 characters, fewer total pixels, and dissimilar character names were associated with higher rates of character confusions. Pixel overlap and total number of pixels are properties of low-level visual representations, such as those at the viewer-centered visual feature level.

The significant contribution of number of pixels beyond pixel overlap suggests an independent influence of complexity (or ‗perceivability‘) on character identification, consistent with similar reports by Mueller & Weidemann (2012) for letters. Character name similarity is computed based on character name representations, accessed from abstract character identities (by analogy to abstract letter identities, refer to Figure 2.2 and related discussion). The negative relationship between character name similarity and confusion rates is in a -intuitive direction, but may indicate that participants took particular care to distinguish identities with similar-sounding names, leading to errors on those with dissimilar names14. Overall, the regression model indicates that both low- and high-level variables influenced character identification errors.

Subsequent analyses considered the set of Letter-for-letter substitutions, which were reliably predicted by a combination of pixel overlap, feature overlap, and maximum frequency. As pixel overlap and feature overlap of a pair increased and the frequency of the more-frequent letter decreased, the number of confusions increased. These variables reflect the impact of visual similarity at low (pixel) and more-abstract (feature) levels, as well as a benefit in processing for high-frequency letters. These conclusions are consistent with a large literature of research on letter confusion matrices (reviewed in the introduction to this chapter). The success of the feature overlap measure suggests that this measure, designed to explain character shapes through a small set of abstract visual

14 This variable added only weakly to the overall explanatory capacity of the model (raising the R2 from .092 to .097).

127 features, is a sufficiently accurate model of visual similarity within the identification system to correlate with identification performance.

Digit-for-digit substitutions were well-predicted (45% of the variance) by a combination of numeric distance and number of pixels. The influence of numeric distance implies involvement of semantic representations of number, which are more similar for close numbers (see also the numeric distance effect, e.g., Moyer & Landauer,

1967). The equivalent variable for letters, alphabetic distance, did not predict the Letter- for-letter confusions, suggesting that numeric distance is a more influential factor in digit identification, perhaps indicating that position of letters in the alphabet is not automatically accessed in identification (Krumhansl & Thomas, 1977). An increase in the number of pixels also increased confusions, which is a property of low-level visual representations. Together these two variables, one strictly visual and one semantic, had a large influence on digit-for-digit confusions.

Mixed character substitutions (Letter-for-digit and Digit-for-letter; 43% of the overall substitutions) were predicted by pixel overlap and name similarity. These variables overlap with those found to predict the Letter-for-letter substitutions (pixel overlap) and the full set of substitutions (pixel overlap, name similarity). Identical variables predictive of Letter-for-letter, Digit-for-digit, and Mixed confusions would have provided strong evidence in favor of the shared systems hypothesis; the variables affecting character identification would not differ depending on the character type. The present findings are not of identical predictors, but there are consistencies (e.g., the role of pixel overlap) which are suggestive of shared systems processing. Furthermore, one major difference, that Digit-for-digit confusions were predicted by normalized distance

128 while Letter-for-letter confusions were not, reflects a post-identification, semantic impact on the responses. This result is not inconsistent with shared identification systems for letters and digits.

With the results of regressions predicting the substitution matrices in mind, an earlier conclusion can be fine-tuned. Previously, the result that response category was independent of target category was discussed as either consistent with the shared system hypothesis, or separate systems with the added requirement that all of the errors arise prior to the determination of target category. However, regressions revealed the importance of high-level variables (e.g., numeric distance, character names) which are accessed only after character identity has been determined. The influences of these representations on the confusability matrix suggest that all of the errors did not arise prior to the determination of target category; some errors were affected by similarity at higher levels. This finding somewhat strengthens the conclusion in favor of a shared identification system for letters and digits. However, analyses did not test for the presence of particular errors which arose after the determination of target category and whether these particular errors did or did not maintain the stimulus category. Rather, the result indicated that some errors arose at a high level and that overall the error did not preserve stimulus category; it is possible that other errors arose prior to the hypothesized split into letter and digit processes and did maintain category but their influence was not evident in the overall error pattern. Additional stimulus manipulations or analyses could search for the locus of particular errors through use of mixed case letter stimuli and/or perseveration errors; perseverations of a letter identity but not its case would indicate involvement of abstract letter identities, accessed only after the split. The presence of

129 such errors and demonstration that they do not maintain the category of the stimulus could strengthen the argument for a single system. However, additional theoretical specifications would be required to describe the interaction between perseverations and current trial activations and their effect on the determination of stimulus category.

Beyond character identification accuracy

Though the substitution matrix provided a rich source of data for analysis, analyses of by-character error rates in this study failed to reveal any impact of the independent variables. In Study 3 I continue to investigate the impact of visual similarity, discriminability, complexity, frequency, and higher-level variables on character identification, through data acquired in a same/different judgment task. Study 3 builds on the results of this study, providing both a second source of data relevant to character identification, and potentially a more sensitive measure of performance in response times. Unlike Study 2, which relied on errors to populate the character similarity matrix, the same/different task ensures that every cell in the matrix contains a value: the reaction time associated with the response to that pair of characters.

Response Times in Letter and Digit Same/Different Judgments (Study 3)

In this study I collected response times for all pairwise combinations of uppercase letters and digits through a speeded same/different judgment. Though similar in many ways to the studies by Courrieu and colleagues (2004) and Podgorny & Garner (1979), the present study has multiple advantages: use of a same/different task requiring a response on every trial (c.f. go/no-go for different pairs), unmasked presentation, simultaneous presentation of both items of the pair, and inclusion of digits.

130

As in Study 2, multiple data sets were derived from a single task. The first set of data is the response times on same trials (e.g., ‗D D‘, ‗4 4‘), which provides a measure of the relative difficulty of determining and indicating that two identical characters are the same. The second set of data concerns the different trials (e.g., ‗D 4‘). These responses provide a measure of the similarity of the characters presented on that trial. Higher response times, and lower accuracy, on these trials correspond to higher similarity/confusability of the two elements of the pair.

Methods

For a complete picture of the similarity of letter and digit stimuli one would use all uppercase letters, lowercase letters, and digits, presenting all pairwise combinations.

However, the resulting 6084 combinations are impractical to test empirically. Instead I restricted the stimuli, as in Study 2, to 25 uppercase letters and 9 digits (excluding 0 and

O) resulting in 561 different pairs. Along with these different trials, an approximately equal number of same trials were presented (17 presentations of each of the 34 same- pairs: 578 trials). Of the 561 different pairs, 225 presented one letter and one digit, 300 presented two letters, and 36 presented two digits. The font Consolas was used to display the stimuli.

On each trial, the two characters of the pair were presented horizontally, centered on fixation. Participants were instructed to respond as quickly as possible whether the two characters were the same or different, using designated keys on the keyboard.

Stimulus presentation timed out after 1500ms; pilot data indicated that participants responded well within 1000ms. The ordering of the stimuli in the pairs and the hands used for the response keys were counterbalanced across participants. A short practice

131 block (40 trials) using symbol stimuli (e.g., %, $, #) preceded the main experiment and served to familiarize the participants with the procedure and response keys. The entire experiment, including practice and a brief mandatory break halfway through the main experiment took about 45 minutes.

Data analysis

Data cleaning proceeded in two steps on individual participants‘ RT data:

Removal of outliers (trials with RT less than 2.5SD below or more than 2.5SD above that participant‘s mean RT), and normalization to the mean by condition (dividing each trial

RT by the participant‘s mean RT for that condition—same or different--calculated without outliers). Cleaned data were averaged across participants and the resulting 34x34 matrix (character 1 of the pair by character 2 of the pair) was symmetrized by averaging the matrix with its transpose. The diagonal of this matrix is the normalized RT to same trials; the upper and lower triangular portions are normalized RT to different trials. Only one triangle was used for correlation and regression analysis as the other triangle is identical due to the symmetrization procedure.

Accuracy data were also analyzed; extracted as a 34x34 matrix, averaged across participants and symmetrized. The diagonal of the matrix gives the accuracy for the same trials, while the off-diagonal cells are accuracy for the different trials; only one triangle was used for analyses of the different accuracies.

To visualize the similarity structure of the Different trial RT data, a clustering analysis was conducted and plotted as a dendrogram. The process for creating the dendrogram involved computing the pairwise Euclidean distance between each of the data points, multidimensional scaling of the distances, and creation of a hierarchical

132 binary cluster tree (using the MATLAB functions pdist, linkage (‗ward‘), and dendrogram, respectively).

Correlation and multiple linear regression analyses were conducted separately for same and different trials. These determined which of the predictor variables (see Tables

5.1 and 5.2) were predictive of the behavioral confusions, and whether they made independent contributions.

Different pairs. The same predictors used in Study 2 for the substitution matrix were used for the Different trial RT and accuracy data. There are thirteen in total: binary category match, pixel overlap, total number of pixels, feature overlap, total number of features, total perimetric complexity, perimetric complexity difference, frequency difference, minimum frequency, maximum frequency, total frequency, name similarity, and normalized distance. These are described in Table 5.2.

In a second analysis, different responses to cross-category pairs (Mixed), within- digit pairs (Digit-digit), and within-letter pairs (Letter-letter) will be considered separately to determine whether the same factors are predictive of RT across these categories of different pair.

Same pairs. Analyses of the same pairs used the single character predictor variables described in Study 2 (see Table 5.1). These variables are: binary category, number of features, number of pixels, perimetric complexity, character frequency, average pixel overlap across all characters, average pixel overlap within category, maximum pixel overlap across all characters, maximum pixel overlap with category, average feature overlap across all characters, average feature overlap within category,

133 maximum feature overlap across all characters, and maximum feature overlap within category.

In addition to analyzing the average data to all same pairs, a second measure of same pair performance was calculated based only on the first presentation of each same pair to each participant. This dataset is referred to as SameFirst to distinguish it from the

SameAll dataset. Parallel analyses were conducted for SameFirst as for SameAll using the predictor variables described in Table 5.1.

Results

Comparing Same to Different Trials

Same and different trials differ in accuracy: Different pair accuracy was higher

(92% correct) than same pair accuracy (89%, U = 6577, N1 = 34, N2 = 561, p < .01).

Comparing RT for same and different pairs (using data normalized by average correct trial RT regardless of condition) revealed that same trials were responded to more quickly than different trials (Same: .99, Different: 1.02; U = 4778, N1 = 34, N2 = 561, p < .001).

The RT matrix for Different, SameAll, and SameFirst data are reproduced in Appendix

C.

Different trials: RT

Dendrogram. The dendrogram computed from the RT values for different trials is displayed in Figure 5.1. Characters with high similarity are grouped into (unordered) clusters, with the height of the vertical branch connecting two clusters indexing their similarity. Order of the clusters along the horizontal axis is arbitrary and depends only on the constraint that branches are not allowed to cross for display purposes. One immediate

134 observation from the dendrogram is that letters and digits are highly intermixed, with most digits having a letter as the other character in their immediate cluster (6-G, 8-B, 5-S,

9-Q, 1-I-T, 4-A, 7-Z). Furthermore, visual similarity appears to be a major organizing factor of the clusters, as can been seen not only in the lowest level clusters but also higher-order groupings containing characters which appear to share categories of features such as curves (leftmost cluster: 6, G, 8, B, R, D, P, U) or the presence of two vertical or near-vertical segments (rightmost cluster: H, W, M, N). These observations suggest that the RT data are indexing similarity which is based in part in visual similarity; particular types of visual similarity are explored in the following sections.

135

Figure 5.1: RT dendrogram computed from Different trials. X-axis labels are shown in Consolas, the display font used in the experiment. Quantitative analyses. Pairwise correlations were performed between the predictor variables in Table 5.2 and the dependent variable of Different trial response

136 time. Five significant correlations were revealed: binary category match, feature overlap, pixel overlap, normalized distance, and minimum frequency (all ps < .05). Binary category match was positively correlated (r = .12) with RT, consistent with the result reported above that Mixed pairs were responded to faster than Letter-letter/Digit-digit pairs. Feature and pixel overlap were both positively correlated (r = .21) with RT, indicating that higher feature and pixel overlap between the characters in a pair are both associated with slower responding. Normalized alphabetic/numeric distance was negatively correlated (r = -.13) with RT, indicating a time-cost associated with letters/digits which are close together in the alphabet or numerically close. The final significant correlation was between the minimum (smaller) frequency of the two characters of the pair, which was positively associated (r = .12) with RT: The presence of a rare character resulted in faster responding.

Stepwise multiple linear regression was conducted with all of the Table 5.2 variables included as possible predictors. The resulting best model included four predictors and accounted for 18% of the variance in the RT data (F(4, 556)=29.46, p <

.05, R2 = .18). The predictors included are pixel overlap (β = .01), normalized distance (β

= -.0007), frequency sum (β = .005), and feature overlap (β = .004), which were added to the model in that order. The intermediate models achieved R2 values of .14, .16, .17, and

.18, suggesting that the addition of each subsequent predictor increased the ability of the model to account for the data. See Table 5.5 for a summary of these and subsequent results.

By trial type. There are three types of different trials: Letter-letter pairs, Digit- digit pairs, and Mixed (digit-letter/letter-digit pairs). The average RTs for these pair

137 types are 1.00 for Letter-letter pairs, 1.01 for Digit-digit pairs, and .99 for Mixed pairs; the former two do not differ while both are slower than the Mixed pairs (LL compared to

Mixed: U = 28691, N1 = 300, N2 = 225, p < .05; DD compared to Mixed: U = 3001, N1 =

36, N2 = 225, p < .05). This result suggests that the presence of a character from the two different categories facilitated responding ―Different‖ to the pair.

Correlations of Digit-digit trials with the predictor variables (excluding binary category match) revealed no significant correlations, likely due to the small size of this subset of the data (only 36 trials). Correlations of Letter-letter trials revealed significant correlations with feature overlap, pixel overlap, total features, total perimetric complexity, minimum frequency, and summed frequency; all were positive. The relationships with feature and pixel overlap indicated slower responding associated with increased visual overlap (feature: r = .23, pixel: r =.34). The relationship with total perimetric complexity (r = .14) and total number of features (r = .14) indicated slower responding associated with increased letter complexity. The positive correlation with minimum frequency (r = .14) indicated an increase in response time for higher frequency letters, while the positive correlation with summed frequency (r = .12) indicated an increase in response time as the overall frequency of the letters of the pair increased.

Correlations with the Mixed trials were largely consistent with the correlations found with the Letter-letter trials: feature overlap, pixel overlap, difference in perimetric complexity, and minimum frequency were all significant correlates. The relationship between Mixed trial RT and the two types of frequency was positive (feature r = .21, pixel r = .42). Difference in perimetric complexity correlated negatively (r = -.15), indicating a decrease in response time as the difference in complexity of the two

138 characters of the pair increased. Minimum frequency was positively correlated: As the frequency of the less-frequent character of the pair increased so did response time.

Stepwise multiple linear regressions were run on the Letter-letter and Mixed datasets, with the same variables used for correlations (normalized distance excluded for

Mixed). A significant model emerged for Letter-letter trials which included pixel overlap, summed frequency, and feature overlap (F(296,3) = 17.57, p < .001). Beta values were .01 for pixel overlap, .005 for summed frequency, and .005 for feature overlap. These variables were added to the regression model in the order listed; R2 values for the subsequent models were .12, .14, and .15 for the final model. The final model for the Mixed data included only two predictors: pixel overlap and feature overlap

(F(222,2) = 27.56, p < .001). This model accounted for 20% of the variance, with a beta value of .13 for pixel overlap and .05 for feature overlap. The interim model included only pixel overlap, accounting for 18% of the variance in the data.

139

Table 5.5: Significant predictor variables in regressions of Different pair RT data (Beta values) Predictor Variable All different RT Letter-letter RT Mixed pairs RT

Pixel overlap .01 .01 .13

Feature overlap .004 .005 .05

Total frequency .005 .005

Normalized -.0007 distance

Total model R2 .18 .15 .20

Different trials: Accuracy

Pairwise correlations were carried out between the predictor variables and

Different trial accuracy. Four significant correlations were revealed: binary category match, feature overlap, pixel overlap, and normalized distance (all ps < .05). Negative correlations were found with feature overlap (r = -.16) and pixel overlap (r = -.25); higher levels of visual overlap were associated with lower accuracy. Binary category match was also negatively correlated with accuracy (r = -.14): Same category pairs resulted in lower accuracy than cross-category pairs. Finally, normalized distance was positively correlated with Different trial accuracy (r = .14); increased alphabetic/numeric distance between characters of a pair resulted in increased accuracy.

Stepwise multiple linear regression was conducted for all of the predictor variables listed in Table 5.2 with the dependent variable of Different trial accuracy. The resulting model included three predictors, accounting in total for 9% of the variance

140

(F(3,557) = 18.35, p < .05). The included predictors were pixel overlap (β = -.01), binary category match (β = -.005), and feature overlap (β = -.004); which were included in the model in that order. The addition of each predictor increased R2 of the model from .07 to

.08 to the final .09 value.

By trial type. Comparisons between Letter-letter, Digit-digit, and Mixed pair types on accuracy indicated that while the two same-category pair types did not differ

(Letter-letter: 91%, Digit-digit: 92%), Mixed trials were more accurate than Letter-letter trials (Mixed: 92%, versus LL: U = 28442, N1 = 300, N2 = 225, p < .05). Parallel to the analyses on RT, correlation and regression analyses were performed on accuracy of

Letter-letter, Digit-digit, and Mixed pairs. As with RT, Digit-digit trials did not correlate significantly with any of the predictor variables. Letter-letter trials correlated negatively with feature overlap (r = -.14), pixel overlap (r = -.22), total perimetric complexity (r =

.12), and summed frequency (r = -.13). Mixed trials correlated with only two variables: feature overlap (r = -.13) and pixel overlap (r = -.35).

Stepwise multiple linear regressions were run on the Letter-letter and Mixed datasets. The resulting model for the Letter-letter trials included pixel overlap (β = -.01) and maximum frequency (β = -.006) and accounted for just 7% of the variance (F(297,2)

= 11.20, p < .05). For the Mixed trials, only pixel overlap (β = -.01) was included in the model (F(223, 1) = 30.78, p < .05, R2 = .12).

Same trials

Accuracy did not differ between SameAll (89%) and SameFirst (89%) datasets.

However, RT did differ: SameFirst trials were slower than SameAll (1.02 versus 1.00 normalized RT, Mann-Whitney U = 270, N1 = 34, N2 = 34, p < .001).

141

SameAll. Letters and digits do not differ in SameAll RT (both 1.00) or accuracy

(both 89%). Correlations between all of the predictor variables in Table 5.1 and the

SameAll RTs resulted in no significant correlations. Correlations between the predictor variables and the SameAll accuracies revealed a single positive correlation with character frequency (r = .36, p < .05). This small correlation indicates the tendency of characters with higher frequencies to result in higher average accuracy on Same trials.

A stepwise multiple linear regression conducted on SameAll RT values did not result in a significant model; in addition to predictors not making significant individual contribution to explaining the data, a combination of predictors was also unable to do so.

The parallel analysis conducted instead on SameAll accuracies resulted in a model containing only character frequency, which accounted for 12% of the variance in the data

(F(1, 32) = 4.86, p < .05, R2 = .12).

SameFirst. Considering only the first presentation of each same pair, letters were responded to more quickly than digits (digits: 1.04, letters: 1.01; U = 46, N1 = 34, N2 =

34, p < .01). Pairwise correlations between SameFirst RT and the predictor variables listed in Table 5.1 resulted in two significant correlations: binary category and average feature overlap computed over all characters. The relationship between RT and binary category was negative (r = -.47, p < .05), consistent with the result that digit characters resulted in slower RT than letter characters. The relationship between RT and average feature overlap (all characters) was also negative (r = -.39, p < .05), indicating that participants were slower to indicate that two identical characters were the same when that character has a high average featural overlap with other characters. Parallel correlations

142 between SameFirst accuracy and the predictor variables resulted in no significant correlations.

Multiple linear regression stepwise analysis on SameFirst RT data resulted in a model containing two predictors: binary category (β = -.02) and average feature overlap computed over all characters (β = -.19), the same variables which were independently correlated with the dependent measure (F(2,31) = 7.14, p < .05). Together the variables account for 32% of the variance in SameFirst RT. Parallel analyses on SameFirst accuracy data resulted in no significant model; no combination of the predictor variables provided a significant explanation of the dependent variable.

Discussion

Overall, attempts to explain the results for different trials were more successful than for same trials. However, insights were gained in both instances about the variables which influence speed and accuracy of responding in a same/different paradigm with letters and digits. Overall, different trials were found to be more accurate while same trials were responded to more quickly (faster same than different RT was also reported by

Podgorny & Garner, 1979; both patterns found by Wiley et al., in prep.).

Different trials

The dendrogram created from RT data for the different trials suggested intermixing of response times to letter and digit stimuli and visual similarity as a strong organizing factor. These observations were borne out in regression analyses, which revealed that pixel and feature overlap between the two characters of the pair were predictive of response times. The effect of visual overlap was to slow response time, a

143 common finding with letter stimuli in similar paradigms (e.g., Courrieu et al., 2004;

Podgorny & Garner, 1979; Wiley et al., in prep.). The stepwise regression procedure is informative as to the relevance of the two types of visual overlap tested. In all models including both visual overlap variables (Different trial RT- All trials, Letter-letter different trial RT, Mixed different trial RT), pixel overlap was added to the model first, followed by feature overlap. Though pixel overlap made a larger contribution (verified by the beta values, which were consistently larger for pixel than feature overlap), feature overlap made a significant independent contribution to explaining the variance in different trial RT. This finding is particularly striking in light of the fact that pixel and feature overlap are significantly intercorrelated (r = .25, p < .001), and supports the feature set as an approximation of abstract features involved in character identification.

In particular, that the feature set contributes to predicting not only Letter-letter different trials but also the full set of all Different trials (and Mixed alone) suggests that the features and overlap relationships are relevant for both letter and digit characters. To my knowledge the feature set developed here is the first to explicitly describe both letter and digit characters (though existing feature sets could be expanded to describe digits).

Multiple variables indexing character frequency were included in the regression models (and are highly inter-correlated); summed pair frequency was found to be predictive of different trial RT. As the total frequency of the two characters increased it slowed responding, indicating a speed advantage for low frequency characters. This effect arose despite the local frequency statistics of the experiment, in which every character was presented equally frequently. As reported in Study 1 (Chapter 4), character frequency for digits exceeds that for uppercase letters; the predictive effect of character

144 frequency is not simply a result of this relationship because Digit-digit pairs were not found to differ in RT from Letter-letter pairs.

Mixed and same-category different trials were found to differ in RT and accordingly a significant correlation was found between binary category match (i.e., whether the two characters of a pair were from the same category) and different trial RT.

However binary category match was not found to be a significant predictor in the stepwise regression. This outcome appears to suggest that the effect of binary category match on RT was better accounted for through a combination of other variables, specifically feature overlap, pixel overlap, normalized distance, and summed character frequency (the significant predictors in the regression model). However, normalized distance is highly correlated with binary category match due to the design of the variable:

Distance is only meaningful for within-category pairs, and the mixed pairs were assigned the same arbitrary value. The result is that normalized distance also indexes binary category match. To investigate this relationship, a stepwise multiple linear regression was run excluding the normalized distance predictor; the resulting model includes binary category match (also pixel overlap, feature overlap, and frequency minimum; model

F(556,4) = 29.27, p < .05, R2 = .18). A stepwise regression which excluded both normalized distance and binary category match also resulted in a significant model, though the explained variance is reduced relative to one including either binary category or normalized distance (F(557,3) = 35.06, p < .05, R2 = .16). These investigations indicate that the RT advantage for mixed-category pairs relative to same-category pairs was fully explained by predictors which did not explicitly encode these two trial types. It is unclear at this point whether the finding reflects the impact of one or more untested

145 variable(s) which co-vary with category membership, or a categorical effect as assumed by the binary category match predictor.

There was less variability in accuracy than response time in this experiment

(average performance was over 90% correct). However, some associations were still revealed between the different trial accuracy data and predictor variables. In a regression model, accuracy was predicted by pixel and feature overlap as well as binary category match; higher visual overlap led to lower accuracy and pairs of characters from the same category led to lower accuracy. The visual overlap result is consistent with that found for

RT; characters which are visually similar led to both slower RT and as well as lower accuracy. Binary category information made an additional contribution beyond the other included variables in accounting for variation in accuracy, though the overall variance accounted for in the accuracy model (9%) was much lower than that in the RT model

(18%).

Subsets of the different trials were analyzed based on the type of different pair:

Letter-letter, Digit-digit, or Mixed. Analyses of Digit-digit trials revealed no significant correlations, likely due to the small number of trials in this condition. Analyses of the RT

Letter-letter and Mixed trials both resulted in significant regression models including pixel and feature overlap, with summed pair frequency making an additional contribution to explaining the Letter-letter data. For accuracy, models predicting Letter-letter and

Mixed data included pixel overlap but not feature overlap, as well as maximum frequency for Letter-letter pairs. That the by-type analyses revealed a subset of the variables found to predict overall Different trial performance suggests that the same variables affect performance regardless of the particular categories of characters presented on each trial,

146 consistent with a single system, sensitive to certain variables (e.g., visual overlap) in processing both letter and digit characters.

Regression models predicting different trial performance included binary category information, and overall mixed pair performance was faster and marginally more accurate than same-category pair performance. Though the number of mixed, Letter-letter, and

Digit-digit trials were not equal, it is not straightforward how Mixed trial performance would have been boosted by trial counts: Mixed trials were approximately 40% of the different trials, Letter-letter: 53%, and Digit-digit: 6%. Perhaps more likely is the influence of explicit knowledge of character category within the identification system.

Investigations reported in McCloskey and Schubert (2014) pointed to the existence of an explicit representation of digit/letter category, an output from the allograph level of representation. We concluded in that study that the category representation does not influence identification (i.e., does not bias selection of an identity of the same category as the stimulus), but it is possible that the information was used in the same/different task.

Importantly, knowledge of the category of the two stimulus characters is relevant to the same/different judgment: Any trial on which the two characters were from different categories was a Different trial.

Same trials

Because same trials were repeated throughout the experiment, which may have lessened the effects of various influences and facilitated processing due to repetition, same trials were analyzed both as a whole and including only each participant‘s first trial for each same pair. Speed on same trials increased when considering all trials compared to just the SameFirst trials (accuracy did not differ). Considering SameAll trials, letters

147 and digits did not differ in accuracy or RT, and no correlations were found between RT and any of the predictor variables. SameAll accuracy was found to be predicted by character frequency, with lower frequency resulting in lower accuracy; this variable alone accounted for 12% of the variance in SameAll response time.

SameFirst response times were predicted by a combination of binary category

(letters v. digits) and average feature overlap between the presented character and all other characters. The inclusion of the former predictor is consistent with the observation that RT to letters exceeded that to digits. This pattern is the reverse of that commonly- reported; one possible explanation for the slower digit responding could be that digits were less frequent in the experiment: Same digit trials were only 26% of the Same trials.

Accordingly, responding may have been facilitated for letter trials because the representations will have been on average more recently and more often accessed. This phenomenon remains to be tested with other paradigms to determine whether it reflects a property of the task or of the identification system more generally.

The success of the average feature overlap computed over all characters is suggestive of a shared letter and digit identification system. Other competing predictors indexed average feature overlap between the presented character and other within- category characters (e.g., digit characters with other digits) as well as the same measures computed with pixel rather than feature overlap. The success of the average feature overlap predictor indicates two properties of the underlying processing. First, this predictor is computed with the assumption that all characters compete with each other; average overlap is calculated with all character pairs rather than just same-category pairs.

This assumption is consistent with shared identification systems, in which letters and

148 digits compete with each other (independently of the category to which they belong).

Second, the success of a feature rather than pixel overlap predictor suggests that the relevant visual similarity was at a fairly abstract level at which character features are represented (such as the character shape level), providing evidence that these features are involved in both letter and digit representations.

Conclusions

Broadly, the results of this study indicated that both same and different responses to pairs of letter and digit characters were affected by visual overlap and character frequency. The results of Study 3 are largely consistent with a shared identification system, suggesting that one of the first two possibilities is likely; but it remains to be confirmed.

Furthermore, the results of this study have implications for interpretation of the same/different task. This task could be accomplished purely on the basis of low-level representations (i.e., the viewer-based visual feature level) without consultation of information about abstract character shape, stored form, or identity. However, the results suggested that representations at multiple levels of the identification system impacted response times and accuracies. The effects of visual similarity were not limited to pixel overlap; feature overlap was a significant predictor of both same and different pairs. This effect suggests that though not necessary for the task, information about the visual features present in the stimuli impacted performance, influencing the perceived similarity of the pairs. Furthermore, effects of character frequency imply use of a stored level of representation: allograph and/or abstract character identity representations. The type of frequency employed here was case-specific, which is most consistent with activation of

149 case-specific allograph representations. If explicit category information was used, this implies activation beyond the allograph level. Alphabetic and numeric distance was also found to predict accuracy and RT for different trials; this information is likely to be linked to character identity representations (or allograph representations) and indicates some impact of high-level semantic knowledge about the characters on performance.

Recent work comparing same/different judgments by naïve and experienced readers of the Arabic alphabet found that both low-level (e.g., visual) and higher-level (e.g., letter names, motor production patterns) information impacts performance (Wiley et al., in prep.). The results of the current study are consistent with this task involving a range of representations in the identification system.

General Discussion

The analyses of Study 2 and Study 3 data were largely parallel, addressing the same underlying questions of how accuracy and speed of identification are influenced by variables relevant to the character identification system. Identification error patterns were considered to reveal the other active representations during processing of a given character and the similarities between these representations. In terms of response time, character pairs with high response time are assumed to be more similar, i.e., to have high relative overlap between the character representations.

The results of Studies 2 and 3 suggest roles of visual similarity (both pixel and feature), visual complexity, character frequency, character name similarity, and numeric distance in character identification performance. Visual complexity (number of pixels) and pixel overlap are assumed to reflect overlap and similarity of representations at viewer-based feature levels. Feature overlap reflects a higher level of visual similarity in

150 terms of somewhat abstract features represented at the character shape level. Character frequency effects would arise at allograph or abstract identity levels, where stored representations can differ in resting activation or threshold based on the frequency with which that representation has been activated in the past (i.e., character has been experienced and correctly represented). Name similarity and numeric distance are both measures of similarity at levels beyond character identification; this information can be accessed through character identity representations. Though effects of visual similarity, complexity, and frequency have been reported in letter identification studies, these studies are the first to investigate their effects on digit identification.

Differences between Study 2 and Study 3

A few key differences can be noted between the findings of Study 2 and Study 3.

First, while by-character independent variables were largely unsuccessful at predicting by-character error rates in Study 2, robust relationships were discovered between these variables and Same trial performance in Study 3. The positive results of Study 3 suggest that the variables are not irrelevant to character identification but rather that the by- character dependent measures collected in Study 2 were not sufficiently variable to reveal significant associations. Future studies eliciting by-character error rates should aim to include higher and more variable values to increase the chance of successful prediction.

A second difference between the two studies concerns the role of feature overlap. This variable was only found to predict one subset of the confusion data in Study 2 (Letter-for- letter substitutions), while it was a robust predictor which accounted for independent sources of variance in Study 3 data. The success of the feature overlap predictor in Study

3 suggests that the feature set approximates abstract featural representations in the

151 character identification system, and furthermore that the method for computing overlap between the features of two characters results in comparable similarities to similarity relationships between character shape representations.

Shared identification systems

A number of results across both studies contribute to a growing body of evidence consistent with shared identification systems for letter and digit characters. In Study 2, the category (letter/digit) of error responses was found to be independent of the category of the target character. This result suggests that the errors arose within an identification system in which processing of letter and digit stimuli was not computationally distinct: a shared identification system. However, this conclusion would be strengthened by direct evidence suggesting that all of the errors (or particular ones which cross category) did not arise at early processing levels prior to any letter/digit processing split.

Two additional findings from Study 3 support the shared systems theory. First, average feature overlap, computed over all characters, was a reliable predictor of RT on same trials. The relevance of this finding is that the predictor is consistent with the assumptions of a shared system. In a shared system all characters compete with all other characters, without regard to letter/digit category, and therefore the relevant visual similarity is computed with respect to all characters. Another feature overlap variable, which was unsuccessful in predicting RT, was computed over only within-category characters, consistent with the assumption of separate systems that competition at some levels is limited to same-category characters. Second, subsets of the different trials in

Study 3 coming from Letter-letter and Mixed (Letter-digit and Digit-letter) trials were predicted by the same variables. This result is also consistent with the shared system

152 assumption that performance on all characters will be sensitive to the same variables, regardless of their category.

A note with regards to the shared systems theory is that it does not preclude the presence of knowledge of digit/letter category membership. In fact, the ability of readers to report the category of a stimulus suggests that this is information which is extracted through the identification process. The claim of the shared systems hypothesis is that category membership does not affect the process of determining the identity of a stimulus. However, category membership can have other impacts on performance. One possible example is the facilitation observed in Study 3 for Mixed category different pairs; one source for this facilitation could be explicit knowledge of stimulus category impacting the same/different judgment.

Stepwise regression analyses such as those used in Study 2 and 3 should be interpreted with some caution as the particular ordering of variables entered may not reflect the most optimum variables predictive of the dependent variable but rather a sufficient set. There was a high degree of intercorrelation among the independent variables used here (particularly between versions of the same variable, such as summed and max frequency, which are based on the same underlying frequency values), and strong conclusions cannot be drawn about the relative importance of these intercorrelated variables on the basis of the results. For that reason conclusions refer to the importance of visual similarity and frequency generally, rather than a specific measure (e.g., summed pair frequency) which was revealed to be a successful predictor in the model found.

153

CHAPTER 6: POSITION IN LETTER IDENTIFICATION (STUDY 4)

This study considers position representation in identification, specifically letter identification during word recognition. Studies utilizing masked priming of lexical decision and cross-case same/different paradigms, among others, have suggested that letter position is coded in a flexible manner. More specifically, using evidence from normal participants in an illusory word paradigm, Fischer-Baum and colleagues (2011) provided evidence for graded both-edges coding of letter position. Through analysis of errors of an individual with acquired dyslexia, McCloskey and colleagues (2013) suggested that graded both-edges position is implicated in the coding of letter allographs and abstract letter identities. However, both of these studies have limitations. First, the illusory word paradigm depends upon setting up the opportunity for migration errors and therefore requires careful design of stimulus items to test particular hypotheses about letter position. This resulted in a small number of position schemes tested in that study.

The latter study, which involved analysis of perseveration errors in a large corpus of unselected reading errors made by an individual with acquired dyslexia (LHD), tested a larger number of position schemes, an improvement over the previous study. They concluded in favor of a graded both-edges scheme used to code position of allograph and abstract letter identity representations. However, the conclusions of the LHD study would be strengthened by converging analyses of letter identification in unimpaired participants. In Study 4 I aimed to address both of the limitations of previous studies eliciting a large dataset of identification errors, from unimpaired participants, on which a wide range of position schemes could be tested, including the commonly-posited open- bigram scheme.

154

According to the assumptions of cascading activation in letter identification, multiple representations are active during letter identification. Under normal circumstances (e.g., sufficient size and contrast of visual letter stimuli) in the unimpaired identification system the activation for the stimulus letter is strong, preventing the other active representations from being selected as the winning identity. By inducing errors in the normal system one reveals the other representations which were active concurrently, though they are normally not selected. To elicit errors from the undamaged identification system one can provide weak input for the target letters by displaying word stimuli briefly and away from fixation (Weinzierl, Kerkhoff, van Eimeren, Keller, & Stenneken,

2012). This technique results in letter substitutions and insertions, as well as smaller numbers of deletions and transpositions, as seen in Study 2. To the extent that these errors are perseveration errors, they reflect the residual activation of letters from previous trials, and can be analyzed for the position maintained by these errors. These errors reveal the underlying position coding used by the identification system: Due to the position-specificity of letter representations, letters will intrude as errors in the same position in which they first appeared (in a previous response). Analyses seek the position code which can explain the position occupied by the perseverated letters with respect to their position on a previous trial. Under the assumption of position-specific representations in identification, residual activation will occur for a representation at a particular position, and this representation will then re-appear at the same position in a future response.

155

Methods

Twenty participants were asked to report (by typing) briefly-shown word and pseudoword stimuli. A fixation cross was presented in the center of the screen and on each trial a single stimulus appeared just to the left or right of fixation. Pilot testing indicated that left visual field presentation resulted in lower accuracy, while a fairly balanced proportion of stimuli in the two visual fields ensured that participants did not shift their focus from the central fixation and thereby inflate accuracy on the fixated side.

Therefore, sixty percent of stimuli were presented in the left visual field, the remainder in the right.

Stimuli consisted of 600 words and 120 pronounceable pseudowords. Length varied between 4 and 8 letters, with equal numbers of items of each length (120 words and 24 pseudowords per length). The font Courier New was used and all stimuli were presented in lower case.

Pilot testing indicated that stimulus durations of less than 70ms were sufficient to produce a high proportion of errors. Stimuli were divided into ―short‖ (4-6 letters) and

―long‖ (7-8 letters) with different durations set for these two categories. Stimulus duration was calibrated for each participant through 3 practice blocks of 10 trials each, aiming to produce an accuracy of 60% correct.

After stimulus offset, participants typed in their response. They were told that they would see some words and some stimuli that were not words, and encouraged to type in whatever letters they saw regardless of whether they formed a word (the pseudowords were included to encourage this). Responses were displayed on the screen

156 as they were entered and participants were allowed to use the Backspace key to correct typos or change their response; the response period was untimed.

The sequence of events in each trial was as follows: fixation cross for 200ms, stimulus for duration set by individual‘s practice performance, response period for unlimited duration (terminated by the Enter key), blank screen for 500ms, and self-paced

ITI (screen displaying ―…‖, terminated by Spacebar). A brief mandatory break split the experiment into two halves; the entire experiment including practice trials took approximately 50 minutes.

Long Duration Experiment

It is possible that errors could be introduced through the typing response; these would not reflect the operation of the identification system. Though the population of undergraduate research participants has extensive reading and typing expertise, typos may be particularly rampant when participants seek to complete a tedious experimental task as quickly as possible. Typing errors could take a different form than the identification errors (e.g., typing errors may be non-perseverative while identification errors contain perseverations) and act as noise in the analyses, reducing the possibility of revealing a true effect in the data. Alternatively, typing errors could be of the same form as the identification errors (e.g., both may be perseverative) and artificially inflate the observed effect(s). The simplest solution is to attempt to collect the cleanest data possible. Precautions in the main experiment consisted of: displaying the response on the screen while it was entered, allowing the participant to make corrections before submitting the response, and unlimited time for responding.

157

In addition, to independently investigate the prevalence of typing errors, a separate experiment was run with three participants. Stimulus items were a randomly- selected subset of the items used in the main experiment (4-8 letters in length). Testing consisted of 360 trials: 59 pseudowords and 301 words. Stimuli were presented for 300 and 350ms (for short and long stimuli, respectively); all other presentation parameters and the trial structure were the same as the main experiment. Identification accuracy in the long duration condition should be at ceiling; any errors introduced can be assumed to be typing errors. If error rates are small in this condition it is fair to assume that typing errors make a negligible contribution to the letter errors in the brief presentation condition.

Accuracy and Perseveration Analyses

Data were combined across participants to increase the corpus of errors for analysis. However, chance analyses were conducted using trials from the participant‘s data being tested. Initial analyses considered error rates for word and pseudoword stimuli. Single letter errors were classified by an automated program (i.e., substitution, insertion, omission, transposition, other error). Errors consisting of the repetition of an entire response within a participant‘s data were excluded, as these could artificially inflate the measures of letter perseveration errors. Substitution and insertion errors are intrusion errors, which are possible perseveration errors. Perseveration analyses, as in

McCloskey et al. (2013), considered these intrusion errors and computed the number of times the same letter occurs in the immediately preceding trial (‗I-1‘ for Intrusion trial minus one) as a potential source for the perseveration. The proportion of times the letter was present in a randomly selected trial (from outside the vicinity of the intrusion trial)

158 was also calculated, as an estimate of the likelihood of a letter recurring in a response by chance. In the event that the intruded letter was not found in I-1, the analysis (number of times the letter occurs in the source trial and chance estimate) was repeated for trial I-2, and so on until a potential source trial could be found or a limit of 10 trials prior to the intrusion trial was reached (i.e., through I-10). The number of contiguous trials for which there were intrusion sources present above chance rates was termed the perseveration window. The perseveration window was then used in analyses of perseveration position.

The intrusions in the data set were matched with one or more possible source trial(s): Trials within the perseveration window containing the intruded letter. The intrusions are potential perseveration errors, now matched with their potential sources, but it is not possible to know which particular intrusions are perseverations (i.e., caused by presence of the letter in the source response). However, the number of true perseverations was calculated from the number of intrusion-source pairs and the chance rate of intruded letters in a prior response (see Appendix A of McCloskey et al., 2013 for details). The number of true perseverations was then used to derive the position matches as a proportion of the number of true perseverations for position schemes, which allows comparisons among position schemes (see below). Throughout the results section, reported p-values were derived from Monte Carlo chance analyses with 1,000 iterations.

Perseveration Position Analyses

Analyses determined which position scheme(s) best predicted the position occupied by the intrusion errors with respect to their source letters. The analysis asked whether the intruded letter in the intrusion trial and that same letter in the source trial were in the same position, according to the particular position scheme under

159 consideration. The number of actual position matches (where the letter was in the same position) was compared to the number of chance position matches. Chance position matches were calculated using randomly selected responses as the source trial, these responses were required to be the same length as the source response, contain the intruded letter, and be located outside the immediate context of the perseveration trial in the experiment. In other words, the position matches were compared to an estimation of the position matches expected if the position of the intruded letter is unrelated to the position of the source letter.

Position Schemes

Broadly, the position schemes considered were: beginning-based, end-based, both-edges, midpoint, preceding-letter and following-letter (i.e., closed-bigram), trigram, open-bigram with preceding- and following-letter, and syllabic. These schemes have all been previously proposed as manners of representing letter position in reading and/or spelling; see Chapter 2 for review. Three both-edges schemes were considered: a simple combination of beginning-based and end-based schemes, the beginning and end schemes plus immediately-adjacent positions (narrowly-graded both-edges), and finally adding second-adjacent positions (broadly-graded both-edges). Three midpoint schemes were considered, which differed in how central letters were assigned to positions: Scheme

Midpoint1 assigns characters flanking the center of the string to positions -1 and 1, with no character in position 0; Midpoint 2 assigns the character to the left of center position 0 and the character to the right of center position 1; Midpoint 3 assigns the character to the right of center position 0 and the character to the left of center position -1. Closed bigram and open bigram schemes were considered with and without explicit inclusion of word

160 boundaries in the position representations. Additionally, open-bigram schemes were considered which included one or two possible intervening letters.

Orthographic syllabification remains a topic of debate in the literature and therefore the syllabic schemes were tested on two syllabifications of the data which maximized the letters in either coda or onset positions, (CMax and OMax, respectively).

The syllabification algorithms labeled each syllable within the response with a syllable number and delineated the role (onset, nucleus, coda, with the possibility of multiple letters per role) held by each letter within a syllable. For example, the response

NEBULSH was coded as having two syllables: NE.BULSH (OMax) or NEB.ULSH

(CMax). Considering the CMax syllabification by way of example, syllable one15 (NEB) would contain Onset1-Nucleus1-Coda1 roles and syllable two (ULSH) would contain

Nucleus1-Coda1-Coda2-Coda3 roles. Three syllabic schemes were tested in the position analysis, varying in the manner in which they considered within-syllable positions. The first scheme, Specific Roles, assigned position matches based on specific syllabic roles

(e.g., Coda2 position matches Coda2 position, HAND = CARD), the second scheme,

Graded Roles, assigned matches on basic syllabic role (i.e., any coda position matching any other coda position in the same syllable), and the third scheme, Graded Adjacent

Roles, considered letters to match in syllable position if they occupied the same basic role in the same or adjacent syllable within the response. Syllabic schemes were tested using two position analyses: the retrospective method (described in the previous section) and the prospective method, which is more appropriate to the syllabic schemes (described below).

15 This is a simplification for explanation of the OMax/CMax distinction. In the analysis each syllable was assigned position based on counting from the beginning of the response (NEB = syllable E+1) as well from the end of the response (NEB = syllable B-1).

161

The retrospective method gives the syllabic schemes an advantage in position matches because position within syllables depends on letter identity. The identity of a letter partially determines its syllable role (i.e., are nuclei, consonants are onsets or codas), and therefore searching for a particular letter identity in a source trial is not independent from ascertaining whether that letter‘s position matches its position as an intrusion. For example, in searching for a source for the intruded U in the response loup, the analysis will consider the previous response, nebulsh, which also contains a U. The retrospective position analysis considers whether these two Us are in the same position, and they are both in a nucleus position, resulting in a match for the Graded Roles scheme.

However this match is misleading: U will necessarily be in a nucleus position in both the intrusion and source responses due to its vowel status, resulting in an inflated number of position matches for syllabic schemes.

To avoid this problem with syllabic scheme analyses, McCloskey and colleagues

(2013) developed an alternative position analysis method which does not search previous trials for position matches but instead synthesizes intrusion errors and compares these to the observed data. The prospective method uses the position of each source letter to generate potential intrusion responses with that letter in the same position. For a given stimulus, the generated response will contain an intrusion of the source letter at the same position at which it appeared in the source trial, through either substitution of the letter in that position or insertion of the source letter (if the position was unoccupied in the stimulus). The generated responses are then compared to the observed intrusion responses in the data set and any matching responses counted as position matches for that scheme. Chance position estimates in the prospective method were calculated using

162 randomly selected trials from outside the immediate context of the source trial, generating predicted errors, and comparing these to the observed responses. Unlike the retrospective method, the prospective method is limited to perseveration responses which have only a single intrusion error because of the response generation process: it is unclear how responses containing more than one error should be generated (using what order of intrusions).

Result reporting is divided into two sections: evaluation of non-syllabic position schemes (using the retrospective method), and evaluation of syllabic position schemes

(using the retrospective method and the prospective method). Retrospective analyses of non-syllabic schemes first considered beginning-based and end-based position schemes, followed by a both-edges scheme, as these schemes have been extremely successful in previous work. Subsequently the graded both-edges schemes were tested. Other schemes were then compared to this scheme. All analyses which compare schemes used the residuals method described in McCloskey et al., (2013), which uses one scheme as the base scheme, and considers the ability of the additional scheme(s) to correctly predict the position of perseveration errors whose position is not predicted by the base scheme.

Long Duration Experiment Results

In the long duration experiment, accuracy was 99% for words and 86% for pseudowords. Accuracy on short pseudowords approached ceiling (93%) while performance on longer pseudowords was less accurate (75%). Perseveration analyses revealed that letters did not intrude into later trials at rates higher than expected by

163 chance (p > .4).16 This result suggests that typing errors do not surface as perseveration errors in this task. Errors due to typing therefore only added noise to the perseveration analysis, making it more difficult to reveal any perseverations from true misidentifications in the brief duration experiment.

Main Experiment Results

Accuracy was 49.7% overall (14.0% for pseudowords, 56.8% for words), and remained essentially constant over the experiment (50% correct in the first quarter, 51% in Q2, 48% in Q3, 49% in Q4). For word stimuli, 83.6% of all responses were words, and of errors, 62.4% were other words. Of these incorrect word responses, 54.4% were more frequent than the stimulus word. For pseudoword stimuli, 52.8% of errors were words. These results suggest a slight bias to respond with a word, which was stronger when the stimulus was a word, suggesting that the participants may have had implicit or explicit knowledge of the lexicality of the stimulus even when they were not sure of all of the letter identities present.

There were 13,951 intrusion errors overall in the twenty participants‘ data. Error types were very similar for the word and pseudoword stimuli: Errors were mainly substitutions (words: 34% of errors, PWs: 38%), followed by deletions (words: 17%,

PWs: 24%), insertions (words: 9%, PWs: 8%), and transpositions (words: 4%, PWs: 5%).

16 For comparison, the perseveration analysis was run on the first 360 trials from three participants (randomly selected) from the main experiment. This analysis revealed 1066 intrusion errors, with significant perseverations present (p< .05 at I-1).

164

Perseveration and position analyses

Perseverated letters were found with sources up to six trials prior to the intrusion trial (I-6; p < .0001). The number of true perseverations was estimated at 1708 over the perseveration window.

Position analyses first considered the ability of end-based and beginning-based position schemes to account for the position of the perseveration errors. Beginning- based and end-based positions schemes correctly matched the position of the perseveration errors (both ps < .0001), matching .442 and .376 of the true perseverations, respectively. Allowing the end-based scheme to match additional perseverations not matched by the beginning-based scheme, and allowing the beginning-based scheme to match perseverations not matched by the end-based scheme both significantly increase the position matches accounted for by the schemes (ps < .001). This both edges scheme matches 68.0% of the true perseveration errors in the corpus. Adding immediately- adjacent positions to the both-edges (beginning plus end) scheme—i.e., the narrowly- graded both-edges scheme—is a significant improvement (p < .002), and matches 95.1% of true perseverations. All of the other schemes tested were also able to account for significant proportions of the true perseverations, as shown in Table 6.1.

The broadly-graded both-edges scheme is also successful (p < .009) and matches a near-perfect 98.2% of true perseverations. However, this scheme is considering nearly 4 positions as matching positions in the source trial and is therefore at an advantage relative to other schemes, which generally predict only one position. For a more conservative comparison, the narrowly-graded both-edges scheme (immediately-adjacent only) was used as the baseline to which other position schemes were compared.

165

Table 6.1: Results of non-syllabic position schemes Proportion of position

Scheme matches

Beginning-based .442**

End-based .376**

Both-edges (Beg + End) .680**

Narrowly-graded both-edges .951**

Broadly-graded both-edges .982**

Midpoint1 (… -1| 1 …) .334**

Midpoint2 (…-1 0 | 1…) .297**

Midpoint3 (…-1 | 0 1…) .273**

.343** Closed Bigram† [AB] .406**

.410** Closed Bigram including word boundaries .429**

.441** Open Bigram-One [AXB] .465**

.038** Open Bigram-Two [AXYB] .045

** Open Bigram-One including word .673 boundaries .561

166

Proportion of position

Scheme matches

** Open Bigram-Two including word .139 boundaries .110**

Trigram [ABC] .166**

Trigram including word boundaries .170**

*Significant at p < .05. **Significant at p < .001 †Two versions of bigram schemes were tested: considering the left letter as context and the right letter as context. Listed in this order in the table.

Comparisons were made between the narrowly-graded both edges scheme and the other schemes by allowing the other scheme to predict the position of perseverations left unaccounted for by the both-edges scheme. Results are presented in Table 6.2; other than the broadly-graded both-edges scheme only three schemes improved the narrowly-graded both-edges scheme. The reported p-values refer to the ability of the additional scheme to account for perseverations which are not matched by the narrowly-graded both edges scheme; nonsignificant p-values indicate a scheme does not match a significant number of the perseverations unaccounted for by the base scheme.

Three schemes (not including syllabic schemes) successfully accounted for perseverations unaccounted for by the narrowly-graded both edges scheme. Of these, two are midpoint schemes, differing in how positions from the midpoint are assigned to the center characters of an even-length response. These schemes were successful in accounting for true perseverations beyond the narrowly-graded both edges scheme. The midpoint schemes were therefore compared to the broadly-graded both edges scheme.

The Midpoint1 scheme continues to account for seven perseverations beyond the

167 broadly-graded both edges scheme (p < .05), taking the number of true perseverations accounted for to 1528 (proportion of .986), from 1521 (proportion of .982) for the broadly-graded both edges scheme. However, the Midpoint2 scheme did not account for any perseverations beyond the broadly-graded both edges scheme (p > .6). The third scheme with significantly improved the both-edges narrowly graded scheme was the open-bigram scheme considering the left letter as context and word boundaries. When this scheme was added to the broadly-graded both edges scheme, it did not match any additional perseveration errors (p > .2).

168

Table 6.2: Results of adding schemes to Narrowly-graded Both-edges scheme

Scheme added to Narrowly- Proportion of position P-value of graded Both-edges matches added by scheme adding scheme

Midpoint1 0.013* .035

Midpoint2 0.011* .024

Midpoint3 0.011 .081

Closed Bigram† -0.0091 .797

0.0172 .105

Closed Bigram with word -0.00751 .816 boundaries 0.0151 .115

Open Bigram-One 0.0111 .352

0.0113 .297

Open Bigram-Two 0.0248 .206

0.0279 .115

Open Bigram-One with word 0.0111 .386 boundaries 0.0132 .310

Open Bigram-Two with word 0.0428* .013 boundaries 0.0407 .096

Closed Trigram 0.0095 .183

Closed Trigram with word 0.0057 .186 boundaries *Significant at p < .05. †Two versions of bigram schemes: left letter as context, right letter as context, in that order in the table. 1Negative change in proportion of position matches resulted from an increased number of chance matches and an increased number of observed matches

169

Table 6.3 presents values for the reverse comparison: The ability of the narrowly- graded both-edges scheme to match perseverations when added to another scheme. In general, adding the narrowly-graded both-edges scheme to another (non-syllabic) schemes significantly improved added perseveration position matches. However, four exceptions exist, schemes to which the addition of the both-edges scheme did not cause a significant improvement: Open-bigram with one and two intervening characters, both with and without word boundaries (i.e., marking of letters at edge of the string as such).

The four schemes are all open-bigram schemes calculated based on the letter to the left of the intrusion as context, rather than the letter to the right. An analysis was conducted to determine whether adding the broadly-graded both-edges scheme would significantly improve these four open-bigram schemes. For three of the schemes (open bigram-one, open bigram-one with word boundaries, open bigram-two with word boundaries), the addition of the broadly-graded both-edges scheme contributed significant position matches beyond the base scheme (ps < .05); for the fourth scheme the broadly-graded scheme contributed only marginally to accounting for the remaining perseveration errors

(p < .08).

170

Table 6.3: Results of adding Narrowly-graded Both-Edges scheme to other schemes

Proportion of position matches added P-value of adding Narrowly- Base scheme by Narrowly-graded Both-edges graded Both-edges

Midpoint1 0.630* < .001

Midpoint2 0.665* < .001

Midpoint3 0.690* < .001

Closed Bigram† 0.576* 0.007

0.523* 0.012

Closed Bigram with 0.523* 0.009 word boundaries 0.489* 0.001

Open Bigram-One 0.451 0.085

0.453* 0.044

Open Bigram-Two 0.373 0.150

0.439* 0.030

Open Bigram-One 0.279 0.171 with word boundaries 0.332* 0.006

Open Bigram-Two 0.112 0.128 with word boundaries 0.180* 0.024

Closed Trigram 0.795* 0.006

Closed Trigram with word 0.764* 0.001 boundaries *Significant at p < .05. †Two versions of bigram schemes: left letter as context, right letter as context, in that order in the table.

171

Syllable-based schemes

Analyses of syllable-based schemes were conducted using two different analysis methods, neither of which is ideal for testing syllabic schemes on this particular data set.

The retrospective method, used for all non-syllabic schemes, is not appropriate because it gives the syllabic schemes an advantage for position matches. The prospective method, which is appropriate for testing the contribution of syllabic schemes, suffers from the requirement that only responses containing a single intrusion can be included combined with the fact that the current data set has very few responses containing only one intrusion error. Only 1394 potential perseveration-source response pairs could be included in this restricted analysis, while the retrospective method analyzed 15534 pairs, a much larger and more robust set of data. Despite these disadvantages, the syllabic schemes were tested using both schemes and both results presented here, but they should be interpreted with caution.

Using the retrospective method, all three syllabic schemes were highly successful in accounting for position matches (see Table 6.4). As with the non-syllabic schemes, these schemes were then compared to the both-edges narrowly-graded scheme. With the

CMax syllabification, only the graded adjacent roles scheme accounted for any perseverations beyond those accounted for by the both-edges narrowly-graded scheme (p

< .01), and this scheme also accounted for perseverations beyond the both-edges broadly- graded scheme (p < .01). With the OMax syllabification, all three of the schemes were successful at matching positions beyond the narrowly-graded both-edges scheme (ps <

.05) as well as beyond the broadly-graded both-edges scheme (ps < .01).

172

Table 6.4: Results of syllabic scheme analyses

CMax OMax

Proportion of Proportion of position matches position matches Retrospective Specific Roles .771* .803*

Graded Roles .878* .878*

Graded Adjacent .994* .979* Roles Prospective Specific Roles 1.10*† 1.09*†

Graded Roles .902 .528

Graded Adjacent 1.23† .827 Roles

*Significant at p < .05. †Proportion exceeding 1.0 suggests that insufficient data was included in the analysis for accurate estimates.

With the prospective method, only the specific roles scheme was successful at accounting for perseveration positions (with both syllabification schemes). Comparisons to the both-edges narrowly-graded scheme were not conducted as this scheme was unsuccessful at accounting for the position of perseverations in the restricted corpus

(added ps > .2 when adding end to beginning scheme or vice-versa to create both-edges scheme). Instead, comparisons were made to the successful beginning-based scheme (p <

.05). With both syllabifications only the specific roles scheme improved the beginning- based scheme (added p < .05, adding 21 and 34 true perseveration matches under CMax and OMax, respectively). Using the prospective method to calculate the proportion of true perseveration matches for the narrowly-graded both-edges scheme gives an estimate of .80 (no difference between CMax and OMax syllabification); the proportion of

173 perseverations matched by the specific roles scheme numerically exceeds this value

(CMax: 1.10, OMax: 1.09).

The results of the prospective analysis are plagued by the small amount of data which could be analyzed using this method. Estimates of the proportion of true perseverations which exceed 1.0 indicate that the estimates of chance in the analysis are unreliable. Therefore, the results of this analysis method should be interpreted with caution and strong conclusions cannot be drawn.

Discussion

The analyses in this study were divided into three parts: retrospective analyses of non-syllabic schemes, retrospective analyses of syllabic schemes, and prospective analyses of syllabic schemes. Each of these results will be discussed in turn followed by general observations and discussion of the particular level(s) at which the perseveration errors may have arisen.

The vast majority of the non-syllabic schemes considered were able to account for the position of significant numbers of the perseveration errors in the dataset.

Comparisons were made between the both-edges narrowly-graded scheme and the other schemes to determine the relative success of these schemes. The both-edges narrowly- graded scheme was used as a base scheme, with the other schemes given the opportunity to account for any perseveration errors not accounted for by the base. This analysis revealed only three schemes which could significantly improve on the narrowly-graded both-edges scheme. When the both-edges broadly-graded scheme was used as the base scheme instead, only one of these schemes remained able to add position matches: a midpoint scheme which matched seven additional perseverations (base scheme matched

174

1528). Though this scheme significantly improved on the both-edges scheme, the contribution of this scheme is extremely small.

In addition to considering the contribution of other schemes on top of the narrowly-graded both-edges scheme, I also considered the ability of the both-edges scheme to improve on the other schemes. If the narrowly-graded both-edges scheme approximates the true positions of perseveration errors in the data, and to the extent that it is making distinct predictions from the other schemes, adding this scheme to other schemes should improve their ability to account for the perseveration positions. Four open-bigram schemes were not significantly improved by the addition of the narrowly- graded both-edges scheme. Of these, one scheme—the open-bigram scheme with up to two intervening letters and no marking of word edges—was only marginally improved by the addition of the broadly-graded both-edges scheme (p < .08). On the whole, adding the graded both-edges schemes to other schemes significantly improved the ability to account for the position of perseveration errors in the dataset.

Syllabic analyses were analyzed using two distinct methods: the retrospective method used for the non-syllabic schemes, and a prospective method. Of these, the retrospective method is not wholly appropriate for testing syllabic schemes because the syllabification of responses depends in part on the identity of the letters, resulting in an over-estimation of syllabic position matches. As expected, use of this method results in extremely high proportions of true perseverations accounted for by the syllabic schemes, exceeding the both-edges narrowly- and broadly-graded schemes. When the syllabic schemes were tested using the more appropriate retrospective method, the Specific

Syllabic Roles scheme was the only scheme which could account for significant numbers

175 of the true perseverations. However, the retrospective analysis is limited to responses containing only a single intrusion error, which vastly reduced the amount of available data and resulted in unreliable chance estimates for all of the retrospective analyses.

Comparisons of the Specific Syllabic Roles scheme to the narrowly-graded Both-edges scheme indicated that the syllabic scheme was more successful, though this result should be interpreted with caution due to insufficient size of the restricted data set.

The primary result of this experiment is that perseverations made by unimpaired readers tended to maintain graded both-edges position. This result leads to the conclusion that graded both-edges position is used to represent letter position during word recognition: representations are activated at particular positions defined by distance from the beginning and end of the string and residual activation of these representations causes them to intrude into later trials. This accords with previous results pointing to use of the both edges scheme in other word recognition and reading paradigms (Fischer-

Baum et al., 2011; McCloskey et al., 2013). A small contribution beyond the both-edges scheme was made by a midpoint scheme (assigning characters flanking the center of the string to positions -1 and 1, with no character in position 0). This empirical demonstration of the utility of both-edges position coding builds upon previous work and is consistent with a large corpus of masked priming and other behavioral results suggesting that letters in adjacent and near-adjacent positions have similar representations.

However, the conclusion that graded both-edges position is the most successful position scheme is tempered by the fact that syllabic schemes could not be adequately tested using this dataset. Previous authors‘ use of the prospective method with sufficient

176 data revealed that the narrowly-graded both-edges scheme was more successful than syllabic schemes. Whether those results contrast with the current results due to the differing data sources (individual with acquired dyslexia versus unimpaired readers) or the small size of the data set is unclear; future research should elicit responses containing single intrusion errors from unimpaired participants for a more robust analysis of the syllabic schemes.

Open bigram position schemes—frequently posited in models of letter identification and word recognition—made some contributions to accounting for the positions of the perseverations in this study. They generally performed inferiorly to the both-edges scheme. However, when adding the both-edges graded schemes to open- bigram schemes considering the left-letter as context, only the broadly-graded scheme was able to significantly improve these open-bigram schemes. Both the open-bigram schemes and the broadly-graded both-edges scheme consider a large number of positions in the source response as matches due to their graded nature; a more robust test of these schemes would be conducted on responses containing more letters than the 4-8 included in this study.

Source of perseveration errors

On a more basic level, this experiment has demonstrated that perseveration errors can be elicited from unimpaired participants performing a word recognition task. This is consistent with assumptions that letter identification occurs in a cascaded system in which multiple representations are active at any given time and compete for selection.

Considering the two underlying causes of perseveration errors discussed by Fischer-

Baum and Rapp (2012), the errors observed in this study likely reflect a failure to activate

177 representations on the current trial, rather than a failure to inhibit those active from previous trials. As discussed previously, the experimental paradigm was designed to limit visual information about the stimulus, which presumably leads to weak activation throughout the identification system due to weak and/or noisy inputs. This allowed representations which retain residual activity from previous trials to be successful competitors in the identification process, resulting in perseveration errors. Therefore, the results of this experiment suggest that it is a normal feature of the letter identification system for letters to maintain residual activation in time. A question for future work concerns the nature of this residual activation: Does it remain for a certain amount of time or for a certain number of following uses of the identification system, or some interaction of these two? Does a representation which was more strongly activated retain activation for longer than one which was more weakly activated? Residual activation is likely due to an underlying decay process, and these questions address the nature of that process within the broader dynamics of the identification system.

Conclusions about the level of position representation

Within the letter identification theory under discussion, the errors elicited in this task could have arisen at any of the five levels of representation involved in word recognition (visual feature, character shape, allograph, abstract letter identity, orthographic lexicon). The present task was to type the stimulus after it disappeared, which is similar to a same-case delayed-copy task with the additional requirement of translating the visual information to key press motor commands, a process which is made possible by processing at least through the level of abstract letter identities. As with the masked priming lexical decision studies (i.e., transposition and relative position priming),

178 it is unclear whether a single level gives rise to the effect or if it results from the combination of multiple levels of processing. However, some observations help to delineate the likely sources of the errors.

One relevant aspect of the paradigm is that stimuli were presented at different viewer-based positions (to the left and to the right of fixation); perseveration responses with sources in the other position would suggest that the perseverations arose at a more- abstract level than the viewer-based visual feature level. To test this possibility I conducted perseveration and position analyses limited to perseverations occurring in trials in which the previous stimulus was in the opposite visual field; sources in this analysis were restricted to the opposite visual field as the current trial stimulus. This analysis revealed significant numbers of perseveration errors (p < .0001). Position analyses revealed that position was maintained according to the both-edges narrowly- graded scheme (p < .0001, .98 of true perseverations accounted for). Perseverations occurred despite the presentation of stimuli at different positions with respect to fixation, ruling out perseverations at the viewer-based feature level, which would not consider two stimuli on opposite sides of fixation to have any matching positions.

It is also likely, based on the higher accuracy found for word than pseudoword stimuli, that lexical representations were used in this task, giving a boost to the ability to recall word stimuli over pseudowords (a similar suggestion is made by Kinoshita &

Lagoutaris, 2010 to explain easier recall of pseudowords than nonwords). However, given that the analysis depends on errors rather than correct trials, and many of the responses are pseudowords or nonwords, it is unlikely that the errors arose solely in the orthographic lexicon. In summary, though a clear delineation of the level at which the

179 errors arise cannot be reached, it is likely that the perseveration errors under consideration resulted from activation at the character shape, abstract letter identity and/or lexical levels. In turn, this suggests that graded both-edges position is used to code letter positions at one of more of these levels.

In summary, the results of this study suggest that graded both-edges coding of letter position is employed in word recognition. Future work should pay particular attention to potential roles of midpoint, open-bigram, and specific role syllabic schemes, which were close competitors to the both-edges scheme across various analyses. These results accord with previous results from Fischer-Baum and colleagues, as well as providing a conclusion consistent with extensive evidence in favor of flexible position coding in letter identification.

180

CHAPTER 7: GENERAL DISCUSSION

In this dissertation I explored the cognitive processes of letter and digit identification. This is the process of translating from visual letter and digit stimuli into abstract representations of character identities and positions. In Chapter 2 I reviewed extensive past research on letter identification and the position code(s) involved. Chapter

3 focused on digit identification, a topic which has seen little research, and the relationship between letter and digit identification. I reviewed research leading to the proposal of a shared letter and digit identification system, as well as outlining the commitments of this theoretical position. Chapters 4 through 6 detailed behavioral studies with unimpaired participants, summarized in the following section.

Summary of experiments

Study 1

Study 1 involved comparisons of the properties of lowercase letters, uppercase letters, and digits, with the intent to determine whether various hypotheses for the digit identification advantage are viable. The particular hypotheses are that digits are faster and easier to identify due to higher digit than letter frequency, or lower average/maximum digit overlap than letter overlap. Visual overlap was tested as pixel overlap and feature overlap, through a feature set and overlap rules I developed. Overall, the results of Study 1 indicated that the variables of character frequency, visual overlap, discriminability, and complexity do not differ consistently between the classes of letter and digit stimuli. Visual overlap, discriminability and complexity were all computed using both pixel and feature metrics to provide a more general answer than using either

181 type of metric alone. For all variables tested there was considerable intermixing of values for the two character categories, and in some cases (e.g., frequency for lowercase letters) a difference between the categories was found in the reverse direction needed for that variable to explain higher digit than letter accuracy. In some cases a difference arose between similarity measures computed with just same-category characters as competitors and those computed treating all characters as competitors. The shared system architecture assumes that all characters compete with all other characters during identification; if this theory is correct the relevant comparison class is to all other characters rather than only same-category characters. A number of the results in this study provided partial or suggestive evidence for a role of character frequency or visual properties in facilitating digit over letter identification, but overall a consistent picture did not emerge.

This study was meant to evaluate the preconditions of two classes of hypotheses explaining the letter/digit performance discrepancy which has been reported in the literature. However, the results did not suggest that frequency or visual properties underlie this consistent empirical phenomenon. Rather, further consideration should be given to the other two hypotheses discussed in Chapter 3: fewer digit identities, and feedback from semantics and/or the right hemisphere. One or more of these hypotheses may combine with the small effects found in Study 1; their influences on the identification system remain to be tested in the future.

Study 2

In Study 2 I elicited errors in the identification of letters and digits in mixed strings. Unimpaired participants viewed these strings briefly and then responded by

182 typing their response after stimulus offset. Most errors were substitution errors and analyses considered the patterns of confusions among characters. One primary finding was the independence of letter/digit category: response characters did not preserve the category of their target character. Analyses of the full matrix (target character by response character) of substitution errors revealed influences of visual similarity, visual complexity, character frequency, similarity of character names, and numeric distance.

Though numerous previous studies have investigated visual similarity and complexity effects on letter identification, this is the first to demonstrate their effects on digit identification as well. The shared influence of some of these variables on Letter-for- letter, Digit-for-digit, and Mixed category responses is consistent with the shared letter and digit identification systems. Further, the lack of preservation of letter/digit category in error responses, combined with the conclusion that some of the errors arise from late and post-identification stages of processing are suggestive of a shared identification system as described in Chapter 3.

Study 3

Study 3 analyses tested the same independent variables tested in Study 2, but with dependent measures from a same/different judgment task on pairs of characters. Both response time and accuracy were analyzed, to Same and Different trials. Results suggested that both pixel and feature overlap made independent contributions to explaining the RT on Different trials; the success of the feature overlap metric beyond that of pixel overlap supports the feature overlap metric as an approximation of visual similarity in the identification system. Frequency, normalized distance

(alphabetic/numeric), and the match of mismatch of the pair on category also contributed

183 to explaining the response time. Overall, mixed-category trials were responded to more quickly than same-category trials, suggesting some impact of category knowledge.

Analyses of Same trials further validated the feature metric as well as providing additional evidence in favor of a shared identification system. A successful predictor involved feature overlap between a particular character and all other characters (rather than other within-category characters); this is consistent with the assumption of the shared system that all characters compete with each other. Other conclusions of the study concern the nature of the same/different task, which showed influence of not only character identification but also higher-order knowledge.

The results of Study 2 and Study 3 did not follow the commonly-reported pattern of higher digit than letter accuracy. This suggests that though the finding is particularly pervasive on a set-wise level (comparing average digit performance to average letter performance); the performance on individual characters from both sets is determined by a combination of factors and does not cleanly split along the category boundary. By- character investigations of other paradigms and participant groups will reveal whether any observed digit-letter distinction is a full separation of the two sets or a misleading portrait of intermixed performance. Further consideration of underlying sources of this performance are also warranted, as described above.

Study 4

In Study 4 I investigated position representation in letter identification. A large body of research has addressed this topic in recent years, with differing conclusions about the position code(s) used in letter identification. One main area of convergence has been that the letter position code is flexible, with high similarity between letter representations

184 in adjacent positions. Study 4 introduced a novel source of data for this question: perseveration errors elicited from unimpaired readers in a brief-duration delayed-typing task. Perseveration errors reflect residual activation of letters from previous trials; due to the position-specific nature of the identification process these representations persist at particular positions. When they appear as perseveration errors, one can ask what definition of position best describes the letter in both its original position and as a perseveration error: these should be in the same position according to the position code used in the representations.

The corpus of perseveration errors collected from the participants was analyzed to determine what position scheme best described the position maintained by the perseverated letters. The results favor a position scheme which considers letter positions based on distance from the beginning and end of the letter string, with graded rather than absolute position. Letters are considered to be most strongly represented at their precise position (e.g., Beginning+3) but also weakly represented at immediately-adjacent

(Beginning+2) and second-adjacent positions (Beginning+1). This broadly-graded both- edges position scheme was highly successful in accounting for the position of the perseveration errors with respect to their sources. Other schemes were also successful, in particular open-bigram schemes considering the letter to the left of the perseverated letter as context; future work should continue to compare these schemes. Syllabic position codes were also tested, in which letter positions are assigned based on delineation of orthographic syllables within the responses. These schemes had some success in predicting the perseveration positions, but appropriate analyses for comparing these

185 schemes to the both-edges scheme were underpowered in this study and require additional data with fewer errors per response.

Though this study provided support for graded both-edges position coding in word recognition, it is unclear at precisely which level(s) of representation this code is used. Prior work studying the graded both-edges scheme has suggested that it is employed at allograph and abstract letter identity levels. The perseveration errors analyzed in Study 4 may have arisen through residual activation at a combination of these levels and through a combination of factors; future work should aim to pinpoint the mechanism through which they arise and the particular level or levels of representation at which both-edges position coding are used.

Conclusions

I have focused on two subtopics in letter and digit identification: the overlap of letter and digit processing, and the nature of position coding specifically in word recognition. The results of Studies 2 and 3 are consistent with a single shared system for letter and digit identification. In the shared identification system, no distinction is made between the computations carried out for letters and digits, and representations of both category types compete for activation. This conclusion remains to be further tested and expanded through future work addressing the dynamics of activation within such a system.

The second avenue of investigation specifically considered position coding of letters, explored through a word recognition task. Support was found for graded both- edges coding of letter position at work within the letter identification system. A broad topic for future research is the integration of the shared systems theory with the large

186 literature on letter position representation, and the extent to which findings of flexible position coding are unique to letters (e.g., applying at lexical levels) or can be extended to digit position representation within identification.

Future directions

Specific questions for future research have been noted throughout this thesis; additional topics are further delineated in this section. One broad topic concerns the visual similarity and other relationships between lowercase letters and digits. Studies 2 and 3 only utilized uppercase letters, but the full picture of letter/digit similarity includes both cases. Confusion matrices for lowercase letters and digits (and reaction times to pairs) are expected to differ from those obtained for uppercase letters and digits due to differences in visual similarity (e.g., 4 is visually similar to A but less similar to a).

Allographic variation in letters, such as g and g, also differ in visual appearance and therefore visual similarity. A recent study by Simpson, Mousikou, Montoya, and Defior,

(2012) collected similarity ratings for a wide range of letters in both cases and included different allographs of the same letter, and found differences in subject similarity between the two allographs of lowercase a, providing further evidence for the importance of considering allographic variation. Along with these questions comes the possibility to refine the feature set developed here, in order to approximate as closely as possible the character features used in identification. With the existing empirical datasets from Study

2 and 3 one could alter the feature set and test whether the resulting overlap measure was more or less able to account for character confusability and same/different response time.

Some of the substitution and addition errors in Study 2 were perseveration errors

(as is unsurprising given the similarity between Study 2 and Study 4 methods). These

187 errors were not analyzed as such in the analyses presented here, but an interesting question about the dynamics of character identification concerns the relationship between residual activation (leading to perseverations) and current-trial activation (e.g., from visual similarity). Schubert and McCloskey (2013) found that perseverations produced by LHD in reading were often visually-similar to the target letters, suggesting that the two sources of activation combine to jointly influence error patterns. Preliminary analyses on this topic could compare the characteristics of substitution errors which are likely to be perseverations to others which are unlikely perseverative. Perseverations also might explain why the observed pattern of response categories in Study 2 only approximated but did not match the frequencies of the categories in the stimuli; perhaps residual activation boosted particular responses and altered the overall pattern.

Another aspect of the data (from Studies 2 and 3) to consider is the presence of asymmetries in the confusion matrix and RT matrix. Both were symmetrized for analysis, but analysis of the asymmetries may reveal properties of the character identification system. In Study 2, asymmetries show instances in which character X was more likely to be given as a response to character Y than character Y was to be given as a response to character X. Particular properties of characters (e.g., frequency, complexity) may make them more likely to be given as responses than others, which would cause these asymmetries despite the symmetrical nature of inter-character similarity (e.g., visual overlap).

The analyses conducted in Study 2 and Study 4 only begin to scratch the surface of the rich dataset of errors elicited in the delayed-typing character identification task.

Only substitution errors were analyzed in Study 2 and substitution and addition errors in

188

Study 4, but deletion and transposition errors were also present. Multiple letter position schemes provide predictions for the prevalence of different types of transposition errors due to the assumption of a positional gradient across letter positions (e.g., Davis, 2010;

Fischer-Baum, Charny, & McCloskey, 2011; Gomez, Ratcliff, & Perea, 2008;

McCloskey, Fischer-Baum, & Schubert, 2013; Norris & Kinoshita, 2012) These schemes predict that immediately adjacent transpositions will be more likely than second-adjacent transpositions. This question has recently been addressed in studies of individuals with developmental dyslexia, another source of relevant evidence (e.g., Friedmann &

Rahamim, 2007; Kohnen, Nickels, Castles, Friedmann, & McArthur, 2012).

In Study 4 a small lexical bias was present, such that participants sometimes reported a word when a pseudoword was presented. Though this effect is numerically small, it is likely to exceed the chance rate of reporting a word through random letter substitutions, transpositions, etc. in pseudowords. If this effect is reliable, and to the extent that the word-errors also contain position-preserving perseveration errors, it may suggest interactions between the orthographic lexicon and the identification of letter identities at previous levels. For example, when multiple lexical entries are activated by an incoming string (either word or pseudoword), feedback is thought to boost the activation of letter identities present in these words. This top-down effect likely combines with the demonstrated effect of residual activation of letter identities. If any of the residually-active letter identities are present in the active lexical entries, these particular entries would in turn be boosted. By hypothesis, these dynamics would result in a bias to select (and respond with) with a word which is both orthographically related to (i.e., contains letters of) the current stimulus and orthographically related to previous stimuli.

189

This prediction remains to be tested, through existing Study 4 data and/or the collection of additional relevant data.

Future work could also consider the underlying mechanism of perseveration errors, asking whether residual activation remains over a number of trials/intervening words read, or an amount of time. Analyses of data from a subset of participants in Study

4 revealed a surprisingly small incidence of perseveration errors in the first half of the experiment (in the first 360 trials; see Footnote 16, p. 158) relative to the large number present in the full dataset. This may relate to a successive buildup of residual activation throughout the identification system. These questions can be addressed through careful control of experiment ITI and duration as well as the number of intervening trials. For example, trials eliciting errors (e.g., stimulus presented briefly) could be interspersed with trials with ceiling performance to examine the resulting perseveration errors.

Digit identification has only recently become a topic of study despite a long history of research on mathematical and numerical abilities. Similarities between the sets of letters and digits have led me to consider both in this work, but much of the existing body of knowledge about letters and letter identification remains to be considered for digits and digit identification. This work was only a start to this enterprise. Future work will consider questions of the position representation of digits, including consideration of any differences that may arise due to the nature and later processing of the digit stimuli

(strings of random digits versus a six-digit number). Feedback from semantic representations may interact with digit position representation and identification, just as feedback from lexical entries may interact with the activation of letter identities.

Furthermore, the nature of digit allographs, posited by McCloskey & Schubert (2014 and

190

Miozzo & Caramazza (1998) has not been investigated: what allographic variation is present among digit forms (e.g., 4 and 4)? Neuropsychological study such as the tasks conducted with patients HR and LHD may provide a source of evidence for this question

(Rapp & Caramazza, 1989; Schubert & McCloskey, 2013).

The inquiries described above are only a few of the possible future research directions stemming from the current research. As an automatic but not innate skill with which modern humans have extensive practice, character identification is a fascinating topic for addressing questions about recycling of cognitive and neural processes that evolved to accomplish non-reading tasks (such as identifying objects or faces). Though much work has been done on this topic, many open research avenues remain

191

APPENDIX A: CHARACTER FEATURE SET

The feature classes used are Orthogonal, Slant, and Curve. These describe the large categories of visual features found in uppercase letters. Within the Orthogonal class are Horizontal and Vertical features; within the Slant class are Slant-left and Slant-right features, and within the Curve class are Curve-facing-up, Curve-facing-down, Curve- facing-left, Curve-facing-right, and Curve-Closed (loop) features. Each letter is considered within a rectangle in letter-space, which has a vertical height of Full and a horizontal width of Half (Half is also used as the size specification for a feature which extends half the height in the vertical dimension). Slant lengths are approximate; a slant extending from the top to bottom of letter-space gets a length of Full even though it is physically longer than an Orthogonal feature extending from the top to bottom. Curves have a size parameter indicating the length in degrees covered by the curve.

To specify positions in letter-space, the FullxHalf grid is subdivided into 4 sections, with the vertices of the grid labeled by their position in the horizontal dimension

(Top, Middle, Bottom) and in the vertical dimension (Left, Center, Right). These vertices are abbreviated by their first letter, and each feature is defined by two positions approximating the two ends of the feature. The system is arbitrarily set such that Position

2 is always downward and/or to the right of Position 1. For example, the Slant left feature in A is described by Position 1 of Center-Middle (CM) and Position 2 of Right-

Bottom (RB).

192

Table A1: Features for uppercase letters and digits

Character Feature class Feature Size Position 1 Position 2 A Slant Slant left Full CM RB Slant Slant right Full CM LB Orthogonal Horizontal Half LM RM B Orthogonal Vertical Full LT LB Curve Curve facing left 180 LT LM Curve Curve facing left 180 LM LB C Curve Curve facing right 180 RT RB D Orthogonal Vertical Full LT LB Curve Curve facing left 180 LT LB

E Orthogonal Vertical Full LT LB Orthogonal Horizontal Half LT RT Orthogonal Horizontal Half LM RM Orthogonal Horizontal Half LB RB F Orthogonal Vertical Full LT LB Orthogonal Horizontal Half LT RT Orthogonal Horizontal Half LM RM G Orthogonal Vertical Half RM RB Orthogonal Horizontal Half CM RM Curve Curve facing right 180 RT RB H Orthogonal Vertical Full LT LB Orthogonal Vertical Full RT RB Orthogonal Horizontal Half LM RM I Orthogonal Vertical Full CT CB Orthogonal Horizontal Half LT RT Orthogonal Horizontal Half LB RB J Orthogonal Vertical Full RT RB Orthogonal Horizontal Half LT RT Curve Curve facing up 180 LM RM K Orthogonal Vertical Full LT LB Slant Slant left Half RT LM Slant Slant right Half LM RB L Orthogonal Vertical Full LT LB Orthogonal Horizontal Half LB RB

M Slant Slant left Full LT LB Slant Slant left Half LT CM Slant Slant right Half RT CM Slant Slant right Full RT RB N Orthogonal Vertical Full LT LB Orthogonal Vertical Full RT RB

193

Character Feature class Feature Size Position 1 Position 2 Slant Slant left Full LT RB O Curve Curve closed 360 CT CB P Orthogonal Vertical Full LT LB Curve Curve facing left 180 LT LM

Q Curve Curve closed 360 CT CB Curve Curve right 90 CB CR

R Orthogonal Vertical Full LT LB Curve Curve facing left 180 LT LM Slant Slant left Half LM RB S Curve Curve facing left 180 RT CM Curve Curve facing right 180 CM LB

T Orthogonal Vertical Full CT CB Orthogonal Horizontal Half LT RT

U Orthogonal Vertical Full LT LB Orthogonal Vertical Full RT RB Curve Curve facing up 180 LB RB V Slant Slant left Full LT CB Slant Slant right Full RT CB

W Slant Slant left Full LT LB Slant Slant left Half CM RB Slant Slant right Half CM LB Slant Slant right Full RT RB X Slant Slant left Full LT RB Slant Slant right Full RT LB

Y Orthogonal Vertical Half CM CB Slant Slant left Half LT CM Slant Slant right Half RT CM Z Orthogonal Horizontal Half LT RT Orthogonal Horizontal Half LB RB Slant Slant right Full RT LB 0 Curve Curve closed 360 CT CB 1 Slant Slant right Half LM RT Orthogonal Vertical Full CT CB Orthogonal Horizontal Half LB RB 2 Slant Slant right Full RT LB Orthogonal Horizontal Half LB RB Curve Curve down 180 LM RM 3 Curve Curve left 180 LT LM Curve Curve left 180 LM LB

4 Slant Slant right Full CT LM Orthogonal Vertical Full RT RB

194

Character Feature class Feature Size Position 1 Position 2 Orthogonal Horizontal Half LM RM 5 Orthogonal Vertical Half LT LM Orthogonal Horizontal Half LT RT Curve Curve left 180 LM LB 6 Curve Curve closed 360 CM CM Curve Curve right 90 LM RT

7 Orthogonal Horizontal Half LT RT Slant Slant right Full RT LB

8 Curve Curve closed 360 CT CT Curve Curve closed 360 CM CM

9 Curve Curve closed 360 CT CT Curve Curve left 90 RM LB

To generate the overlap matrix for all characters, the rules in Table A.2 were followed for each feature in every pairwise combination of characters. The overlap values for a character-pair where then summed and divided by 3 times the product of the number of features in each of the two characters.

Table A2: Rules for computing feature overlap

Overlap type Units of overlap assigned Feature class only .5 Feature class and feature 1 Feature class, feature, and feature size 2 Feature class, feature, feature size, and position 3 (maximum)

195

Table A3: Feature overlap matrix (lower triangle only)

0 1 2 3 4 5 6 7 8 9 A B C D E F G H I J K L M N O P Q R S T U V W X Y Z 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 .06 .17 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 .17 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 .26 .17 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 5 .06 .15 .11 .19 .15 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 6 .17 0 .06 .17 0 .06 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 7 0 .19 .28 0 .25 .19 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 8 .33 0 .06 .17 0 .06 .5 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 9 .17 0 .06 .25 0 .08 .29 0 .5 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 A 0 .15 .17 0 .22 .09 0 .17 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 B .11 .09 .06 .56 .09 .24 .11 .03 .11 .17 .02 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 C .17 0 .06 .17 0 .06 .25 0 .17 .17 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 D .08 .14 .06 .33 .14 .19 .08 .04 .08 .13 .03 .39 .08 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 E 0 .31 .21 0 .31 .28 0 .31 0 0 .21 .13 0 .19 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 F 0 .28 .17 0 .31 .28 0 .31 0 0 .2 .15 0 .22 .47 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 G .06 .15 .11 .06 .15 .2 .08 .14 .06 .06 .07 .09 .33 .11 .13 .13 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 H 0 .28 .11 0 .35 .2 0 .17 0 0 .15 .2 0 .31 .43 .46 .2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 I 0 .35 .2 0 .28 .28 0 .31 0 0 .17 .11 0 .17 .51 .48 .24 .39 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 J .06 .19 .11 .06 .22 .2 .06 .19 .06 .06 .09 .13 .06 .17 .31 .31 .17 .31 .31 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 K 0 .19 .11 0 .19 .06 0 .08 0 0 .17 .11 0 .17 .13 .15 .06 .2 .11 .09 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 L 0 .33 .19 0 .28 .22 0 .21 0 0 .14 .19 0 .29 .5 .47 .22 .47 .47 .28 .19 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 M 0 .11 .14 0 .11 0 0 .17 0 0 .22 0 0 0 0 0 0 0 0 0 .22 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 N 0 .2 .06 0 .24 .11 0 .08 0 0 .13 .19 0 .28 .22 .26 .11 .41 .22 .22 .24 .28 .11 0 0 0 0 0 0 0 0 0 0 0 0 0 O 1 0 .06 .17 0 .06 .17 0 .33 .17 0 .11 .17 .08 0 0 .06 0 0 .06 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 P .08 .14 .08 .42 .14 .19 .08 .04 .08 .13 .03 .44 .08 .42 .19 .22 .08 .31 .17 .17 .17 .25 0 .28 .08 0 0 0 0 0 0 0 0 0 0 0

196

0 1 2 3 4 5 6 7 8 9 A B C D E F G H I J K L M N O P Q R S T U V W X Y Z Q .58 0 .06 .17 0 .06 .29 0 .25 .17 0 .11 .17 .08 0 0 .08 0 0 .06 0 0 0 0 .58 .08 0 0 0 0 0 0 0 0 0 0 R .06 .11 .06 .28 .11 .13 .06 .06 .06 .08 .07 .3 .06 .28 .13 .15 .11 .2 .11 .11 .2 .17 .1 .22 .06 .33 .06 0 0 0 0 0 0 0 0 0 S .17 0 .06 .33 0 .14 .21 0 .17 .21 0 .28 .42 .21 0 0 .14 0 0 .06 0 0 0 0 .17 .08 .17 .14 0 0 0 0 0 0 0 0 T 0 .33 .14 0 .28 .28 0 .29 0 0 .14 .14 0 .21 .46 .47 .22 .42 .53 .33 .14 .42 0 .28 0 .21 0 .14 0 0 0 0 0 0 0 0 U .06 .19 .06 .06 .22 .13 .06 .06 .06 .06 .04 .22 .06 .31 .22 .26 .13 .41 .22 .3 .19 .33 0 .37 .06 .31 .06 .2 .06 .28 0 0 0 0 0 0 V 0 .08 .14 0 .14 0 0 .21 0 0 .28 0 0 0 0 0 0 0 0 0 .17 0 .33 .14 0 0 0 .08 0 0 0 0 0 0 0 0 W 0 .11 .11 0 .11 0 0 .17 0 0 .22 0 0 0 0 0 0 0 0 0 .22 0 .33 .11 0 0 0 .11 0 0 0 .33 0 0 0 0 X 0 .08 .19 0 .14 0 0 .29 0 0 .28 0 0 0 0 0 0 0 0 0 .17 0 .33 .19 0 0 0 .08 0 0 0 .42 .33 0 0 0 Y 0 .15 .07 0 .11 .09 0 .11 0 0 .13 .02 0 .06 .07 .07 .09 .09 .07 .06 .22 .06 .28 .13 0 .11 0 .13 0 .08 .07 .17 .22 .17 0 0 Z 0 .26 .3 0 .22 .22 0 .44 0 0 .24 .04 0 .06 .42 .37 .19 .22 .41 .22 .09 .33 .1 .09 0 .06 0 .06 0 .33 .07 .14 .11 .19 .09 0

197

APPENDIX B: CHARACTER CONFUSION MATRIX

198

Table B1: Behavioral confusion matrix from Study 2 (lower triangle only)

1 2 3 4 5 6 7 8 9 A B C D E F G H I J K L M N P Q R S T U V W X Y Z 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 1 19 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 1 7 12 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 5 0 4 5 16 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 6 1 3 2 5 11 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 7 2 1 3 2 6 4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 8 1 6 10 7 9 8 9 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 9 1 6 5 4 2 9 4 25 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 A 2 2 3 27 4 3 0 13 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 B 0 1 4 0 0 5 0 57 1 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 C 1 4 4 1 2 6 2 1 1 4 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 D 0 3 1 3 1 1 4 2 5 3 11 3 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 E 3 2 14 4 6 1 2 9 0 2 6 2 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 F 0 0 4 17 8 1 9 2 2 4 9 3 5 30 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 G 0 1 1 1 5 47 3 1 3 3 4 16 5 5 5 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 H 0 0 0 1 1 0 1 2 1 1 2 2 1 1 5 4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 I 231 3 1 2 5 0 2 0 0 2 0 0 0 0 2 1 3 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 J 7 4 3 3 1 2 28 2 4 0 3 3 3 3 9 18 2 17 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 K 0 0 2 4 3 1 3 1 0 1 2 2 2 0 9 4 3 6 5 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 L 6 0 2 2 1 4 6 1 0 1 0 2 1 5 2 1 0 10 5 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 M 0 1 0 2 1 0 0 2 0 0 4 4 0 1 2 4 3 0 1 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 N 0 5 0 2 0 1 2 0 3 1 0 2 1 0 0 1 9 1 1 2 0 21 0 0 0 0 0 0 0 0 0 0 0 0 P 0 1 2 2 4 5 4 4 19 3 7 1 7 2 6 5 3 3 3 3 2 1 2 0 0 0 0 0 0 0 0 0 0 0 Q 2 4 1 1 0 3 0 6 2 2 3 8 3 2 4 27 3 1 3 3 0 1 4 5 0 0 0 0 0 0 0 0 0 0 R 1 2 3 4 5 6 3 11 3 0 9 4 3 0 6 3 4 2 2 3 5 0 1 9 2 0 0 0 0 0 0 0 0 0

199

1 2 3 4 5 6 7 8 9 A B C D E F G H I J K L M N P Q R S T U V W X Y Z S 0 4 12 5 96 7 1 2 3 4 3 8 4 1 5 6 0 0 7 3 0 1 4 3 2 5 0 0 0 0 0 0 0 0 T 7 2 0 1 3 0 7 2 0 1 0 5 3 2 2 0 2 21 5 3 4 0 0 3 2 0 1 0 0 0 0 0 0 0 U 1 4 2 3 1 0 3 1 3 3 3 4 4 2 3 8 2 4 3 4 2 4 4 1 3 3 1 1 0 0 0 0 0 0 V 0 1 2 4 1 0 6 1 0 1 5 9 0 2 3 3 6 1 6 7 1 5 4 3 2 4 0 2 18 0 0 0 0 0 W 1 3 2 2 2 2 3 1 3 6 3 1 4 5 5 2 4 2 1 4 1 26 4 2 8 2 1 0 2 13 0 0 0 0 X 1 1 3 2 3 1 6 4 2 0 1 6 0 2 2 3 2 1 3 22 0 1 2 1 6 4 3 0 2 10 22 0 0 0 Y 0 0 2 5 0 2 15 0 2 1 2 1 1 2 5 4 2 0 10 8 3 1 5 2 3 3 5 3 7 25 11 10 0 0 Z 2 11 1 4 10 1 21 0 1 4 3 3 0 2 5 4 5 1 15 11 2 3 5 2 4 3 5 2 0 11 1 32 3 0

200

APPENDIX C: SAME/DIFFERENT RESPONSE TIMES

201

Table C1: SameAll (diagonal) and Different (lower triangle) response times from Study 3

1 2 3 4 5 6 7 8 9 A B C D E F G H I J K L M N P Q R S T U V W X Y Z 1 1.010 2 1.032 1.011 3 1.010 1.096 1.009 4 1.032 .989 .985 1.016 . 5 .952 .994 1.018 .997 1.005 6 .986 1.003 .995 1.021 .981 1.011 7 .998 1.001 .991 .995 .948 1.028 .993 8 1.005 1.023 1.061 .997 1.006 1.054 1.032 .985 9 1.084 1.061 .972 .990 1.011 1.019 1.001 .986 1.003 A .996 1.076 .990 1.093 .939 1.004 .945 1.007 .956 .970 B 1.059 .958 1.040 .954 .997 1.044 .913 1.155 1.064 1.026 .975 C .952 1.078 1.006 .990 1.058 1.025 1.028 .988 .989 1.004 1.005 .990 D .918 1.032 .944 1.012 .985 1.008 .961 1.002 1.034 .996 1.055 1.014 .982 E 1.003 1.008 1.093 .918 1.046 .976 .959 1.023 .980 .996 1.045 .960 .981 1.001 F 1.039 1.033 1.003 .958 1.022 .961 .961 .939 1.026 1.063 .999 1.026 1.020 1.090 1.007 G 1.017 .966 1.014 .973 .974 1.175 1.036 1.040 .997 1.010 1.039 1.108 1.049 .968 .999 .996 H .938 .971 .955 1.005 .979 .963 .949 .981 .958 1.028 1.033 1.000 .954 1.095 .944 1.036 .980 I 1.078 1.003 .977 .959 .939 1.011 .992 1.009 .950 .971 .956 1.003 .969 1.003 .954 1.039 .993 1.016 J .959 1.023 1.013 .944 .982 .952 1.028 .982 .942 .942 1.006 1.000 .987 1.011 .960 .948 .973 1.017 1.024 K 1.046 .915 .946 .959 1.024 .988 .950 .942 1.012 1.001 .989 .970 1.010 1.032 1.007 .980 1.066 1.047 1.008 1.010 L .986 .978 .933 .991 .975 .990 .935 .972 .976 1.009 .991 1.002 1.030 1.013 .976 .970 .996 1.080 1.045 1.047 .981 M .981 .983 .966 1.004 1.022 .974 .961 1.014 .938 1.022 1.029 1.024 .989 .991 1.011 .989 1.113 1.005 .964 1.079 1.007 .986 N 1.017 .980 .987 .972 1.030 .964 1.015 .959 .956 1.001 1.017 1.023 .963 1.001 .956 1.054 1.044 .956 .933 1.057 .953 1.140 1.016 P 1.009 .987 1.005 .982 .961 1.015 .984 1.017 1.021 1.027 1.056 1.037 1.101 .987 1.007 1.010 .965 .961 .991 .982 .993 .940 .959 .985 Q .935 .982 1.015 .931 .981 .980 .965 1.005 1.128 .950 1.024 1.016 .999 .976 .991 .994 .992 1.003 .980 .986 .943 .992 .948 .995 .998 R .999 .966 .975 1.010 .977 .977 .937 1.058 1.025 1.027 1.119 .988 1.033 .992 1.070 .949 1.000 .961 .981 1.012 .995 1.029 .986 1.099 .954 1.002

202

1 2 3 4 5 6 7 8 9 A B C D E F G H I J K L M N P Q R S T U V W X Y Z S .908 1.086 1.068 .996 1.102 1.036 .998 1.016 1.066 .993 .947 1.071 .952 1.018 .962 1.061 .989 .992 .978 1.021 .978 1.038 .977 1.002 1.015 1.056 1.000 T 1.067 1.017 1.008 1.025 1.019 1.036 1.080 .998 .976 1.006 1.017 .939 1.004 .971 1.006 .945 1.025 1.089 .997 1.011 1.036 1.018 .993 .991 .962 .950 1.065 .994 U .998 .960 .961 .987 .959 1.002 .998 .965 1.028 .983 1.021 1.012 1.066 .984 .954 .979 1.081 1.031 .983 .983 .964 1.031 1.053 1.040 .900 .988 .980 .946 1.002 V .989 .952 .984 1.042 .993 .977 1.045 .942 .946 .983 1.002 .942 1.010 1.016 .962 .969 .966 .964 .998 .937 1.007 1.048 1.003 .982 .958 .969 .976 1.005 1.051 1.009 W .906 .936 .983 1.014 1.021 .968 .982 .997 .984 .974 .983 1.003 .970 1.064 1.036 1.057 1.068 .985 1.022 .987 1.023 1.120 1.026 .985 .986 .998 .982 .994 1.023 1.045 1.008 X .999 1.015 1.025 .931 1.026 .997 .975 .971 .969 1.019 .947 .980 1.007 .956 .971 .996 1.041 .974 .963 1.081 1.046 .976 .981 1.003 .965 .982 1.054 1.007 .975 1.018 1.040 1.016 Y .960 1.014 .999 .999 .997 1.027 1.073 .967 .950 1.063 .946 .990 .958 .991 .966 .950 .985 1.005 .979 .970 1.014 1.027 1.012 1.013 1.006 1.027 1.031 1.069 1.027 1.129 1.059 1.067 1.001 Z 1.005 1.061 .997 .980 1.020 .966 1.102 .961 .949 1.018 .961 1.036 .965 1.026 .990 1.019 .996 .982 1.081 .988 .962 1.023 1.009 1.005 .914 1.016 1.027 1.028 .977 1.010 1.022 .963 .985 1.015

203

Table C2: SameFirst RT from Study 3 Pair SameFirst RT 1 1 1.022 2 2 1.048 3 3 1.056 4 4 1.01 5 5 1.042 6 6 1.062 7 7 1.013 8 8 1.036 9 9 1.079 A A 0.996 B B 0.992 C C 0.971 D D 1.002 E E 1.032 F F 1.031 G G 0.971 H H 1.001 I I 1.012 J J 1.025 K K 1.02 L L 0.965 M M 1.015 N N 0.999 P P 1.009 Q Q 1.029 R R 1.012 S S 1.016 T T 1.006 U U 1.042 V V 1.031 W W 1.045 X X 1.033 Y Y 1.064 Z Z 1.014

204

BIBLIOGRAPHY

Appelman, I. B., & Mayzner, M. S. (1981). The letter-frequency effect and the generality of familiarity effects on perception. Perception & Psychophysics, 30(5), 436–46.

Appelman, I. B., & Mayzner, M. S. (1982). Application of geometric models to letter recognition: distance and density. Journal of Experimental Psychology. General, 111(1), 60–100.

Arguin, M., & Bub, D. (1995). Priming and response selection processes in letter classification and identification tasks. Journal of Experimental Psychology: Human Perception and Performance, 21(5), 1199–1219.

Beeli, G., Esslen, M., & Jäncke, L. (2007). Frequency correlates in -color synaesthesia. Psychological Science, 18(9), 788–92.

Behrmann, M., Plaut, D., & Nelson, J. (1998). A literature review and new data supporting an interactive account of letter-by-letter reading. Cognitive Neuropsychology, 15(1), 7–51.

Benford, F. (1938). The law of anomalous numbers. Proceedings of the American Philosophical Society, 78, 551–572.

Besner, D., Coltheart, M., & Davelaar, E. (1984). Basic processes in reading: computation of abstract letter identities. Canadian Journal of Psychology, 38(1), 126–34.

Boles, D. B., & Clifford, J. E. (1989). An upper- and lowercase alphabetic similarity matrix, with derived generation similarity values. Behavior Research Methods, Instruments, & Computers, 21(6), 579–586.

Briggs, R., & Hocevar, D. (1975). A new distinctive feature theory for upper case letters. The Journal of General Psychology, 93, 87–93.

Brunsdon, R., Coltheart, M., & Nickels, L. (2006). Severe developmental letter- processing impairment: a treatment case study. Cognitive Neuropsychology, 23(6), 795–821.

Cantlon, J. F., Pinel, P., Dehaene, S., & Pelphrey, K. a. (2011). Cortical representations of symbols, objects, and faces are pruned back during early childhood. Cerebral Cortex, 21(1), 191–9.

Caramazza, A., & Hillis, A. E. (1990a). Levels of representation, co-ordinate frames, and unilateral neglect. Cognitive Neuropsychology, 7(5/6), 37–41.

205

Caramazza, A., & Hillis, A. E. (1990b). Spatial representation of words in the brain implied by studies of a unilateral neglect patient. Nature, 346, 267–269.

Carreiras, M., Quiñones, I., Hernández-Cabrera, J. A., & Duñabeitia, J. A. (2014). Orthographic coding: brain activation for letters, symbols, and digits. Cerebral CortexCerebral Cortex, 1–13.

Chanceaux, M., & Grainger, J. (2012). Serial position effects in the identification of letters, digits, symbols, and shapes in peripheral vision. Acta Psychologica, 141(2), 149–58.

Chang, Y.-N., Furber, S., & Welbourne, S. (2012). Modelling normal and impaired letter recognition: implications for understanding pure alexic reading. Neuropsychologia, 50(12), 2773–88.

Changizi, M. A., & Shimojo, S. (2005). Character complexity and redundancy in writing systems over human history. Proceedings. Biological Sciences / The Royal Society, 272(1560), 267–75.

Chanoine, V., Teixeira Ferreira, C., Demonet, J.-F., Nespoulous, J., & Poncet, M. (1998). Optic aphasia with pure alexia: a mild form of visual associative agnosia? a case study. Cortex, 34, 437–448.

Chauncey, K., Holcomb, P. J., & Grainger, J. (2008). Effects of stimulus font and size on masked repetition priming: an event-related potentials (erp) investigation. Language and Cognitive Processes, 23(1), 183–200.

Cipolotti, L., Butterworth, B., & Denes, G. (1991). A specific deficit for numbers in a case of dense acalculia. Brain, 114, 2619–37.

Cohen, L., & Dehaene, S. (1991). Neglect dyslexia for numbers? a case report. Cognitive Neuropsychology, 8(1), 39–58.

Cohen, L., & Dehaene, S. (1995). Number processing in pure alexia: the effect of hemispheric asymmetries and task demands. Neurocase, 1(2), 121–137.

Coltheart, M., Davelaar, E., Jonasson, J. T., & Besner, D. (1977). Access to the internal lexicon. In S. Dornic (Ed.), Attention and Performance VI (pp. 535–555). Hillsdale, : Lawrence Erlbaum Associates.

Coltheart, M., Rastle, K., Perry, C., Langdon, R., & Ziegler, J. C. (2001). Drc: a dual route cascaded model of visual word recognition and reading aloud. Psychological Review, 108, 204–256.

Courrieu, P., Farioli, F., & Grainger, J. (2004). Inverse discrimination time as a perceptual distance for alphabetic characters. Visual Cognition, 11(7), 901–919.

206

Dalmás, J. F., & Dansilio, S. (2000). Visuographemic alexia: a new form of a peripheral acquired dyslexia. Brain and Language, 75(1), 1–16.

Davis, C. J. (2010). The spatial coding model of visual word identification. Psychological Review, 117(3), 713–58.

Davis, C. J., & Bowers, J. S. (2004). What do letter migration errors reveal about letter position coding in visual word recognition? Journal of Experimental Psychology. Human Perception and Performance, 30(5), 923–41.

Davis, C. J., & Bowers, J. S. (2006). Contrasting five different theories of letter position coding: evidence from orthographic similarity effects. Journal of Experimental Psychology. Human Perception and Performance, 32(3), 535–57.

Dehaene, S. (2009). Reading in the brain: The science and evolution of a human invention. Chicago: Viking.

Dehaene, S., Cohen, L., Sigman, M., & Vinckier, F. (2005). The neural code for written words: a proposal. Trends in Cognitive Sciences, 9(7), 335–41.

Duñabeitia, J. A., Dimitropoulou, M., Grainger, J., Hernández, J. A., & Carreiras, M. (2012). Differential sensitivity of letters, numbers, and symbols to character transpositions. Journal of Cognitive Neuroscience, 24(7), 1610–24.

Finkbeiner, M., & Coltheart, M. (2009). Letter recognition: from perception to representation. Cognitive Neuropsychology, 26(1), 1–6.

Fischer-Baum, S., Charny, J., & McCloskey, M. (2011). Both-edges representation of letter position in reading. Psychonomic Bulletin & Review, 18(6), 1083–9.

Fischer-Baum, S., McCloskey, M., & Rapp, B. C. (2010). Representation of letter position in spelling: evidence from acquired dysgraphia. Cognition, 115(3), 466–90.

Fischer-Baum, S., & Rapp, B. C. (2012). Underlying cause(s) of letter perseveration errors. Neuropsychologia, 50(2), 305–18.

Fiset, D., Blais, C., Arguin, M., Tadros, K., Ethier-Majcher, C., Bub, D., & Gosselin, F. (2009). The spatio-temporal dynamics of visual letter recognition. Cognitive Neuropsychology, 26(1), 23–35.

Fiset, D., Blais, C., Ethier-Majcher, C., Arguin, M., Bub, D., & Gosselin, F. (2008). Features for identification of uppercase and lowercase letters. Psychological Science, 19(11), 1161–8.

Fisher, D. F., & Monty, R. A. (1969). Visual confusion matrices: fact or artifact? The Journal of Psychology, 71, 111–125.

207

Forster, K. I., Davis, C. J., Schoknecht, C., & Carter, R. (1987). Masked priming with graphemically related forms: repetition or partial activation? The Quarterly Journal of Experimental Psychology Section A, 39(2), 211–251.

Friedmann, N., Kerbel, N., & Shvimer, L. (2010). Developmental attentional dyslexia. Cortex; a Journal Devoted to the Study of the Nervous System and Behavior, 46(10), 1216–37.

Friedmann, N., & Rahamim, E. (2007). Developmental letter position dyslexia. Journal of Neuropsychology, 1(2), 201–236.

Frost, R. (2012). Towards a universal model of reading. The Behavioral and Brain Sciences, 1–17.

García-Orza, J., Perea, M., & Muñoz, S. (2010). Are transposition effects specific to letters? Quarterly Journal of Experimental Psychology (2006), 63(8), 1603–18.

Gauthier, I., Tarr, M. J., Moylan, J., Skudlarski, P., Gore, J. C., & Anderson, a W. (2000). The fusiform ―face area‖ is part of a network that processes faces at the individual level. Journal of Cognitive Neuroscience, 12(3), 495–504.

Gauthier, I., Wong, A. C.-N., Hayward, W. G., & Cheung, O. S. (2006). Font tuning associated with expertise in letter perception. Perception, 35(4), 541–559.

Gervais, M. J., Harvey, L. O., & Roberts, J. O. (1984). Identification confusions among letters of the alphabet. Journal of Experimental Psychology. Human Perception and Performance, 10(5), 655–66.

Geyer, L. H., & DeWald, C. G. (1973). Feature lists and confusion matrices. Perception & Psychophysics, 14(3), 471–482.

Gilmore, G. C., Hersh, H., Caramazza, a, & Griffin, J. (1979). Multidimensional letter similarity derived from recognition errors. Perception & Psychophysics, 25(5), 425– 31.

Gomez, P., Ratcliff, R., & Perea, M. (2008). The overlap model: a model of letter position coding. Psychological Review, 115(3), 577–600.

Grainger, J. (2008). Cracking the orthographic code: an introduction. Language and Cognitive Processes, 23(1), 1–35.

Grainger, J., Granier, J.-P., Farioli, F., Van Assche, E., & van Heuven, W. J. B. (2006). Letter position information and printed word perception: the relative-position priming constraint. Journal of Experimental Psychology. Human Perception and Performance, 32(4), 865–84.

208

Grainger, J., & Hannagan, T. (2014). What is special about orthographic processing? Written Language & Literacy, 17(2), 225–252.

Grainger, J., & Holcomb, P. J. (2009). Watching the word go by: on the time-course of component processes in visual word recognition. Language and Linguistics Compass, 3(1), 128–156.

Grainger, J., Kiyonaga, K., & Holcomb, P. J. (2006). The time course of orthographic and phonological code activation. Psychological Science, 17(12), 1021–6.

Grainger, J., & Van Heuven, walter j B. (2003a). Modeling letter position coding in printed word perception. The Mental Lexicon, 1–23.

Grainger, J., & Van Heuven, W. J. B. (2003b). Modeling letter position coding in printed word perception. The Mental Lexicon, 1–23.

Greenblatt, S. H. (1973). Alexia without agraphia or hemianopsia. Brain, 96, 307–316.

Grossi, D., Fragassi, N. a, Orsini, a, De Falco, F. a, & Sepe, O. (1984). Residual reading capability in a patient with alexia without agraphia. Brain and Language, 23(2), 337–48.

Guerrera, C., & Forster, K. (2008). Masked form priming with extreme transposition. Language and Cognitive Processes, 23(1), 117–142.

Hammond, E. J., & Green, D. W. (1982). Detecting targets in letter and non-letter arrays. Canadian Journal of Psychology, 36(1), 67–82.

Harm, M. W., Seidenberg, M. S., Allen, J., Devlin, J., Gonnerman, L., Keating, P., Macdonald, M., & Plaut, D. (1999). Phonology, reading acquisition, and dyslexia: insights from connectionist models, 106(3), 491–528.

Hillis, A. E., & Caramazza, A. (1995). Spatially specific deficits in processing graphemic representations in reading and writing. Brain and Language, 48, 263–308.

Hillis, A. E., Rapp, B. C., Benzing, L., & Caramazza, A. (1998). Dissociable coordinate frames of unilateral spatial neglect: ―‗viewer-centered‘‖ neglect. Brain and Cognition, 37, 491–526.

Hofstadter, D., & McGraw, G. (1995). Letter spirit: esthetic perception and creative play in the rich microcosm of the roman alphabet. In Fluid Concepts and Creative Analogies (pp. 407–466). New York, NY: Basic Books.

Holcomb, P. J., & Grainger, J. (2006). On the time course of visual word recognition: an event-related potential investigation using masked repetition priming. Journal of Cognitive Neuroscience, 18(10), 1631–43.

209

Holender, D., & Peereman, R. (1987). Differential processing of phonographic and logographic single-digit numbers by the two hemispheres. In G. Deloche & X. Seron (Eds.), Mathematical Disabilities: A Cognitive Neuropsychological Perspective (pp. 43–86). Hillsdale, NJ: Lawrence Erlbaum Associates.

Humphreys, G. W., Evett, L. J., & Quinlan, P. T. (1990). Orthographic processing in visual word identification. Cognitive Psychology, 22(4), 517–60.

Ingles, J. L. J. L., & Eskes, G. A. (2008). A comparison of letter and digit processing in letter-by-letter reading. Journal of the International Neuropsychological Society, 14(01), 164–173.

Jacobs, A. M., Ziegler, J. C., Rey, A., & Grainger, J. (1998). Mrom-p: an interactive activation, multiple readout model of orthographic and phonological processes in visual word recognition. In J. Grainger & A. M. Jacobs (Eds.), Localist Connectionist Approaches to Human Cognition (pp. 147–188). Hoboken.

James, K. H., James, T. W., Jobard, G., Wong, A. C.-N., & Gauthier, I. (2005). Letter processing in the visual system: different activation patterns for single letters and strings. Cognitive, Affective & Behavioral Neuroscience, 5(4), 452–66.

Jones, M. N., & Mewhort, D. J. K. (2004). Case-sensitive letter and bigram frequency counts from large-scale english corpora. Behavior Research Methods, Instruments, & Computers : A Journal of the Psychonomic Society, Inc, 36(3), 388–96.

Katz, R. B., & Sevush, S. (1989). Positional dyslexia. Brain and Language, 37, 266–289.

Kinoshita, S., & Lagoutaris, S. (2010). Priming by numb3r5 does not involve top-down feedback. Journal of Experimental Psychology. Learning, Memory, and Cognition, 36(6), 1422–40.

Kinoshita, S., & Norris, D. (2009). Transposed-letter priming of prelexical orthographic representations. Journal of Experimental Psychology. Learning, Memory, and Cognition, 35(1), 1–18.

Kinoshita, S., Robidoux, S., Mills, L., & Norris, D. (2013). Visual similarity effects on masked priming. Memory & Cognition.

Kohnen, S., Nickels, L., Castles, A., Friedmann, N., & McArthur, G. (2012). When ―slime‖ becomes ―smile‖: developmental letter position dyslexia in english. Neuropsychologia, 50(14), 3681–92.

Krumhansl, C. L., & Thomas, E. A. C. (1977). Effect of level of confusability on reporting letters from briefly presented visual displays. Perception & Psychophysics, 21(3), 269–279.

210

Lake, B. M., Salakhutdinov, R., & Tenenbaum, J. B. (2013). One-shot learning by inverting a compositional causal process, 1–9.

Larsen, J., Baynes, K., & Swick, D. (2004). Right hemisphere reading mechanisms in a global alexic patient. Neuropsychologia, 42(11), 1459–76.

Levenshtein, V. (1966). Binary codes capable of correcting deletions, insertions and reversals. Soviet Physics Doklady, 10, 707.

Loomis, J. M. (1982). Analysis of tactile and visual confusion matrices. Perception & Psychophysics, 31(1), 41–52.

Lupker, S. J., Perea, M., & Davis, C. J. (2008). Transposed-letter effects: consonants, vowels and letter frequency. Language and Cognitive Processes, 23(1), 93–116.

Lupker, S. J., Zhang, S., Perry, J. R., & Davis, C. J. (2014). Superset versus substitution- letter priming: an evaluation of open-bigram models. Journal of Experimental Psychology: Human Perception and Performance.

Madec, S., Rey, A., Dufau, S., Klein, M., & Grainger, J. (2011). The time course of visual letter perception. Journal of Cognitive Neuroscience, 24(7), 1645–55.

Margolin, D. I., & Goodman-Schulman, R. A. (1984). Oral and written spelling impairments (pp. 263–297).

Mason, M. (1982). Recognition time for letters and nonletters: effects of serial position, array size, and processing order. Journal of Experimental Psychology. Human Perception and Performance, 8(5), 724–38.

Mayzner, M. S., & Tresselt, M. E. (1965). Tables of single-letter and digram frequency counts for various word-length and letter-position combinations. Pyschonomic Monograph Supplements, 3(2), 13–32.

McClelland, J. L., & Rumelhart, D. (1981). An interactive activation model of context effects in letter perception: i. an account of basic findings. Psychological Review, 88, 375–407.

McCloskey, M., Fischer-Baum, S., & Schubert, T. (2013). Representation of letter position in single-word reading: evidence from acquired dyslexia. Cognitive Neuropsychology, 30(6), 396–428.

McCloskey, M., & Schubert, T. (2014). Shared versus separate processes for letter and digit identification. Cognitive Neuropsychology, 0(0), 1–24.

Miozzo, M., & Caramazza, A. (1998). Varieties of pure alexia: the case of failure to access graphemic representations. Cognitive Neuropsychology, 15(1), 203–238.

211

Moyer, R., & Landauer, T. (1967). Time required for judgements of numerical inequality. Nature, 215, 1519–1520.

Mueller, S. T., & Weidemann, C. T. (2012). Alphabetic letter identification: effects of perceivability, similarity, and bias. Acta Psychologica, 139, 19–37.

Mycroft, R. H., Behrmann, M., & Kay, J. (2009). Visuoperceptual deficits in letter-by- letter reading? Neuropsychologia, 47(7), 1733–44.

New, B., & Grainger, J. (2011). On letter frequency effects. Acta Psychologica, 138(2), 322–328.

Norris, D., & Kinoshita, S. (2008). Perception as evidence accumulation and bayesian inference: insights from masked priming. Journal of Experimental Psychology. General, 137(3), 434–55.

Norris, D., & Kinoshita, S. (2012). Reading through a noisy channel: why there‘s nothing special about the perception of . Psychological Review, 119(3), 517– 545.

Norris, D., Kinoshita, S., & van Casteren, M. (2010). A stimulus sampling theory of letter identity and order. Journal of Memory and Language, 62(3), 254–271.

Park, J., Chiang, C., Brannon, E. M., & Woldorff, M. G. (2014). Experience-dependent hemispheric specialization of letters and numbers is revealed in early visual processing. Journal of Cognitive Neuroscience, 1–11.

Park, J., Hebrank, A., Polk, T. A., & Park, D. C. (2011). Neural dissociation of number from letter recognition and its relationship to parietal numerical processing. Journal of Cognitive Neuroscience, 24(1), 39–50.

Park, J., Park, D. C., & Polk, T. a. (2012). Investigating unique environmental contributions to the neural representation of written words: a monozygotic twin study. PloS One, 7(2), e31512.

Patterson, K., & Wilson, B. (1990). A rose is a rose or a nose : a deficit in initial letter identification. Cognitive Neuropsychology, 7(5/6), 447–477.

Pelli, D. G., Burns, C. W., Farell, B., & Moore-, D. C. (2006). Feature detection and letter identification. Vision Research, 46(28), 4646–74.

Perea, M., & Carreiras, M. (2006). Do transposed-letter similarity effects occur at a prelexical phonological level? Quarterly Journal of Experimental Psychology (2006), 59(9), 1600–13.

212

Perea, M., & Carreiras, M. (2008). Do orthotactics and phonology constrain the transposed-letter effect? Language and Cognitive Processes, 23(1), 69–92.

Perea, M., Dun, J. A., & Carreiras, M. (2008). R34d1ng w0rd5 w1th numb3r5. Journal of Experimental Psychology: Human Perception and Performance, 34(1), 237–241.

Perea, M., Duñabeitia, J. A., Pollatsek, A., & Carreiras, M. (2009). Does the brain regularize digits and letters to the same extent? Quarterly Journal of Experimental Psychology (2006), 62(10), 1881–8.

Perea, M., & Lupker, S. (2003). Transposed-letter confusability effects in masked form priming. In Masked Priming: State of the (pp. 97–120).

Perea, M., & Lupker, S. J. (2004). Can caniso activate casino? transposed-letter similarity effects with nonadjacent letter positions. Journal of Memory and Language, 51(2), 231–246.

Peressotti, F., Cubelli, R., & Job, R. (2003). On recognizing proper names: the orthographic cue hypothesis. Cognitive Psychology, 47(1), 87–116.

Peressotti, F., & Grainger, J. (1995). Letter-position coding in random consonant arrays. Perception & Psychophysics, 57(6), 875–890.

Peressotti, F., & Grainger, J. (1999). The role of letter identity and letter position in orthographic priming, 61(4), 691–706.

Perri, R., Bartolomeo, P., & Silveri, M. C. (1996). Letter dyslexia in a letter-by-letter reader. Brain and Language, 53(3), 390–407.

Perry, C., Ziegler, J. C., & Zorzi, M. (2010). Beyond single syllables: large-scale modeling of reading aloud with the connectionist dual process (cdp++) model. Cognitive Psychology, 61(2), 106–51.

Petit, J.-P., & Grainger, J. (2002). Masked partial priming of letter perception. Visual Cognition, 9(3), 337–353.

Petit, J.-P., Midgley, K. J., Holcomb, P. J., & Grainger, J. (2006). On the time course of letter perception: a masked priming erp investigation. Psychonomic Bulletin & Review, 13(4), 674–81.

Phillips, J. R., Johnson, K. O., & Browne, H. M. (1983). A comparison of visual and two modes of tactual letter resolution. Perception & Psychophysics, 34(3), 243–9.

Pitchford, N. J., Ledgeway, T., & Masterson, J. (2008). Effect of orthographic processes on letter position encoding. Journal of Research in Reading, 31(1), 97–116.

213

Plaut, D. C. (1999). A connectionist approach to word reading and acquired dyslexia: extension to sequential processing. Cognitive Science, 23(4), 543–568.

Plaut, D. C., McClelland, J. L., Seidenberg, M. S., & Patterson, K. (1996). Understanding normal and impaired word reading: computational principles in quasi-regular domains. Psychological Review, 103(1), 56–115.

Podgorny, P., & Garner, W. R. (1979). Reaction time as a measure of inter- and intraobj ect visual similarity: letters of the alphabet. Perception & Psychophysics, 26(1), 37– 52.

Polk, T. A., & Farah, M. J. (1994). Late experience alters vision. Nature, 376, 648–9.

Polk, T. A., & Farah, M. J. (1995). Brain localization for arbitrary stimulus categories: a simple account based on hebbian learning. Proceedings of the National Academy of Sciences of the United States of America, 92(26), 12370–3.

Polk, T. A., & Farah, M. J. (1998). The neural development and organization of letter recognition: evidence from functional neuroimaging, computational modeling, and behavioral studies. Proceedings of the National Academy of Sciences of the United States of America, 95, 847–52.

Polk, T. A., Stallcup, M., Aguirre, G. K., Alsop, D. C., Esposito, M. D., Detre, J. a, Farah, M. J., & D‘Esposito, M. (2002). Neural specialization for letter recognition. Journal of Cognitive Neuroscience, 14(2), 145–159.

Price, G. R., & Ansari, D. (2011). Symbol processing in the left angular gyrus: evidence from passive perception of digits. NeuroImage, 57(3), 1205–11.

Rapp, B. C., & Caramazza, A. (1989). Letter processing in reading and spelling: some dissociations. Reading and Writing: An Interdisciplinary Journal, 1, 3–23.

Rapp, B. C., & Caramazza, A. (1991). Spatially determined deficits in letter and word processing. Cognitive Neuropsychology, 8(3-4), 275–311.

Rayner, K., Inhoff, a W., Morrison, R. E., Slowiaczek, M. L., & Bertera, J. H. (1981). Masking of foveal and parafoveal vision during eye fixations in reading. Journal of Experimental Psychology. Human Perception and Performance, 7(1), 167–79.

Reinke, K., Fernandes, M., Schwindt, G., O‘Craven, K., & Grady, C. L. (2008). Functional specificity of the visual word form area: general activation for words and symbols but specific network activation for words. Brain and Language, 104, 180– 189.

Roethlein, B. E. (1912). The relative legibility of different faces of printing types. The American Journal of Psychology, 23(1), 1–36.

214

Rothlein, D., & Rapp, B. (2014). The similarity structure of distributed neural responses reveals the multiple representations of letters. NeuroImage, 89, 331–44.

Sanocki, T. (1987). Visual knowledge underlying letter perception: font-specific, schematic tuning. Journal of Experimental Psychology: Human Perception and Performance, 13(2), 267–278.

Sanocki, T. (1988). Font regularity constraints on the process of letter recognition. Journal of Experimental Psychology. Human Perception and Performance, 14(3), 472–80.

Sanocki, T., & Dyson, M. C. (2012). Letter processing and font information during reading: beyond distinctiveness, where vision meets design. Attention, Perception & Psychophysics, 74(1), 132–45.

Schoonbaert, S., & Grainger, J. (2004). Letter position coding in printed word perception: effects of repeated and transposed letters. Language and Cognitive Processes, 19(3), 333–367.

Schubert, T., & McCloskey, M. (2013). Prelexical representations and processes in reading: evidence from acquired dyslexia. Cognitive Neuropsychology, 30(6), 360– 95.

Seidenberg, M. S., & McClelland, J. L. (1989). A distributed, developmental model of visual word recognition and naming. Psychological Review, 96(4), 523–568.

Shum, J., Hermes, D., Foster, B. L., Dastjerdi, M., Rangarajan, V., Winawer, J., Miller, K. J., & Parvizi, J. (2013). A brain area for visual numerals. Journal of Neuroscience, 33(16), 6709–6715.

Simpson, I. C., Mousikou, P., Montoya, J. M., & Defior, S. (2012). A letter visual- similarity matrix for latin-based alphabets. Behavior Research Methods.

Smilek, D., Carriere, J. S. a, Dixon, M. J., & Merikle, P. M. (2007). Grapheme frequency and color luminance in grapheme-color synaesthesia. Psychological Science, 18(9), 793–5.

Solso, R. L., & King, J. F. (1976). Frequency and versatility of letters in the . Behavior Research Methods & Instrumentation, 8(3), 283–286.

Starrfelt, R., & Behrmann, M. (2011). Number reading in pure alexia-a review. Neuropsychologia, 49, 2283–2298.

Stinchcombe, E. J., Lupker, S. J., & Davis, C. J. (2012). Transposed-letter priming effects with masked subset primes : a re-examination of the ―relative position priming constraint.‖ Language, 27, 475–499.

215

Subbiah, I., & Caramazza, A. (2000). Stimulus-centered neglect in reading and object recognition. Neurocase, 6, 13–31.

Tainturier, M.-J., & Rapp, B. C. (2003). Is a single graphemic buffer used in reading and spelling? Aphasiology, 17(6-7), 537–562.

Townsend, J. T. (1971). Theoretical analysis of an alphabetic confusion matrix. Perception & Psychophysics, 9(1A), 40–50.

Tydgat, I., & Grainger, J. (2009). Serial position effects in the identification of letters , digits , and symbols. Journal of Experimental Psychology: Human Perception and Performance, 35(2), 480–498.

Van Assche, E., & Grainger, J. (2006). A study of relative-position priming with superset primes. Journal of Experimental Psychology. Learning, Memory, and Cognition, 32(2), 399–415.

Van der Heijden, A. H. C., Malhas, M. S. M., & van der Roovaart, B. P. (1984). An empirical interletter confusion matrix for continuous-line capitals. Perception & Psychophysics, 35(I), 85–88.

Vinckier, F., Dehaene, S., Jobert, A., Dubus, J. P., Sigman, M., & Cohen, L. (2007). Hierarchical coding of letter strings in the ventral stream: dissecting the inner organization of the visual word-form system. Neuron, 55(1), 143–56.

Vogel, E. K., Woodman, G. F., & Luck, S. J. (2001). Storage of features, conjunctions, and objects in visual working memory. Journal of Experimental Psychology: Human Perception and Performance2, 27(1), 92–114.

Volpato, C., Bencini, G., Meneghello, F., Piron, L., & Semenza, C. (2012). Covert reading of letters in a case of global alexia. Brain and Language, 120(3), 217–25.

Walker, P. (2008). Font tuning: a review and new experimental evidence. Visual Cognition, 16(8), 1022–1058.

Weinzierl, C., Kerkhoff, G., van Eimeren, L., Keller, I., & Stenneken, P. (2012). Error types and error positions in neglect dyslexia: comparative analyses in neglect patients and healthy controls. Neuropsychologia, 50(12), 2764–72.

Welvaert, M., Farioli, F., & Grainger, J. (2008). Graded effects of number of inserted letters in superset priming. Experimental Psychology (formerly “Zeitschrift Für Experimentelle Psychologie”), 55(1), 54–63.

Whitney, C. (2001a). How the brain encodes the order of letters in a printed word: the seriol model and selective literature review. Psychonomic Bulletin & Review, 8(2), 221–43.

216

Whitney, C. (2001b). Position-specific effects within the seriol framework of letter- position coding. Connection Science, 13(3), 235–255.

Whitney, C., & Berndt, R. S. (1999). A new model of letter string encoding: simulating right neglect dyslexia. Progress in Brain Research, 121, 143–163.

Whitney, C., Bertrand, D., & Grainger, J. (2012). On coding the position of letters in words: a test of two models. Experimental Psychology, 59(2), 109–14.

Whitney, C., & Marton, Y. (2013). The SERIOL2 Model of Orthographic Processing (pp. 1–76).

Yarkoni, T., Balota, D., & Yap, M. J. (2008). Moving beyond coltheart‘s n: a new measure of orthographic similarity. Psychonomic Bulletin & Review, 15(5), 971–9.

Ziegler, J. C., Pech-Georgel, C., Dufau, S., & Grainger, J. (2010). Rapid processing of letters, digits and symbols: what purely visual-attentional deficit in developmental dyslexia? Developmental Science, 13(4), F8–F14.

Zorzi, M., Houghton, G., & Butterworth, B. (1998). Two routes or one in reading aloud? a connectionist dual-process model. Journal of Experimental Psychology: Human Perception and Performance, 24(4), 1131–1161.

217

VITA

Teresa Marie Schubert was born on October 9th, 1987 in Durham, North

Carolina. In May 2009 she received a B.A. with honors in Linguistics at the University of

North Carolina at Chapel Hill, with minors in Cognitive Science and Spanish. She joined the Cognitive Science Department at Johns Hopkins University in September 2009 under the mentorship of Drs. Brenda Rapp and Michael McCloskey. In January 2015 she will join the ARC Centre of Excellence in Cognition and its Disorders in the Cognitive

Science Department of Macquarie University in Sydney, Australia as a Postdoctoral

Research Fellow.

218