The automatic form assumption: Evidence for the parasitic model of vocabulary development

CHRISTOPHER J. HALL

Abstract

The Parasitic Hypothesis, formulated to account for early stages of vocabu- lary development in second learners, claims that on initial exposure to a word, learners automatically exploit existing lexical material in the L1 or L2 in order to establish an initial memory representation. At the level of phonological and orthographic form, it is claimed that significant overlaps with existing forms, i.e. , are automatically detected and new forms are subordinately connected to them in the mental lexicon. In the study re- ported here, English nonwords overlapping with real words in Spanish (pseu- docognates), together with noncognate nonwords, were presented to Spanish- speaking learners of English in a word familiarity task. Participants reported significantly higher levels of familiarity with the pseudocognates and showed greater consistency in providing for them. These results, together with measures of the degree of overlap between nonword stimuli and transla- tions, were interpreted as evidence for the automatic use of cognates in early word learning.

1. Introduction1 1.1. The role of cognates in vocabulary development The facilitating role of cognates in the L2 vocabulary learning process has long been recognized (cf. Sweet 1972 [1899]). Cognates are words in two or more which share phonological and/or orthographic form, and normally (but not necessarily) are also related semantically. Ringbom (1987: 41) makes the commonsense observation that “[w]hen both phonological and semantic similarity work together, the effect is like that of a magnet attracting a new word to be stored in the learner’s mental lexicon when he meets it for the Þrst time”. In this way, cognates have been recognized as signiÞcant sources of positive transfer (Ringbom 1987; Odlin 1989; Nation 1990). Not surprisingly, the same authors have pointed out that phonological similarity without (sufÞ-

IRAL 40 (2002), 69–87 0019042X/2002/040-069 c Walter de Gruyter 70 Christopher J. Hall cient) semantic overlap, in the familiar cases of false cognates (faux amis), will lead to negative transfer (or interference). Very little experimental work has been conducted by applied linguists on the effects of cognates in vocabulary learning (although cf. Ard and Hom- burg 1983). There have been relatively more data from naturalistic studies (e.g., Ringbom 1987; Holmes and Ramos 1993; also studies discussed in Hatch and Brown 1995 and Singleton 1999), as well as from word association tasks (e.g., Meara 1984), all demonstrating the signiÞcant role of formal similarity in the development and organization of the L2 mental lexicon. General research on orthographic and phonological aspects of foreign language vocabulary de- velopment and processing remains, however, sparse, while work on top-down strategies has dominated the literature (Koda 1997). In research by experimental psycholinguists on the mental lexicons of bilin- guals, on the other hand, an impressive amount of data has been gathered (cf. Chen and Leung 1989; Jin 1990; Sánchez Casas et al. 1992; de Groot and Nas 1991; de Groot 1992, 1993; Kroll and Stewart 1994). These data suggest that phonological and semantic cognates are more closely associated than noncognate equivalents, but that purely phonological cognates (false cognates) appear to behave like noncognates on a number of psycholin- guistic tasks, such as cued translation, word and picture naming, and priming using translation, repetition and semantic associates. More recent research on the effects of phonological and conceptual aspects of words in bilingual processing has found that purely formal similarity be- tween word competitors can inßuence performance on lexical decision and translation recognition tasks (e.g., Dijkstra et al. 1998; Talamas et al. 1999). Talamas et al. (1999), following Kroll and Stewart’s (1994) asymmetric model of lexical representation and processing in bilinguals, conÞrm that less ßuent bilinguals are more affected by formal similarity than more balanced bilin- guals: “For less ßuent individuals, who are likely to have greater uncertainty about their L2 knowledge than more ßuent individuals, any signiÞcant activa- tion of shared [formal] features may present sufÞcient evidence to respond pos- itively that the pair of words are translation equivalents, regardless of whether or not that is so” (1999: 56). Dijkstra et al. (1998) show that fully ßuent bilinguals also demonstrate false cognate interference effects (from interlin- gual homographs such as room, Eng. ‘room’, Dutch ‘cream’) when performing the lexical decision task in bilingual mode (i.e., when language input is mixed between L1 and L2). Dijkstra and his colleagues have also shown that the neighbourhood density of a word, i.e. the number of lexical neighbours difffering minimally from it in orthography in either L1 or L2, will affect recognition and translation latencies (Grainger and Dijkstra 1992; Van Heuven et al. 1998). Traitors (L1 words with more neighbours in L2 than in L1) are slower to recognize, and provoke The automatic cognate form assumption 71 more errors, than patriots (L1 words with more neighbours in L1 than L2). Van Heuven et al. (1998: 474) conclude that “[s]timulus items automatically acti- vate orthographically similar words in both the target language and the other language of the bilingual participant”. In addition, studies on lexical production and comprehension errors in a for- eign language consistently reveal patterns of formal organization in and be- tween the native and foreign language lexicons. Ecke’s work on tip-of-the- tongue recall stages in second and third language learners (Ecke 1996, 1997; Ecke and Garrett 1998) shows clearly that interlexical inßuence at the level of phonological and orthographic form plays a crucial role in learners’ ex- tended word searches. In a study of cognate reliance in reading comprehen- sion by Brazilian learners of English, Holmes and Ramos (1993) report lexical misidentiÞcation on the basis of formal similarity with other words in the L1 and L2 (e.g., L2 poll interpreted as L1 polo ‘city, central point’ and L2 swing taken as L2 swim). Laufer (1989, 1997) discusses the issue in terms of the phenomenon of de- ceptive transparency,inwhich readers misidentify a word on the basis of its formal similarity with existing words in the L1 and L2, listing false cognates (such as Eng. tramp taken as Hebrew tremp ‘lift’) and identifying a class of error which she calls synforms, which, according to her studies, are the largest category of deceptively transparent words. Synforms (malapropisms in the monolingual literature) are lexical mis-hits selected due to formal resemblance with other L2 forms (such as price for prize and cute for acute). Laufer argues that synforms are identiÞed because of insecure knowledge of the target form or of both target and error forms. In a study of French EFL learners’ errors, Granger (cited in James 1998: 149) found that over 34 % of lexical errors were due to the use of false cognates, i.e., L2 words with partial form and meaning overlap with L1 translation equivalents. The evidence summarized here, much of it collected in studies which tap au- tomatic, non-attentional processes, strongly suggests that similar form features in the L1 and L2 are automatically detected and exploited in the establishment of memory traces for new L2 words. The following section sketches a model of vocabulary development that takes such similarity detection and exploitation as the principal motor which drives early word learning.

1.2. The Parasitic Strategy of vocabulary development On the basis of a series of studies on foreign language errors in L2 and L3, Hall (1992, 1996, 1997; Hall and Schultz 1994) and Ecke and Hall (1998, 2000) have argued that vocabulary development may usefully be viewed as a problem of pattern-matching and assimilation with current lexical knowledge, at least at the onset of the word learning process. This psycholinguistic approach has 72 Christopher J. Hall motivated the postulation of a Parasitic Strategy of vocabulary development: a series of automatic, unconscious cognitive stages that an emerging lexical entry is hypothesized to undergo after the learner Þrst encounters an unknown word. According to the Parasitic Strategy, the key to learning the word is Þrst to establish a form representation, i.e., construct a memory trace of the pronun- ciation and/or spelling, and then to make the right connections with existing lexical and conceptual knowledge. The strategy claims that after registering the form, learners will immediately identify a translation equivalent, should one be available, through overt translation into L1, by an L1 or L2 deÞnition, by some icon (e.g., a picture or mime), contextual cues, or by whatever other medium. This is because when language input is received, it is the immediate and inevitable responsibility of the language faculty in the mind/brain to deal with it, whether it is from the L1 or the L2. The central purpose of the language faculty is, I assume, to assign forms to meanings, and meanings to forms, using any and all linguistic resources available. We witness this assumption of translation equivalence in learner errors, for example where an L2 form is used erroneously in the syntactic frame of a semantically equivalent L1 lexical entry. Hall and Schultz (1994) collected lexical errors from 125 compositions written by Mexican learners of English at the beginner level, in which the majority of examples of incorrect syntactic deployment could be traced to the grammatical behaviour of translation equiv- alents (74 % of the 104 errors detected). Examples include (1a), on the basis of (1b): (1) a. It would like you. (produced) b. Te gustaría you-object it-would-please ‘You would like it.’ (intended meaning) Here, the English verb like is rightly taken as the most appropriate translation equivalent of the Spanish verb gustar. The problem is that the two verbs behave differently with regard to their syntactic deployment. Both verbs select the same thematic roles as arguments, but (simplifying slightly), like places the experiencer in subject (preverbal) position and the theme in object (postverbal) position, whereas Spanish gustar does the opposite (following the pattern of Eng. please).2 We can represent this phenomenon schematically using diagrams that follow the conventions of Hall’s (1992) “Triad Model”, according to which entries in the mental lexicon are built from pairs of linguistic representations (phono- logical/orthographic form and syntactic frame) connected with nonlinguistic conceptual representations (cf. the lexeme/lemma/concept distinction of Levelt (e.g., Levelt 1989; Levelt et al. 1999) and Kroll’s distributed lexical/conceptual The automatic cognate form assumption 73

FORM FRAME CONCEPT

gustar V, ___ ‘like’

like L1 LEXICAL ENTRY

L2 LEXICAL ENTRY

Figure 1. Noncognate translation equivalent feature model of the bilingual lexicon (Kroll and de Groot 1997). Hence, in

Figure 1, the L2 form like is connected in subordinate fashion to the frame

   representation of the L1 lexical entry (where T ___ E represents the appro- priate thematic grid):3 According to the Parasitic Strategy, the normal pattern for learners in the early stages of acquisition will be to connect novel L2 words to the frame representation (lemma) of a translation equivalent (leading to errors such as [1a]). Formal cognates, however, as items sharing some criterial amount of phono- logical and/or orthographic form, might lead automatically to even more eco- nomical representation-building and retention, whether the concept identiÞed is the correct one or not (or even if no concept is identiÞed at all). The identi- Þcation of cognates, both true and false, constitutes a form of cross-linguistic inßuence (Ringbom 1987; Hatch and Brown 1995). True cognates (for exam- ple Eng. rose and Sp. rosa) are lexical items which are common to L1 and L2, despite superÞcial phonological or orthographic differences, as a result of ei- ther shared lexical inheritance from a common ancestor language, or through inter-language borrowing. In true cognates, a form’s meaning equivalence has been maintained, historically, between L1 and L2, whereas in false cognates either the meanings have diverged (e.g., Sp. actual ‘current’ and Eng. actual, from Lat. actualis ‘active, practical’) or the items are actually unconnected historically (e.g., Sp. tuna ‘prickly pear’ and Eng. tuna). The learner’s identiÞcation of a cognate (which may turn out to be true or false), is represented in the model as a direct connection at the form representa- tion level, as diagrammed in the example of the true cognate in Figure 2, where 74 Christopher J. Hall

rosa N ‘rose’

---e

Figure 2. Cognate translation equivalent

(ignoring minor details such as vowel quality) the L1 word rosa differs only in the addition of a Þnal vowel from the L2 form rose. Given this interlexical connection (shown in Figure 2 as dotted lines rep- resenting redundancy connections from the L2 form to the L1 orthographic representation), retention of the L2 form (i.e., the representation’s achievement of permanence in lexical memory) should require less effort, since much of the form is already in place in the (robust) L1 lexicon. A logical consequence of this process is, however, the student’s construc- tion of false cognates. We distinguish here between two types of false cog- nate: (i) true false cognates, and (ii) indirect cognates, representing points on a continuum rather than poles. (Holmes and Ramos (1993) identify a “cline of ‘cognateness’ ” and Granger (1996, cited in James 1998: 148) makes a similar distinction between “totally deceptive” and “partially deceptive” cog- nates). A true false cognate would be Eng. tuna for Sp. tuna ‘prickly pear’ (the Spanish translation of Eng. tuna is ‘atún’). An indirect cognate, on the other hand, shares some criterial amount of salient features in conceptual memory, of which the learner may be consciously aware, for example Eng. library used for Sp. librería ‘bookshop’, instead of for Sp. biblioteca. Here the semantic over- lap is clear: both concepts involve a building containing shelves from which clients take away one or more books (in the second case, they leave money and do not bring the books back). In Figure 3 and Figure 4 these examples are schematized, showing both the learner error (via the dotted line redundancy connections) and the target conÞguration (from L2 form to L1 syntactic frame representation, via the subordinate connection represented here by a dashed line). In essence, learners create such non-target conÞgurations because they are trying to maximize the use of already established language structure, i.e., en- tries in the L1 mental lexicon. In other words, they are exploiting the L1 lex- icon in parasitic fashion. We should not be surprised that L2 learners execute The automatic cognate form assumption 75

atún N ‘tuna’

----

tuna N ‘prickly pear’

Figure 3. True false cognate

biblioteca N

‘library’ ----a-y

‘bookshop’

libre ría N

Figure 4. Indirect cognate 76 Christopher J. Hall the most economical strategy in their acquisition of vocabulary. The parasitic nature of L2 vocabulary learning and representation is in full accord with gen- eral principles governing the nature of mental representation and processing, such as economy (Martinet 1964; Lightfoot 1979; Haiman 1983), least effort (Zipf 1949; Slobin 1977; Chomsky 1991) and accommodation (Piaget and In- helder 1969). Instead of constructing an entirely new knowledge store for the L2, learners utilize the store they have already in place. Instead of duplicating information, at the form level with cognates and at the frame level with gram- matical equivalents, they list it only once and attach new L2 forms to existing representations. Naturally, the account of the vocabulary development process given by the Parasitic Strategy can only be partial, since, as Singleton (1999) and others have pointed out, there is much more to learning words than registering their form and connecting them with representations in the L1 mental lexicon. Clear- ly, forms which remain unconnected with appropriate syntactic, semantic, col- locational and sociocultural features for the L2 will fossilize as interlanguage phenomena. Nevertheless, even though the process is not uniquely form-driven, even at the beginning stages (cf. Talamas et al. 1999), patently all learners must establish some memory representation of a phonological and/or orthographic form to which the other characteristics of wordship may be anchored. If the learner’s native language affords storage and access cues for an L2 word form, in the shape of a cognate, then it seems probable that the language faculty will try to detect and exploit these cues.

2. The experiment In the study reported here, one aspect of the Parasitic Strategy was subjected to scrutiny: the automatic use of form cognates in the construction of a rep- resentation for a new L2 word, and the consequent assumption that they are true cognates, independently of whether evidence is available to suggest that they share semantic content. A simple experiment was conducted to conÞrm or disconÞrm this prediction of the strategy.

2.1. Design and hypotheses Spanish-speaking learners of English as a foreign language participated in a word familiarity report task, where the word list presented included items which were formal cognates with word forms in the L1. In order to isolate form from confounding variables introduced by semantic and syntactic factors, the experimental stimuli were all nonwords, some overlapping in form with real L1 word forms (pseudocognates), others not (noncognates). For each item, participants were instructed to record whether they had seen it before, and to provide a word in the L1 which they judged might be closest to it in meaning. The automatic cognate form assumption 77

Unlike nonword use in tasks such as lexical decision (where participants make speeded decisions as to whether a stimulus is an existing word in the language or not), here the nonword stimuli should have the same status as unknown real L2 words. This is because (a) participants have limited knowledge of English vocabulary and (b) they are required only to judge familiarity with the stimuli, rather than assess their status as words. Thus, they have no reason to believe that stimuli will be anything other than existing words of the language. Following the predictions of the Parasitic Hypothesis, the following hy- potheses were formulated: 1. The mean number of participants reporting having seen the word before will be higher for pseudocognates than for noncognates. 2. The mean number of possible L1 translations given across participants per item will be lower for pseudocognates than for noncognates. 3. The mean number of participants responding with the most favoured trans- lation per item across participants will be higher for pseudocognates than for noncognates. 4. The degree of overlap between nonword and suggested translation will be greater for pseudocognates than for noncognates. Hypothesis 1 reßects the assumption of the Parasitic Strategy that participants will automatically activate any stored form in the L1 which signiÞcantly over- laps with the L2 word, and thus will be more likely to report familiarity with it. Hypotheses 2 and 3 respond to the prediction of the Parasitic Strategy that similar form will be taken automatically as a token of similar meaning (as in the case of false cognates), and therefore that the pseudocognates will con- strain participants’ translation guesses more than the noncognates, both across responses (Hypothesis 2) and across participants (Hypothesis 3). Finally, Hy- pothesis 4 afÞrms the essential corollary that what constrains translation word selection is, precisely, overlap of form.

2.2. Participants Ninety-Þve native Spanish-speaking university students participated in the ex- periment. All were enrolled in sections of an intermediate English for Aca- demic Purposes course. They received extra course credit for their participa- tion.

2.3. Materials A set of English nonwords (orthographically and phonologically well-formed but non-occuring words) was constructed. To detect the role of form overlap with the L1 (Spanish), half of the nonwords were designed so that they had cognates in Spanish (henceforth called pseudocognates). For example, one of the English pseudocognates constructed was *stribe (cognate with Spanish es- 78 Christopher J. Hall tribo ‘stirrup’); another was *campanary (cognate with Spanish campanario ‘bell tower’). The other half of the nonword set were non-cognates, i.e., evinc- ing no form overlap with Spanish words, such as *plude and *thrimble (see Appendix). Fifteen pseudocognates and Þfteen noncognates were designed and random- ly distributed among 60 real English words, half of which were cognates and half not. The complete list of 90 items was controlled for the following vari- ables: ◦ Frequency: The nonwords have zero frequency, so in order to provide a bal- anced list and avoid demoralizing participants, the majority of real words used were of high frequency, as were their translations in Spanish. Forty of the sixty real English words used, together with their Spanish equivalents, occurred in the Þrst 1000 most frequent words in Eaton’s (1940) crosslin- guistic frequency list. The other twenty English words and their Spanish equivalents were taken from the sixth or seventh thousand words in the Eaton list. Of the Þfteen pseudocognates, six had Spanish cognates from the Þrst thousand words in the list, while nine were taken from the sixth or seventh 1000 words. ◦ Length:Anattempt was made to provide a fair spread of word lengths, re- ßecting the kind of distribution these students would have been exposed to, but erring on the side of simplicity (fewer syllables), in order not to make the list appear too offputting. Of the nonwords, Þve were monosyllabic, Þve bisyllabic and Þve polysyllabic. Of the sixty real words, thirty were mono- syllabic, twenty bisyllabic, and ten polysyllabic. ◦ Morphological complexity:Inorder to reßect the reality of English and Spanish vocabulary, the polysyllabic pseudocognates and noncognates all used transparent sufÞxes which were cognate with their Spanish counterparts (e.g., Eng. pseudocognate *sper-ance versus Spanish esper-anza ‘hope’; and noncognate *onter-ize using cognate sufÞx-ize, Sp. -izar). Mono- and bisyl- labic nonwords were morphologically simple (except for one preÞxed form in both the pseudo- and noncognate bisyllables). Only Þve of the real words had a transparent afÞx. ◦ Cognate status: The principal criterion to determine cognate status for both real and nonwords was that the pair must share at least two-thirds of their form, in number of letters, without the substitution, epenthesis, metathesis or omission of any letter.4 The only exceptions admitted, in accord with English-Spanish comparative norms, were the substitution of s for z,orñ for n; the epenthesis of e before s at word onset; or the omission of repeated consonants in the case of English gemination. Exhaustive searches of the Diccionario de la Lengua Española (1984) and The Shorter Oxford English Dictionary (1973), and consultation with two panels of three native speakers, ensured that the noncognates were in fact such. The automatic cognate form assumption 79

To avoid order effects, the ninety (randomized) stimuli were divided into four blocks, and the order of presentation of blocks differed for each of the partici- pant groups.

2.4. Procedure The stimuli were presented visually to participants in seven intact classroom groups, averaging 13.6 in each group. Ten warm-up items, real English words reßecting a range of frequencies of occurrence, were added at the beginning of each session, but not revealed as such to participants. Participants saw the 100 items centred, in large lowercase typeface, on a video monitor. Each word appeared for a duration of twenty seconds, with a blank screen of Þve sec- onds between each word, and a twenty second delay between each of the Þve blocks. After the warm-up block, the experimenter clariÞed any doubts about the procedure. Participants were asked to perform two tasks for each word they saw: (i) record whether or not they thought they had seen the word before: and (ii) write down what they thought could be the Spanish word closest in meaning to the English word presented, even if they had to guess. Participants recorded their responses on a simple answer sheet containing numbers and spaces for writing response words. They recorded familiarity with the word by circling the corresponding number, and unfamiliarity by placing a cross over it. They wrote down the Spanish word closest in meaning in the space next to the cor- responding number.

2.5. Results Results were collected from a total of ninety-Þve participants. Due to experi- menter error, data were lost for one item of the pseudocognate condition. The results were coded, per item per condition, according to Þve separate criteria designed to address the hypotheses presented above: 1. Number of participants reporting familiarity with each word. 2. Number of different Spanish words given as translations, per item. 3. Number of participants responding with the most frequent translation into Spanish, per item. 4. a. Number of participants responding with Spanish translations sharing the same initial letter with the English word. b. Number of participants responding with Spanish translations sharing the same Þrst three consonants as the English word, in the same order. The overlap criteria in (4), of shared initial letter and Þrst three consonants, were used instead of the original cognate criteria, since, obviously, the noncog- nates had no L1 cognates by design. Mean scores and percentages for each criterion are given in Tables 1–5. 80 Christopher J. Hall

Table 1. Mean number of participants reporting familiarity with stimulus, across items

Pseudocognate condition Noncognate condition Participants 38.36 5.33 [n = 95] (40.37 %) (06.01 %)

Table 2. Mean number of translations reported per item, across participants

Pseudocognate condition Noncognate condition Translations 14.14 41.33

Table 3. Mean number of participants responding with the most frequent translation, across items Pseudocognate condition Noncognate condition Participants 54.5 12.6 [n = 95] (57.37 %) (13.76 %)

Table 4. Mean number of participants responding with forms sharing initial letter with stimulus, across items

Pseudocognate condition Noncognate condition Participants 77.57 46.33 [n = 95] (81.65 %) (48.73 %)

Table 5. Mean number of participants responding with forms sharing first three conso- nants with stimulus, across items

Pseudocognate condition Noncognate condition Participants 74.93 17.07 [n = 95] (78.87 %) (19.02 %)

These Þve between-condition scores were subjected to a series of t-tests to verify whether the differences were signiÞcant. All t-tests showed that the differences were statistically signiÞcant at at least p ≤ 0.001 (Criterion 1: t(13) = 5.821, p ≤ 0.0001; Criterion 2: t(13) = 7.796, p ≤ 0.0001; Criterion 3: t(13) = 6.096, p ≤ 0.0001; Criterion 4(a): t(13) = 4.284, p ≤ 0.001; Criterion 4(b): t(13) = 8.945, p ≤ 0.0001).

3. Discussion Participants consistently reported that pseudocognates were more familiar than noncognates, even though neither group contained real words. In addition, a much narrower range of translations was provided for pseudocognates than for The automatic cognate form assumption 81 noncognates, and the most frequent translation given was provided by a much larger group of participants in the former condition than in the latter. An analy- sis of the translations provided shows that the pseudocognate responses shared signiÞcantly more formal features than those of the noncognate condition, thus strongly suggesting that this difference is indeed due to form similarity and not other factors. These results clearly show that, in line with the results for non-ßuent bilin- guals yielded in the study by Talamas et al. (1999), intermediate EFL students assume that shared form indicates shared meaning. In operational terms, this implies that learners are sensitive to form overlap, and not only that they au- tomatically register it, but that this formal connection leads transitively to an assumption of semantic overlap, as shown by the higher rates of familiarity and translation conformity for pseudocognates as opposed to noncognates. These results, though collected with artiÞcial forms in an artiÞcial environ- ment, support a major claim of the Parasitic Hypothesis, by showing that learn- ers, on encountering novel vocabulary items (in this case nonwords), initially use already existing form information from the words they know in order to conÞrm or create hypotheses about word meaning. Of course, in the task they were asked to perform here, independent semantic information (in the form of context sentences, texts, topics, pictures, or real situations of language use) was not provided. Earlier naturalistic studies, however, seem to conÞrm that form familiarity can override such cues, a phenomenon observed by Holmes and Ramos (1993: 92) in their study of cognate use in reading comprehension, and termed by them “reckless guessing”. Laufer (1989) reports similar cases, where overlapping forms in the L2 (syn- forms) inßuence word recognition and subsequent semantic processing. For example, one participant understood sentence (2) below as (3), by confusing the words nurturing with natural, fending with finding, and leaving with living:

(2) This nurturing behaviour, this fending for females instead of leaving them to fend for themselves, may take many different forms. (3) Instead of living natural life, natural behaviour, females and children find many different forms of life.

These synform confusion data are matched by some of the responses given in the present study. For example, the pseudocognate *gan yielded four instances of pistola ‘pistol’ and two of arma ‘weapon’, presumably due to its similarity (at least in American English pronunciation) with the word gun. Similarly, the pseudocognate *tard was translated as duro ‘hard’, by two participants, and the pseudocognate *encendate as fecha ‘date’, by four participants. Such data conÞrm Ecke and Hall’s (1999, 2000) claims that the Parasitic Strategy is essentially promiscuous with regard to the language source of potential form 82 Christopher J. Hall associates accessed, i.e., that any form in L1 or L2 (or L3) can inßuence the processing of any other form in L1 or L2 (or L3) with which it overlaps. The results of this experiment are consistent with connectionist, interactive activation models of the lexicon (cf. Rumelhart and McClelland 1986), an ap- proach which has been extended to the bilingual lexicon (e.g., Dijkstra and Van Heuven 1998; Kroll and de Groot 1997) and to second language acqui- sition (e.g., MacWhinney 1997; Ellis 1998). Connectionism views the mind as a vast network of simple processing units, where complex mental states and behaviours are the result of different conÞgurations of the network (in a style similar to the neurophysiological network of neurons in the brain). In a distributed network, maximal economy is envisaged. For example, instead of separate nodes to represent the word forms bit, sit, fit, kit, lit, etc., there would be activation in the areas of the network representing the syllable nucleus and coda -it, together with activation of mutually-inhibiting patterns representing the consonantal onsets b-, s-, f -, k-, l-, etc. Learning word forms then becomes a process of mapping new input across the existing network in a succession of cycles, through which connection weights and facilitatory or inhibitory effects become settled in different ways. The present results and the Parasitic Strategy in general may be interpreted within such a parallel distributed framework: recall that parasitic vocabulary learning means establishing new form representations and connecting them with existing units (accommodating them into the existing network). With cognates, much of the “new” material is already represented, distributed across the network conÞgured for L1 lexical knowledge, and so little new must be learnt at the form level (the redundancy connections used in Figures 2–4 can be taken as schematic oversimpliÞcations of patterns of activation of shared form features, independent of particular lexical entries). Activation of an L1 form through overlap with the form features of a novel L2 word will automatically trigger activation of the meaning of the L1 word, via spreading activation from lexical to conceptual levels of the network. This much falls out from connec- tionist models such as Dijkstra and colleagues’ Bilingual Interactive Activation (BIA) model of word recognition (Dijkstra and Van Heuven 1998). The Para- sitic Strategy takes this further by making explicit the fact that such transitory activation in the online recognition process can lead to more permanent con- nections, which inßuence the outcomes of vocabulary acquisition (for example, the fossilization of false cognates). It appears, then, that in early word learning, form information is automati- cally and unconsciously exploited, via pattern-matching between the contents of working memory (the new form) and long-term lexical memory (the L1 and L2 mental lexicons). The connections thus generated lead to the establishment of lexical triads of a parasitic nature, where most of the L2 entry is made up of already existing L1 material. Such is the automaticity of this process that a The automatic cognate form assumption 83 signiÞcant number of participants in the study described here report that they are familiar with nonwords, on the basis of the extent of their formal overlap with cognates in the L1. In applied linguistics and language pedagogy, we often underestimate how much learners bring to the learning task. Matthei and Roeper (1983: 86), dis- cussing the instinctual nature of human language processing, refer tangentially to the problem of second language learning:

The reader can ask himself ... whether he thinks that the kind of knowledge [of language] we have and use every day could really be learned or if understanding language is like opening one’s eyes and seeing. For language we must open our ears, make connections between words and things, and adjust our grammars in some slight ways. The rest may be all there. Making ‘slight adjustments’ and a few ‘connections between words and things’ can be taken to refer to the special characteristics of each language. Such operations seem monumental to those in language classes or to tourists trying to make themselves understood in a foreign country, but they may actually be quite minimal. It is like noticing that human beings all look quite different from each other; but, in many respects, we all look exactly the same.

Although it would be absurd to characterize second language learning as fun- damentally instinctual in this way (given all that we know of maturational, environmental, affective, motivational and other factors which distinguish L1 from L2 development), we would do well to recognize that parts of the process must be the result of automatic cognitive procedures, and that psycholinguists can make important contributions to our understanding of language learning (and possibly to the development of more effective teaching practice) by ex- ploring the conditions and scope of such procedures.

Universidad de las Américas-Puebla  [email protected] 84 Christopher J. Hall

Appendix: Items used in the pseudocognate and noncognate conditions (Spanish cognates and English translations are provided in parentheses for the pseudocognates.)

Pseudocognates Noncognates stribe (estribo ‘stirrup’) pirt mirl (mirlo ‘blackbird’) jiss pulge (pulga ‘ßea’) tarm gan (ganar ‘win’) plude tard (tarde ‘late’) rause recort (recortar ‘cut’ (V)) belmer halcone (halcón ‘falcon’) purtent gemel (gemelo ‘twin’) extrow entend (entender ‘understand’) elter bastant (bastante ‘enough’) thrimble campanary (campanario ‘bell tower’) muttlement menesterous (menesteroso ‘needy’) urnimary esperance (esperanza ‘hope’ (N)) onterize atraverse (atravesar ‘cross’ (V)) chailerate encendate (encender ‘light’ (V)) astazance

Notes 1. This research was supported by a grant from the UDLA Institute of Research and Graduate Studies. I gratefully acknowledge the generous time and effort dedicated to this project by my research assistant Moya Schultz. I also thank Peter Ecke and two anonymous reviewers for their helpful comments on the manuscript. 2. In an earlier stage of English, like also shared this syntactic behaviour (e.g. It likes

us well – meaning ‘We like it well’ – from Shakespeare’s King John, II.1.533).

   3. T = theme and E =experiencer. 4. Neighbourhood density and frequency in the L1 and L2 (Grainger and Dijkstra 1992; Van Heuven et al. 1998) were not calculated. A future experiment might proÞtably manipulate such factors, to assess whether degree of formal overlap be- yond a single form in the L1 might affect reported familiarity. To my knowledge, neighbourhood effects have so far been observed only in online word recognition tasks such as lexical decision.

References

Ard, Josh and Taco Homburg (1983). VeriÞcation of . In Language transfer in Language Learning, Susan Gass and Larry Selinker (eds.), 157–176. Rowley, MA: Newbury House. The automatic cognate form assumption 85

Chen, Hsuan-Chih and Yuen-Sum Leung (1989). Patterns of lexical processing in a non-native language. Journal of Experimental Psychology: Learning, Memory and Cognition 15 (2): 316–325. Chomsky, Noam (1991). Some notes on economy of derivation and representation. In Principles and Parameters in Comparative Grammar, Robert Freidin (ed.), 417–454. Cambridge, MA: MIT Press. de Groot, Annette M. B. (1992). Determinants of word translation. Journal of Experimental Psychology: Learning, Memory and Cognition 18 (5): 1001–1018. — (1993). Word-type effects in bilingual processing tasks: Support for a mixed-representational system. In The Bilingual Lexicon, Robert Schreuder and Bert Weltens (eds.), 27–51. Amster- dam: John Benjamins. de Groot, Annette M. B. and Gerard L. J. Nas (1991). Lexical representation of cognates and non-cognates in compound bilinguals. Journal of Memory and Language 30 (1): 90–123. Diccionario de la Lengua Española (1984). 20th Edition. Madrid: Real Academia Española. Dijkstra, Ton and Walter J. B. Van Heuven (1998). The BIA model and bilingual word recognition. In Localist Connectionist Approaches to Human Cognition,J.Grainger and A. Jacobs (eds.), 189–225. Hillsdale, NJ: Lawrence Erlbaum Associates. Dijkstra, Ton, Henk Van Jaarsveld and Sjoerd Ten Brinke (1998). Interlingual homograph recog- nition: Effects of task demands and language intermixing. Bilingualism: Language and Cog- nition 1 (1): 51–66. Eaton, Helen S. (1940). Semantic Frequency List for English, French, German, and Spanish. Chicago: University of Chicago Press. Ecke, Peter (1996). Cross-language Studies of Lexical Retrieval: Tip-of-the-tongue States in First and Foreign Languages. Doctoral dissertation, University of Arizona. — (1997). Tip of the tongue states in Þrst and foreign languages: Similarities and differences of lexical retrieval failures. In Proceedings of the EUROSLA 7 Conference, 505–514. Barcelona: EUROSLA. Ecke, Peter and Merrill Garrett (1998). Lexical retrieval stages of momentarily inaccessible foreign language words. Ilha do Desterro. Special Issue on the Cognitive Aspects of Foreign/Second Language Acquisition/Learning 35: 157–183. Ecke, Peter and Christopher J. Hall (1998). Tres niveles de la representación mental: Evidencia de errores léxicos en estudiantes de un tercer idioma. Estudios de Lingüística Aplicada 28: 15–26. — (2000). Lexikalische Fehler in Deutsch als Drittsprache: Translexikalischer Einßuss auf 3 Ebenen der Mentalen Reprasentation. Deutsch als Fremdsprache 37 (1): 30–36. Ellis, Nicholas C. (1998). Emergentism, connectionism and language learning. Language Learn- ing 48 (4): 631–664. Grainger, Jonathan and Ton Dijkstra (1992). On the representation and use of language information in bilinguals. In Cognitive Processing in Bilinguals,R.J.Harris (ed.), 207–220. Amsterdam: North Holland. Granger, Sylviane (1996). Romance words in English: from history to pedagogy. In Words: KVHAA Conference 36, Jan Svartvik (ed.), 105–121. Stockholm: Almquist and Wiksell. Haiman, John (1983). Iconic and economic motivation. Language 59: 781–819. Hall, Christopher J. (1992). Making the right connections: Vocabulary learning and the mental lexicon. Unpublished manuscript, Universidad de las Américas, Puebla: ERIC Document Reproduction Service No. ED 363 128. — (1996). La estrategia parasítica: Un modelo psicolingüístico del aprendizaje de vocabulario. In La Imaginación y la Intelegencia en el Lenguaje: Homenaje a Roman Jakobson, Susana Cuevas and Julieta Haidar (eds.), 229–238. Mexico City: INAH. — (1997). Palabras concretas, palabras abstractas y rasgos categoriales en el léxico mental bil- ingüe. In Varia Lingüística y Literaria: 50 Años del CELL, Tomo 1. Lingüística, Rebeca 86 Christopher J. Hall

Barriga Villanueva and Pedro M. Butrageño (eds.), 363–381. Mexico City: El Colegio de México. Hall, Christopher J. and Moya Schultz (1994). Los errores de marco sintáctico: Evidencia del Modelo Parasitario del léxico mental en un segundo idioma. In Memorias del II Congreso Nacional de Lingüística, special issue of Estudios en Lingüística Aplicada, 376–389. Mexico City: UNAM, INAH and ECM. Hatch, Evelyn and Cheryl Brown (1995). Vocabulary, Semantics, and Language Education. Cam- bridge: Cambridge University Press. Holmes, John and Rosinda G. Ramos (1993). False friends and reckless guessers. Observing cog- nate recognition strategies. In Second Language Reading and Vocabulary Learning, Thomas Huckin, Margot Haynes and James Coady (eds.), 86–108. Norwood, NJ: Ablex. James, Carl (1998). Errors in Language Learning and Use. London: Longman. Jin, Young-Sun (1990). Effects of concreteness on cross-language priming in lexical decision. Perception and Motor Skills 70: 1139–1154. Koda, Keiko (1997). Orthographic knowledge in L2 lexical processing. A cross-linguistics per- spective. In Second Language Vocabulary Acquisition, James Coady and Thomas Huckin (eds.), 35–52. Cambridge: Cambridge University Press. Kroll, Judith F. and Annette M. B. de Groot (1997). Lexical and conceptual memory in the bilin- gual: Mapping form to meaning in two languages. In Tutorials in Bilingualism. Psycholin- guistic Perspectives, Annette M. B. de Groot and Judith F. Kroll (eds.), 169–199. Mahwah, NJ: Lawrence Erlbaum Associates. Kroll, Judith F. and Erika Stewart (1994). Category interference in translation and picture naming: Evidence for asymmetric connections between bilingual memory representations. Journal of Memory and Language 33 (2): 149–174. Laufer, Batia (1989). A factor of difÞculty in vocabulary learning: Deceptive transparency. AILA Review 6: 10–20. — (1997). The lexical plight in second language reading. Words you don’t know, words you think you know, and words you can’t guess. In Second Language Vocabulary Acquisition, James Coady and Thomas Huckin (eds.), 20–34. Cambridge: Cambridge University Press. Levelt, Willem J. M. (1989). Speaking. From Intention to Articulation. Cambridge, MA: MIT Press. Levelt, Willem J. M., Ardi Roelofs and Antje S. Meyer (1999). A theory of lexical access in speech production. Behavioral and Brain Sciences 22 (1): 1–75. Lightfoot, David (1979). Principles of Diachronic Syntax. Cambridge: Cambridge University Press. MacWhinney, Brian (1997). Second language acquisition and the Competition Model. In Tutorials in Bilingualism. Psycholinguistic Perspectives, Annette M. B. de Groot and Judith F. Kroll (eds), 113–142. Mahwah, NJ: Lawrence Erlbaum Associates. Martinet, André (1964). Elements of General Linguistics. Chicago: University of Chicago Press. Matthei, Edward and Thomas Roeper (1983). Understanding and Producing Speech. London: Fontana. Meara, Paul (1984). The study of lexis in interlanguage. In Interlanguage,A.Davies, C. Criper and A. P. R. Howatt (eds.), 225–235. Edinburgh: Edinburgh University Press. Nation, I. S. Paul (1990). Teaching and Learning Vocabulary.New York: Newbury House. Odlin, Terence (1989). Language Transfer. Cross-linguistic influence in language learning. Cam- bridge: Cambridge University Press. Piaget, Jean and Bärbel Inhelder (1969). The Psychology of the Child. London: Routledge and KeganPaul. Ringbom, Håkan (1987). The Role of the First Language in Foreign Language Learning. Cleve- don, PH: Multilingual Matters. The automatic cognate form assumption 87

Rumelhart, David, James L. McClelland and the PDP Research Group (1986). Parallel Distributed Processing: Explorations in the Microstructure of Cognition, Vol. 1, Foundations. Cam- bridge: MIT Press. Sánchez Casas, Rosa M., Christopher W. Davis and José E. García-Albea (1992). Bilingual lexical processing: Exploring the cognate/noncognate distinction. European Journal of Cognitive Psychology 4: 293–310. Shorter Oxford English Dictionary (1973). Oxford: Oxford University Press. Singleton, David (1999). Exploring the Second Language Mental Lexicon. Cambridge: Cambridge University Press. Slobin, Dan I. (1977). Language change in childhood and history. In Language learning and thought,J.MacNamara (ed.), 185–214. New York: Academic Press. Sweet, Henry (1972 [1899]). The Practical Study of Languages. London: Oxford University Press. Talamas, Adrienne, Judith F. Kroll and Robert Dufour (1999). From form to meaning: Stages in the acquisition of second-language vocabulary. Bilingualism: Language and Cognition 2 (1): 45–58. Van Heuven, Walter J. B., Ton Dijkstra and J. Grainger (1998). Orthographic neighbourhood effects in bilingual word recognition. Journal of Memory and Language 39 (3): 458–483. Zipf, George K. (1949). Human Behavior and the Principle of Least Effort. New York: Addison- Wesley.