Quick viewing(Text Mode)

Psychological Review J Copyright © 1977 C ? by the American Psychological Association, Inc

Psychological Review J Copyright © 1977 C ? by the American Psychological Association, Inc

J Copyright © 1977 C_? by the American Psychological Association, Inc. VOLUME 84 NUMBER 4 JULY 1977

Features of Similarity

Amos Tversky Hebrew University Jerusalem,

The metric and dimensional assumptions that underlie the geometric represen- tation of similarity are questioned on both theoretical and empirical grounds. A new set-theoretical approach to similarity is developed in which objects are represented as collections of features, and similarity is described as a feature- matching process. Specifically, a set of qualitative assumptions is shown to imply the contrast model, which expresses the similarity between objects as a linear combination of the measures of their common and distinctive features. Several predictions of the contrast model are tested in studies of similarity with both semantic and perceptual stimuli. The model is used to uncover, analyze, and explain a variety of empirical phenomena such as the role of common and distinctive features, the between judgments of similarity and differ- ence, the presence of asymmetric similarities, and the effects of context on judgments of similarity. The contrast model generalizes standard representa- tions of similarity data in terms of clusters and trees. It is also used to analyze the relations of prototypicality and family resemblance.

Similarity plays a fundamental role in errors of substitution, and correlation between theories of knowledge and behavior. It serves occurrences. Analyses of these data attempt to as an organizing principle by which individuals explain the observed similarity relations and classify objects, form concepts, and make gen- to capture the underlying structure of the ob- eralizations. Indeed, the concept of similarity jects under study. is ubiquitous in psychological theory. It under- The theoretical analysis of similarity rela- lies the accounts of and response tions has been dominated by geometric generalization in learning, it is employed to models. These models represent objects as explain errors in memory and pattern recogni- points in some coordinate space such that the tion, and it is central to the analysis of con- observed dissimilarities between objects cor- notative . respond to the metric distances between the Similarity or dissimilarity data appear in respective points. Practically all analyses of different forms: ratings of pairs, sorting of proximity data have been metric in nature, objects, communality between associations, although some (e.g., hierarchical clustering) yield tree-like structures rather than dimen- This paper benefited from fruitful discussions with sionallv. organized spaces. However, most Y. Cohen, I. Gati, D. Kahneman, L. Sjeberg, and theoretical and empirical analyses of similarity S. Sattath. assume that objects can be adequately repre- Requests for reprints should be sent to , , , ... ,. , i Department of Psychology, Hebrew University, sented as points m some coordinate space and Jerusalem, Israel. that dissimilarity behaves like a metric dis- 327 328 AMOS TVERSKY tance function. Both dimensional and metric and argues that similarity should not be assumptions are open to question. treated as a symmetric relation. It has been argued by many authors that Similarity judgments can be regarded as dimensional representations are appropriate extensions of similarity statements, that is, for certain stimuli (e.g., colors, tones) but not statements of the form "a is like b." Such a for others. It seems more appropriate to repre- statement is directional; it has a subject, a, sent faces, countries, or personalities in terms and a referent, b, and it is not equivalent in of many qualitative features than in terms of general to the converse similarity statement a few quantitative dimensions. The assessment "b is like a." In fact, the choice of subject of similarity between such stimuli, therefore, and referent depends, at least in part, on the may be better described as a comparison of relative salience of the objects. We tend to features rather than as the computation of select the more salient stimulus, or the proto- metric distance between points. type, as a referent, and the less salient stimu- A metric distance function, 6, is a scale that lus, or the variant, as a subject. We say "the assigns to every pair of points a nonnegative portrait resembles the person" rather than number, called their distance, in accord with "the person resembles the portrait." We say the following three axioms: "the son resembles the father" rather than "the father resembles the son." We say "an Minimality: ellipse is like a circle," not "a circle is like an 8(a,b) > S(a,a) = 0. ellipse," and we say "North Korea is like Red Symmetry: China" rather than "Red China is like North S(a,b) = 5(b,a). Korea." As will be demonstrated later, this asym- The triangle inequality: metry in the choice of similarity statements is «(a,b) + S(b,c) > 8(a,c). associated with asymmetry in judgments of similarity. Thus, the judged similarity of To evaluate the adequacy of the geometric North Korea to Red China exceeds the judged approach, let us examine the validity of the similarity of Red China to North Korea. Like- metric axioms when 8 is regarded as a measure wise, an ellipse is more similar to a circle than of dissimilarity. The minimality axiom implies a circle is to an ellipse. Apparently, the direc- that the similarity between an object and itself tion of asymmetry is determined by the rela- is the same for all objects. This assumption, tive salience of the stimuli; the variant is however, does not hold for some similarity more similar to the prototype than vice versa. measures. For example, the probability of The directionality and asymmetry of simi- judging two identical stimuli as "same" rather larity relations are particularly noticeable in than "different" is not constant for all stimuli. similies and metaphors. We say "Turks fight Moreover, in recognition experiments the off- like tigers" and not "tigers fight like Turks." diagonal entries often exceed the diagonal Since the tiger is renowned for its fighting entries; that is, an object is identified as an- spirit, it is used as the referent rather than other object more frequently than it is identi- the subject of the simile. The poet writes "my fied as itself. If identification probability is love is as deep as the ocean," not "the ocean interpreted as a measure of similarity, then is as deep as my love," because the ocean these observations violate minimality and are, epitomizes depth. Sometimes both directions therefore, incompatible with the distance are used but they carry different meanings. model. "A man is like a tree" implies that man has Similarity has been viewed by both philoso- roots; "a tree is like a man" implies that the phers and as a prime example of a symmetric relation. Indeed, the assumption tree has a life history. "Life is like a play" of symmetry underlies essentially all theo- says that people play roles. "A play is like retical treatments of similarity. Contrary to life" says that a play can capture the essential this tradition, the present paper provides elements of human life. The relations between empirical evidence for asymmetric similarities the interpretation of metaphors and the as- FEATURES OF SIMILARITY 329 sessment of similarity are briefly discussed in ity is somewhat problematic, symmetry is ap- the final section. parently false, and the triangle inequality is The triangle inequality differs from minimal- hardly compelling. ity and symmetry in that it cannot be formu- The next section develops an alternative lated in ordinal terms. It asserts that one theoretical approach to similarity, based on distance must be smaller than the sum of two feature matching, which is neither dimensional others, and hence it cannot be readily refuted nor metric in nature. In subsequent sections with ordinal or even interval data. However, this approach is used to uncover, analyze, and the triangle inequality implies that if a is quite explain several empirical phenomena, such as similar to b, and b is quite similar to c, then the role of common and distinctive features, a and c cannot be very dissimilar from each the relations between judgments of similarity other. Thus, it sets a lower limit to the simi- and difference, the presence of asymmetric larity between a and c in terms of the similari- similarities, and the effects of context on simi- ties between a and b and between b and c. larity. Extensions and implications of the The following example (based on William present development are discussed in the final James) casts some doubts on the psychological section. validity of this assumption. Consider the simi- larity between countries: Jamaica is similar Feature Matching to Cuba (because of geographical proximity); Cuba is similar to Russia (because of their Let A = {a,b,c,...} be the domain of objects political affinity); but Jamaica and Russia are (or stimuli) under study. Assume that each not similar at all. object in A is represented by a set of features This example shows that similarity, as one or attributes, and let A,B,C denote the sets of might expect, is not transitive. In addition, it features associated with the objects a,b,c, re- suggests that the perceived distance of Jamaica spectively. The features may correspond to to Russia exceeds the perceived distance of components such as eyes or mouth; they may Jamaica to Cuba, plus that of Cuba to Russia represent concrete properties such as size or —contrary to the triangle inequality. Although color; and they may reflect abstract attributes such examples do not necessarily refute the such as or complexity. The character- triangle inequality, they indicate that it should ization of stimuli as feature sets has been not be accepted as a cornerstone of similarity employed in the analysis of many cognitive models. processes such as speech (Jakobson, It should be noted that the metric axioms, Fant, & Halle, 1961), pattern recognition by themselves, are very weak. They are satis- (Neisser, 1967), perceptual learning (Gibson, fied, for example, by letting 5 (a,b) = 0 if a = b, 1969), preferential choice (Tversky, 1972), and and 5(a,b) = 1 if a j^ b. To specify the dis- semantic judgment (Smith, Shoben, & Rips, tance function, additional assumptions are 1974). made (e.g., intradimensional subtractivity and Two preliminary comments regarding fea- interdimensional additivity) relating the di- ture representations are in order. First, it is mensional structure of the objects to their important to note that our total data base metric distances. For an axiomatic analysis concerning a particular object (e.g., a person, and a critical discussion of these assumptions, a country, or a piece of furniture) is generally see Beals, Krantz, and Tversky (1968), Krantz rich in content and in form. It in- and Tversky (1975), and Tversky and Krantz cludes appearance, function, relation to other (1970). objects, and any other of the object In conclusion, it appears that despite many that can be deduced from our general knowl- fruitful applications (see e.g., Carroll & Wish, edge of the world. When faced with a particular 1974; Shepard, 1974), the geometric approach task (e.g., identification or similarity assess- to the analysis of similarity faces several ment) we extract and compile from our data difficulties. The applicability of the dimen- base a limited list of relevant features on the sional assumption is limited, and the metric basis of which we perform the required task. axioms are questionable. Specifically, minimal- Thus, the representation of an object as a col- 330 AMOS TVERSKY

not to a. A schematic illustration of these components is presented in Figure 1. A-B 2. Monotonicity: APIB s(a,b) > s(a,c) whenever , A-BCA-C, B-A and Figure 1. A graphical illustration of the relation between B - A C C - A. two feature sets. Moreover, the inequality is strict whenever either inclusion is proper. lection of features is viewed as a product of a That is, similarity increases with addition prior process of extraction and compilation. of common features and/or deletion of distinc- Second, the term, feature usually denotes the tive features (i.e., features that belong to one value of a binary variable (e.g., voiced vs. object but not to the other). The monotonicity voiceless consonants) or the value of a nominal axiom can be readily illustrated with block variable (e.g., eye color). Feature representa- letters if we identify their features with the tions, however, are not restricted to binary or component (straight) lines. Under this as- nominal variables; they are also applicable to sumption, E should be more similar to F than ordinal or cardinal variables (i.e., dimensions). to I because E and F have more common A series of tones that differ only in loudness, features than E and I. Furthermore, I should for example, could be represented as a sequence be more similar to F than to E because I and of nested sets where the feature set associated F have fewer distinctive features than I and E. with each tone is included in the feature sets Any function F satisfying Assumptions 1 associated with louder tones. Such a represen- and 2 is called a matching function. It measures tation is isomorphic to a directional unidimen- the degree to which two objects—viewed as sional structure. A nondirectional unidimen- sets of features—match each other. In the sional structure (e.g., a series of tones that present theory, the assessment of similarity is differ only in pitch) could be represented by a described as a feature-matching process. It is chain of overlapping sets. The set-theoretical formulated, therefore, in terms of the set- representation of qualitative and quantitative theoretical notion of a matching function dimensions has been investigated by Restle rather than in terms of the geometric concept (1959). of distance. Let s(a,b) be a measure of the similarity of In order to determine the functional form a to b denned for all distinct a, b in A. The of the matching function, additional assump- scale s is treated as an ordinal measure of tions about the similarity ordering are intro- similarity. That is, s(a,b) > s(c,d) means that duced. The major assumption of the theory a is more similar to b than c is to d. The (independence) is presented next; the remain- present theory is based on the following ing assumptions and the proof of the represen- assumptions. tation theorem are presented in the Appendix. Readers who are less interested in formal 1. Matching: theory can skim or skip the following para- graphs up to the discussion of the representa- s(a,b) = F(AH B, A - B, B - A). tion theorem. The similarity of a to b is expressed as a Let $ denote the set of all features associated function F of three arguments: AHB, the with the objects of A, and let X,Y,Z,... etc. features that are common to both a and b; denote collections of features (i.e., subsets of A — B, the features that belong to a but not $). The expression F(X,Y,Z) is defined when- to b; B — A, the features that belong to b but ever there exists a, b in A such that A C\ B = X, FEATURES OF SIMILARITY 331

A — B = Y, and B - A = Z, whence s(a,b) = F(A H B, A - B, B - A) = F(X,Y,Z). Next, define V c~ W if one or more of the following hold for some X,Y,Z: F(V,Y,Z) = F(W,Y,Z), F(X,V,Z) = F(X,W,Z), F(X,Y,V) = F(X,Y,W). The pairs (a,b) and (c,d) are said to agree on one, two, or three components, respec- tively, whenever one, two, or three of the following hold: (A C\ B) ~ (C C\ D), (A - B) ~(C-D),(B-A)~(D-C).

3. Independence: Suppose the pairs (a,b) and (c,d), as well as the pairs (a',b') and (c',d'), agree on the same two components, while the c d c' d' pairs (a,b) and (a',b'), as well as the pairs Figure 2. An illustration of independence. (c,d) and (c',d'), agree on the remaining (third) component. Then It should be emphasized that any test of the s(a,b) > s(a',b') iff s(c,d) > s(c',d'). axioms presupposes an interpretation of the To illustrate the force of the independence features. The independence axiom, for example, axiom consider the stimuli presented in Figure may hold in one interpretation and fail in 2, where another. Experimental tests of the axioms, therefore, test jointly the adequacy of the in- \ B = C n D = round profile = X, terpretation of the features and the empirical validity of the assumptions. Furthermore, the A' H B' = C' H D' = sharp profile = X', above examples should not be taken to mean A— B = C — D = smiling mouth = Y, that stimuli (e.g., block letters, schematic A' - B' = C' - D' = frowning mouth = Y', faces) can be properly characterized in terms of their components. To achieve an adequate B-A=B'-A' = straight eyebrow = Z, feature representation of visual forms, more D-C = D'-C' = curved eyebrow = Z'. global properties (e.g., symmetry, connected- ness) should also be introduced. For an inter- By independence, therefore, esting discussion of this problem, in the best tradition of , see Goldmeier s(a,b) = F(AP\ B, A - B, B - A) (1972; originally published in 1936). = F(X,Y,Z) > F(X',Y',Z) In addition to matching (1), monotonicity = F(A'nB', A'-B', B'- A') (2), and independence (3), we also assume = s(a',b') solvability (4), and invariance (5). Solvability requires that the feature space under study be if and only if sufficiently rich that certain (similarity) equa- tions can be solved. Invariance ensures that s(c,d) = F(C n D, C - D, D - C) the equivalence of intervals is preserved across = F(X,Y,Z') > F(X',Y',Z') factors. A rigorous formulation of these as- sumptions is given in the Appendix, along with = F(C'C\ D', C' - D', D' - C') a proof of the following result. Representation theorem. Suppose Assump- tions 1, 2, 3, 4, and 5 hold. Then there exist a Thus, the ordering of the joint effect of any similarity scale S and a nonnegative scale f two components (e.g., X,Y vs. X',Y') is inde- such that for all a,b,c,d in A, pendent of the fixed level of the third factor (e.g.,ZorZ'). (i). S(a,b) > S(c,d) iff s(a,b) > s(c,d); 332 AMOS TVERSKY

(ii). S(a,b) = 0f(AH B) - «f(A - B) mational content. The manner in which the — /8f(B — A), for some 0,«,/8 > 0; scale f and the parameters (0,a,/3) depend on the context and the task are discussed in the (iii). f and S are interval scales. following sections. Let us recapitulate what is assumed and The theorem shows that under Assumptions what is proven in the representation theorem. 1-5, there exists an interval similarity scale S We begin with a set of objects, described as that preserves the observed similarity order collections of features, and a similarity order- and expresses similarity as a linear combina- ing which is assumed to satisfy the axioms of tion, or a contrast, of the measures of the the present theory. From these assumptions, common and the distinctive features. Hence, we derive a measure f on the feature space and the representation is called the contrast model. prove that the similarity ordering of object In parts of the following development we also pairs coincides with the ordering of their con- assume that f satisfies feature additivity. That trasts, denned as linear combinations of the is, f (X U Y) = f (X) + f (Y) whenever X and 1 respective common and distinctive features. Y are disjoint, and all three terms are defined . Thus, the measure f and the contrast model Note that the contrast model does not define are derived from qualitative axioms regarding a single similarity scale, but rather a family of the similarity of objects. scales characterized by different values of the The nature of this result may be illuminated parameters 0, a, and /9. For example, if 0 = 1 by an analogy to the classical theory of deci- and a and 0 vanish, then S(a,b) = f (A H B); sion under (von Neumann & Morgenstern, that is, the similarity between objects is the 1947). In that theory, one starts with a set of measure of their common features. If, on the prospects, characterized as probability dis- other hand, a = /3 = 1 and 0 vanishes then tributions over some consequence space, and -S(a,b) = f (A - B) + f (B - A); that is, the a order that is assumed to satisfy dissimilarity between objects is the measure the axioms of the theory. From these assump- of the symmetric difference between the respec- tions one derives a scale on the conse- tive feature sets. Restle (1961) has proposed quence space and proves that the preference these forms as models of similarity and psycho- order between prospects coincides with the logical distance, respectively. Note that in the order of their expected . Thus, the former model (6 = 1, a = ft = 0), similarity utility scale and the expectation principle are between objects is determined only by their common features, whereas in the latter model derived from qualitative assumptions about (0 = 0, a = ft = 1), it is determined by their preferences. The present theory of similarity distinctive features only. The contrast model differs from the expected-utility model in that expresses similarity between objects as a the characterization of objects as feature sets weighted difference of the measures of their is perhaps more problematic than the char- common and distinctive features, thereby al- acterization of uncertain options as probability lowing for a variety of similarity relations over distributions. Furthermore, the axioms of util- the same domain. ity theory are proposed as (normative) prin- The major constructs of the present theory ciples of rational behavior, whereas the axioms are the contrast rule for the assessment of of the present theory are intended to be de- similarity, and the scale f, which reflects the scriptive rather than prescriptive. salience or prominence of the various features. The contrast model is perhaps the simplest Thus, f measures the contribution of any par- ticular (common or distinctive) feature to the form of a matching function, yet it is not the similarity between objects. The scale value only form worthy of investigation. Another f(A) associated with stimulus a is regarded, therefore, as a measure of the overall salience 'To derive feature additivity from qualitative as- of that stimulus. The factors that contribute sumptions, we must assume the axioms of an extensive to the salience of a stimulus include intensity, structure and the compatibility of the extensive and the frequency, familiarity, good form, and infor- conjoint scales; see Krantz et al. (1971, Section 10.7). FEATURES OF SIMILARITY 333 matching function of interest is the ratio model, Hence, s(a,b) = s(b,a) if either a = /8, or _ f(A - B) = f(B - A), which implies f (A) = .,- ( nB)+af(A-B)+^f(B-A)' f(B), provided feature additivity holds. Thus, f A symmetry holds whenever the objects are equal «,/3>0, in measure (f(A) = f(B)) or the task is non- where similarity is normalized so that S lies directional (a = /3). To interpret the latter between 0 and 1. The ratio model generalizes condition, compare the following two forms: several set-theoretical models of similarity proposed in the literature. If a = ft = 1, S (a,b) (i). Assess the degree to which a and b are reduces to f (A H B)/f (A (J B) (see Gregson, similar to each other. 1975, and Sjoberg, 1972). If « = ft = i S(a,b) (ii). Assess the degree to which a is similar equals 2f(AH B)/(f(A) + f(B)) (see Eisler & tob. Ekman, 1959). If a = 1 and ft = 0, S(a,b) re- In (i), the task is formulated in a nondirectional duces to f (AH B)/f (A) (see Bush & Hosteller, fashion; hence it is expected that a = ft and 1951). The present framework, therefore, en- s(a,b) = s(b,a). In (ii), on the other hand, the compasses a wide variety of similarity models task is directional, and hence a and /3 may that differ in the form of the matching function differ and symmetry need not hold. F and in the weights assigned to its arguments. If s(a,b) is interpreted as the degree to In order to apply and test the present theory which a is similar to b, then a is the subject in any particular domain, some assumptions of the comparison and b is the referent. In about the respective feature structure must be such a task, one naturally focuses on the sub- made. If the features associated with each ject of the comparison. Hence, the features of object are explicitly specified, we can test the the subject are weighted more heavily than axioms of the theory directly and scale the the features of the referent (i.e., a > 0). Con- features according to the contrast model. This sequently, similarity is reduced more by the approach, however, is generally limited to distinctive features of the subject than by the stimuli (e.g., schematic faces, letters, strings distinctive features of the referent. It follows of symbols) that are constructed from a fixed readily that whenever a > ft, feature set. If the features associated with the objects under study cannot be readily speci- s(a,b) > s(b,a) iff f(B) > f(A). fied, as is often the case with natural stimuli, Thus, the focusing hypothesis (i.e., a > /3) we can still test several predictions of the implies that the direction of asymmetry is contrast model ^which involve only general determined by the relative salience of the qualitative assumptions about the feature stimuli so that the less salient stimulus is more structure of the objects. Both approaches were similar to the salient stimulus than vice versa. employed in a series of experiments conducted In particular, the variant is more similar to by Itamar Gati and the present author. The the prototype than the prototype is to the following three sections review and discuss our variant, because the prototype is generally main findings, focusing primarily on the test more salient than the variant. of qualitative predictions. A more detailed de- scription of the stimuli and the data are pre- Similarity of Countries sented in Tversky and Gati (in press). Twenty-one pairs of countries served as Asymmetry and Focus stimuli. The pairs were constructed so that one According to the present analysis, similarity element was more prominent than the other is not necessarily a symmetric relation. Indeed, (e.g., Red China-North Vietnam, USA-Mexico, Belgium-Luxemburg). To verify this relation, it follows readily (from either the contrast or 2 the ratio model) that we asked a group of 69 subjects to select in s(a,b) = s(b,a) iff of (A - B) + 0f (B - A) 2 The subjects in all out experiments were Israeli = of (B - A) + /3f (A - B) college students, ages 18-28. The material was pre- iff (a - /9)f (A - B) = (a - 0)f (B - A). sented in booklets and administered in a group setting. 334 AMOS TVERSKY

each pair the country they regarded as more participated in this study. They received the prominent. The proportion of subjects that same list of 21 pairs except that one group agreed with the a priori ordering exceeded f was asked to judge the degree to which for all pairs except one. A second group of 69 country a differed from country b, denoted subjects was asked to choose which of two d(a,b), whereas the second group was asked phrases they preferred to use: "country a is to judge the degree to which country b was similar to country b," or "country b is similar different from country a, denoted d(b,a). If to country a." In all 21 cases, most of the judgments of difference follow the contrast subjects chose the phrase in which the less model, and a > /3, then we expect the promi- prominent country served as the subject and nent stimulus p to differ from the less promi- the more prominent country as the referent. nent stimulus q more than q differs from p; For example, 66 subjects selected the phrase that is, d(p,q) > d(q,p). This hypothesis was "North Korea is similar to Red China" and tested using the same set of 21 pairs of countries only 3 selected the phrase "Red China is and the prominence ordering established earlier. similar to North Korea." These results demon- The average d(p,q), across all subjects and strate the presence of marked asymmetries in pairs, was significantly higher than the average the choice of similarity statements, whose d(q,p): t test for correlated samples yielded direction coincides with the relative promi- /(20) = 2.72, p < .01. Furthermore, the aver- nence of the stimuli. age asymmetry score, computed as above for To test for asymmetry in direct judgments each subject, was significantly positive, /(45) of similarity, we presented two groups of 77 = 2.24, p < .05. subjects each with the same list of 21 pairs of countries and asked subjects to rate their Similarity of Figures similarity on a 20-point scale. The only differ- ence between the two groups was the order of A major determinant of the salience of geo- the countries within each pair. For example, metric figures is goodness of form. Thus, a one group was asked to assess "the degree to "good figure" is likely to be more salient than which the USSR is similar to Poland," whereas a "bad figure," although the latter is generally the second group was asked to assess "the more complex. However, when two figures are degree to which Poland is similar to the roughly equivalent with respect to goodness USSR." The lists were constructed so that the of form, the more complex figure is likely to be more prominent country appeared about an more salient. To investigate these hypotheses equal number of times in the first and second and to test the asymmetry prediction, two sets positions. of eight pairs of geometric figures were con- For any pair (p,q) of stimuli, let p denote structed. In the first set, one figure in each the more prominent element, and let q denote pair (denoted p) had better form than the the less prominent element. The average other (denoted q). In the second set, the two s(q,p) was significantly higher than the aver- figures in each pair were roughly matched in age s(p,q) across all subjects and pairs: / test goodness of form, but one figure (denoted p) for correlated samples yielded <(20) = 2.92, was richer or more complex than the other p < .01. To obtain a statistical test based on (denoted q). Examples of pairs of figures from individual data, we computed for each subject each set are presented in Figure 3. a directional asymmetry score defined as the A group of 69 subjects was presented with average similarity for comparisons with a the entire list of 16 pairs of figures, where the prominent referent, that is, s(q,p), minus the two elements of each pair were displayed side average similarity for comparisons with a by side. For each pair, the subjects were asked prominent subject, s(p,q). The average differ- to indicate which of the following two state- ence was significantly positive: 2(153) = 2.99, ments they preferred to use: "The left figure p < .01. is similar to the right figure," or "The right The above study was repeated using judg- figure is similar to the left figure." The positions ments of difference instead of judgments of of the stimuli were randomized so that p and similarity. Two groups of 23 subjects each q appeared an equal number of times on the FEATURES OF SIMILARITY 335 left and on the right. The results showed that in each one of the pairs, most of the subjects selected the form "q is similar to p." Thus, the more salient stimulus was generally chosen as the referent rather than the subject of similar- ity statements. To test for asymmetry in judgments of similarity, we presented two groups of 67 sub- jects each with the same 16 pairs of figures and asked the subjects to rate (on a 20-point scale) the degree to which the figure on the left was similar to the figure on the right. The two groups received identical booklets, except that the left and right positions of the figures in Figure 3. Examples of pairs of figures used to test the each pair were reversed. The results showed prediction of asymmetry. The top two figures are ex- that the average s(q,p) across all subjects and amples of a pair (from the first set) that differs in good- ness of form. The bottom two are examples of a pair pairs was significantly higher than the average (from the second set) that differs in complexity. s(p,q). A t test for correlated samples yielded /(IS) = 2.94, p < .01. Furthermore, in both sets the average asymmetry scores, computed A total of 32 subjects participated in the as above for each subject, were significantly experiment. Each subject was tested individ- positive: In the first sett (131) = 2.96, p< .01, ually. On each trial, one letter (known in and in the second set *(131) = 2.79, p < .01. advance) served as the standard. For one half of the subjects the standard stimulus always Similarity of Letters appeared on the left, and for the other half of A common measure of similarity between the subjects the standard always appeared on stimuli is the probability of confusing them in the right. Each one of the eight letters served a recognition or an identification task: The as a standard. The trials were blocked into more similar the stimuli, the more likely they groups of 10 pairs in which the standard was are to be confused. While confusion probabili- paired once with each of the other letters and ties are often asymmetric (i.e., the probability three times with itself. Since each letter served of confusing a with b is different from the as a standard in one block, the entire design probability of confusing b with a), this effect consisted of eight blocks of 10 trials each. is typically attributed to a response bias. To Every subject was presented with three repli- eliminate this interpretation of asymmetry, cations of the entire design (i.e., 240 trials). one could employ an experimental task where The order of the blocks in each design and the the subject merely indicates whether the two order of the letters within each block were stimuli presented to him (sequentially or randomized. simultaneously) are identical or not. This pro- According to the present analysis, people cedure was employed by Yoav Cohen and the compare the variable stimulus, which serves present author in a study of confusion among the role of the subject, to the standard (i.e., block letters. the referent). The choice of standard, there- The following eight block letters served as fore, determines the directionality of the com- stimuli: F, C, l~l, D, F, E, R, B. All pairs parison. A natural partial ordering of the of letters were displayed on a cathode-ray tube, letters with respect to prominence is induced side by side, on a noisy background. The by the relation of inclusion among letters. letters were presented sequentially, each for Thus, one letter is assumed to have a larger approximately 1 msec. The right letter always measure than another if the former includes followed the left letter with an interval of 630 the latter. For example, E includes F and P msec in between. After each presentation the but not D. For all 19 pairs in which one letter subject pressed one of two keys to indicate includes the other, let p denote the more whether the two letters were identical or not. prominent letter and q denote the less promi- 336 AMOS TVERSKY nent letter. Furthermore, let s(a,b) denote the The asymmetry effect is enhanced when we percentage of times that the subject judged consider only those comparisons in which one the variable stimulus a to be the same as the signal is a proper subsequence of the other. standard b. (For example, • • is a subsequence of • • - as It follows from the contrast model, with well as of • - •) • From a total of 195 comparisons a > /3, that the proportion of "same" responses of this type, s(q,p) exceeds s(p,q) in 128 cases, should be larger when the variable is included s(p,q) exceeds s(q,p) in 55 cases, and s(q,p) in the standard than when the standard is equals s(p,q) in 12 cases, p < .001 by sign included in the variable, that is, s(q,p) > test. The average difference between s(q,p) s(p,q). This prediction was borne out by the and s(p,q) in this case is 4.7%, f (194) = 7.58, data. The average s(q,p) across all subjects p < .001. and trials was 17.1%, whereas the average A later study following the same experi- s(p,q) across all subjects and trials was 12.4%. mental paradigm with somewhat different sig- To obtain a statistical test, we computed for nals was conducted by Wish (1967). His sig- each subject the difference between s(q,p) and nals consisted of three tones separated by two s(p,q) across all trials. The difference was silent intervals, where each component (i.e., significantly positive, <(31) = 4.41, p < .001. a tone or a silence) was either short or long. These results demonstrate that the prediction Subjects were presented with all pairs of 32 of directional asymmetry derived from the signals generated in this fashion and judged contrast model applies to confusion data and whether the two members of each pair were not merely to rated similarity. the same or not. The above analysis is readily applicable to Similarity of Signals Wish's (1967) data. From a total of 386 com- parisons between signals of different length, Rothkopf (1957) presented 598 subjects with s(q,p) exceeds s(p,q) in 241 cases, s(p,q) ex- all ordered pairs of the 36 Morse Code signals ceeds s(q,p) in 117 cases, and s(q,p) equals and asked them to indicate whether the two s(p,q) in 28 cases. These data are clearly signals in each pair were the same or not. The asymmetric, p < .001 by sign test. The aver- pairs were presented in a randomized order age difference between s(q,p) and s(p,q) is without a fixed standard. Each subject judged 5.9%, which is also highly significant, <(385) about one fourth of all pairs. = 9.23, p < .001. Let s(a,b) denote the percentage of "same" In the studies of Rothkopf and Wish there responses to the ordered pair (a,b), i.e., the is no a priori way to determine the directional- percentage of subjects that judged the first ity of the comparison, or equivalently to iden- signal a to be the same as the second signal b. tify the subject and the referent. However, if Note that a and b refer here to the first and we accept the focusing hypothesis (a > ft} and second signal, and not to the variable and the the assumption that longer signals are more standard as in the previous section. Obviously, prominent than shorter ones, then the direc- Morse Code signals are partially ordered ac- tion of the observed asymmetry indicates that cording to temporal length. For any pair of the first signal serves as the subject that is signals that differ in temporal length, let p and compared with the second signal that serves q denote, respectively, the longer and shorter the role of the referent. Hence, the direc- element of the pair. tionality of the comparison is determined, ac- From the total of 555 comparisons between cording to the present analysis, from the signals of different length, reported in Rothkopf prominence ordering of the stimuli and the (1957), s(q,p) exceeds s(p,q) in 336 cases, observed direction of asymmetry. s(p,q) exceeds s(q,p) in 181 cases, and s(q,p) equals s(p,q) in 38 cases, p < .001, by sign Rosch's Data test. The average difference between s(q,p) and s(p,q) across all pairs is 3.3%, which is Rosch (1973, 1975) has articulated and sup- also highly significant. A t test for correlated ported the view that perceptual and semantic samples yields <(554) = 9.17, p < .001. categories are naturally formed and denned in FEATURES OF SIMILARITY 337 terms of focal points, or prototypes. Because distance t from a is interpreted as saying that of the special role of prototypes in the forma- the (perceived) distance from b to a equals t. tion of categories, she hypothesized that (i) Rosch (1975) attributed the observed asym- in sentence frames involving hedges such as metry to the special role of distinct prototypes "a is essentially b," focal stimuli (i.e., proto- (e.g., a perfect square or a pure red) in the types) appear in the second position; and (ii) processing of information. In the present the perceived distance from the prototype to theory, on the other hand, asymmetry is the variant is greater than the perceived dis- explained by the relative salience of the tance from the variant to the prototype. To stimuli. Consequently, it implies asymmetry test these hypotheses, Rosch (1975) used three for pairs that do not include the prototype stimulus domains: color, line orientation, and (e.g., two levels of distortion of the same number. Prototypical colors were focal (e.g., form). If the concept of prototypicality, how- pure red), while the variants were either non- ever, is interpreted in a relative sense (i.e., a focal (e.g., off-red) or less saturated. Vertical, is more prototypical than b) rather than in an horizontal, and diagonal lines served as proto- absolute sense, then the two interpretations of types for line orientation, and lines of other asymmetry practically coincide. angles served as variants. Multiples of 10 (e.g., 10, 50, 100) were taken as prototypical Discussion numbers, and other numbers (e.g., 11, 52, 103) were treated as variants. The conjunction of the contrast model and Hypothesis (i) was strongly confirmed in all the focusing hypothesis implies the presence three domains. When presented with sentence of asymmetric similarities. This prediction was frames such as " is virtually . ," sub- confirmed in several experiments of perceptual jects generally placed the prototype in the and conceptual similarity using both judg- second blank and the variant in the first. For mental methods (e.g., rating) and behavioral instance, subjects preferred the sentence "103 methods (e.g., choice). is virtually 100" to the sentence "100 is virtu- The asymmetries discussed in the previous ally 103." To test hypothesis (ii), one stimulus section were observed in comparative tasks in (the standard) was placed at the origin of a which the subject compares two given stimuli semicircular board, and the subject was in- to determine their similarity. Asymmetries structed to place the second (variable) stimulus were also observed in production tasks in which on the board so as "to represent his feeling of the subject is given a single stimulus and asked the distance between that stimulus and the to produce the most similar response. Studies one fixed at the origin." As hypothesized, the of pattern recognition, stimulus identification, measured distance between stimuli was signifi- and word association are all examples of pro- cantly smaller when the prototype, rather than duction tasks. A common pattern observed in the variant, was fixed at the origin, in each of such studies is that the more salient object the three domains. occurs more often as a response to the less If focal stimuli are more salient than non- salient object than vice versa. For example, focal stimuli, then Rosch's findings support "tiger" is a more likely associate to "leopard" the present analysis. The hedging sentences than "leopard" is to "tiger." Similarly, Garner (e.g., "a is roughly b") can be regarded as a (1974) instructed subjects to select from a particular type of similarity statements. In- given set of dot patterns one that is similar— deed, the hedges data are in perfect agreement but not identical—to a given pattern. His re- with the choice of similarity statements. Fur- sults show that "good" patterns are usually chosen as responses to "bad" patterns and not thermore, the observed asymmetry in distance conversely. placement follows from the present analysis of This asymmetry in production tasks has asymmetry and the natural assumptions that commonly been attributed to the differential the standard and the variable serve, respec- availability of responses. Thus, "tiger" is a tively, as referent and subject in the distance- more likely associate to "leopard" than vice placement task. Thus, the placement of b at versa, because "tiger" is more common and 338 AMOS TVERSKY

hence a more available response than "leop- tween all 66 pairs of vehicles on a scale from ard." This account is probably more applicable 1 (no similarity) to 20 (maximal similarity). to situations where the subject must actually Following Rosch and Mervis (1975), we in- produce the response (as in word association structed a second group of 40 subjects to list or pattern recognition) than to situations the characteristic features of each one of the where the subject merely selects a response vehicles. Subjects were given 70 sec to list the from some specified set (as in Garner's task). features that characterized each vehicle. Dif- Without questioning the importance of re- ferent orders of presentation were used for sponse availability, the present theory suggests different subjects. another reason for the asymmetry observed in The number of features per vehicle ranged production tasks. Consider the following trans- from 71 for airplane to 21 for sled. Altogether, lation of a production task to a question-and- 324 features were listed by the subjects, of answer scheme. Question: What is a like? which 224 were unique and 100 were shared Answer: a is like b. If this interpretation is by two or more vehicles. For every pair of valid and the given object a serves as a subject vehicles we counted the number of features rather than as a referent, then the observed that were attributed to both (by at least one asymmetry of production follows from the subject), and the number of features that were present theoretical analysis, since s(a,b) > attributed to one vehicle but not to the other. s(b,a) whenever f (B) > f (A). The frequency of subjects that listed each In summary, it appears that proximity data common or distinctive feature was computed. from both comparative and production tasks In order to predict the similarity between reveal significant and systematic asymmetries vehicles from the listed features, the measures whose direction is determined by the relative of their common and distinctive features must salience of the stimuli. Nevertheless, the sym- be defined. The simplest measure is obtained metry assumption should not be rejected al- by counting the number of common and dis- together. It seems to hold in many contexts, tinctive features produced by the subjects. and it serves as a useful approximation in The product-moment correlation between the many others. It cannot be accepted, however, (average) similarity of objects and the number as a principle of psychological of their common features was .68. The cor- similarity. relation between the similarity of objects and the number of their distinctive features was Common and Distinctive Features — .36. The multiple correlation between simi- larity and the numbers of common and dis- In the present theory, the similarity of tinctive features (i.e., the correlation between objects is expressed as a linear combination, similarity and the contrast model) was .72. or a contrast, of the measures of their common The counting measure assigns equal weight and distinctive features. This section investi- to all features regardless of their frequency of gates the relative impact of these components mention. To take this factor into account, let and their effect on the relation between the Xa denote the proportion of subjects who at- assessments of similarity and difference. The tributed feature X to object a, and let NX de- discussion concerns only symmetric tasks, note the number of objects that share feature where a = /3, and hence s(a,b) = s(b,a). X. For any a,b, define the measure of their common features by f(A f~\ B) = SXaXb/Nx, Eiicitation of Features where the summation is over all X in kt~\ B, and the measure of their distinctive features The first study employs the contrast model by to predict the similarity between objects from f (A - B) + f (B - A) = SY + SZ features that were produced by the subjects. a b The following 12 vehicles served as stimuli: where the summations range over all YeA — B bus, car, truck, motorcycle, train, airplane, and ZeB — A, that is, the distinctive features bicycle, boat, elevator, cart, raft, sled. One of a and b, respectively. The correlation group of 48 subjects rated the similarity be- between similarity and the above measure FEATURES OF SIMILARITY 339

of the common features was .84; the corre- moment correlation between the ratings was lation between similarity and the above again —.98. This inverse relation between measure of the distinctive features was —.64, similarity and difference, however, does not The multiple correlation between similarity always hold. and the measures of the common and the dis- Naturally, an increase in the measure of the tinctive features was .87. common features increases similarity and de- Note that the above methods for defining the creases difference, whereas an increase in the measure f were based solely on the elicited measure of the distinctive features decreases features and did not utilize the similarity data similarity and increases difference. However, at all. Under these conditions, a perfect cor- the relative weight assigned to the common relation between the two should not be ex- and the distinctive features may differ in the pected because the weights associated with two tasks. In the assessment of similarity be- the features are not optimal for the prediction tween objects the subject may attend more to of similarity. A given feature may be fre- their common features, whereas in the assess- quently mentioned because it is easily labeled ment of difference between objects the subject or recalled, although it does not have a great may attend more to their distinctive features. impact on similarity, and vice versa. Indeed, Thus, the relative weight of the common when the features were scaled using the addi- features will be greater in the former task than tive tree procedure (Sattath & Tversky, in in the latter task. press) in which the measure of the features is Let d(a,b) denote the perceived difference derived from the similarities between the between a and b. Suppose d satisfies the objects, the correlation between the data and axioms of the present theory with the reverse the model reached .94. inequality in the monotonicity axiom, that is, The results of this study indicate that (i) d(a,b) < d(a,c) whenever A Pi B D AP\ C, it is possible to elicit from subjects detailed A-BCA-C, andB-ACC-A. Fur- features of semantic stimuli such as vehicles thermore, suppose s also satisfies the present (see Rosch & Mervis, 1975); (ii) the listed theory and assume (for simplicity) that both features can be used to predict similarity ac- d and s are symmetric. According to the cording to the contrast model with a reason- representation theorem, therefore, there exist able degree of success; and (iii) the prediction a nonnegative scale f and nonnegative con- of similarity is improved when frequency of stants 6 and X such that for all a,b,c,e, mention and not merely the number of fea- tures is taken into account. s(a,b) > s(c,e) iff 0f(AP» B) - f(A - B) - f(B - A) > Similarity versus Difference 0f (C HE)- f (C - E) - f (E - C), and It has been generally assumed that judg- ments of similarity and difference are comple- d(a,b) > d(c,e) iff mentary; that is, judged difference is a linear f (A - B) + f (B - A) - Xf (A n B) > function of judged similarity with a slope of f (C - E) + f (E - C) - Xf (C n E). — 1. This hypothesis has been confirmed in several studies. For example, Hosman and The weights associated with the distinctive Kuennapas (1972) obtained independent judg- features can be set equal to 1 in the symmetric ments of similarity and difference for all pairs case with no loss of generality. Hence, d and X of lowercase letters on a scale from 0 to 100. reflect the relative weight of the common fea- The product-moment correlation between the tures in the assessment of similarity and dif- judgments was —.98, and the slope of the ference, respectively. regression line was —.91. We also collected Note that if 6 is very large then the similarity judgments of similarity and difference for 21 ordering is essentially determined by the pairs of countries using a 20-point rating scale. common features. On the other hand, if X is The sum of the two judgments for each pair very small, then the difference ordering is was quite close to 20 in all cases. The product- determined primarily by the distinctive fea- 340 AMOS TVERSKY

tures. Consequently, both s(a,b) > s(c,e) and more different from each other than Ceylon d(a,b) > d(c,e) may be obtained whenever and Nepal. These data demonstrate how the relative weight of the common and the dis- f(Ar\B) > f(Cr\E) tinctive features varies with the task and sup- and port the hypothesis that people attend more f (A - B) + f (B - A) > to the common features in judgments of simi- f (C - E) + f (E - C). larity than in judgments of difference.

That is, if the common features are weighed Similarity in Context more heavily in judgments of similarity than in judgments of difference, then a pair of Like other judgments, similarity depends on objects with many common and many dis- context and frame of reference. Sometimes the tinctive features may be perceived as both relevant frame of reference is specified explic- more similar and more different than another itly, as in the questions, "How similar are pair of objects with fewer common and fewer English and French with respect to sound?" distinctive features. "What is the similarity of a pear and an apple To test this hypothesis, 20 sets of four with respect to taste ?" In general, however, the countries were constructed on the basis of a relevant feature space is not specified explic- pilot test. Each set included two pairs of itly but rather inferred from the general countries: a prominent pair and a nonpromi- context. nent pair. The prominent pairs consisted of When subjects are asked to assess the simi- countries that were well known to our subjects larity between the USA and the USSR, for (e.g., USA-USSR, Red China-Japan). The instance, they usually assume that the relevant nonprominent pairs consisted of countries that context is the set of countries and that the were known to the subjects, but not as well as relevant frame of reference includes all politi- the prominent ones (e.g., Tunis-Morocco, cal, geographical, and cultural features. The Paraguay-Ecuador). All subjects were pre- relative weights assigned to these features, of sented with the same 20 sets. One group of course, may differ for different people. With 30 subjects selected between the two pairs in natural, integral stimuli such as countries, each set the pair of countries that were more people, colors, and sounds, there is relatively similar. Another group of 30 subjects selected little ambiguity regarding the relevant feature between the two pairs in each set the pair of space. However, with artificial, separable countries that were more different. stimuli, such as figures varying in color and Let II8 and Hd denote, respectively, the per- shape, or lines varying in length and orienta- centage of choices where the prominent pair tion, subjects sometimes experience difficulty of countries was selected as more similar or as in evaluating overall similarity and occasion- more different. If similarity and difference are ally tend to evaluate similarity with respect to complementary (i.e., 0 = X), then ns + lid one factor or the other (Shepard, 1964) or should equal 100 for all pairs. On the other change the relative weights of attributes with hand, if 6 > X, then HB + Ed should exceed a change in context (Torgerson, 1965). 100. The average value of ns + nd, across all In the present theory, changes in context or sets, was 113.5, which is significantly greater frame of reference correspond to changes in than 100, *(59) = 3.27, p < .01. the measure of the feature space. When asked Moreover, on the average, the prominent to assess the political similarity between coun- pairs were selected more frequently than the tries, for example, the subject presumably nonprominent pairs in both the similarity and attends to the political aspects of the countries the difference tasks. For example, 67% of the and ignores, or assigns a weight of zero to, subjects in the similarity group selected West all other features. In addition to such restric- Germany and East Germany as more similar tions of the feature space induced by explicit to each other than Ceylon and Nepal, while or implicit instructions, the salience of features 70% of the subjects in the difference group and hence the similarity of objects are also selected West Germany and East Germany as influenced by the effective context (i.e., the FEATURES OF SIMILARITY 341

Set 1

44% 42%

Set 2

12% 80%

Figure 4. Two sets of schematic faces used to test the diagnosticity hypothesis. The percentage of sub- jects who selected each face (as most similar to the target) is presented below the face. 342 AMOS TVERSKY

set of objects under consideration). To under- Set 1 was c and p (smiling faces) versus a and stand this process, let us examine the factors b (nonsmiling faces). The most common parti- that determine the salience of a feature and tion of Set 2 was b and q (frowning faces) its contribution to the similarity of objects. versus a and c (nonfrowning faces). Thus, the replacement of p by q changed the grouping The Diagnosticity Principle of a: In Set 1 a was paired with b, while in Set 2 a was paired with c. The salience (or the measure) of a feature is According to the above analysis, smiling has determined by two types of factors: intensive a greater diagnostic value in Set 1 than in and diagnostic. The former refers to factors Set 2, whereas frowning has a greater diagnos- that increase intensity or signal-to-noise ratio, tic value in Set 2 than in Set 1. By the diagnos- such as the brightness of a light, the loudness ticity hypothesis, therefore, similarity should of a tone, the saturation of a color, the size follow the grouping. That is, the similarity of of a letter, the frequency of an item, the a (which has a neutral expression) to b (which clarity of a picture, or the vividness of an is frowning) should be greater in Set 1, where image. The diagnostic factors refer to the they are grouped together, than in Set 2, classificatory significance of features, that is, where they are grouped separately. Likewise, the importance or prevalence of the classifica- the similarity of a to c (which is smiling) tions that are based on these features. Unlike should be greater in Set 2, where they are the intensive factors, the diagnostic factors grouped together, than in Set 1, where they are highly sensitive to the particular object are not. set under study. For example, the feature To test this prediction, two different groups "real" has no diagnostic value in the set of of 50 subjects were presented with Sets 1 and actual animals since it is shared by all actual 2 (in the form displayed in Figure 4) and animals and hence cannot be used to classify asked to select one of the three faces below them. This feature, however, acquires con- (called the choice set) that was most similar siderable diagnostic value if the object set is to the face on the top (called the target). extended to include legendary animals, such The percentage of subjects who selected each as a centaur, a mermaid, or a phoenix. of the three elements of the choice set is When faced with a set of objects, people presented below the face. The results con- often sort them into clusters to reduce infor- firmed the diagnosticity hypothesis: b was mation load and facilitate further processing. chosen more frequently in Set 1 than in Set 2, Clusters are typically selected so as to maxi- whereas c was chosen more frequently in Set mize the similarity of objects within a cluster 2 than in Set 1. Both differences are statisti- and the dissimilarity of objects from different cally significant, p < .01. Moreover, the re- clusters. Hence, the addition and/or deletion placement of p by q actually reversed the of objects can alter the clustering of the re- similarity ordering: In Set 1, b is more similar maining objects. A change of clusters, in turn, to a than c, whereas in Set 2, c is more similar is expected to increase the diagnostic value of to a than b. features on which the new clusters are based, A more extensive test of the diagnosticity and therefore, the similarity of objects that hypothesis was conducted using semantic share these features. This relation between rather than visual stimuli. The experimental similarity and grouping—called the diagnosti- design was essentially the same, except that city hypothesis—is best explained in terms of a countries served as stimuli instead of faces. concrete example. Consider the two sets of Twenty pairs of matched sets of four countries four schematic faces (displayed in Figure 4), of the form {a,b,c,p} and {a,b,c,q} were con- which differ in only one of their elements structed. An example of two matched sets is (p and q). presented in Figure 5. The four faces of each set were displayed in Note that the two matched sets (1 and 2) a row and presented to a different group of 25 differ only by one element (p and q). The subjects who were instructed to partition them sets were constructed so that a (in this case into two pairs. The most frequent partition of Austria) is likely to be grouped with b (e.g., FEATURES OF SIMILARITY 343

Sweden) in Set 1, and with c (e.g., Hungary) a in Set 2. To validate this assumption, we pre- Austria sented two groups of 25 subjects with all sets Setl b P c of four countries and asked them to partition Sweden Poland Hungary each quadruple into two pairs. Each group 49% 15% 36% received one of the two matched quadruples, which were displayed in a row in random a order. The results confirmed our prior hypothe- Austria sis regarding the grouping of countries. In Set 2 every case but one, the replacement of p by q b q c changed the pairing of the target country in Sweden Norway Hungary the predicted direction, p < .01 by sign test. 14% 26% 60% For example, Austria was paired with Sweden Figure 5. Two sets of countries used to test the diagnos- by 60% of the subjects in Set 1, and it was ticity hypothesis. The percentage of subjects who se- paired with Hungary by 96% of the subjects lected each country (as most similar to Austria) is in Set 2. presented below the country. To test the diagnosticity hypothesis, we presented two groups of 35 subjects with 20 the odd element in the choice set is p, and so sets of four countries in the format displayed on. As in the above examples, the notation is in Figure 5. These subjects were asked to chosen so that b is generally grouped with q, select, for each quadruple, the country in the and c is generally grouped with p. The dif- choice set that was most similar to the target ferences b(p) — b(q) and c(q) — c(p), there- country. Each group received exactly one fore, reflect the effects of the odd elements, p quadruple from each pair. If the similarity and q, on the similarity of b and c to the of b to a, say, is independent of the choice set, target a. In the absence of context effects, then the proportion of subjects who chose b both differences should equal 0, while under rather than c as most similar to a should be the diagnosticity hypothesis both differences the same regardless of whether the third ele- should be positive. In Figure 5, for example, ment in the choice set is p or q. For example, b(p) - b(q) = 49 - 14 = 35, and c(q) - c(p) the proportion of subjects who select Sweden = 60 — 36 = 24. The average difference, across rather than Hungary as most similar to all pairs of quadruples, equals 9%, which is Austria should be independent of whether the significantly positive, 2(19) = 3.65, p < .01. odd element in the choice set is Norway or Several variations of the experiment did not Poland. alter the nature of the results, The diagnosti- In contrast, the diagnosticity hypothesis city hypothesis was also confirmed when (i) implies that the change in grouping, induced each choice set contained four elements, rather by the substitution of the odd element, will than three, (ii) the subjects were instructed to change the similarities in a predictable manner. rank the elements of each choice set according Recall that in Set 1 Poland was paired with to their similarity to the target, rather than Hungary, and Austria with Sweden, while in to select the most similar element, and (iii) the Set 2 Norway was paired with Sweden, and target consisted of two elements, and the sub- Austria with Hungary. Hence, the proportion jects were instructed to select one element of of subjects who select Sweden rather than the choice set that was most similar to the Hungary (as most similar to Austria) should two target elements. For further details, see be higher in Set 1 than in Set 2. This predic- Tversky and Gati (in press). tion is strongly supported by the data in Figure 5, which show that Sweden was selected The Extension Effect more frequently than Hungary in Set 1, while Hungary was selected more frequently than Recall that the diagnosticity of features is Sweden in Set 2. determined by the classifications that are based Let b(p) denote the percentage of subjects on them. Features that are shared by all the who chose country b as most similar to a when objects under consideration cannot be used to 344 AMOS TVERSKY

classify these objects and are, therefore, devoid pairs of countries from the same continent of diagnostic value. When the context is ex- sharing a common border). Unlike Sjoberg's tended by the enlargement of the object set, (Note 1) study, which extended the context some features that had been shared by all by introducing nonhomogeneous pairs, our objects in the original context may not be experiment extended the context by construct- shared by all objects in the broader context. ing heterogeneous sets composed of homogene- These features then acquire diagnostic value ous pairs. Hence, the increase of similarity and increase the similarity of the objects that with the enlargement of context, observed in share them. Thus, the similarity of a pair of the present study, cannot be explained by objects in the original context will usually be subjects' tendency to equate the average smaller than their similarity in the extended similarity for any set of assessments. context. Essentially the same account was proposed The Two Faces of Similarity and supported by Sjoberg (Note 1) in studies of similarity between animals, and between According to the present analysis, the sali- musical instruments. For example, Sjoberg ence of features has two components: intensity showed that the similarities between string and diagnosticity. The intensity of a feature instruments (banjo, violin, harp, electric guitar) is determined by perceptual and cognitive were increased when a wind instrument (clari- factors that are relatively stable across con- net) was added to this set. Since the string texts. The diagnostic value of a feature is instruments are more similar to each other determined by the prevalence of the classifica- than to the clarinet, however, the above result tions that are based on it, which change with may be attributed, in part at least, to subjects' the context. The effects of context on similar- tendency to standardize the response scale, ity, therefore, are treated as changes in the that is, to produce the same average similarity diagnostic value of features induced by the for any set of comparisons. respective changes in the grouping of the This effect can be eliminated by the use of objects. a somewhat different design, employed in the This account was supported by the experi- following study. Subjects were presented with mental finding that changes in grouping (pro- pairs of countries having a common border duced by the replacement or addition of ob- and assessed their similarity on a 20-point jects) lead to corresponding changes in the scale. Four sets of eight pairs were con- similarity of the objects. These results shed structed. Set 1 contained eight pairs of Euro- light on the dynamic interplay between simi- pean countries (e.g., Italy-Switzerland). Set larity and classification. It is generally assumed 2 contained eight pairs of American countries that classifications are determined by similari- (e.g., Brazil-Uruguay). Set 3 contained four ties among the objects. The preceding discus- pairs from Set 1 and four pairs from Set 2, sion supports the converse hypothesis: that while Set 4 contained the remaining pairs from the similarity of objects is modified by the Sets 1 and 2. Each one of the four sets was manner in which they are classified. Thus, presented to a different group of 30-36 subjects. similarity has two faces: causal and derivative. According to the diagnosticity hypothesis, It serves as a basis for the classification of the features "European" and "American" objects, but it is also influenced by the adopted have no diagnostic value in Sets 1 and 2, al- classification. The diagnosticity principle which though they both have a diagnostic value in underlies this process may provide a key to Sets 3 and 4. Consequently, the overall average the analysis of the effects of context on similarity in the heterogeneous sets (3 and 4) similarity. is expected to be higher than the overall aver- age similarity in the homogeneous sets (1 and Discussion 2). This prediction was confirmed by the data, /(IS) = 2.11, p < .05. In this section we relate the present de- In the present study all similarity assess- velopment to the representation of objects in ments involve only homogeneous pairs (i.e., terms of clusters and trees, discuss the con- FEATURES OF SIMILARITY 34S

Table 1 ADCLUS Analysis of the Similarities Among the Integers 0 Through 9 (from Shepard & A rabie, Note 2)

Rank Weight Elements of subset Interpretation of subset

1st .305 248 powers of two 2nd .288 6789 large numbers 3rd .279 369 multiples of three 4th .202 0 1 2 very small numbers 5th .202 13579 odd numbers 6th .175 1 2 3 small nonzero numbers 7th .163 567 middle numbers (largish) 8th .160 0 1 additive and multiplicative identities 9th .146 01234 smallish numbers

cepts of prototypicality and family resem- we obtain blance, and comment on the relation between similarity and metaphor. S(a,b) = W(AH B) - f(A - B) - f(B - A) = 0f (A C\ B) + 2f (A C\ B) - f (A - B) -f(B -A) - 2f(AHB) Features, Clusters, and Trees = (6 + 2)f (A n B) - f (A) - f (B) There is a well-known correspondence be- tween features or properties of objects and the classes to which the objects belong. A red since f(A) = f(B) for all a,b in A. Under the flower, for example, can be characterized as present assumptions, therefore, similarity be- having the feature "red," or as being a member tween objects is a linear function of the mea- of the class of red objects. In this manner we sure of their common features. associate with every feature in €> the class of Since f is an additive measure, f(AO B) is objects in A which possesses that feature. This expressible as the sum of the measures of all correspondence between features and classes the features that belong to both a and b. For provides a direct link between the present each subset A of A, let (A) denote the set of theory and the clustering approach to the features that are shared by all objects in A, representation of proximity data. and are not shared by any object that does not belong to A. Hence, In the contrast model, the similarity be- tween objects is expressed as a function of S(a,b) = their common and distinctive features. Rela- tions among overlapping sets are often repre- XeAHB sented in a Venn diagram (see Figure 1). How- ever, this representation becomes cumbersome AD{a,b}. when the number of objects exceeds four or five. To obtain useful graphic representations Since the summation ranges over all subsets of of the contrast model, two alternative simpli- A that include both a and b, the similarity fications are entertained. between objects can be expressed as the sum of the weights associated with all the sets that First, suppose the objects under study are include both objects. all equal in prominence, that is, f(A) = f(B) This form is essentially identical to the addi- for all a,b in A. Although this assumption is tive clustering model proposed by Shepard and not strictly valid in general, it may serve as a Arabie (Note 2). These investigators have de- reasonable approximation in certain contexts. veloped a computer program, ADCLUS, which Assuming feature additivity and symmetry, selects a relatively small collection of subsets 346 AMOS TVERSKY o

circ ular e a curved g p tailed q d

m

arched n u

long vertical

f

dotted

twisted

k angular y forked v X Figure 6. The representation of letter similarity as an additive (feature) tree. From Sattath and Tversky (in press). and assigns weight to each subset so as to studies including Shepard, Kilpatric, and maximize the proportion of (similarity) vari- Cunningham's (1975) on judgments of simi- ance accounted for by the model. Shepard and larity between the integers 0 through 9 with Arabic (Note 2) applied ADCLUS to several respect to their abstract numerical character. FEATURES OF SIMILARITY 347

A solution with 19 subsets accounted for 95% program, ADDTREE, for the construction of of the variance. The nine major subsets (with additive feature trees from similarity data and the largest weights) are displayed in Table 1 discuss its relation to other scaling methods. along with a suggested interpretation. Note It follows readily from the above discussion that all the major subsets are readily interpret- that if we assume both that the feature set $ able, and they are overlapping rather than is a tree, and that f (A) = f (B) for all a,b in A, hierarchical. then the contrast model reduces to the well- The above model expresses similarity in known hierarchical clustering scheme. Hence, terms of common features only. Alternatively, the additive clustering model (Shepard & similarity may be expressed exclusively in Arabic, Note 2), the additive similarity tree terms of distinctive features. It has been shown (Sattath & Tversky, in press), and the hier- by Sattath (Note 3) that for any symmetric archical clustering scheme (Johnson, 1967) are contrast model with an additive measure f, all special cases of the contrast model. These there exists a measure g denned on the same scaling models can thus be used to discover feature space such that the common and distinctive features of the objects under study. The present development, S(a,b) = Of (AH B) - f(A - B) - f(B - A) in turn, provides theoretical foundations for = X - g(A - B) - g(B - A) the analysis of set-theoretical methods for the for some X > 0. representation of proximities. This result allows a simple representation of dissimilarity whenever the feature space <£ is Similarity, Prototypicality, and Family a tree (i.e., whenever any three objects in A Resemblance can be labeled so that AHB = AnCC Similarity is a relation of proximity that Bn C). Figure 6 presents an example of a holds between two objects. There exist other feature tree, constructed by Sattath and proximity relations such as prototypicality and Tversky (in press) from judged similarities representativeness that hold between an object between lowercase letters, obtained by Kuen- and a class. Intuitively, an object is proto- napas and Janson (1969). The major branches typical if it exemplifies the category to which are labeled to facilitate the interpretation of it belongs. Note that the prototype is not the tree. necessarily the most typical or frequent mem- Each (horizontal) arc in the graph repre- ber of its class. Recent research has demon- sents the set of features shared by all the strated the importance of prototypicality or objects (i.e., letters) that follow from that arc, representativeness in perceptual learning (Pos- and the arc length corresponds to the measure ner & Keele, 1968; Reed, 1972), inductive of that set. The features of an object are the inference (Kahneman & Tversky, 1973), se- features of all the arcs which lead to that mantic memory (Smith, Rips, & Shoben, object, and its measure is its (horizontal) dis- 1974), and the formation of categories (Rosch tance to the root. The tree distance between & Mervis, 1975). The following discussion objects a and b is the (horizontal) length of analyzes the relations of prototypicality and the path joining them, that is, f(A — B) + family resemblance in terms of the present f(B — A). Hence, if the contrast model holds, theory of similarity. a = j8, and * is a tree, then dissimilarity (i.e., Let P(a,A) denote the (degree of) proto- — S) is expressible as tree distance. typicality of object a with respect to class A, A feature tree can also be interpreted as a with cardinality n, defined by hierarchical clustering scheme where each arc length represents the weight of the cluster P(a,A) = pn(XSf(Ar\ B) - S(f(A - B) consisting of all the objects that follow from + f (B - A))), that arc. Note that the tree in Figure 6 differs from the common hierarchical clustering tree where the summations are over all b in A. in that the branches differ in length. Sattath Thus, P(a,A) is defined as a linear combina- and Tversky (in press) describe a computer tion (i.e., a contrast) of the measures of the 348 AMOS TVERSKY

features of a that are shared with the elements their members, and by them alone. Wittgen- of A and the features of a that are not shared stein proposed that natural categories and con- with the elements of A. An element a of A is cepts are commonly characterized and under- a prototype if it maximizes P(a,A). Note that stood in terms of family resemblance, that is, a class may have more than one prototype. a network of similarity relations that link the The factor pn reflects the effect of category various members of the class. The importance size on prototypicality, and the constant X of family resemblance in the formation and determines the relative weights of the common processing of categories has been effectively and the distinctive features. If pn = 1/n, X = 6, underscored by the work of Rosch and her and a = /3 = 1, then P(a,A) = l/n2S(a,b) collaborators (Rosch, 1973; Rosch & Mervis, (i.e., the prototypicality of a with respect to 1975; Rosch, Mervis, Gray, Johnson, & Boyes- A equals the average similarity of a to all Braem, 1976). This research demonstrated members of A). However, in line with the that both natural and artificial categories are focusing hypo theses discussed earlier, it appears commonly perceived and organized in terms likely that the common features are weighted of prototypes, or focal elements, and some more heavily in judgments of prototypicality measure of proximity from the prototypes. than in judgments of similarity. Furthermore, it lent substantial support to Some evidence concerning the validity of the the claim that people structure their world in proposed measure was reported by Rosch and terms of basic semantic categories that repre- Mervis (1975). They selected 20 objects from sent an optimal level of abstraction. Chair, each one of six categories (furniture, vehicle, for example, is a basic category; furniture is fruit, weapon, vegetable, clothing) and in- too general and kitchen chair is too specific. structed subjects to list the attributes associ- Similarly, car is a basic category; vehicle is ated with each one of the objects. The proto- too general and sedan is too specific. Rosch typicality of an object was denned by the argued that the basic categories are selected number of attributes or features it shared so as to maximize family resemblance—defined with each member of the category. Hence, the in terms of cue validity. prototypicality of a with respect to A was The present development suggests the fol- denned by 2N(a,b), where N(a,b) denotes the lowing measure for family resemblance, or cate- number of attributes shared by a and b, and gory resemblance. Let A be some subset of A the summation ranges over all b in A. Clearly, with cardinality n. The category resemblance the measure of prototypicality employed by of A denoted R(A) is defined by Rosch and Mervis (1975) is a special case of R(A) = r (XSf(An B) - S(f(A - B) the proposed measure, where X is large and n f(AMB) = N(a,b). + f(B - A))), These investigators also obtained direct the summations being over all a,b in A. Hence, measures of prototypicality by instructing category resemblance is a linear combination subjects to rate each object on a 7-point scale of the measures of the common and the dis- according to the extent to which it fits the tinctive features of all pairs of objects in that "idea or image of the meaning of the category." category. The factor rn reflects the effect of The rank correlations between these ratings category size on category resemblance, and the and the above measure were quite high in all constant X determines the relative weight of categories: furniture, .88; vehicle, .92; weapon, the common and the distinctive features. If .94; fruit, .85; vegetable, .84; clothing, .91. X = 6, a = /3 = 1, and rn = 2/n(n - 1), then The rated prototypicality of an object in a SS(a,b) category, therefore, is predictable by the R(A) = number of features it shares with other mem- bers of that category. (a)' In contrast to the view that natural cate- the summation being over all a,b in A; that is, gories are definable by a conjunction of critical category resemblance equals average similarity features, Wittgenstein (1953) argued that between all members of A. Although the pro- several natural categories (e.g., a game) do posed measure of family resemblance differs not have any attribute that is shared by all from Rosch's, it nevertheless captures her FEATURES OF SIMILARITY 349

basic notion that family resemblance is highest most interesting property of metaphoric ex- for those categories which "have the most pressions is that despite their novelty and attributes common to members of the category nonliteral nature, they are usually understand- and the least attributes shared with members able and often informative. For example, the of other categories" (Rosch et al., 1976, p. statement that Mr. X resembles a bulldozer is 435). readily understood as saying that Mr. X is a The maximization of category resemblance gross, powerful person who overcomes all ob- could be used to explain the formation of stacles in getting a job done. An adequate categories. Thus, the set A rather than F is analysis of connotative meaning should account selected as a natural category whenever R(A) for man's ability to interpret metaphors with- > R(F). Equivalently, an object a is added to out specific prior learning. Since the message a category A whenever R({A Ua}) > R(A). conveyed by such expressions is often pointed The fact that the preferred (basic) categories and specific, they cannot be explained in terms are neither the most inclusive nor the most of a few generalized dimensions of connotative specific imposes certain constraints on rn. meaning, such as evaluation or potency (Os- If rn = 2/n(n - 1) then R(A) equals the good, 1962). It appears that people interpret average similarity between all members of A. similes by scanning the feature space and This index leads to the selection of minimal selecting the features of the referent that are categories because average similarity can gen- applicable to the subject (e.g., by selecting erally be increased by deleting elements. The features of the bulldozer that are applicable average similarity between sedans, for ex- to the person). The nature of this process is ample, is surely greater than the average left to be explained. similarity between cars; nevertheless, car There is a close tie between the assessment rather than sedan serves as a basic category. of similarity and the interpretation of meta- If rn = 1 then R(A) equals the sum of the phors. In judgments of similarity one assumes similarities between all members of A. This a particular feature space, or a frame of index leads to the selection of maximal cate- reference, and assesses the quality of the gories because the addition of objects increases match between the subject and the referent. total similarity, provided S is nonnegative. In the interpretation of similes, one assumes In order to explain the formation of inter- a resemblance between the subject and the mediate-level categories, therefore, category re- referent and searches for an interpretation of semblance must be a compromise between an the space that would maximize the quality of average and a sum. That is, rn must be a de- the match. The same pair of objects, therefore, creasing function of n that exceeds 2/n(n — 1). can be viewed as similar or different depending In this case, R(A) increases with category size whenever average similarity is held constant, on the choice of a frame of reference. and vice versa. Thus, a considerable increase One characteristic of good metaphors is the in the extension of a category could outweigh contrast between the prior, literal interpreta- a small reduction in average similarity. tion, and the posterior, metaphoric interpreta- Although the concepts of similarity, proto- tion. Metaphors that are too transparent are typicality, and family resemblance are inti- uninteresting; obscure metaphors are uninter- mately connected, they have not been previ- pretable. A good metaphor is like a good ously related in a formal explicit manner. The detective story. The solution should not be present development offers explications of apparent in advance to maintain the reader's similarity, prototypicality, and family resem- interest, yet it should seem plausible after the blance within a unified framework, in which fact to maintain coherence of the story. Con- they are viewed as contrasts, or linear combina- tions, of the measures of the appropriate sets sider the simile "An essay is like a fish." At of common and distinctive features. first, the statement is puzzling. An essay is not expected to be fishy, slippery, or wet. The Similes and Metaphors puzzle is resolved when we recall that (like a Similes and metaphors are essential ingredi- fish) an essay has a head and a body, and it ents of creative verbal expression. Perhaps the occasionally ends with a flip of the tail. 350 AMOS TVERSKY

Reference Notes Posner, M. I., & Keele, S. W. On the genesis of abstract ideas. Journal of , 1968, 77, 1. Sjoberg, L. A cognitive theory of similarity. Gdteborg 353-363. Psychological Reports (No. 10), 1972. Reed, S. K. Pattern recognition and categorization. 2. Shepard, R. N., & Arable, P. Additive cluster analy- , 1972, 3, 382-407. sis of similarity data. Proceedings of the U.S.-Japan Restle, F. A metric and an ordering on sets. Psycho- Seminar on Theory, Methods, and Applications of metrika, 1959, 24, 207-220. Multidimensional Scaling and Related Techniques. Restle, F. Psychology of judgment and choice. New York: San Diego, August 1975. Wiley, 1961. 3. Sattath, S. An equivalence theorem. Unpublished note, Rosch, E. On the internal structure of perceptual and Hebrew University, 1976. semantic categories. In T. E. Moore (Ed.), Cognitive development and the acquisition of language. New York: Academic Press, 1973. References Rosch, E. Cognitive reference points. Cognitive Psychol- ogy, 1975, 7, 532-547. Seals, R., Krantz, D. H., & Tversky, A. Foundations Rosch, E., & Mervis, C. B. Family resemblances: of multidimensional scaling. Psychological Review, Studies in the internal structure of categories. Cog- 1968, 75, 127-142. nitive Psychology, 1975, 7, 573-603. Bush, R. R., & Hosteller, F. A model for stimulus gen- Rosch, E., Mervis, C. B., Gray, W., Johnson, D., & eralization and discrimination. Psychological Review, Boyes-Braem, P. Basic objects in natural categories. 1951, 58, 413-423. Cognitive Psychology, 1976, 8, 382-439. Carroll, J. D., & Wish, M. Multidimensional perceptual Rothkopf, E. Z. A measure of stimulus similarity and models and measurement methods. In E. C. Carter- errors in some paired-associate learning tasks. Journal ette & M. P. Friedman (Eds.), Handbook of perception. of Experimental Psychology, 1957, 53, 94-101. New York: Academic Press, 1974. Sattath, S., & Tversky, A. Additive similarity trees. Eisler, H., & Ekman, G. A mechanism of subjective Psychomelrika, in press. similarity. Acta Psychologica, 1959, 16, 1-10. Shepard, R. N. Attention and the metric structure of the stimulus space. Journal of Mathematical Psy- Garner, W. R. The processing of information and struc- chology, 1964, /, 54-87. ture. New York: JJalsted Press, 1974. Shepard, R. N. Representation of structure in similarity Gibson, E. Principles of perceptual learning and develop- data: Problems and prospects. Psychomelrika, 1974, ment. New York: Appleton-Century-Crofts, 1969. 39, 373-421. Goldmeier, E. Similarity in visually perceived forms. Shepard, R. N., Kilpatric, D. W., & Cunningham, J. P. Psychological Issues, 1972, 8, 1-136. The internal representation of numbers. Cognitive Gregson, R. A. M. of similarity. New Psychology, 1975, 7, 82-138. York: Academic Press, 1975. Smith, E. E., Rips, L. J , & Shoben, E. J. Semantic Hosman, J., & Kuennapas, T. On the relation between memory and psychological semantics. In G. H.Bower similarity and dissimilarity estimates (Report No. (Ed.), The psychology of learning and (Vol. 354). University of Stockholm, Psychological Lab- 8). New York: Academic Press, 1974. oratories, 1972. Smith, E. E., Shoben, E. J., & Rips, L. J. Structure and Jakobson, R., Fant, G. G. M., & Halle, M. Prelimi- process in semantic memory: A featural model for naries to speech analysis: The distinctive features and semantic decisions. Psyctiological Review, 1974, 81, their correlates. Cambridge, Mass.: MIT Press, 1961. 214-241. Johnson, S. C. Hierarchical clustering schemes. Psych- Torgerson, W. S. Multidimensional scaling of similarity. mnelrika, 1967, 32, 241-254. Psychomelrika, 1965, 30, 379-393. Kahneman, D., & Tversky, A. On the psychology of Tversky, A. Elimination by aspects: A theory of choice. prediction. Psychological Review, 1973,80, 237-251. Psychological Review, 1972, 70, 281-299. Krantz, D. H., Luce, R. D., Suppes, P., & Tversky, A. Tversky, A., & Gati, I. Studies of similarity. In E. Foundations of measurement (Vol. 1). New York: Rosch & B. Lloyd (Eds.), On the nature and principle Academic Press, 1971. of formation of categories. Hillsdale, N. J.: Erlbaum, Krantz, D. H., & Tversky, A. Similarity of rectangles: in press. An analysis of subjective dimensions. Journal of Tversky, A., & Krantz, D. H. The dimensional repre- , 1975, 12, 4-34. sentation and the metric structure of similarity data. Kuennapas, T., & Janson, A. J. Multidimensional simi- Journal of Mathematical Psychology, 1970, 7, 572-597. larity of letters. Perceptual and Motor Skills, 1969,28, von Neumann, J., & Morgenstern, O. Theory of games 3-12. and economic behavior. Princeton, N. J.: Princeton Neisser, U. Cognitive psychology. New York: Appleton- University Press, 1947. Century-Crofts, 1967. Wish, M. A model for the perception of Morse Code-like Osgood, C. E. Studies on the generality of affective signals. Human Factors, 1967, 9, 529-540. meaning systems. American , 1962, 17, Wittgenstein, L. Philosophical investigations. New York: 10-28. Macmillan, 1953. FEATURES OF SIMILARITY 351

Appendix

An Axiomatic Theory of Similarity Let A = (a,b,c, ...} be a collection of objects merely asserts that the structure under study is characterized as sets of features, and let A,B,C, sufficiently rich so that certain equations can denote the sets of features associated with be solved. The first part of Axiom 4 is analogous a,b,c, respectively. Let s(a,b) be an ordinal to the existence of a factorial structure. The measure of the similarity of a to b, denned for second part of the axiom implies that the range all distinct a,b in A. The present theory is of s is a real interval: There exist objects in A based on the following five axioms. Since the whose similarity matches any real value that first three axioms are discussed in the paper, is bounded by two similarities. The third part they are merely restated here; the remaining of Axiom 4 ensures that all arguments of F are axioms are briefly discussed. essential. Let $1, $2, and $3 be the sets of features that 1. Matching: s(a,b) = F(An B, A - B, appear, respectively, as first, second, or third B — A) where F is some real-valued function in arguments of F. (Note that $2 = <£ .) Suppose three arguments. 3 X and X' belong to $1, while Y and Y' belong 2. Monotonicity: s(a,b) > s(a,c) whenever to . Define (X.X')i^ (Y,Y')a whenever the A n B D A H C, A-BCA-C, and 2 two intervals are matched, that is, whenever B — A C C — A. Moreover, if either inclusion there exist pairs (a,b) and (a',b') of equally is proper then the inequality is strict. similar objects in A which agree on the third Let $ be the set of all features associated factor. Thus, (X,X')i^ (Y,Y') whenever with the objects of A, and let X,Y,Z, etc. de- 2 note subsets of $. The expression F(X,Y,Z) is s(a,b) = F(X,Y,Z) = F(X',Y',Z) = s(a',b'). defined whenever there exist a,b in A such that A n B = X, A - B = Y, and B - A = Z, This definition is readily extended to any other whence s(a,b) = F(X,Y,Z). Define V=~ W if pair of factors. Next, define (V,V')i=^ (W, one or more of the following hold for some W')i, i = 1,2,3 whenever (V,V')i ^ (X,X')j X.Y.Z: F(V,Y,Z) = F(W,Y,Z), F(X,V,Z) ^ (W,W')i, for some (X,X')j, j ^ i. Thus, two = F(X,W,Z), F(X,Y,V) = F(X,Y,W). The intervals on the same factor are equivalent if pairs (a,b) and (c,d) agree on one, two, or three both match the same interval on another fac- components, respectively, whenever one, two, tor. The following invariance axiom asserts or three of the following hold : (A (~\ B) that if two intervals are equivalent on one fac- ~ (C C\ D), (A - B) ~ (C - D), (B - Ator) , they are also equivalent on another factor. ~ (D - C). 5. Invariance: Suppose V,V', W,W' belong 3. Independence: Suppose the pairs (a,b) to both 4>; and $j, i,j = 1,2,3. Then and (c,d), as well as the pairs (a',b') and (c', (V,V')i^ (W,W')i iff (V,V')j^ (W,W')j. d'), agree on the same two components, while the pairs (a,b) and (a',b'), as well as the pairs Representation Theorem (c,d) and (c',d'), agree on the remaining (third) component. Then Suppose Axioms 1-5 hold. Then there exist a similarity scale S and a nonnegative scale f s(a,b) > s(a',b') iff s(c,d) > s(c',d'). such that for all a,b,c,d in A 4. Solvability: (i). S(a,b) > S(c,d) iff s(a,b) > s(c,d), (i). For all pairs (a,b), (c,d), (e,f), of objects (ii). S(a,b) = 0f(APi B) - af(A - B) - in A there exists a pair (p,q) which agrees with (9f (B - A), for some 0,a,/3 > 0. them, respectively, on the first, second, and (iii). f and S are interval scales. third component, that is, Pr\Q~AC\B, P - Q ~ C — D, andQ— P ~ F - E. While a self-contained proof of the repre- (ii). Suppose s(a,b) > t > s(c,d). Then there sentation theorem is quite long, the theorem exist e,f with s(e,f) = t, such that if (a,b) and can be readily reduced to previous results. (c,d) agree on one or two components, then Recall that i is the set of features that ap- (e,f) agrees with them on these components. pear as the ith argument of F, and let ^i = *i/ (iii). There exist pairs (a,b) and (c,d) of ob- ~, i = 1,2,3. Thus, *i is the set of equivalence jects in A that do not agree on any component. classes of $1 with respect to ^. It follows from Unlike the other axioms, solvability does not Axioms 1 and 3 that each S^i is well defined, impose constraints on the similarity order; it and it follows from Axiom 4 that * = *i X ^2 352 AMOS TVERSKY

X SFj is equivalent to the domain of F. We According to Axiom 5, the equivalence of in- wish to show that ^, ordered by F, is a three- tervals is preserved across factors. That is, for component, additive conjoint structure, in the all V,V, W,W in ^iH^j, i,j, = 1,2,3, sense of Krantz, Luce, Suppes, and Tversky (1971, Section 6.11.1). fi(V) - fi(V') = fi(W) - fi(W') iff This result, however, follows from the analy- fi(V)-fj(V) =f,(W)-f,(W). sis of decomposable similarity structures, de- veloped by Tversky and Krantz (1970). In Hence by part (i) of Theorem 6.15 of Krantz particular, the proof of part (c) of Theorem 1 in et al. (1971), there exist a scale f and constants that paper implies that, under Axioms 1, 3, and 81 such that fs(X) = 6{f (X), i = 1,2,3. Finally, 4, there exist nonnegative functions f; defined by Axiom 2, S increases in fi and decreases in on ^j, i = 1,2,3, so that for all a,b,c,d in A fa and fs. Hence, it is expressible as s(a,b) > s(c,d) iff S(a,b) > S(c,d) S(a,b) = B) - «f(A - B) where S (a, b) = fi(AH B) - 0f (B - A), + fs(A-B) - A), for some nonnegative constants 6,a,{i. and fj, f2, fa are interval scales with a common unit. Received November 17, 1976