OPERATIONS RESEARCH Vol. 64, No. 3, May–June 2016, pp. 564–584 ISSN 0030-364X (print) ISSN 1526-5463 (online) ó http://dx.doi.org/10.1287/opre.2015.1364 © 2016 INFORMS
Preferences, Homophily, and Social Learning
Ilan Lobel, Evan Sadler Stern School of Business, New York University, New York, New York 10012 {[email protected], [email protected]}
We study a sequential model of Bayesian social learning in networks in which agents have heterogeneous preferences, and neighbors tend to have similar preferences—a phenomenon known as homophily. We find that the density of network connections determines the impact of preference diversity and homophily on learning. When connections are sparse, diverse preferences are harmful to learning, and homophily may lead to substantial improvements. In contrast, in a dense network, preference diversity is beneficial. Intuitively, diverse ties introduce more independence between observations while providing less information individually. Homophilous connections individually carry more useful information, but multiple observations become redundant. Subject classifications: network/graphs; probability; games/group decisions. Area of review: Special Issue on Information and Decisions in Social and Economic Networks. History: Received June 2013; revision received June 2014; accepted December 2014. Published online in Articles in Advance April 7, 2015.
1. Introduction tributions, and the extent of homophily, influence social We rely on the actions of our friends, relatives, and neigh- learning. In this paper, we study a sequential model of social learn- bors for guidance in many decisions. This is true of both ing to elucidate how link structure and preference structure minor decisions, like where to go for dinner or what movie interact to determine learning outcomes. We adopt a partic- to watch tonight, and major, life-altering ones, such as ularly simple representation of individual decisions, assum- whether to go to college or get a job after high school. ing binary states and binary actions, to render a clear intu- This reliance may be justified because others’ decisions ition on the role of the network. Our work builds directly are informative: our friends’ choices reveal some of their on that of Acemoglu et al. (2011) and Lobel and Sadler knowledge, which potentially enables us to make better (2015), introducing two key innovations. First, we signif- decisions for ourselves. Nevertheless, in the vast majority icantly relax the assumption of homogeneous preferences, of real-life situations, we view the implicit advice of our considering a large class of preference distributions among friends with at least some skepticism. In essentially any the agents. Although all agents prefer the action matching setting in which we would expect to find social learning, the underlying state, they can differ arbitrarily in how they preferences influence choices as much as information does. weigh the risk of error in each state. Second, we allow For instance, upon observing a friend’s decision to patron- correlations between the link structure of the network and ize a particular restaurant, we learn something about the individual preferences, enabling our study of homophily. restaurant’s quality, but her decision is also influenced by Our principal finding is that network link density deter- her preference over cuisines. Similarly, a stock purchase mines how preference heterogeneity and homophily will signals both company quality and risk preferences. In each impact learning. In a relatively sparse network, diverse case, two individuals with the same information may rea- preferences are a barrier to information transmission: het- sonably choose opposite actions. erogeneity between neighbors introduces an additional The structure of our social network determines what source of signal noise. However, we find that sufficiently information we can obtain through social ties. This struc- strong homophily can ameliorate this noise, leading to ture comprises not only the pattern of links between indi- learning results that are comparable to those in networks viduals, but also patterns of preferences among neighbors. with homogeneous preferences. In a dense network, the
Downloaded from informs.org by [216.165.95.68] on 15 September 2016, at 14:56 . For personal use only, all rights reserved. Since differences in preferences impact our ability to infer situation is quite different. If preferences are sufficiently information from our friends’ choices, these preference pat- diverse, dense connectivity facilitates learning that is highly terns affect the flow of information. In real-world networks, robust, and too much homophily may lead to inefficient homophily is a widespread structural regularity: individu- herding. Homophilous connections offer more useful infor- als interact much more frequently with others who share mation, but the information provided through additional similar characteristics or preferences.1 To understand the ties quickly becomes redundant. Conversely, diverse ties effects of network structure on information transmission, provide less information individually, but there is more we must therefore consider how varying preference dis- independence between observations. Hence, the value of
564 Lobel and Sadler: Preferences, Homophily, and Social Learning Operations Research 64(3), pp. 564–584, © 2016 INFORMS 565
a homophilous connection versus a diverse connection broad, there are always some types that act on their pri- depends on the other social information available to an vate information, creating the needed independence. Goeree individual. et al. (2006) find this leads to asymptotic learning in a Our formal analysis begins with sparse networks, mean- complete network even with bounded private beliefs. The- ing networks in which there is a uniform bound on how orem 4 generalizes this insight to a broad class of net- many neighbors any agent can observe. We first present a work structures, showing that learning robustly succeeds specialized example to highlight key mechanisms underly- within a dense cluster of agents as long as two condi- ing our results. There are two distinct reasons why pref- tions are satisfied: there is sufficient diversity within the erence heterogeneity reduces the informational value of cluster, and agents in the cluster are aware of its exis- an observation. One source of inefficiency is uncertainty tence. Since homophily reduces the diversity of preferences about how our neighbors make trade-offs. Perhaps surpris- among an agent’s neighbors, it has the potential to inter- ingly, there is additional inefficiency simply by virtue of fere with learning in a dense network. Example 4 shows the opposing trade-offs agents with different preferences that if homophily is especially extreme, with agents sorting make. Consider a diner who prefers Japanese food to Ital- themselves into two isolated clusters based on preferences, ian. When choosing between restaurants, this diner might informational cascades can emerge, and learning is incom- default to a Japanese restaurant unless there is strong evi- plete. We might imagine a network of individuals with dence that a nearby Italian option is of higher quality. strongly polarized political beliefs, in which agents on each Observing this individual enter a Japanese restaurant is a side never interact with agents on the other. Interestingly, a much weaker signal of quality than observing the same small amount of bidirectional communication between the individual enter an Italian restaurant. Consequently, this two clusters is sufficient to overcome this failure. The sem- observation is unlikely to influence a person who strongly inal paper of Bikhchandani et al. (1992) emphasized the prefers Italian food. When connections are scarce, long- fragility of cascades, showing that introducing a little out- run learning depends upon individual observations carrying side information can quickly reverse them. Our result is a greater amount of information, and homophily reduces similar in spirit, showing that in the long run, the negative inefficiency from both sources. impact of homophily on learning in dense networks is also Further results show that preference heterogeneity is fragile. generically harmful in a sparse network. Theorem 1 shows We make several contributions to the study of social that we can always render asymptotic learning impossible learning and the broader literature on social influence. First, via a sufficiently extreme preference distribution. Moreover, we show clearly how preference heterogeneity has dis- we find that the improvement principle, whereby agents tinct effects on two key learning mechanisms. This in turn are guaranteed higher ex ante utility than any neighbor, helps us understand how the network affects long-run learn- breaks down with even a little preference diversity. The ing. Most of the social learning literature assumes that improvement principle is a cornerstone of earlier results all individuals in society have identical preferences, dif- in models with homogeneous preferences (Banerjee and fering only in their knowledge of the world.2 Smith and Fudenberg 2004, Acemoglu et al. 2011), but we show that Sorensen (2000) offer one exception, showing that, in a there can be no improvement principle unless strong pref- complete network, diverse preferences generically lead to erences occur much less frequently than strong signals. an equilibrium in which observed choices become unin- Homophily reduces the inefficiency from preference diver- formative. However, a key assumption behind this result sity by rescuing the improvement principle: if an agent can is that preferences are nonmonotonic in the state: differ- identify a neighbor with preferences very close to her own, ent agents may change their actions in opposite directions then she can use her signal to improve upon that neighbor’s in response to the same new information. This excludes choice. We define the notion of a strongly homophilous the case in which utilities contain a common value com- network, in which each agent can find a neighbor with arbi- ponent, which is our focus. Closer to our work, Goeree trarily similar preferences in the limit as the network grows, et al. (2006) consider a model with private and common and we find that learning in a strongly homophilous net- values in a complete network, obtaining a result that we work is comparable to that in a network with homogeneous generalize to complex dense networks. Models that situate preferences. Moreover, adding homophily to a network that learning in the context of investment decisions have also
Downloaded from informs.org by [216.165.95.68] on 15 September 2016, at 14:56 . For personal use only, all rights reserved. already satisfies the improvement principle will never dis- considered preference heterogeneity. This work finds that rupt learning. heterogeneity hinders learning and may increase the inci- In contrast, when networks are dense, preference het- dence of cascades, leading to pathological spillover effects erogeneity plays a positive role in the learning process. (Cipriani and Guarino 2008). Our innovation is the incorpo- When agents have many sources of information, the law ration preference heterogeneity and complex network struc- of large numbers enables them to learn as long as there is ture into the same model. We obtain results for general some independence between the actions of their neighbors. classes of networks, providing new insights on the under- If the support of the preference distribution is sufficiently lying learning mechanisms. Lobel and Sadler: Preferences, Homophily, and Social Learning 566 Operations Research 64(3), pp. 564–584, © 2016 INFORMS
In tandem with our study of preference diversity, we 2. Model shed light on the impact of homophily on social learning. Each agent in a countably infinite set sequentially chooses Despite its prevalence, few papers on learning have con- between two actions, 0 and 1. We index agents by the order sidered homophily. Golub and Jackson (2012) study group- of their decisions, with agent m
Downloaded from informs.org by [216.165.95.68] on 15 September 2016, at 14:56 . For personal use only, all rights reserved. ✓ É lead strong or weak ties to be most influential. is, agent n observes the value of xm for all m B4n5. The We first describe our model before analyzing learning neighborhood B4n5 is a random variable, and the2 sequence in sparse networks. Our analysis emphasizes the difficul- of neighborhood realizations describe a social network of ties that preference diversity creates and the benefits that connections between the agents. We call the probability
homophily confers. We next provide general results on qn ⇣4à 1 B4n5, xm for m B4n55 agent n’s social dense networks, finishing with a brief discussion. Most belief.= = ó 2 proofs are contained in the main body of the paper, with We represent the structure of the network as a joint dis- more technical results given in the appendix. tribution ⌘ over all possible sequences of neighborhoods Lobel and Sadler: Preferences, Homophily, and Social Learning Operations Research 64(3), pp. 564–584, © 2016 INFORMS 567
and types; we call ⌘ the network topology and assume We conclude this section with two basic lemmas required ⌘ is common knowledge. We allow correlations across in our subsequent analysis. These lemmas provide use- neighborhoods and types since this is necessary to model ful properties of belief distributions and characterize best homophily, but we assume ⌘ is independent of the state response behavior. à and the private signals. The types t share a common n Lemma 1. The private belief distributions ⌥ and ⌥ sat- marginal distribution , and the distribution has full sup- 0 1 1 isfy the following properties: port in some range 4É1É5, where 0 ∂ É ∂ ∂ É ∂ 1. The ¯ 2 ¯ (a) For all r 401 15, we have d⌥ /d⌥ 4r5 41 r5/r0 range 4É1É5 provides one measure of the diversity of pref- 2 0 1 = É ¯ (b) For all 0
01 if ⇣ ë 4à 1 In5
asymptotic learning obtains is the main focus of our anal- type. Therefore, agent n’s type tn can be interpreted as the ysis, but we also comment on learning rates to shed some minimum belief agent n must have that the true state is light on shorter term outcomes. à 1 before she will choose action x 1. = n = Lobel and Sadler: Preferences, Homophily, and Social Learning 568 Operations Research 64(3), pp. 564–584, © 2016 INFORMS
3. Sparsely Connected Networks same utility as that neighbor, so utility is strictly increasing We call a network sparse if there exists a uniform bound along a chain of agents. In our example, differing prefer- on the size of all neighborhoods; we say the network is ences mean that an agent earns less utility than her neigh- M-sparse if, with probability 1, no agent has more than M bor if she copies. The improvement component is unable neighbors. We begin our exploration of learning in sparse to make up this loss, and asymptotic learning fails. networks with an example. Although we shall model far more general network struc- tures, in which agents have multiple neighbors and are Example 1. Suppose the signal structure is such that uncertain about their neighbors’ types, this insight is impor- ⌥ 4r5 2r r 2 and ⌥ 4r5 r 2. Consider the network tant throughout this section. Learning fails if there is a 0 = É 1 = topology ⌘ in which each agent observes her immediate chance that an agent’s neighbors have sufficiently different 1 predecessor with probability 1, agent 1 has type t1 5 with preferences because opposing risk trade-offs obscure the 4 = probability 0.5 and type t with probability 0.5. Any information that is important to the agent. Homophily in 1 = 5 other agent n has type tn 1 tn 1 with probability 1. the network allows individuals to identify neighbors who = É É In this network, agents fail to asymptotically learn the are similar to themselves, and an improvement principle true state, despite having unbounded beliefs and satisfying can operate along homophilous connections. the connectivity condition from Acemoglu et al. (2011).
1 3.1. Failure Caused by Diverse Preferences Without loss of generality, suppose t1 5 . We show inductively that all agents with odd indices= err in state 0 This subsection analyzes the effects of diverse preferences 1 in the absence of homophily. We assume that all types and with probability at least 4 , and likewise agents with even 1 neighborhoods are independently distributed. indices err in state 1 with probability at least 4 . For the 1 9 3 first agent, observe that ⌥04 5 5 25 < 4 , so the base case = Assumption 2. The neighborhoods 8B4n59n and types holds. Now suppose the claim holds for all agents of index 2 8tn9n are mutually independent. less than n, and n is odd. The social belief qn is minimized 2 if x 0, taking the value Definition 1. Define the ratio RÖ t41 Ö5/4t41 Ö5 n 1 t ⌘ É É É = 41 t5Ö5. Given private belief distributions 8⌥ 9, we say + É à ⇣ ë 4xn 1 0 à 15 1/4 1 that the preference distribution is M-diverse with respect É = ó = æ 0 1 ⇣ ë 4xn 1 0 à 15 ⇣ ë 4xn 1 0 à 05 1/4 1 = 5 to beliefs if there exists Ö with 0 <Ö< 2 such that É = ó = + É = ó = + It follows from Lemma 2 that agent n will choose action 1 1 1/M Ö 1 Ö 1 Ö 1 whenever p > . We obtain the bound ⌥04Rt 5 É ⌥14Rt É 5 d 4t5 ∂ 00 n 2 0 É Ö Z ✓ ◆ 1 1 A type distribution with M-diverse preferences has ⇣ 4x 1 à 05 1 ⌥ 0 ë n = ó = æ É 0 2 = 4 at least some minimal amount of probability mass located ✓ ◆ near the endpoints of the unit interval. We could also An analogous calculation proves the inductive step for interpret this as a polarization condition, ensuring there agents with even indices. Hence, all agents err with prob- are many people in society with strongly opposed prefer- ability bounded away from zero, and asymptotic learning ences. As M increases, the condition requires that more fails. Note this failure is quite different from the classic mass is concentrated on extreme types. For any N Downloaded from informs.org by [216.165.95.68] on 15 September 2016, at 14:56 . For personal use only, all rights reserved. away from 1 across all M-sparse networks. a neighbor’s decision. We can decompose an agent’s util- ity into two components: utility obtained through copying Proof. Given the Ö in Definition 1, we show the social her neighbor and the improvement her signal allows over belief qn is contained in 6Ö11 Ö7 with probability 1 for this. The improvement component is always positive when all n, which immediately impliesÉ all of the stated results. 1 private beliefs are unbounded, creating the possibility for Proceed inductively; the case n 1 is clear with q1 2 . improvements to accumulate toward complete learning over Suppose the result holds for all=n k. Let B4k 15= B ∂ + = time. With homogeneous preferences, this is exactly what with B ∂ M, and let xB denote the random vector of ó ó B happens because copying a neighbor implies earning the observed actions. If x 801 19ó ó is the realized vector of 2 Lobel and Sadler: Preferences, Homophily, and Social Learning Operations Research 64(3), pp. 564–584, © 2016 INFORMS 569 decisions that agent k 1 observes, we can write the social ⇣ 4x i à 05 d 4t5 + · ë m = ó = belief qk 1 as 1/41 l5, where l is a likelihood ratio: 1 1 + + qn ⇣4pn ∂ Rt 5⇣ ë 4xm i à 05 d 4t5 ⇣ 4x x à 05 = 0 i 0 = ó = l ë B = ó = Z = 1 X = ⇣ ë 4xB x à 15 1 y y = ó = y⌥04Rt É 5 41 y5⌥04Rt 5 d 4t5 ⇣ ë 4xm xm à 01xi xi1i < m5 = 0 + É = ó = = 0 Z h 1 i = m B ⇣ ë 4xm xm à 11xi xi1i < m5 y 1 y 2 = ó = = y 41 y5⌥04Rt 5 y41 ⌥04Rt É 55 d 4t5 Y = + 0 É É É Conditional on a fixed realization of the social belief qm, Z 1h i y y the decision of each agent m in the product terms is inde- y 41 y5⌥04Rt 5 y⌥14R1 t5 d 4t5 = + 0 É É É pendent of the actions of the other agents. Since q m 2 Z 1h i 6Ö11 Ö7 with probability 1, we can fix the social belief of y y É y 41 y5⌥04Rt 5 y⌥14Rt 5 d 4t51 agent m at the end points of this interval to obtain bounds. = + 0 É É Z Each term of the product is bounded above by h i as desired. É 1 Ö 1/M Theorem 2. Suppose the network is 1-sparse, and let As- 0 ⌥04Rt 5 d 4t5 1 Ö 1 ∂ É 0 sumptions 1 and 2 hold. If either of the following conditions 4R1 Ö5 d 4t5 Ö R0 ⌥1 t É ✓ ◆ is met, then asymptotic learning fails. R (a) The preference distribution satisfies This in turn implies that qk 1 æ Ö. A similar calculation using a lower bound on the+ likelihood ratio shows that 4t5 qk 1 ∂ 1 Ö. É lim inf > 00 + t 0 É ! t Regardless of the signal or network structure, we can always find a preference distribution that will disrupt (b) For some K>1 we have asymptotic learning in an M-sparse network. Moreover, ⌥ 4r5 because of the uniform bound, this is a more severe fail- lim 0 c>01 and r 0 r K 1 = ure than what occurs with bounded beliefs and homo- ! É 1 K 1 t É 4K 42K 15t5 geneous preferences. Acemoglu et al. (2011) show that d 4t5< 00 É K É asymptotic learning often fails in an M-sparse network if 0 41 t5 É private beliefs are bounded, but the point at which learn- Z Proof. Each part follows from the existence of an Ö>0 ing stops depends on the network structure and may still such that 4y5 given in Lemma 3 satisfies 4y5< y for be arbitrarily close to complete learning. In this sense, the Z Z y 61 Ö115. This immediately implies that learning is challenges introduced by preference heterogeneity are more incomplete.2 É substantial than those introduced by weak signals. Assume the condition of part (a) is met. Divide by 1 y An analysis of 1-sparse networks allows a more pre- to normalize; for any Ö>0 we obtain the bound É cise characterization of when diverse preferences cause the improvement principle to fail. In the proof of our result, 1 y y y we employ the following convenient representation of the ⌥04Rt 5 ⌥14Rt 5 d 4t5 0 É 1 y improvement function under Assumptions 1 and 2. Recall Z h É i y 1 Ö 1 the notation Rt t41 y5/4t41 y5 41 t5y5. É y y ⌘ É É + É ∂ ⌥04Rt 5 d 4t5 ⌥04Rt 5 d 4t5 0 + 1 Ö Lemma 3. Suppose Assumptions 1 and 2 hold. Define Z Z É y 1 261 1 17 6 1 1 17 by y Z 2 2 ⌥14Rt 5 d 4t5 ! É 1 y y É Z 1 1 y 41 y5 4y5 y 41 y5 4Ry5 y 4Ry5 d 4t50 (1) y Z ⌥0 t ⌥1 t ∂ ⌥04R1 Ö5 4Ö5 ⌥1 É 0 = + 0 É É É + É 2 1 y Z h i ✓ ◆ É In any equilibrium ë, we have ⇣ ë 4xn à B4n5 8m95 Choosing Ö sufficiently small, the second term is negligible, = ó = = Z4⇣ 4x à55. Downloaded from informs.org by [216.165.95.68] on 15 September 2016, at 14:56 . For personal use only, all rights reserved. ë m while for large enough y, the first term approaches zero, = and the third is bounded above by a constant less than zero. Proof. Observe under Assumption 1 we have ⇣ 4x à5 ë n = Hence the improvement term in Equation (1) is negative in ⇣ ë 4xn à à i5 for each i 801 19, and =1 = ó 1 = 2 61 Ö115 for sufficiently small Ö. f4t5d 4t5 f41 t5d 4t5 for any f . Taking y 0 = 0 É = TheÉ proof of part (b) is presented in the appendix. ⇣ 4x à5, we have ⇣ 4x à B4n5 8m95 equal to É R ë m = R ë n = ó = These conditions are significantly weaker than 1- 1 1 diversity. We establish a strong connection between the ⇣ ë 4xn à B4n5 8m91 xm i1 à 01tn t5 0 i 0 = ó = = = = tail thicknesses of the type and signal distributions. The Z X= Lobel and Sadler: Preferences, Homophily, and Social Learning 570 Operations Research 64(3), pp. 564–584, © 2016 INFORMS improvement principle fails to hold unless the preference technical challenges because we cannot retain the inde- distribution has sufficiently thin tails; for instance, part (a) pendence assumptions from the last subsection; we need implies learning fails under any signal structure if types are to allow correlations to represent homophily. In a model uniformly distributed on 401 15. As the tails of the belief with homogeneous preferences, Lobel and Sadler (2015) distributions become thinner, the type distribution must highlight unique issues that arise when neighborhoods are 1 increasingly concentrate around 2 in order for learning to correlated. This creates information asymmetries leading remain possible. In many cases, little preference diversity different agents to have different beliefs about the overall is required to disrupt the improvement principle. structure of connections in the network. To avoid the com- Even if preferences are not so diverse that learning fails, plications this creates, and focus instead on issues related we might expect heterogeneity to significantly slow the to preferences and homophily, we assume the following. learning process. Interestingly, once the distribution of pref- erences is concentrated enough to allow an improvement Assumption 3. For every agent n, the neighborhood B4n5 principle, learning rates are essentially the same as with is independent of the past neighborhoods and types homogeneous preferences. Lobel et al. (2009) show in a 84tm1B4m559m Proof. See appendix. É previous assumption, the distribution of tn conditioned on the value of t is , regardless of agent m’s type. With Once an improvement principle holds, the size of the m homophily, we would expect this conditional distribution improvements, and hence the rate of learning, depends on the tails of the signal structure. Early work on learning to concentrate around the realized value of tm. We define a suggests that diverse preferences can slow learning (Vives notion of strong homophily to capture the idea that agents 1993, 1995), even if information still aggregates in the long are able to find neighbors with similar types to their own run. Perhaps surprisingly, Proposition 1 shows that as long in the limit as the network grows large. as an improvement principle still holds, asymptotic learn- To formalize this, we recall terminology introduced by Lobel and Sadler (2015). We use B4n5 to denote an exten- ing rates are identical to the homogeneous preferences case. ˆ This result suggests that the learning speed we find if pref- sion of agent n’s neighborhood, comprising agent n’s erences are not too diverse, or if we have homophily as in neighbors, her neighbors’ neighbors, and so on. the next section, is comparable to that in the homogeneous Definition 2. A network topology ⌘ features expand- preferences case. ing subnetworks if, for all positive integers K, Downloaded from informs.org by [216.165.95.68] on 15 September 2016, at 14:56 . For personal use only, all rights reserved. lim supn ⌘4 B4n5