How network structure impacts socially reinforced diffusion?
by
Jad Sassine
M.S. Applicable Mathematics London School of Economics (2013)
SUBMITTED TO THE SLOAN SCHOOL OF MANAGEMENT IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF
MASTER OF SCIENCE IN MANAGEMENT RESEARCH
at the
MASSACHUSETTS INSTITUTE OF TECHNOLOGY
MAY 2020
©2020 Massachusetts Institute of Technology. All rights reserved.
Signature of Author: ______
Department of Management April 17, 2020
Certified by: ______
Hazhir Rahmandad Associate Professor of System Dynamics Thesis Supervisor
Accepted by: ______
Catherine Tucker Sloan Distinguished Professor of Management Professor, Marketing Faculty Chair, MIT Sloan PhD Program 2
How network structure impacts socially reinforced diffusion? by
Jad Sassine
Submitted to the Sloan School of Management on April 17, 2020 in Partial Fulfillment of the Requirements for the Degree of Master of Science in Management Research
Abstract
Social scientists have long studied adoption choices that depend on the number of prior adopters. What is the effect of network structure on such adoption dynamics? The emerging consensus holds that when agents require a high reinforcement threshold for adoption, clustered networks are better conduits of social contagion than random ones. Using models with deterministic thresholds this argument formalizes the idea that transmission will get ‘stuck’ should the number of neighboring adopters fall below a threshold. In this paper, we explore the effect of stochastic thresholds on the diffusion races between random and clustered networks. We show that even low probabilities of adoption upon a single contact would tilt the balance in favor of random networks, a tendency that is reinforced with the size of the network. Moreover, if repeated signals from the same adopter can reinforce a message, random networks are further promoted. However, we also show that clustered networks can still be preferred over random networks if adopters become ‘inactive’ – i.e. they stop sending messages - with high probability. These findings refocus our theoretical understanding of how network structure moderates social influence, and raises new questions on contagion phenomena that benefit from clustered networks.
Thesis Supervisor: Hazhir Rahmandad Title: Associate Professor of System Dynamics
3
Introduction
Social influence is at the heart of social sciences. From joining an online social network, to adopting a social norm, participating in a riot or buying a novel product, many important choices are strongly influenced by the existence and the number of other adopters (Karsai, Iniguez, Kaski, & Kertész, 2014; Scott, Konstantin,
& Milan, 2016; Ugander, Backstrom, Marlow, & Kleinberg., 2012). These adoption dynamics have historically been modeled using behavioral thresholds, where a threshold represents the minimum number of social connections who should have signaled their adoption before the focal actor is convinced to adopt
(Granovetter, 1978).
In understanding social influence, a central question of theoretical and practical relevance relates to the type of network that best facilitates diffusion: Would cohesive communities promote the adoption of new social norms? What about the impact of random boundary spanning ties on diffusion of infectious diseases, or social movements? In promoting new organizational practices should intra-organizational social networks be seeded with cohesive clusters or cross-cutting relationships? Communication networks are rarely fully connected, and even neighboring adopters do not always broadcast their choices. Earlier studies highlighted the central role of random ties connecting disparate parts of social networks in promoting diffusion (Granovetter, 1973). However, threshold models identified an important value of repetition: unless located in a highly clustered network, high threshold agents may not receive sufficient reinforcement to adopt, thus breaking the diffusion dynamics in a population of high- threshold individuals.
It is this intuition that led Morris (2000), Centola & Macy (2007), and Montanari & Saberi (2010) to argue that for an important class of diffusion dynamics clustered networks dominate random networks, i.e. they lead to more or faster diffusion, because they decrease the fraction of isolated agents. From a policy perspective, this insight has implied that maintaining social cohesion increases the speed of diffusion when adoption involves risk, complementarity or normative acceptance (Centola, 2018). The emerging consensus has thus separated social diffusion into “simple” and “complex” alternatives, where the simple diffusion is
4 enhanced by more random networks and the corresponding weak ties, but clustered networks engendering significant reinforcement are needed for enabling complex contagion.
The argument in favor of clustered networks relies on the idea that transmission will get ‘stuck’ should the number of neighboring adopters fall below the adoption threshold. This argument is elegant; however, it has only been formally shown using deterministic thresholds where adoption is impossible below the threshold reinforcement. Considering (the arguably more realistic) stochastic activation functions, where transmission likelihood is non-zero for any level of reinforcement but becomes significantly more likely at a given threshold, could change the calculus. The transmission would not be stuck indefinitely for any actor, unless additional mechanisms are invoked. As a result, a tradeoff emerges between the cost of being temporarily slowed down due to low reinforcement and benefits of seeding the contagion in distant parts of a social network. The resulting tradeoff is intricately dependent on the state of diffusion. As the diffusion spreads in a random network, the frontier connecting susceptible to adopters expands creating an exponential growth dynamics not observed in clustered networks. Moreover, later in diffusion susceptible nodes in a random network become increasingly likely to be connected to more than one adopter as the network saturates. Therefore in the later stages random networks may actually facilitate complex contagion by exposing all susceptible nodes to a large level of reinforcement simultaneously. The implication of these mechanisms for the race between random and clustered networks has not been previously studied, and they may be of significant theoretical and practical relevance. If these mechanisms promote random networks significantly, we may need to revisit the dichotomy between simple and complex contagion, or seek other mechanisms that explain why clustered networks may promote diffusion under specific conditions.
We develop a general model allowing us to study this question under a variety of probability activation functions. We build the model on the principle that two people may behave differently, even if they have the same adoption threshold and reinforcement levels. In other words, the thresholds are stochastic and not deterministic. We then consider that repeated signals from the same source may accumulate and reinforce adoption. After all, repetition is a very powerful tool of persuasion and
5 experimental evidence shows that agents generally overweight repeated information (Enke &
Zimmermann, 2019). By comparing the behavior of this model on different network structures, we can identify the conditions under which clustered networks dominate random ones. Our analysis suggests those conditions are rather narrow and strongly depend both on the shape of the probability activation function and the size of the network. As prior theory had suggested, deterministic thresholds promote clustered networks, but this regularity breaks down as soon as realistically stochastic activation functions are considered. Moreover, the larger the network, the stronger are the benefits of random networks for complex contagion. Finally, if repetition can engender adoption, then random networks are further strengthened against clustered diffusion. Overall, incorporating two behaviorally realistic features of social influence, stochastic thresholds and repetition, can significantly tilt the balance of diffusion speed in favor of random networks.
We also explore how the horserace between random and clustered networks is impacted by heterogeneity in adopters’ motivation to share information. In the ‘The Strength of Weak ties’, Granovetter
(1973) had already postulated that “if the motivation to spread [a] rumor is dampened a bit on each wave of retelling […] bridges will not be crossed”. In other words, the spread will get ‘stuck’. We confirm this insight and show that clustered networks can still be preferred over random networks over some parts of the space as the probability that an adopter becomes ‘inactive’ (i.e. stops sending messages) increases.
Therefore, we show that the strength of clustered networks depends on two likelihoods – one for adoption upon a single contact, the other for adopters becoming silent. We end by discussing the policy implication of this insight, proposing that how the messages are sent may be as important as ‘what is to be diffused’.
Specifically, if the environment limits repeated interactions, then maintaining social cohesion may speed up diffusion. Otherwise, from common social networks to the so-called conversational firms (Turco, 2016), where repeated signals are common and low cost, the benefits of distant ties are exponential, increase with network size, and may well apply to ‘whatever is to be diffused’.
6
From deterministic to stochastic thresholds
Connecting socially distant actors promotes diffusion by enabling it to spread simultaneously in different parts of the network (Granovetter, 1973). Thus rewiring highly clustered networks to create such shortcuts increases the speed of diffusion (Watts & Strogatz, 1998). In Complex Contagion and the Strength of Weak ties, Centola & Macy (2007) showed that this simple intuition may not generalize to ‘whatever is to be diffused’. Specifically, they created a typology of contagion in terms of the distinct number of adopters required for transmission to occur. If that threshold is equal to one, then the contagion is simple and random networks dominate clustered ones because of their many shortcuts across different parts of the network.
Centola and Macy showed that in ‘complex’ contagion, where the threshold is higher than one, those shortcuts become less effective, even harmful. Because adoption is contingent upon multiple reinforcements, a single weak tie is an ineffective conduit. In fact rewiring to create those distant connections – by reducing the density of local reinforcement-- may hurt complex diffusion. This intuition has promoted the distinction between simple and complex contagion as fundamental to understanding diffusion processes. However, the basic insight was developed in comparing two extremes: where all that it takes for diffusion is a single contact (simple) vs. a requirement for at least two reinforcements from different adopters (complex diffusion). We know little about the intermediate cases where additional reinforcement adds value but a single contact, with a small probability, may be sufficient for adoption.
Arguably most real-world diffusion processes are better represented by such stochastic thresholds. In practice we expect agents to influence each other over repeated interactions, with each reinforcement incrementally increasing the probability of adoption. How should we think about the relative merits of clustered vs. random networks under such repeated interactions with stochastic adoption thresholds?
We first build intuition about relevant tradeoffs using a very simple example, and then extend the analysis to the more general case. Let us consider the impact of rewiring local links to create shortcuts on a ring lattice with four links per node (Figure 1). We show a random rewiring that replaces two local links
(A-B and C-D) with the distant short-cuts (A-C and B-D). Such a rewiring increases the randomness of the
7 graph while keeping the degree of each node constant at four. In the graph, black nodes are the adopters and white nodes are susceptible.
Figure 1: 1-dimensional lattice network (left) and with a single rewiring (right)
First consider the classic ‘simple’ and ‘complex’ contagion where one and two adopter neighbors are necessary (and sufficient) for adoption. In the local graph (left) the diffusion progresses locally in both clockwise and counter-clockwise directions. In simple contagion each frontier of contagion (region where susceptible nodes and adopters come together) includes two susceptible nodes that adopt every period for a speed of four nodes per period. That rate goes down to two nodes per period in complex contagion where a single susceptible node is connected to two adopters on each of two contagion frontiers. The rewiring
(right graph) impacts the simple and complex contagion differently. In simple contagion rewiring seeds the diffusion in another region of network, opening two new frontiers (and while the different frontiers have not merged) doubling the diffusion speed. In contrast, rewiring hurts complex diffusion because absent A-
B link the B node is only connected to one adopter, which stops the diffusion when a frontier reaches B. In this case rewiring cuts the diffusion speed by half. By developing this simple intuition Centola and Macy
(2007) showed that random rewiring may have very different effects depending on whether one is concerned with simple or complex contagion.
8
But what if we consider complex contagions where with probability p adoption may also happen in response to a signal from a single adopter neighbor? An approximation of early diffusion speed in this more general scenario is informative. The contagion may now get stuck at B for multiple periods. However, if the interaction is repeated, a non-zero value of p implies that transmission will ultimately occur both locally and across the network, doubling the rate of adoption per round moving forward. For the diffusion rate to double, A needs to infect C, and B needs to infect D. However, B needs to adopt (before he can infect D), creating a bottleneck of two successive single-neighbor adoptions with the expected waiting time before (B, C and) D adopt at 1⁄� . During that time, the initial rate of spread (call that r) is divided by two, so the total number of susceptible nodes goes down by �⁄2� leaving us with � − �⁄2� susceptible nodes adopting with rate 2r. Compare that with diffusion speed in the clustered network (left) which after 1⁄� has � − �⁄� susceptibles adopting at rate r. Comparing these adoption rates rewiring helps if
� 2� < � − �⁄� � − �⁄2�
Which offers a threshold for probability of single-contact adoption that promotes rewiring:
3� � > (1) 2�
This result is consistent with Eckles et al. (2019) who proved the existence of a similar threshold on a graph generated by the union of a lattice network and a random network.
The value of rewiring depends on the network size, N: the higher the N, the lower the p can be while rewiring enhances diffusion speed. This analytical solution focused on early diffusion speed, and additional considerations should be accounted for as the number of shortcuts increases. On the one hand, as the number of shortcuts increases, the speed may double further, because diffusion can occur over more chunks at the same time. However, as the number of shortcuts increases, the size of each chunk decreases.
Since the maximum diffusion speed (2r in the above example with a single rewiring) is not realized for a while, the smaller and overlapping chunks are especially costly. Diffusion should wait significantly for any chunk to activate, but once active it is quickly consumed so benefits of rate increase are short-lived. At the
9 extreme, all triangles are rewired to form shortcuts resulting in a regular random graph, the number of chunks grows to the size of the network, but each offers little gain for diffusion speed. The resulting tradeoffs are not analytically tractable, and next we turn to a simulation model to understand the race between clustered and random networks in facilitating a broad class of contagion processes.
A model of Contagion with repeated interactions and stochastic thresholds
We model a population of agents where each ‘active’ node j (i.e. an adopter that is actively sending messages to her neighbors) sends messages to all her neighbors at a rate � . Each susceptible node keeps track of all the messages received from each adopter. Let � (�) represent the total number of messages from adopter j remembered by agent i at time t (we consider forgetting in an extension; the base model increments messages from agent j without forgetting). The susceptible agent (down)weights multiple messages from the same adopter using a discount factor w and combines the resulting signals from all her
� (�) adopter neighbors into an activation score, � (�):