URN MODELS WITH RANDOM MULTIPLE DRAWING AND RANDOM ADDITION
Abstract. We consider a two-color urn model with multiple drawing and random time-dependent addition matrix. The model is very general with respect to previous literature: the number of sampled balls at each time-step is random, the addition matrix is not balanced and it has general random entries. For the proportion of balls of a given color, we prove almost sure convergence results. In particular, in the case of equal reinforcement means, we prove fluctuation theorems (through CLTs in the sense of stable convergence and of almost sure conditional convergence, which are stronger than convergence in distribution) and we give asymptotic confidence intervals for the limit proportion, whose distribution is generally unknown.
Irene Crimaldi1, Pierre-Yves Louis23, Ida G. Minelli4
Keywords. Hypergeometric Urn; Multiple drawing urn; P´olya urn; Random process with rein- forcement; Randomly reinforced urn; Central limit theorem; Stable convergence; Opinion dynamics; Epidemic models
MSC2010 Classification. Primary: 60B10; 60F05; 60F15; 60G42 Secondary: 62P25; 91D30 ; 92C60
Contents 1. Introduction 2 2. The model 4 3. Asymptotic results 5 3.1. Almost sure convergence 7 3.2. Central limit theorems for the case of equal reinforcement means 10 3.3. Probability distribution of the limit proportion in the case of equal reinforcement means 14 3.4. Asymptotic confidence intervals for the limit proportion in the case of equal reinforcement means 14 4. Examples and numerical illustrations 16 Appendix A. Technical results 23 Appendix B. Some auxiliary results 25 arXiv:2102.06287v3 [math.PR] 4 Jul 2021 Appendix C. Stable convergence and its variants 25 References 27
1IMT School for Advanced Studies Lucca, Piazza San Ponziano 6, 55100 Lucca, Italy, irene.crimaldi@imtlucca. it 2PAM UMR 02.102, Universit´eBourgogne Franche-Comt´e,AgroSup Dijon, 1 esplanade Erasme, F-21000, Dijon, France, [email protected] 3Institut de Math´ematiquesde Bourgogne, UMR 5584 CNRS, Universit´eBourgogne Franche-Comt´e,F-21000, Dijon, France, [email protected] 4Dipartimento di Ingegneria e Scienze dell’Informazione e Matematica, Universit`adegli Studi dell’Aquila, Via Vetoio (Coppito 1), 67100 L’Aquila, Italy, [email protected] 1 2 MULTIPLE DRAWING AND RANDOM ADDITION
1. Introduction Reinforcement (see [34] for a review) means the tendency of a stochastic evolution to increase (or sometimes decrease, so called, negative reinforcement) the occurrence of an event in relationship with the number of time this event took place in the past. The P´olya urn stochastic process is the fundamental and paradigmatic example. It led to several generalizations. The original evolution rule of the P´olya urn is based on picking one ball in an urn filled with colored balls and replacing that ball in the urn together with one or more balls, according to some ”updating matrix”. More generalized samples have been considered, leading to multi-drawing based updating rules. In these models, many balls are selected at each time and returned before adding some new ones according to a reinforcement rule. Bi-color and multi-color models have been considered, as well as models where the extraction of the balls is with or without replacement. The number of sampled balls is always a fixed constant and the “replacement matrix” is in general assumed to be balanced, that is, the number of added balls to the urn is constant along time (e.g. [9,10,19,21,23,25,28,31]). In particular, in [20,28,31] the number of added balls is a deterministic function of the composition of the extracted sample. Results deal with the asymptotic behavior, evolution of moments, almost sure convergence and Central Limit Theorems (CLTs) for the fraction of balls of a given color in the urn. In the model considered in [29], m balls are sampled at a time, with replacement, and the distribution of the increment of one color follows, given the past, a binomial distribution with parameters m and p, where p depends on weights associated to the drawn colors. Results mainly deal with regimes where “fixation” happens, which is more interesting for reinforced random walks applications. Moreover, different urn models with multi-drawing were considered in relationship with some specific applications. See for instance [15, 24, 26, 27]. Other urn models merge multi-drawing and random replacement matrix. The paper [2] is a generalization of [1] and it deals with a constant sample size and a random replacement matrix. This matrix can be of P´olya (diagonal) or Friedman (anti-diagonal, reinforcement of the non chosen color) type and its entries have time-homogenous distribution. In particular, we point out that CLTs are not proven for the P´olya type case. As we will see later on, we here fill in this gap. The papers [3, 12] study a multi-drawing model (called HRRU, hypergeometric randomly rein- forced urn model) with a random number Nn of sampled balls and a random replacement matrix of rank 1 (bicolor case). The number of added balls of a given color is proportional to the number of balls of the same color in the sample, but the random reinforcement factor is the same for both col- ors. Note that this model generalizes the one recently given in [8]. The almost sure convergence of the color proportions toward a non degenerate random variable is proven. Necessary and sufficient conditions for no-atoms in the limiting distribution are given. In this paper, we consider a two-color urn model, with multiple drawing and random time- dependent addition matrix. The model is very general with respect to previous literature: the number of sampled balls at each step is random, the addition matrix, defining the number of additional balls, has general random entries. More precisely, for both colors, the random number of added balls is proportional to the number of balls of the same color in the sample, with possibly different random coefficients An, Bn (which may be correlated and their distribution may depend on time n). The model studied in [3,12] corresponds to the particular case An = Bn. The reinforcement rule we consider is not balanced (thus the long-run behavior of the total number Sn of balls in the urn at time n needs to be studied). We prove almost sure convergence results for the proportion as well as fluctuation results, through central limit theorems in the sense of stable convergence and of almost sure conditional convergence, by suitably extending some approaches employed in the urn model literature without multi-drawing (see [4, 5, 32]). Specifically, we consider two cases. If the factors An and Bn have the same mean (equal reinforcement means case), the limit proportion Z is MULTIPLE DRAWING AND RANDOM ADDITION 3 random without atoms. In the case of unequal limit reinforcement means, the proportion converges almost surely to 1 (or 0). When the limit proportion Z is random, the proven central limit theorems are employed in order to obtain asymptotic confidence intervals. Some applications of the urn models with multi-drawing are described in [26]. Moreover, like explained in [3,12], the present model may be applied in the context of technology adoption to model, for example, the evolution of the choice between different operative systems by companies. Below we illustrate other possible interpretations in the contexts of opinion dynamics and propagation of contagious diseases (epidemic models). Applications to opinion dynamics could be developed as follows. Assume to be before an election between two candidates. People decide who they are going to vote for. People who have already decided are represented as the colored balls already in the urn, the color meaning the choice for one candidate. One assume this is a not evolving choice. At each iteration, a group (with random size Nn) of people is sampled (without replacement) and each one is given the opportunity to convince a group of other people. The new-comers will adopt the same choice as the person who convinced them. The heterogeneity of this reproduction mechanism is modeled through the time- dependent randomness of the factors An and Bn. The assumption of equal reinforcement means would mean that in the long-run no advantage is given to any party. We can also consider the evolution of the diffusion of a binary opinion through social networks, like Twitter. Each agent inside a connected community has an un-changing opinion (for instance, a vote or a purchased product). This community will grow dynamically through immigration of followers. At each step, a subset (with random size Nn) of agents is chosen. Each agent of this committee is allowed to call into the community of followers sharing their opinion. Once again, the heterogeneity of this growth mechanism is modeled by allowing the multiplying factors An and Bn for each opinion to be random. Correlation between these growth coefficients are possible. If one of these coefficients is eventually larger in mean, then the associated opinion will dominate eventually (but may take some time). If both coefficients are equal in mean then some random equilibrium takes place. In the original paper [17], where the P´olya urn model was first defined, smallpox epidemy was the context it was applied to (see for instance [22,30] and references therein). Therefore, a second application of our model one could have in mind is the diffusion of genetic variants of viruses (see for instance [33] for a review on epidemic models on networks). We do not pretend to do any modeling study here but want to illustrate the potentialities of our model as a “toy model”. Assume one want to model the propagation of a virus, existing in two forms. Assume to consider a time scale such that there are infinitely many persons to be possibly contaminated and that once a person is contaminated, he/she remains contagious “for ever” (no recovering, no dying). Balls in the urn represent the contaminated persons by one of the two variants of the virus (corresponding to the two possible colors of the balls). We do have in mind the initial exponential regime of the propagation of two competing variants of one virus. Each discrete time-step of the urn’s evolution means a contagion step. People that are contaminating are assimilated to the sample made without replacement in the urn. This is a random number Nn and this randomness may depend on time and on the total number of contaminated persons. One chosen contaminating person diffuse the same variant. Each variant has its own amplifying factor An (resp. Bn): one assume that each selected person, contaminated by a given variant, is contaminating the same number of people. This somewhat unrealistic hypothesis is compensated by the fact that the number of individuals infected by one person is random, with a time-dependent and variant-dependent randomness. Moreover, An and Bn could be correlated. This model gives insights: if the limit means (time-asymptotic reproduction means of each variant in this context) are unequal, one kind of virus will eventually dominate. If they are equal, there is a limiting genuinely random proportion, for which we provide confidence intervals. 4 MULTIPLE DRAWING AND RANDOM ADDITION
Finally, another application context could be population dynamics in case of competitive or cooperative growth. As before, the flexibility of the model lies in the choice of Nn, An and Bn. The joint distribution of [An,Bn] is important to model competition or cooperation. One may think to bacterial populations and the evolution of their respective proportions in the microbial gut. The paper is organized as follows. In Section 2 we formally define the model. In Section 3 we state and prove the main results. In Subsection 3.1 we prove the almost sure convergence towards a limit proportion Z. Different behaviors occur according to equality/unequality of the limit reinforcement means. In particular, in the case of equal reinforcement means, we provide precise asymptotic rates: indeed, in Subsection 3.2 we establish central limit theorems for the proportion Zn of the balls of a given color in the urn and for the empirical mean Mn of the proportion of the balls of a given color in the samples. Moreover, in the case of equal reinforcement means, in Subsection 3.3, we prove that the distribution of the limit proportion Z has no atoms and, in Subsection 3.4, we provide asymptotic confidence intervals for Z, centered in Zn and Mn. We then present in Section 4 more specific examples, illustrated with some numerical simulations. The paper is enriched with an appendix in three parts which collects some more technical lemmas and general results, in particular about stable convergence and its variants.
2. The model An urn contains a ∈ N \{0} balls of color A and b ∈ N \{0} balls of color B. At each discrete time n ≥ 1, we simultaneously (i.e. without replacement) draw a random number Nn of balls. Let Xn be the number of extracted balls of color A. Then we return the extracted balls in the urn together with other AnXn balls of color A and Bn(Nn − Xn) balls of color B. More precisely, we take a probability space (Ω, A,P ) and, on it, some random variables Nn,Xn,An,Bn such that, for each n ≥ 1, we have:
(A1) The conditional distribution of the random variable Nn given
[N1,X1,A1,B1 ...,Nn−1,Xn−1,An−1,Bn−1]
is concentrated on {1,...,Sn−1} where Sn−1 is the total number of balls in the urn at time n − 1, that is n−1 n−1 X X Sn−1 = a + b + AjXj + Bj(Nj − Xj). (1) j=1 j=1
(A2) The conditional distribution of the random variable Xn given
[N1,X1,A1,,B1 ...,Nn−1,Xn−1,An−1,Bn−1,Nn]
is hypergeometric with parameters Nn,Sn−1 and Hn−1, where Hn−1 is the total number of balls of color A at time n − 1, that is n−1 X Hn−1 = a + AjXj. (2) j=1
(A3) The random vector [An,Bn] takes values in N \{0} × N \{0} and it is independent of
[N1,X1,A1,B1,...,Nn−1,Xn−1,An−1,Bn−1,Nn,Xn] .
According to the above notation, the random variable Xn corresponds to the number of balls having the color A in a random sample without replacement of size Nn from an urn with Hn−1 balls of color A and Kn−1 = (Sn−1 − Hn−1) balls of color B. The reinforcement rule is of the “multiplica- tive” type: indeed, each time n, we add to the urn AnXn balls of color A and Bn(Nn − Xn) balls of color B. Therefore, the total number of added balls to the urn, that is AnXn + Bn(Nn − Xn), MULTIPLE DRAWING AND RANDOM ADDITION 5 is random and depends on n.
Note that we do not specify the conditional distribution of the random variable Nn (the sample size) given the past [N1,X1,A1,B1 ...,Nn−1,Xn−1,An−1,Bn−1] nor the distribution of [An,Bn] (the random reinforcement factors An and Bn may have different distributions, they may be correlated and their joint and marginal distributions may vary with n).
It is worthwhile to remark that this model include the Hypergeometric Randomly Reinforced Urn (HRRU) studied in [3, 12] (take An = Bn for all n), which in turn include the model recently given in [8]. In particular, two special cases are the classical P´olya urn (the case with Nn = 1 and An = Bn = k ∈ N \{0} for each n) and the 2-colors randomly reinforced urn with the reinforce- ments for the two colors equal or different in mean (the case with Nn = 1 for each n and [An,Bn] arbitrarily random in N\{0}×N\{0}). Moreover, as told in Section 1, previous literature (we refer to the quoted papers in Sec. 1) deals with the case when the sample size Nn is a fixed constant, not depending on n, and/or the balanced case (constant number of added balls to the urn each time).
We set Zn equal to the proportion of balls of color A in the urn (immediately after the updating of the urn at time n and immediately before the (n + 1)-th extraction), that is Z0 = a/(a + b) and
Hn Zn = for n ≥ 1. Sn Moreover we set F0 = {∅, Ω}, Fn = σ N1,X1,A1,B1,...,Nn,Xn,An,Bn for n ≥ 1 , and Gn = Fn ∨ σ(Nn+1), Hn = Gn ∨ σ(An+1,Bn+1) for n ≥ 0. By the above assumptions and notation, we have
E[An+1 | Gn] = E[An+1] ,E[Bn+1 | Gn] = E[Bn+1] (3) and E[Xn+1 | Hn] = E[Xn+1 | Gn] = Nn+1Zn , (4) E[Nn+1 − Xn+1 | Hn] = E[Nn+1 − Xn+1 | Gn] = Nn+1(1 − Zn) .
Finally, we set Xn = {0 ∨ Nn − (Sn−1 − Hn−1),...,Nn ∧ Hn−1} and, for each k ∈ Xn,