<<

Bull. Org. mond. Bull. Wld Hit/h Org.SanteJ 1971, 45, 201-208

Equilibrium under in Balanced Polymorphisms

I. BARRAI 1 AND S. P. H. MANDEL 2

The results of a study on frequencies of causing lethal or sublethal disease are reported in relation to the level ofconsanguinity and heterozygote advantage in populations. The theoretical background of the population dynamics of genes under heterozygote advantage and in the presence of inbreeding is examined and the theoretical models are applied to the estimation of the level of for lethal and sublethal genes that are responsible for important public health problems in certain populations; these problems include fl-thalassaemia and sickle-cell anaemia. Estimates obtained from the proposed models show a high degree of internal consistency under widely different conditions; data on sickle-cell anaemia from several African populations and on f-thalassaemia from Italy and Malta were analysed. The hypothesis that rare lethal genes, such as those for Tay- Sachs disease, are maintained in the human population by heterozygote advantage is examined and discussed.

In the past few decades there has been a general The closer the consanguinity between parents, the decrease in morbidity and mortality due to infectious higher is the value of F. A second type of genetic and parasitic diseases. As a result, diseases of morbidity is determined by genes that confer an genetic origin, or those with a genetic component, advantage on the individuals carrying them in single have shown an increase in relative frequency with dose in the heterozygote, but which are deleterious respect to total morbidity and have thus become or lethal to the individual in the homozygous form; comparatively of greater importance from the public sickle-cell anaemia and fl-thalassaemia are diseases health point of view. These diseases of genetic origin of this kind. These genes, which may reach pre- are maintained in the human population essentially valence levels as high as 20 %, are called "poly- by two basic mechanisms: recurrent mutation and morphic" genes, and the corresponding situation at heterozygote advantage. Genetic morbidity due to the population level is known as a balanced poly- recurrent mutation is associated with a class of morphism. genes having a relatively low prevalence. These A more detailed account of the relationship be- so-called " oligomorphic " genes give rise to con- tween human genetics and public health, some of the ditions such as , galactosaemia, essential features of which have been mentioned and Tay-Sachs disease, the incidence of which briefly here, is given in a recent review of this subject is determined by the homozygous state of their by Barrai (1970). specific alleles. That is to say, the genes in question Recently it has been suggested that some oligo- are recessive, and the affected individuals are often morphic genes that are deleterious or lethal in found to be the offspring of consanguineous mar- homozygotes, such as those responsible for Tay- riages. Consanguineous marriage generates inbreed- Sachs disease and for of the pancreas, ing of the progeny, the intensity of which is measured may be maintained in human populations by hetero- by a coefficient of inbreeding (F), which is an expres- zygote advantage. For Tay-Sachs disease, this sion of the level of homozygosity in the progeny. hypothesis was prompted by the possibility that these genes might have increased in frequency in 1 Chief Medical Officer, Human Genetics, World Health some groups that separated in historical times from Organization, Geneva, Switzerland. an & Aron- 2Chief Statistician, Mathematics-Statistics, World Health ancestral population (Myrianthopoulos Organization, Geneva, Switzerland. son, 1966; Smith & Shaw, 1969).

2717 - 201 - 202 I. BARRAI & S. P. H. MANDEL

Since the genes under consideration are recessive Table 1. Array of frequencies under and have low frequencies that would respond to inbreeding inbreeding, it is convenient, in examining this Equivalent I Equivalent hypothesis, to consider the properties of a diallelic Genotype random mating selfing system at a single under inbreeding and fraction - fraction selection. The elements of such a system are two allelic genes, symbolically represented by A and a, AA (1-q)2 (1-F) (1-q)F Wii and the possible that may result from Aa 2q(1 -q) (1-F) 0 W12 them, namely the two homozygotes AA and aa and the heterozygote Aa. These alleles have a character- aa q2 (1 - F) qF W22 istic prevalence in any given population of interest, which may be represented by, say, q for allele a so that the prevalence of the complementary allele A is p=l-q. other is equivalent to a selfing fraction. The resulting The objectives and rationale of mathematical array is shown in Table 1. models describing the population dynamics of The whole mathematical treatment that follows is genetical systems are discussed in considerable detail based on the assumption that we are dealing with an by Li (1967). The fundamental mathematical pro- infinite population in which the level of inbreeding is position, which forms the basis of all such models, constant through generations. However, even in a is the celebrated theorem of Hardy & Weinberg, finite population with variable F, the variation in which asserts that, in a population of diploid indivi- inbreeding will be small, provided that the level of duals with random mating and in the absence of inbreeding is low. selection, zygotes AA, Aa, and aa will be formed in The change in frequency for allele a in any the ratio (1- q) 2: 2q (1- q): q2, respectively. one generation is given by the following equation In real population situations, however, matings (Li, 1955): do not occur at random. From the point of view of ~dWR dWil population genetics, these departures from the ran- Aq = q(l1-q +2(2) dom mating situation are adequately described by 2W dq dq J the concept of inbreeding. Moreover, with respect to most genetic loci, it cannot realistically be assumed where WR is the fitness of the random mating that all genotypes are equally well adapted to their fraction of the system, WI is the fitness of the inbred environment. The term selection describes such fraction, and W is the average fitness of the entire differences between genotypes in fitness or adapta- population so that: tion to the environment. Fitness can be defined as the relative ability of a genotype to survive from zlq= q(l-q){(W22-WI,) F+(1 -F) birth to maturity and to produce offspring; sym- bolically, the values of fitness of the three genotypes [W12-Wll-q (2W12-W11-W22)]), (2) can be represented by non-negative numbers W11, W,2, and W22, whose magnitudes are proportional which can be written in the form: to the ratios of the fitnesses of the genotypes AA, Aa, and aa, respectively. aq= q(l(-q) (2W123W1)_W22) (I-F) (3) If we incorporate into the basic mathematical model of Hardy & Weinberg the complicating where phenomena of selection and inbreeding, and if we W12_W11_ (W12_W22)F now let F represent the average inbreeding level in qF:-- (4) the population, where F is a number between 0 (I _F) (2W12_Wn_W22) and 1, it is possible to generate an array describing (Malecot, 1948, p. 44). the structure of the population with respect to genotype, gene prevalence, and fitness. The frequen- The equilibrium condition is given by Aq=0, cies of the various genotypes can be partitioned into which is satisfied by the trivial solutions q=0 and two main fractions, one of which is equivalent to the q=1, and by the solution q =q, which corresponds random mating part of the population, while the to a non-trivial equilibrium if, and only if, 0

(Note that under F = 0, the equation stable non-trivial equilibrium exists if, and only if, W12 W'11 W12-Wul qF F < ---12 (9) 2W12 -Wi11-W22 2 is obtained, which is the usual random mating It may be noted that the equilibrium gene fre- solution.) It follows that a polymorphic equilibrium quency of the gene corresponding to the less-fit exists if, and only if, one of the following pairs of homozygote under inbreeding is smaller than the inequalities holds: corresponding random mating value since

W, 2-WjLj> FMW 2-W22) (5) 15 ~~~(W11-W22)F W12-W22> F(W12-W11)( qF_-O-qF - (1-F) (2W12-W11-W22)>0 (10) or and that this difference increases with increasing W12-WI, 0, while conditions (6) imply A<0 and hence ton is qn it follows that an equilibrium characterized by dt = Wdq (12) conditions (5) is stable, whereas an equilibrium Aq(1 -q)(4Fq) (2 characterized by conditions (6) is unstable. to qo Thus conditions (5) are necessary and sufficient Resolving the right-hand side into partial fractions for the existence of a stable non-trivial balanced and integrating, we have: in the presence of inbreeding. It may be noted that these conditions reduce to the classical A(tn-to) = W L.-ln qn + IW22n Iqn_ conditions, namely W12> W1U, W12> W22, when 4F o (1 qF) 1q F=0. It may also be noted that the conditions WI, W22 +} qln W12>W11, W12>W22 are necessary (but not suffi- A) In 4Fq~+ (1( 1 qF) qF cient) for the existence of a stable equilibrium, and hence we can suppose, without loss of generality, This integral may be used to compute the numbers that W12> W11 > W22. Under these conditions, a of generations that are necessary, under continuity,

5 ')AA I. BARRAI & S. P. H. MANDEL

-- -F = 0.01 F = 0.10 F= .30 F= 0.20 F=i.10 Fo.oo F = 0.20 ."I 0-w238

F = 0.30 Fig. 2. Variation of lq as a function of q and F in a balanced polymorphism where Wi1 = 0.9, W12 = 1.0 and W22 = 0.8, all equilibria being stable for F<0.5. F = 0.40

The intercept of Aq with the abscissa has a nega- 10 20 40 60 80 100 tive slope, for all values of F < 1/2, so that the Generations family of equilibria is seen to be stable under the conditions given. Fig. 1. Deterministic approach to equilibrium in a balanced polymorphism where Wi1 = 0.9, W12 = 1.0, and W22 = 0.8, under inbreeding. CASE OF THE RECESSIVE LETHAL Let us study a system in which W22=0, so that the to reach a frequency qn, starting from a frequency qo. condition (9) on F reduces to It has been obtained previously for the particular case of F=0 (Cavalli, 1950; Livingstone, 1967). It

co I Do F = 0.04 I .D 7 L, F=0.06

F = 0.08 0.06 F = 0.09 F values W"O 00241 Fig. 5. Time in generations necessary to approach 40 60 equilibrium, and equilibrium frequency as a function Generations W"O 00239 of F (qo = 0.001, Wi2 = 1.169). Fig. 3. Deterministic approach to equilibrium under inbreeding in a balanced polymorphism where Wi1 = zygote advantage is required for the maintenance of 0.9, W12 = 1.0, and W22 = 0. a nontrivial stable equilibrium (see Allard, Jain & Workman (1968) for review and discussion). The main effect of inbreeding on a balanced polymorphism, within the permissible range, is the decrease of the equilibrium frequency of the allele corresponding to the less-fit homozygote as com- pared with the equilibrium frequency under random mating. The higher the inbreeding level, the lower the equilibrium frequency. A secondary effect is the increase in the time, measured in numbers of generations, that is necessary for the system to reach near equilibrium. Fig. 5 shows the variation with respect to F of the equilibrium gene frequency and time for a system in which Wl, = 1, W12= 1.169, and W22=0. The initial frequency is taken at 0.001; near equilibrium is defined as a difference from the equilibrium of one-thousandth of the equilibrium Fig. 4. Variation of Aq as a function of q and F in a frequency. In an initial phase of polymorphism, balanced polymorphism where Wii = 0.9, W12 = 1.0, inbreeding might keep gene frequencies indefinitely and W22 = 0, all equilibria being stable for F<1/10. low. APPLICATIONS OF TIHE MODEL inbreeding exist, and this has been shown to be the The model permits estimation of q, F and the case in the situation where the selection coefficients relative values of fitness in the case of a diallelic are fixed. Changes in these coefficients may result system with heterozygote advantage and an allele in a variation of the range of F in which equilibrium that is lethal in the homozygote, such as the systems is possible. However, if the level of inbreeding for haemoglobin S, ,-thalassaemia and presumably increases beyond the critical point, increasing hetero- Tay-Sachs disease and cystic fibrosis. 206 I. BARRAI & S. P. H. MANDEL

If the observed distribution of gene frequencies is Table 2. Distribution of population groups known in a population that is subdivided into villages and samples used for the estimation isolated groups, an equivalent value of F can be of fitness calculated by Wahlund's formula (see Li, 1955) Gene Population group Samples |Total sample cTq2 F = (13) Sickling People's Republic of 4J the Congo 185 62 140 where q is the observed average, p 1 -q, and a 2 Cameroon 22 6 744 is the variance of the gene frequencies. It should be Nigeria 34 22 803 noted that in this case it is assumed that a population with an excess of homozygotes resulting from sub- 8-thalassaemia Sardinia 89 20 154 division is equivalent to a non-subdivided population Sicily 34 9 290 with a level of inbreeding defined by equation (13); Ferrara area 27 26 155 the effect of local inbreeding is disregarded, and this Malta 42 10 385 might be a severe restriction. If adequate sets of data are available the parameters fitting the model that may be estimated are as follows: (1) Equilibrium value of q; this requires that the study of the gene for haemoglobin S, and 4 for The ratio Wn/W12 and F are stable through generations, the study of ,B-thalassaemia in Italy and Malta. and known. number of groups sampled and the total sample size are given in Table 2 by gene and population group. (2) Value of the relative fitness W,1/W12; this Table 3 shows the values of q and the equivalent F requires the assumption of equilibrium of the dis- for the haemoglobin-S gene distributions and for of gene frequencies. tribution ,-thalassaemia in these areas. (3) Value of F; this is also possible under the The distributions of gene frequencies in these assumption of equilibrium. areas are not necessarily in a steady state; and the (4) Time necessary to go from an initial frequency value of the F equivalent computed from them is qo to a frequency qn; it is possible when F and the inflated by sampling variance. The gene frequencies ratio Wn/W12 are stable and known. for haemoglobin S were calculated from sickling rates, under the assumption that no homozygote Sickle-cell anaemia and ,-thalassaemia survives to adulthood; they may have been under- Livingstone (1967) has given extensive data on estimated. The gene frequencies for fl-thalassaemia sickling rates and frequencies of AS heterozygotes in are based on osmotic fragility tests; osmotic fragility various countries, and of,-thalassaemia in Italy and may be due to several causes quite independent of elsewhere. From that report, data for 7 populations the heterozygous state for ,B-thalassaemia, and the have been selected, 3 populations in Africa for the gene frequencies may be largely overestimated. As

Table 3. Relative values of fitness W12/W11 in the populations under study

Gene | Population group | q F [ WI2/W1I Sickling People's Republic of the Congo 0.12557 0.01317 -1.185 Cameroon 0.06369 0.02650 1.104 Nigeria 0.11356 0.00314 1.145

B-thalassaemia Sardinia 0.09378 0.01683 1.136 Sicily 0.02417 0.00548 1.031 Ferrara area 0.06542 0.00836 1.085 Malta 0.03274 0.02181 1.059 EQUILIBRIUM UNDER INBREEDING IN BALANCED POLYMORPHISMS 207 a consequence, the relative value of fitness may be gants should be an obvious alternative to hetero- overestimated for sickle-cell anaemia, whereas it may zygous advantage. It may be suggested that remote be underestimated for B-thalassaemia. inbreeding accounts for part at least of the number Accurate estimation depends on the quality of the of segregants, and that may have played data and it would be premature to accept these a part in the increase in frequency. relative values of heterozygote fitness as more than It should be noted that if the advantage is of the indicative. It seems, however, that the method order of 5 %, this is also the maximum level of proposed may be applied, since the estimates of the inbreeding that would be tolerated by the system. fitness of the same heterozygote are highly consistent Inbreeding in excess of the permissible maximum under different distributions of gene frequencies in would result in the disappearance of the gene, what- different environments. The average estimate of the ever the initial gene frequency. As a consequence, relative fitness for the genotype AS, weighted by if we were to accept the 5 % advantage for Ashkena- sampled population size, is zim, inbreeding in this group should have remained W12= 1.169±0.026, below this level during their history. Even if this were the case, it would be difficult to accept advan- and for the ,B-thalassaemia heterozygote tage under environmental conditions that are, or W12= 1.089±0.035. have been, specific for the Ashkenazi group. In conclusion, it is felt that the indication of Tay-Sachs disease advantage for the Tay-Sachs heterozygote is vague. According to Myrianthopoulos & Aronson (1966), The magnitude of the assumed advantage seems to the heterozygote advantage for the Tay-Sachs gene be very large, and a more complete distribution of in Ashkenazi Jews of the USA could be as low as gene frequencies in several different groups is 4.4% and as high as 6% when compared with the necessary before attempts are made to obtain fitness of the normal homozygote. It is possible to estimates with the proposed models; the estimate apply the model to obtain another independent obtained by use of the available distribution results estimate of the relative fitness of the Tay-Sachs in a rather small advantage. heterozygote. Using the gene frequencies given by It is most important that problems like the one Myrianthopoulos & Aronson (op. cit., Table 1) the posed by Tay-Sachs disease and cystic fibrosis are inbreeding equivalent (F=0.0051) and the average clarified. The assumption of advantage in the urban gene frequency (q =0.0082) can be computed; thus environment has very serious consequences for therelative fitness is W12 = 1.013, which is smaller than public health. Since it seems unavoidable that the the values given by Myrianthopoulos & Aronson urban environment will replace all others, a rise in (op. cit.) and very similar to the value calculated by the frequency of alleles that have an advantage in Smith & Shaw (1969). Should the inbreeding level such environments might be expected, and a 5 % be higher than estimated, one could expect a greater advantage for Tay-Sachs heterozygotes would mean fitness but since there is no way of invoking a an equilibrium frequency of the same magnitude, and larger inbreeding coefficient if data are not available possibly an infant mortality greater than 2 per 1000, the argument becomes purely speculative. as a single cause. However, gene repair and substitu- The possibility that a higher inbreeding level in tion techniques may have been developed and widely Ashkenazim groups distorts the estimation of gene adopted before equilibrium becomes established, frequencies through an excess of Tay-Sachs segre- since that would take a long time.

UME tQUILIBRE EN CAS D'ENDOGAMIE DANS LES POLYMORPHISMES BALANCJ-S

Les auteurs discutent la th6orie de l'equilibre qui des frequences initiales des genes, du degre d'endogamie s'etablit, dans des conditions d'endogamie, dans les et de l'aptitude relative (aptitude d'un g6notype donne a polymorphismes balances. Une formule math6matique survivre jusqu'a l'age adulte et 'a procreer). Grace a permet d'estimer le nombre de gen6rations necessaire cette formule, on demontre que l'endogamie a pour effet pour atteindre un etat voisin de l'equilibre, en fonction principal d'abaisser le niveau ofu se stabilise la frequence 208 I. BARRAT & S. P. H. MANDEL de l'allele le plus rare et d'accroitre la duree necessaire zygote normal, lui est superieure de 16,9 ± 2,6% dans pour parvenir a cet 6quilibre. le cas de la drepanocytose, de 8,9 ± 3,S % dans le cas de Un modele math6matique - destin6 a evaluer la la ,-thalassemie, et de 1,3 % dans le cas de la maladie de valeur de l'aptitude relative dans le cas d'un gene r6cessif Tay-Sachs. On ne disposait, pour cette derniere affection, l6tal dont la transmission persiste parce qu'il confere un que d'un lot unique de donn6es et il n'a pas 6t6 possible avantage au porteur h6terozygote - a ete appliqu6 a de mesurer une erreur type. trois affections d'origine g6n6tique: l'anemie a hematies Les resultats obtenus par l'analyse des donn6es effec- falciformes, la P-thalass6mie et la maladie de Tay- tuee suivant le modele decrit ne plaident pas en faveur Sachs. Par cette methode, on constate que l'aptitude de l'hypothese selon laquelle l'etat d'het6rozygote relative de l'h6terozygote, comparee A celle de l'homo- conf&e un avantage dans la maladie de Tay-Sachs.

REFERENCES

Allard, R. W., Jain, S. K. & Workman, P. L. (1968) Livingstone, F. B. (1967) Abnormal haemoglobins in Advanc. Genet., 14, 55-132 human populations, Chicago, Aldine Publishing Com- Barrai, I. (1970) WHO Chron., 24, 241-247 pany Myrianthopoulos, N. C. & Aronson, S. M. (1966) Cavalli, L. L. (1950) Biometrics, 6, 208-220 Amer. J. hum. Genet., 18, 313-327 Malecot, G. (1948) Les Li, C. C. (1955) Population genetics, University of Chicago mathematiques de l'heredite, Paris, Press Masson Smith, A. P. & Shaw, R. (1969) Nature (Lond.), 224, Li, C. C. (1967) Biometrics, 23, 397484 1214-1215