The Estimation of the Genetic Correlation: the Use of the Jackknife

The Estimation of the Genetic Correlation: the Use of the Jackknife

Heredity 73 (1994) 544-548 Received 21 March 1994 Genetica! Society of Great Britain The estimation of the genetic correlation: the use of the jackknife DEREK A. ROFF* & RICHARD PREZIOSI Department of Biology, McGill University, 1205 Dr Pen field Ave., Montréal, Québec, Canada H3A 1B1 Tounderstand fully the process of evolution of quantitative traits it is necessary to be able to estimate the genetic correlation and its associated standard error. At present, estimation methods are available only for relatively simple designs. An alternative procedure is to use the correlation of family means as an estimate of the genetic correlation. We evaluate the utility of the family mean method and that of the more general procedure, the jackknife. The family mean method is shown to be potentially very biased unless family sizes are very large (20),and therefore its general utility is questionable. However, the jackknife method does provide valid estimates of both the correlations (phenotypic and genetic) and their standard errors. Keywords: bias,confidence limits, geneticcorrelation, heritability, jackknife, phenotypic correlation. Introduction confidence interval. A likely candidate is the jackknife method. The jackknife is one of a number of methods Because of genetic correlations among characters,collectively known as robust techniques because they selection, even if it operates directly only on a single are relatively insensitive to the underlying distribu- character, also affects many other characters (Lande & tional properties of the statistics being estimated Arnold, 1983; Falconer, 1989). For this reason the (Potvin & Roff, 1993). Nevertheless, these methods are estimation of the genetic correlation is of considerable not panaceas in the sense that they can be applied importance both for the application of quantitative indiscriminately, and it is necessary to demonstrate in genetic theory in artificial selection and for the under- each particular circumstance that they are valid standing of evolutionary processes in natural popula- approaches (Miller, 1974; Potvin & Roff, 1993). This tions. In any estimation procedure it is necessary to can be done using simulation. The efficacy of the jack- estimate both the value of the statistic itself and also the knife in estimating the heritability and its standard associated confidence limits. The estimation of confi- error has already been demonstrated (Simons & Roff, dence limits for the genetic correlation is difficult and 1994), and thus a priori it seems likely that the method even in the restricted cases where estimation methods will also work for the estimation of the genetic correla- have been worked out the statistical behaviour is not tion and its standard error. well understood (Robertson, 1959, 1960; Van Vleck & In this paper we present a theoretical analysis of the Henderson, 1961; Van Vleck, 1968; Hammond & family mean method, deriving the relationship between Nicholas, 1972; Grossman & Norton, 1974; Becker, the bias and family size. Having demonstrated that this 1985). An alternative approach suggested by Via approach is, in general, unacceptable, we apply the (1984) is to use the Pearson product-moment correla- jackknife technique to a set of simulated data, showing tion between family means, for which the usual that it correctly estimates the genetic correlation and its methods of estimating confidence intervals on correla- standard error. tions can be applied. It is known that such an estimate is biased, the amount of bias declining with family size. However, the magnitude of this bias has not been Thesimulation model estimated. Becausethe simulation model is used to verify the Clearly, it would be useful to have a general method theoretical analysis of the family mean method as well of estimating the genetic correlation and its associated as to verify the utility of the jackknife method, we present its construction first. The model is based upon *Correspondence the procedure worked out by Ronningen (1974) and 544 ESTIMATION OF GENETIC CORRELATION 545 Olausson & Ronningen (1975). The case simulated is tion of family means: that of full-sibs in which each family is split between two cages. The basic method of estimating the heritabi- = COVm(xy) lities and genetic correlation is through a series of Tffl nested ANOVAS (Becker, 1985). Because the nested varm(y) design is identical in structure to that of the half-sib where COVmandvarm are calculated from family means design without cage effects, the results should also of phenotypic values. This is an approximation because apply to this design. The values of the two characters X the variance and covariance terms contain a fraction of and Y are given by the within-family error term: COVWIth in COVm = + X,J,k= b11,,k1— h+GCx,,k COVamong n where n is the family size. The potential advantage of the above method is that because they are simply ,j,krgax,i a (1—r)h standard product-moment correlations the usual signi- ficance tests can be applied and confidence intervals computed. Specifically, if the number of families is N, +rebx,E,,k r) — h+bYij,kI(1_h)(1_ the confidence interval is computed by first transform- ing the correlation to the z scale + CCyk, 1 1+r where: z=—ln 2 1—r X11 k' k are the values of traits X and Y, respect- ively, for individual jinfamily i of cage k; The standard erroristhen approximately a11, are random standard normal values, N(0,1), 1 /.j( N —3),and confidence limits on r can be estimated common to the ith family; by computing the confidence limits on z and back- b11,Jk,bl,J,kare random standard normal values, transforming (for more details see Sokal & Rohlf N(0,1), of thejth individual from family i in cage k; (1981, pp. 583—591 )).Notethat the family size, n, does Cx 1k' Cyjkarerandom standard normal values, N(0,1), not appear in the formula, and thus any bias due to an common to individuals in cage k of family 1; insufficiently large family size will not be reflected in h, h are the heritabilities of traits X and Y, respect- the confidence interval. ively; It can be shown by use of the equations derived for rg is the genetic correlation between traits X and Y; the simulation model, that the family mean correlation Te is the environmental correlation between traits X is equal to and Y, given by rg+!n (u12_rg)h1h r — Tg Tm Te 1 n/i1 h2 \/(1+! [i]) (+[-])'nh \/1k 2 ) where r, is the phenotypic correlation. Note that, unlike the estimated standard error which contains where r is the phenotypic correlation; and only N, the family mean correlation contains only n, a is the standard deviation of the cage effect. the family size. To examine the potential effect of For each variable combination, 1 000 simulations changing family size on this approximation for the were run. In all simulations the number of families was genetic correlation we assumed h == r=0.5, kept constant at 50. rgO, and no cage effect (a=0). Under these condi- tions rm =1/(n+ 3). The predicted and observed family mean correlations agree very closely (Table 1). It is Themethod of family means evident that for family sizes less than 20 there is a Becausestandard errors are not generally available for considerable bias in the family mean correlation as an complex designs, Via (1984) suggested using as an approximation to the genetic correlation. Further, the approximation of the genetic correlation, the correla- estimated 95 per cent confidence intervals are far from 546 D. A. ROFF & H. PREZIOSI correct when family sizes are small (Table 1). The bias using the usual ANOVA approach (see Becker (1985) for and error in the confidence interval will decline as the the appropriate formulae). A sequence of N pseudo- genetic correlation approaches the phenotypic correla- values is computed by dropping in turn each of the tion. However, given that these are the parameters we families, estimating the resulting correlations and using are attempting to estimate it would certainly be unwise the formula to make any a priori assumptions. If the genetic correlation and phenotypic correlation were even more SN=NrN—(N—1)rN_1,1, different than assumed above then even a family size of where:SN2 is the ith pseudovalue, rN is the correlation 20 might not be sufficient. The formula given above estimated using all N families, and rN_ is the correla- can be used as a guide for the bias resulting from tion obtained by dropping the ith family alone. a particular family size. Nevertheless it would be The jackknife estimate of the correlation, r,, is certainly preferable to have a method of estimation that simply the mean of the pseudovalues is not potentially confounded as this method can be. i=N The S jackknife r' Thejackknife method works as follows: first, the N genetic and phenotypic correlations are estimated An estimate of the standard error, SE, is given by Table I The predicted and observed family mean correlation for different family sizes, given h == r=0.5, (S — = U j=1 rg =0 SE = N(N—1) Family mean correlation For the suite of variable values used previously the Number per family Predicted Observed P confidence limits estimated using the jackknife for both the phenotypic and genetic correlations are not signifi- 3 0.333 0.334 0.34 cantly different from the required 95 per cent (Table 2). 5 0.250 0.256 0.57 10 0.154 0.149 0.83 With the possible exception of the genetic correlation 15 0.111 0.112 0.88 for N= 3, the mean estimates show no evidence of 20 0.087 0.082 0.92 significant bias. Though there is a possible bias for N 3 the confidence limits are still acceptable Results from 1 000 simulations using 50 families per run.

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    5 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us