From Classical Genetics to Quantitative Genetics to Systems Biology: Modeling Epistasis

From Classical Genetics to Quantitative Genetics to Systems Biology: Modeling Epistasis David L. Aylor1, Zhao-Bang Zeng1,2,3* 1 Bioinformatics Research Center and Program in Bioinformatics, North Carolina State University, Raleigh, North Carolina, United States of America, 2 Department of Genetics, North Carolina State University, Raleigh, North Carolina, United States of America, 3 Department of Statistics, North Carolina State University, Raleigh, North Carolina, United States of America Abstract Gene expression data has been used in lieu of phenotype in both classical and quantitative genetic settings. These two disciplines have separate approaches to measuring and interpreting epistasis, which is the interaction between alleles at different loci. We propose a framework for estimating and interpreting epistasis from a classical experiment that combines the strengths of each approach. A regression analysis step accommodates the quantitative nature of expression measurements by estimating the effect of gene deletions plus any interaction. Effects are selected by significance such that a reduced model describes each expression trait. We show how the resulting models correspond to specific hierarchical relationships between two regulator genes and a target gene. These relationships are the basic units of genetic pathways and genomic system diagrams. Our approach can be extended to analyze data from a variety of experiments, multiple loci, and multiple environments. Citation: Aylor DL, Zeng Z-B (2008) From Classical Genetics to Quantitative Genetics to Systems Biology: Modeling Epistasis. PLoS Genet 4(3): e1000029. doi:10.1371/journal.pgen.1000029 Editor: David B. Allison, University of Alabama at Birmingham, United States of America Received June 12, 2007; Accepted February 8, 2008; Published March 14, 2008 Copyright: ß 2008 Aylor and Zeng. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Funding: This work was partially supported by NIH GM45344 and by the National Research Initiative of the USDA Cooperative State Research, Education and Extension Service grant number 2005-00754. DLA is supported by a NIEHS training grant (T32 ES007329) in Bioinformatics. Competing Interests: The authors have declared that no competing interests exist. * E-mail: [email protected] Introduction and the effect of the environment. Epistasis is defined as a deviation from these additive gene effects [3]. A quantitative Epistasis has traditionally been discussed in two distinct genetic model can include multiple loci and multiple interactions. contexts, corresponding to the disciplines of classical molecular Epistasis in this sense describes a functional relationship between genetics and quantitative genetics. In each case, the term describes genes in the context of a trait, but it includes both hierarchical an interaction between alleles at two or more loci. However, the relationships and nonhierarchical relationships and there is no way methods for detecting epistasis and interpretations of the to distinguish between these. underlying biology have kept historical divisions in place despite Any genetic effect is only relevant to the population being calls for synthesis [1]. This is largely because the two fields studied due to the presence of genetic background. Background is traditionally study different types of traits in different experimental genetic variation that is unobserved in the population and cannot populations. be modeled. The classical experiment is performed using The classical epistasis experiment compares a double-mutant genetically homogenous laboratory strains so there is no with two associated single-mutants. Epistasis is present if the background. Quantitative genetics studies diverse populations observed double-mutant phenotype is categorized as being the and background variation is almost always present. The same as a single-mutant phenotype. This implies a specific type of implication of this is that epistasis may be detected in one interaction in which an allele at one locus masks the effect of experiment but not in another. This has led to criticisms that variation at the second locus. This relationship is described as the epistasis in the quantitative genetic sense is a statistical construct first locus being epistatic to the second, and can be interpreted as rather than a true representation of biology. one gene acting upstream of the other. This hierarchical In fact, both approaches seek to illustrate underlying molecular interpretation has been used to construct biological pathways architecture and each has its strengths. A hierarchical interpreta- via a series of epistatic gene pairs. However, this approach is tion of epistasis is attractive as increased focus is placed on genetic limited by the necessity of easily observed and categorized pathways and systems diagrams. However, quantitative approach- phenotypes [2]. es are necessary to accommodate continuous data types such as In contrast, quantitative genetics examines traits that vary gene expression, metabolite concentrations, and fitness. Recent continuously and cannot easily be categorized. Such trait literature suggests that such approaches are being adopted. For distributions result from the cumulative effects of many genes. example, while early large-scale fitness profiles in yeast deletion Each additional gene increases the possible combination of alleles, mutants [4,5] were scored categorically, St Onge et al [6] and the number of possible phenotypes grows exponentially. An measured fitness in 650 double-deletion yeast strains and individual’s phenotype is the sum of the allelic effects at each gene employed a novel quantitative analysis. PLoS Genetics | www.plosgenetics.org 1 2008 | Volume 4 | Issue 3 | e1000029 Modeling Epistasis Author Summary Results Epistasis has long had two slightly different meanings Modeling Epistasis for Continuously Variable Traits depending on the context in which it is discussed. The In the classical epistasis analysis, triplets of deletion mutants classical definition describes an allele at one locus combine with a wild type to form a contrast. Each contrast completely masking the effect of an allele at a second includes two single mutants and a double mutant. Each is locus. Such relationships can be interpreted as hierarchical, described relative to the known wild type phenotype. A and they can be combined to infer genetic pathways. In hypothetical example of a trait affected by two genes, A and B, quantitative genetics, epistasis encompasses a wide range can be described as follows, where y is the trait value, m is the of interactions and can be extended to more than two loci. expected value of the wild type, b and b are the effects of These two definitions coexist because they are typically A B deleting each gene, and e is an error term. applied to different types of study populations and different types of traits. The current trend is to treat gene expression as a trait in a variety of genetic backgrounds. z z ~ z This provides reason to revisit epistasis in this new context. A B : y m e We accommodate the continuous nature of gene expres- { z A B : y~mzbAze sion using ideas from quantitative genetics, but retain the z { hierarchical interpretation of the classical experiment. A B : y~mzbBze These hierarchical relationships are the building blocks of ( mzbAze if A is epistatic to B systems diagrams and genetic pathways. This framework A{B{ : y~ can serve as a foundation for future epistasis analyses mzbBze if B is epistatic to A based on genomic data. This adheres strictly to the classical definition, but there is a The rise in genomic techniques has broken down one of the clear problem; there is no provision if the double mutant does not traditional barriers discussed above: the same traits are now being fall neatly into the same category as one of the single mutants. used in both classical and quantitative settings [7]. Gene Gene expression traits fit poorly into the classical framework for expression is perhaps the most prevalent example. Instead of a this reason. Expression is continuous and intermediate levels are single phenotypic trait value, a vector of expression measurements expected. Furthermore, even normalized trait values will inevita- describes each individual. Expression profiling in single-deletion bly include some measurement error. For these reasons, the yeast strains found that 34% of mutants showed twenty or more double mutant observation is rarely the same as either of the single differentially expressed genes [2]. Expression quantitative trait mutant observations or the wild type. Previous studies have locus (eQTL) mapping uses a linear modeling approach to attempted to circumvent this problem by relying on differences associate genetic variation with gene expression traits [8–12]; between the mutants to determine the most similar mutant pair. Storey et al. [13] found over thirty percent of traits were jointly However, the assumption that expression is completely masked is linked to two loci in yeast. When gene expression correlates with a poor. To address these issues, we move away from comparing trait complex phenotype, the corresponding traits may reflect the values directly. Instead, we evaluate each deletion according to molecular basis of that trait at a level intermediate between whether it significantly

From Classical Genetics to Quantitative Genetics to Systems Biology: Modeling Epistasis

Models of the Gene Must Inform Data-Mining Strategies in Genomics

Epigenetics Is Normal Science, but Don't Call It Lamarckian

Using Classical Genetics Simulator (CGS) to Teach Students the Basics of Genetic Research

The Sustained Impact of Model Organisms—In Genetics and Epigenetics

G14TBS Part II: Population Genetics

Mendelian Gene ” and the ” Molecular Gene ” : Two Relevant Concepts of Genetic Units V Orgogozo, Alexandre E

Molecular Population Genetics

The Hardy-Weinberg Equilibrium: Some Helpful Suggestions

A Century of Geneticists Mutation to Medicine a Century of Geneticists Mutation to Medicine

00326673.Pdf

Population Genetics

Chapter 2. the Beginnings of Genomic Biology – Classical Genetics Contents