<<

Gene Regulatory Networks

02-710 Computaonal Genomics Seyoung Kim Transcripon Factor Binding Transcripon Control

transcripon is influenced by – Transcripon factor binding affinity for the regulatory regions of target – Transcripon factor concentraon – Nucleosome posioning and chroman states – acvity Gene Transcriponal Regulatory Network

• The expression of a gene is controlled by cis and trans regulatory elements – Cis regulatory elements: DNA sequences in the regulatory region of the gene (e.g., TF binding sites)

– Trans regulatory elements: and that interact with the cis regulatory elements Gene Transcriponal Regulatory Network

• Consider the following regulatory relaonships:

Target gene1

Target TF gene2

Target gene3 Cis/Trans Regulatory Elements

Binding site: cis Target TF regulatory element TF binding affinity gene1 can influence the TF target gene Target gene2 expression Target TF gene3

TF: trans regulatory element TF concentraon TF can influence the target gene

TF expression Gene Transcriponal Regulatory Network

• Cis and trans regulatory elements form a complex transcriponal regulatory network – Each trans regulatory element (proteins/RNAs) can regulate mulple target genes – Cis regulatory modules (CRMs) • Mulple different regulators need to be recruited to iniate the transcripon of a gene • The DNA binding sites of those regulators are clustered in the regulatory region of a gene and form a CRM How Can We Learn Transcriponal Networks?

• Leverage specific expressions – In diploid , the transcript levels from the two copies of the genes may be different – RNA-seq can capture allele- specific transcript levels How Can We Learn Transcriponal Networks?

• Leverage allele specific gene expressions – Teasing out cis/trans regulatory divergence between two (Wikopp et al. Nature 2004) • First study that makes use of allele-specific gene expressions to learn about • Examines genome/transcriptome data for two parent species and their F1 progeny

– Allele-specific eQTL mapping • Extension of eQTL mapping • Examines populaon genome/transcriptome data Cis/trans regulatory Divergence in Two Species

• Compare two parent species with their F1 progeny Only Cis-regulatory Divergence Between Two Species

D. melanogaster

TF Target Expr F1 High TF Target Expr High High Low

D. sechellia TF Target Expr Low Low Only Trans-regulatory Divergence Between Two Species

D. melanogaster TF Target Expr F1 Hybrid High TF Target Expr High Medium Medium

D. sechellia TF Target Expr Low Low Cis/Trans Regulatory Divergence and Allele- Specific Expression More Precisely…

A, a: Two allele-specific transcript levels of target gene p, F1: parent species and F1 hybrid More Precisely…

A, a: Two allele-specific transcript levels of target gene p, F1: parent species and F1 hybrid How Much Cis/Trans Regulatory Divergence Between D. melanogaster and D. sechellia

• Only cis-regulatory divergence: genes on the diagonal line (y=x)

• Only trans-regulatory divergence: genes on y=1

• Most genes are influenced by a combinaon of cis and trans regulatory divergence How Much Cis/Trans Regulatory Divergence Between D. melanogaster and D. sechellia

• RNA-seq-based studies – Cis + trans • Allele specific raos have the same signs in parents and hybrids • Cis/trans regulatory differences favored expression of the same allele during evoluon – Cis x trans • Allele specific raos have different signs in parents and hybrids • Cis/trans regulatory differences favored expression of the different allele during evoluon – Conserved • no expression differences in the parents but differenally expressed in the hybrids. • cis and trans divergence compensated with each other Overall Regulatory Networks

• Regulatory genes – TFs, signaling molecules Regulatory – Expression changes lead to genes trans-acng effects on downstream genes

• Structural genes – Encode or cellular components Structural genes – Usually terminal nodes with no trans-acng effects Limitaons

• Examined the data between only two species and their progeny – Limited stascal power – Captured only the overall contribuon of cis/trans regulatory divergence – Unable to idenfy the precise cis/trans regulatory elements whose variaon contribute to the divergence How Can We Learn Transcriponal Networks?

• Leverage allele specific gene expressions – Teasing out cis/trans regulatory divergence between two species (Wikopp et al. Nature 2004) • First study that makes use of allele-specific gene expressions to learn about gene regulatory network • Examines genome/transcriptome data for two parent species and their F1 progeny

– Allele-specific eQTL mapping (Sun, Biometrics 2012) • Extension of eQTL mapping • Examines populaon genome/transcriptome data Expression Quantave Trait Locus (eQTL) Mapping

• Study of the genec basis of gene expression using populaon gene expression and data • Essenally GWAS with gene expression as values

eQTLs for HLA-C gene expression in HapMap European samples eQTL Mapping

• Popular approach for studying the genec basis of gene expressions for – Different ssue types – Different diseases Expression Quantave Trait Locus (eQTL) Mapping

• Recombinant inbred lines, instead of populaon of unrelated individuals – Has been applied to model organisms yeast, mouse, etc. Expression Quantave Trait Locus (eQTL) Mapping

• Recombinant inbred lines, instead of populaon of unrelated individuals – Key ideas: shuffle their genomes through mang, to understand the effects of genec differences in parent species. – More generaons of mang means more shuffling through recombinaons and higher resoluon for eQTLs Expression Quantave Trait Locus (eQTL) Mapping

• Recombinant inbred lines, instead of populaon of unrelated individuals – At a given locus, the samples are separated according to the inherited

Expression of gene 1

Expression of gene 2 cis and trans eQTLs

Cis eQTL • cis eQTL: in genomes, the eQTL is located near the gene whose expression is affected by the TF eQTL. X – E.g., mutaons in the upstream of SNP a gene influences the expression target gene 3 level of the gene

• trans eQTL: in genomes, the eQTL is located far away (or on a Trans eQTL TF different chromosome) from the gene whose expression is affected by the eQTL. – E.g., mutaons in the transcripon X factor gene can influence the SNP expression level of the TF target promoter TF gene genes. TF

promoter target gene 2 cis and trans eQTLs

• Cis eQTLs and its downstream effects, leading to trans eQTLs cis and trans eQTLs

• In tradional eQTL mapping, cis eQTLs as “local” eQTLs and trans eQTLs as “distant” eQTLs

• However, to be precise, not all cis eQTLs are “local” and not all trans eQTLs are “distant” (Rockman & Kruglyak, Nature Reviews Genecs 2006). – Precisely, cis eQTLs = cis regulatory variaon – Precisely, trans eQTLs = trans regulatory variaon Not All Cis eQTLs are Cis-regulatory Variaon Distant Regulatory Variaon

Affected gene expression

Pleiotropic effects of the Mutaon in the coding region regulatory of the regulator variaon

Mutaon in the regulatory region of the regulator Data for Allele-Specific eQTL Mapping

• Data preparaon – Gene expression data: Allele-specific expression data can be measured only if the exons for the transcript contain heterozygous loci

Allele-specific expression can be measured only for individuals 1 and 2 Data for Allele-Specific eQTL Mapping

• Data preparaon – Genotype data: cis-regulatory effects can be assessed only if the allele- specific expression is measured AND the locus is heterozygous

Cis-regulatory effects can be assessed only for individual 1

Haplotype informaon btween SNPs and allele- specific transcript should be resolved Allele-specific eQTLs

• Cis eQTLs = cis-regulatory • Trans eQTLs = trans- variaon regulatory variaon – Linear model with respect to – Linear model with respect to the alleles Summary

• Cis/trans regulatory elements and the effects of their variaon on gene expression

• How can we learn the underlying transcriponal network that consist of cis and trans regulatory elements? – Cis/trans regulatory divergence can be assessed based on allele- specific expressions of two parent species and their F1 hybrid. However, this approach has limited power – Cis/trans regulatory variaon and their effects can be assessed in populaon genome sequence and allele-specific expression data