3/2/20

Outline

• History of the

Mouse Genetics • Mouse strains

Heather A Lawson • Genetic mapping Department of Genetics • How do we find ? Spring 2020 • Genetic Engineering • How do we analyze function?

Why Mice?

• 95% of their genome is similar to our own Three Main Subspecies: • Mice have short generation times and lifespans that make musculus castaneus (southern and southeastern them easy to manage Asia) • Variety of inbred strains and genetic backgrounds give us Mus musculus domesticus (western , the opportunity to closely examine the effects of genes and southwestern Asia, Americas, Africa, and ) their interactions Mus musculus musculus ( and northern • We have the technology to manipulate their genome Asia) directly and create models for human diseases • We have control over their environment

Origin of Modern Classical Strains History of Mouse Genetics

~1850 Gregor Mendel begins genetic studies of coat colors in mice but turns to plants when the Bishop of his Abbey, Anton Schaffgoth, is critical of the requisite copulation.

1 3/2/20

Fancy Mice Become Lab Mice Husbandry Basics

• 18-21 day gestation period

~1900 Abbie Lathrop 1908 William Castle opens • 21 day weaning breeds fancy mice at Harvard’s Bussy Institution Granby Farm, MA • Sexual maturity 6-8 weeks

• Average birthweight: 1g

• Average weaning weight: 8-12g

Rex • Average adult weight: 20-30g Golden curly Angora Hereford

History of Mouse Genetics Inbred Strain

• Defined as the product of >20 generations of brother-sister matings 1909 Clarence Little • Individuals are ~98% identical begins to develop the first • After 40 generations of inbreeding, they inbred strain, designated are ~99.5% similar DBA for dilute, brown, • Inbreeding coefficient ~0.99 non-agouti. • maintained at different institutions for many generations may show genetic drift

Inbred Strain History of Mouse Genetics

x

AaBbCc AaBbCc

x

AABbCc AABbCc 1915 J.B.S. Haldane et al. publish the first genetic linkage study in mice. x

AABBCc AABBCc

x

AABBCC AABBCC

Inbred Strain

2 3/2/20

Linkage Mapping History of Mouse Genetics

• Linkage refers to the presence of 2 or more genes on same • Tendency for traits to be inherited together • A linkage map is a genetic map showing the position of genes or genetic markers relative to each other 1926 Clarence Little starts • In terms of meiotic recombination frequency the Jackson Laboratory in Bar Harbor, Maine.

LG/J www.jax.org

Substrains Recombinant Inbred Strain

• Branches of an inbred strain that have known or probable genetic differences • Branches of a strain that are separated before • are alternating th haplotypes of variable length that the 40 generation of inbreeding are inherited intact from the parents • Branches of a strain that have been maintained separately from other branches • Animals within a strain are for more than 10 generations of inbreeding genetically identical and • Example: C57B6 from different vendors are interchangeable NOT equivalent

3 3/2/20

Phenotypic Variation Among RI lines Examine RI Haplotype Structure

High Fat Fed Females High Fat Fed Males LGXSM recombinant inbred line set - 16 RI lines

Lines vary in progress towards diabetes

Low Fat Fed Females

Low Fat Fed Males - Allows studies of genetic LOD factors

Cheverud et al., Diabetes 2004 Toth et al., Pathogens and Disease 2014

Advanced Intercross QTL Mapping

• Statistically links two types of information 1. Phenotypic data • Variation in a measurable quantitative trait (e.g. height, weight, expression level) 2. Genotypic data • Variation at molecular markers • Attempt to explain genetic basis of variation in complex traits

General Experimental Method QTL Mapping: correlating phenotypic variation with genotypic variation population measure phenotype 9 QTL Position 8 7 significance

LOD 6 5 4 3 Significance M1 M2 M3 M4 M… Mn 2 Threshold LOD Scores = log(1/p) 1 Support Region genotype individuals 0 at Genomic Markers 0 50 100 * 150 200 (cM) extract DNA

4 3/2/20

QTL: region of the genome affecting LG/J X SM/J quantitative traits

chr1 (qH2.3-qH3) 1qA1 qA3 1qA5 1qB qC1.1 C2 1qC3 C4 qC5 1qD E2.1 1qE2.3 1qE4 1qF H3 H4 H5 qH6

F1 LGxSM polymorphisms My Track RefSeq Genes Nuf2 Rgs4 Hsd17b7 Uap1 Olfml2b Dusp12 Fcgr3 Nr1i3 Nit1 Itln1 Rgs5 Ddr2 Sh2d1b1 Atf6 Fcgr4 Apoa2 Pvrl4 Cd244 1700084C01Rik Uhmk1 Fcrlb Sdhc Ufc1 F11r Ly9 Sh2d1b2 Fcrla Mpz Dedd Refbp2 1700015E13Rik Fcrla Pcp4l1 Pfdn2 Nos1ap Fcrla Tomm40l Pvrl4 Nos1ap Fcgr2b Fcer1g Usf1 Attribute F Fcgr2b Ndufs2 Tstd1 2 1700009P17Rik Ppox variation in Adamts4 B4galt3 phenotypes to SS LS LL Usp21 Dedd variation in QTL Klhdc9 Arhgap30 genotypes 30-Way Multiz Alignment & Conservation Cons

Fn Fn Fn Fn Fn Fn Fn Fn Interbreeding Leads to Accumulation of Recombination (variation at loci)

5 3/2/20

Modified Quantitative Hybrid Complementation Test

Identifying Genes and X Variants

LG SM KO WT

Identifying QTL

LG KO LG WT SM KO SM WT

QTL Limitations History of Mouse Genetics

The population used defines the genetic variation

ÞWe cannot find loci that are not variable Late 1940s George Snell develops ÞLocation: the actual gene may be far away congenic strains of mice by breeding for differences only at the H2 . ÞSome loci we find might be confounded (correlation with other traits)

ÞInteraction effects: if the effect of a locus differs due to interactions with other loci, environment or phenotype, it can cancel out ÞGenetic background!

Consomic and Congenic Strains T2dm1: A Fasting Plasma Glucose QTL on Chromosome 16

Parentals • Identify an interval of - LOD = 4.4 (p=3e-5) interest. F1 Offspring - Accounts for ~13% of • Backcross to generate variance in the F2 population F1 Gametes congenic strains carrying “Backcross” the interval of interest. - B6 |B6 and BTBR | BTBR homozygotes differ by ~120 mg/dl glucose F2 : Stoehr et al., (2000) Diabetes : Fn

6 3/2/20

Generation of Consomic Mice Figure 1. Chromosome 16 of BTBR mice contains diabetogenic alleles. 16 16

X

B6 (receiver) BBTBR (donor)

Repeated Backcrossing to Receiver Strain

16

B6.16BT Bhatnagar et al., (2013) PLOS Genetics

Generation of Congenic Mice Figure 3. Effect on insulin secretion of introgressing 1.6 Mb of BTBR Chr 16 into B6 mice.

16 16

X

B6 (receiver) BTBR (donor) Repeated Backcrossing to Receiver Strain 16 16 16

Bhatnagar et al., (2013) PLOS Genetics B6.16BT24-37 B6.16BT36-38 B6.16BT37-55

Figure 4. Plasma insulin and glucose levels in the B6.16BT36–38 male mice. Figure 6. Insulin secretion defective region narrowed to 0.94 Mb on mouse chromosome 16 containing 13 genes.

Bhatnagar et al., (2013) PLOS Genetics Bhatnagar et al., (2013) PLOS Genetics

7 3/2/20

Genome Tagged Mice Heterogeneous Stock

Eight strains - C57BL/6J, BALB/c, RIII, AKR, DBA/2, I, A, and C3H/2

Maintained for >60 generations

Average recombination distance <4Mb

The Collaborative Cross: Multi-parental Recombinant Inbred The Collaborative Cross Population “Funnel” mating scheme

Eight strains - A/J, C57BL/6J, 129Sv/ImJ, NOD/LtJ, NZO/H1J, CAST/EiJ, PWK/PhJ, and WSB/EiJ

Spans 90% of segregating variation in laboratory mice.

Churchill et al., Nature Genetics 2004

Diversity Outbred Breeding Schemes for Outbred Populations

Solberg-Woods, Physiological Genomics 2014

8 3/2/20

CC Recombinant Intercross (ccRIX) Panel Overcoming Loss Hybrid Mouse Diversity Panel of Heterozygosity

•Panel of 100 strains •including ‘classic’ inbred strains and RI lines

•Select strains based on experimental needs

•Detailed genomic and phenotypic information for all strains is publically available

Churchill et al., Nature Genetics 2004

Variation in Insulin Resistance Among Strains in HMD Differences in Resolution of GWAS of HDL Cholesterol Panel

Parks et al., Cell Metabolism 2015

Flint and Eskin Nature Genetics 2012

History of Mouse Genetics Mouse Genome Statistics: Genes, Genome Features and Maps

• 20 haploid chromosome (19, X, Y)

• Diploid DNA content: 2.7x109 bp 2001-02 Sequencing of ‘the’ mouse genome is completed • 8-10% repetitive sequence

• 47,116 Genes with nucleotide sequence data

• 24,211 Genes with sequence data

• 18,415 Genes with expression assay results

• 150,339 Mapped genes/markers

9 3/2/20

Mouse Genomic Sequence Reveals Great www.informatics.jax.org/humanDisease.shtml Similarity with the

Extremely high conservation: 560,000 “anchors”

Mouse-Human Comparison both genomes 2.5-3 billion bp long > 99% of genes have homologs > 95% of genome “syntenic”

Whole Genome Sequences Can Inform Quantitative Trait Gene Identifying Variants in Apoa2 To Follow-up

Identification Apoa2

Wang et al. Genome Research (2009)

SM M K L L A M V A L L V T I C S L E G A L V K R Q A D G Q D M Q S L F T Q Y F Q S M T D Y G K D L M E K A K T S E I Q S Q A K A Y F E K T H E Q L T P L V R S A G T S L V N F F S S L M N L E E K P A P A A K

AKR M K L L A M V A L L V T I C S L E G A L V K R Q A D G P D M Q S L F T Q Y F Q S M T E Y G K D L V E K A K T S E I Q S Q A K A Y F E K T H E Q L T P L V R S A G T S L V N F F S S L M N L E E K P A P A A K

A_J M K L L A M V A L L V T I C S L E G A L V K R Q A D G Q D M Q S L F T Q Y F Q S M T E Y G K D L V E K A K T S E I Q S Q A K A Y F E K T H E Q L T P L V R S A G T S L V N F F S S L M N L E E K P A P A A K

C57BL M K L L A M V A L L V T I C S L E G A L V K R Q A D G P D M Q S L F T Q Y F Q S M T D Y G K D L M E K A K T S E I Q S Q A K A Y F E K T H E Q L T P L V R S A G T S L V N F F S S L M N L E E K P A P A A K

CAST M K L L A M V A L L V T I C S L E G A L V K R Q A D G P D M Q S L F T Q Y F Q S M T D Y G K D L M E K A K T S E I Q S Q A K A Y F E K T H E Q L T P L V R S A G T S L V N F F S S L M N L E E K P A P A A K

CBA M K L L A M V A L L V T I C S L E G A L V K R Q A D G P D M Q S L F T Q Y F Q S M T D Y G K D L M E K A K T S E I Q S Q A K A Y F E K T H E Q L T P L V R S A G T S L V N F F S S L M N L E E K P A P A A K

DBA M K L L A M V A L L V T I C S L E G A L V K R Q A D G P D M Q S L F T Q Y F Q S M T D Y G K D L M E K A K T S E I Q S Q A K A Y F E K T H E Q L T P L V R S A G T S L V N F F S S L M N L E E K P A P A A K

NOD M K L L A M V A L L V T I C S L E G A L V K R Q A D G P D M Q S L F T Q Y F Q S M T D Y G K D L M E K A K T S E I Q S Q A K A Y F E K T H E Q L T P L V R S A G T S L V N F F S S L M N L E E K P A P A A K

NZO M K L L A M V A L L V T I C S L E G A L V K R Q A D G Q D M Q S L F T Q Y F Q S M T E Y G K D L V E K A K T S E I Q S Q A K A Y F E K T H E Q L T P L V R S A G T S L V N F F S S L M N L E E K P A P A A K

PWK M K L L A M V A L L V T I C S L E G A L V K R Q A D G Q D M Q S L F T Q Y F Q S M T E Y G K D L V E K A K T S E I Q S Q A K A Y F E K T H E Q L T P L V R S A G T S L V N F F S S L M N L E E K P A P A A K

LG M K L L A M V A L L V T I C S L E G A L V K R Q A D G P D M Q S L F T Q Y F Q S M T D Y G K D L M E K A K T S E I Q S Q V K A Y F E K T H E Q L T P L V R S A G T S L V N F F S S L M N L E E K P A P A A K

129P2 M K L L A M V A L L V T I C S L E G A L V K R Q A D G P D M Q S L F T Q Y F Q S M T E Y G K D L V E K A K T S E I Q S Q V K A Y F E K T H E Q L T P L V R S A G T S L V N F F S S L M N L E E K P A P A A K

129S1 M K L L A M V A L L V T I C S L E G A L V K R Q A D G P D M Q S L F T Q Y F Q S M T E Y G K D L V E K A K T S E I Q S Q V K A Y F E K T H E Q L T P L V R S A G T S L V N F F S S L M N L E E K P A P A A K

129S5 M K L L A M V A L L V T I C S L E G A L V K R Q A D G P D M Q S L F T Q Y F Q S M T E Y G K D L V E K A K T S E I Q S Q V K A Y F E K T H E Q L T P L V R S A G T S L V N F F S S L M N L E E K P A P A A K

BALB M K L L A M V A L L V T I C S L E G A L V K R Q A D G P D M Q S L F T Q Y F Q S M T E Y G K D L V E K A K T S E I Q S Q V K A Y F E K T H E Q L T P L V R S A G T S L V N F F S S L M N L E E K P A P A A K

C3H M K L L A M V A L L V T I C S L E G A L V K R Q A D G P D M Q S L F T Q Y F Q S M T E Y G K D L V E K A K T S E I Q S Q V K A Y F E K T H E Q L T P L V R S A G T S L V N F F S S L M N L E E K P A P A A K

LP_J M K L L A M V A L L V T I C S L E G A L V K R Q A D G P D M Q S L F T Q Y F Q S M T E Y G K D L V E K A K T S E I Q S Q V K A Y F E K T H E Q L T P L V R S A G T S L V N F F S S L M N L E E K P A P A A K

QTG to Candidate Quantitative History of Mouse Genetics – 2012 to Today Trait Nucleotide

• SNPs in nonIBD regions

•SNP in exon of Ptprz1 • Involved in bone formation • P à H • Predicted to be functionally damaging

Nikolskiy et al., BMC Genomics 2015

10 3/2/20

History of Mouse Genetics – 2014 to Today

The Mouse ENCODE Consortium

• Expression profiles of many mouse genes are divergent

• Cis-regulatory landscape is highly diverged between human and mouse • Varies between classes of elements • Tissue context

• Chromatin landscape is relatively stable between human and mouse

11