https://www.alphaknockout.com

Mouse Bag5 Conditional Knockout Project (CRISPR/Cas9)

Objective: To create a Bag5 conditional knockout Mouse model (C57BL/6J) by CRISPR/Cas-mediated genome engineering.

Strategy summary: The Bag5 (NCBI Reference Sequence: NM_027404 ; Ensembl: ENSMUSG00000049792 ) is located on Mouse 12. 2 exons are identified, with the ATG start codon in exon 2 and the TGA stop codon in exon 2 (Transcript: ENSMUST00000054636). Exon 2 will be selected as conditional knockout region (cKO region). Deletion of this region should result in the loss of function of the Mouse Bag5 gene. To engineer the targeting vector, homologous arms and cKO region will be generated by PCR using BAC clone RP24-126L5 as template. Cas9, gRNA and targeting vector will be co-injected into fertilized eggs for cKO Mouse production. The pups will be genotyped by PCR followed by sequencing analysis. Note:

Exon 2 covers 100.0% of the coding region. Start codon is in exon 2, and stop codon is in exon 2. The size of effective cKO region: ~2675 bp. The cKO region does not have any other known gene.

Page 1 of 7 https://www.alphaknockout.com

Overview of the Targeting Strategy

Wildtype allele 5' gRNA region gRNA region 3'

1 1 2 Targeting vector

Targeted allele

Constitutive KO allele (After Cre recombination)

Legends Homology arm Exon of mouse Apopt1 Exon of mouse Bag5 cKO region loxP site

Page 2 of 7 https://www.alphaknockout.com

Overview of the Dot Plot Window size: 10 bp

Forward Reverse Complement

Sequence 12

Note: The sequence of homologous arms and cKO region is aligned with itself to determine if there are tandem repeats. No significant tandem repeat is found in the dot plot matrix. So this region is suitable for PCR screening or sequencing analysis.

Overview of the GC Content Distribution Window size: 300 bp

Sequence 12

Summary: Full Length(7341bp) | A(24.37% 1789) | C(24.12% 1771) | T(25.66% 1884) | G(25.84% 1897)

Note: The sequence of homologous arms and cKO region is analyzed to determine the GC content. Significant high GC-content regions are found. It may be difficult to construct this targeting vector.

Page 3 of 7 https://www.alphaknockout.com

BLAT Search Results (up)

QUERY SCORE START END QSIZE IDENTITY CHROM STRAND START END SPAN ------browser details YourSeq 3000 1 3000 3000 100.0% chr12 - 111711388 111714387 3000 browser details YourSeq 276 128 543 3000 92.1% chr15 + 79274635 79275142 508 browser details YourSeq 266 128 511 3000 92.4% chr16 + 14484329 14484753 425 browser details YourSeq 266 128 510 3000 92.4% chr13 + 97004466 97004889 424 browser details YourSeq 265 143 510 3000 93.8% chr17 + 34163473 34163968 496 browser details YourSeq 263 131 511 3000 93.8% chr15 + 8144606 8145169 564 browser details YourSeq 261 128 517 3000 92.5% chr16 - 96074128 96074762 635 browser details YourSeq 247 128 544 3000 91.9% chr11 - 87413067 87413527 461 browser details YourSeq 243 136 543 3000 93.0% chr12 - 84404425 84405052 628 browser details YourSeq 222 127 517 3000 91.4% chr17 + 46174741 46175136 396 browser details YourSeq 207 136 512 3000 91.2% chrX - 153363861 153364467 607 browser details YourSeq 207 129 494 3000 92.6% chr16 + 22247736 22248227 492 browser details YourSeq 196 172 511 3000 92.6% chr2 - 180941017 180941471 455 browser details YourSeq 184 308 543 3000 88.0% chr4 - 128847636 128847854 219 browser details YourSeq 180 324 543 3000 89.8% chr7 - 27767370 27767573 204 browser details YourSeq 178 324 543 3000 89.3% chr7 - 35326305 35326508 204 browser details YourSeq 178 324 543 3000 89.3% chr4 - 151777027 151777230 204 browser details YourSeq 178 324 543 3000 89.3% chr2 + 112236454 112236657 204 browser details YourSeq 177 324 543 3000 93.2% chr7 + 92751952 92752554 603 browser details YourSeq 176 221 516 3000 88.1% chr12 - 85976199 85976452 254

Note: The 3000 bp section upstream of Exon 2 is BLAT searched against the genome. No significant similarity is found.

BLAT Search Results (down)

QUERY SCORE START END QSIZE IDENTITY CHROM STRAND START END SPAN ------browser details YourSeq 3000 1 3000 3000 100.0% chr12 - 111707047 111710046 3000 browser details YourSeq 165 1340 1525 3000 97.2% chr1 + 15786082 15786267 186 browser details YourSeq 152 1360 1520 3000 97.6% chrX - 152331010 152331178 169 browser details YourSeq 152 1363 1526 3000 97.0% chr7 + 81505203 81505383 181 browser details YourSeq 151 1360 1525 3000 95.6% chr6 - 48787219 48787381 163 browser details YourSeq 151 1359 1520 3000 96.9% chr2 - 164084989 164085152 164 browser details YourSeq 151 1360 1515 3000 98.8% chr2 - 112517658 112517836 179 browser details YourSeq 150 1368 1526 3000 98.2% chr3 - 96762337 96762503 167 browser details YourSeq 150 1335 1507 3000 95.2% chr2 + 144260026 144260199 174 browser details YourSeq 149 1359 1520 3000 96.3% chr3 + 58580282 58580455 174 browser details YourSeq 147 1357 1507 3000 97.4% chr9 - 106845771 106845920 150 browser details YourSeq 146 1356 1507 3000 96.7% chr7 - 13068933 13069083 151 browser details YourSeq 146 1358 1507 3000 97.4% chr2 - 119392187 119392335 149 browser details YourSeq 146 1359 1526 3000 94.4% chr3 + 69478829 69478994 166 browser details YourSeq 146 1358 1507 3000 97.4% chr19 + 44101318 44101466 149 browser details YourSeq 146 1358 1507 3000 98.7% chr19 + 34823129 34823278 150 browser details YourSeq 145 1359 1507 3000 98.7% chr2 - 3495206 3495354 149 browser details YourSeq 145 1359 1507 3000 97.3% chr19 - 23098238 23098385 148 browser details YourSeq 145 1357 1507 3000 98.1% chr13 - 55255756 55255906 151 browser details YourSeq 145 1359 1507 3000 98.7% chr8 + 83950578 83950726 149

Note: The 3000 bp section downstream of Exon 2 is BLAT searched against the genome. No significant similarity is found.

Page 4 of 7 https://www.alphaknockout.com

Gene and information: Bag5 BCL2-associated athanogene 5 [ Mus musculus (house mouse) ] Gene ID: 70369, updated on 12-Aug-2019

Gene summary

Official Symbol Bag5 provided by MGI Official Full Name BCL2-associated athanogene 5 provided by MGI Primary source MGI:MGI:1917619 See related Ensembl:ENSMUSG00000049792 Gene type protein coding RefSeq status VALIDATED Organism Mus musculus Lineage Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus Also known as Bag-5; 1600025G07Rik; 4930405J06Rik Expression Biased expression in testis adult (RPKM 180.4), CNS E11.5 (RPKM 14.4) and 6 other tissues See more Orthologs human all

Genomic context

Location: 12; 12 F1 See Bag5 in Genome Data Viewer

Exon count: 3

Annotation release Status Assembly Chr Location

108 current GRCm38.p6 (GCF_000001635.26) 12 NC_000078.6 (111709488..111713858, complement)

Build 37.2 previous assembly MGSCv37 (GCF_000001635.18) 12 NC_000078.5 (112947699..112951467, complement)

Chromosome 12 - NC_000078.6

Page 5 of 7 https://www.alphaknockout.com

Transcript information: This gene has 4 transcripts

Gene: Bag5 ENSMUSG00000049792

Description BCL2-associated athanogene 5 [Source:MGI Symbol;Acc:MGI:1917619] Gene Synonyms 1600025G07Rik, 4930405J06Rik Location Chromosome 12: 111,709,488-111,713,257 reverse strand. GRCm38:CM001005.2 About this gene This gene has 4 transcripts (splice variants), 196 orthologues, 2 paralogues and is a member of 1 Ensembl protein family. Transcripts

Name Transcript ID bp Protein Translation ID Biotype CCDS UniProt Flags

Bag5-201 ENSMUST00000054636.6 2139 447aa ENSMUSP00000051049.6 Protein coding CCDS26185 Q8CI32 TSL:1 GENCODE basic APPRIS P1

Bag5-202 ENSMUST00000160576.1 1852 447aa ENSMUSP00000125183.1 Protein coding CCDS26185 Q8CI32 TSL:1 GENCODE basic APPRIS P1

Bag5-204 ENSMUST00000162953.1 447 37aa ENSMUSP00000124861.1 Protein coding - E0CX76 CDS 3' incomplete TSL:3

Bag5-203 ENSMUST00000160825.1 445 59aa ENSMUSP00000123839.1 Protein coding - E0CYW3 CDS 3' incomplete TSL:3

23.77 kb Forward strand 111.70Mb 111.71Mb 111.72Mb Gm36635-201 >lncRNA Apopt1-201 >protein coding (Comprehensive set...

Apopt1-204 >protein coding

Apopt1-202 >nonsense mediated decay

Apopt1-203 >protein coding

Contigs < AC152065.4

Genes < Bag5-201protein coding (Comprehensive set...

< Bag5-202protein coding

< Bag5-203protein coding

< Bag5-204protein coding

Regulatory Build

111.70Mb 111.71Mb 111.72Mb Reverse strand 23.77 kb

Regulation Legend CTCF Enhancer Promoter Promoter Flank Transcription Factor Binding Site

Gene Legend Protein Coding

merged Ensembl/Havana Ensembl protein coding

Non-Protein Coding

processed transcript RNA gene

Page 6 of 7 https://www.alphaknockout.com

Transcript: ENSMUST00000054636

< Bag5-201protein coding

Reverse strand 3.77 kb

ENSMUSP00000051... PDB-ENSP mappings Low complexity (Seg) Coiled-coils (Ncoils) Superfamily BAG domain superfamily SMART BAG domain Pfam BAG domain PROSITE profiles BAG domain PANTHER PTHR12329:SF2

Molecular chaperone regulator BAG Gene3D BAG domain superfamily

All sequence SNPs/i... Sequence variants (dbSNP and all other sources)

Variant Legend missense variant synonymous variant

Scale bar 0 40 80 120 160 200 240 280 320 360 400 447

We wish to acknowledge the following valuable scientific information resources: Ensembl, MGI, NCBI, UCSC.

Page 7 of 7