Mouse Chd8 Knockout Project (CRISPR/Cas9)
Total Page:16
File Type:pdf, Size:1020Kb
https://www.alphaknockout.com Mouse Chd8 Knockout Project (CRISPR/Cas9) Objective: To create a Chd8 knockout Mouse model (C57BL/6J) by CRISPR/Cas-mediated genome engineering. Strategy summary: The Chd8 gene (NCBI Reference Sequence: NM_201637 ; Ensembl: ENSMUSG00000053754 ) is located on Mouse chromosome 14. 37 exons are identified, with the ATG start codon in exon 1 and the TGA stop codon in exon 37 (Transcript: ENSMUST00000089752). Exon 2~7 will be selected as target site. Cas9 and gRNA will be co-injected into fertilized eggs for KO Mouse production. The pups will be genotyped by PCR followed by sequencing analysis. Note: Homozygous null embryos are growth retarded starting at E5.5 and exhibit developmental arrest at E6.5. Mutants develop into an egg cylinder but do not form a primitive streak or mesoderm and exhibit increased apoptosis at E7.5. Exon 2 starts from about 10.9% of the coding region. Exon 2~7 covers 15.32% of the coding region. The size of effective KO region: ~8889 bp. The KO region does not have any other known gene. Page 1 of 9 https://www.alphaknockout.com Overview of the Targeting Strategy Wildtype allele 5' gRNA region gRNA region 3' 1 2 3 4 5 6 7 37 Legends Exon of mouse Chd8 Knockout region Page 2 of 9 https://www.alphaknockout.com Overview of the Dot Plot (up) Window size: 15 bp Forward Reverse Complement Sequence 12 Note: The 1228 bp section upstream of Exon 2 is aligned with itself to determine if there are tandem repeats. Tandem repeats are found in the dot plot matrix. The gRNA site is selected outside of these tandem repeats. Overview of the Dot Plot (down) Window size: 15 bp Forward Reverse Complement Sequence 12 Note: The 457 bp section downstream of Exon 7 is aligned with itself to determine if there are tandem repeats. No significant tandem repeat is found in the dot plot matrix. So this region is suitable for PCR screening or sequencing analysis. Page 3 of 9 https://www.alphaknockout.com Overview of the GC Content Distribution (up) Window size: 300 bp Sequence 12 Summary: Full Length(1228bp) | A(27.12% 333) | C(16.86% 207) | T(31.68% 389) | G(24.35% 299) Note: The 1228 bp section upstream of Exon 2 is analyzed to determine the GC content. No significant high GC-content region is found. So this region is suitable for PCR screening or sequencing analysis. Overview of the GC Content Distribution (down) Window size: 300 bp Sequence 12 Summary: Full Length(457bp) | A(24.29% 111) | C(20.79% 95) | T(32.6% 149) | G(22.32% 102) Note: The 457 bp section downstream of Exon 7 is analyzed to determine the GC content. No significant high GC-content region is found. So this region is suitable for PCR screening or sequencing analysis. Page 4 of 9 https://www.alphaknockout.com BLAT Search Results (up) QUERY SCORE START END QSIZE IDENTITY CHROM STRAND START END SPAN ----------------------------------------------------------------------------------------------- browser details YourSeq 1228 1 1228 1228 100.0% chr14 - 52235502 52236729 1228 browser details YourSeq 67 974 1040 1228 100.0% chr14 - 23979763 23979829 67 browser details YourSeq 54 993 1049 1228 98.3% chr10 + 64413648 64413718 71 browser details YourSeq 52 526 647 1228 86.2% chr16 + 10668269 10668391 123 browser details YourSeq 51 519 648 1228 90.5% chr16 + 53112681 53112811 131 browser details YourSeq 50 545 750 1228 93.2% chr10 - 12912581 12912970 390 browser details YourSeq 48 522 640 1228 92.9% chr4 + 117045831 117046203 373 browser details YourSeq 48 522 656 1228 88.8% chr18 + 74370485 74370620 136 browser details YourSeq 47 529 643 1228 91.3% chr16 - 87486752 87487068 317 browser details YourSeq 43 509 566 1228 87.8% chr5 + 123337422 123337484 63 browser details YourSeq 41 544 647 1228 95.6% chr10 - 84649946 84650050 105 browser details YourSeq 41 522 648 1228 93.7% chr3 + 88657625 88657752 128 browser details YourSeq 40 501 647 1228 97.7% chr3 + 87244893 87245042 150 browser details YourSeq 40 511 647 1228 82.3% chr13 + 87942463 87942606 144 browser details YourSeq 40 501 590 1228 71.1% chr11 + 114858912 114858999 88 browser details YourSeq 38 519 569 1228 88.0% chrX - 11952716 11952768 53 browser details YourSeq 37 507 560 1228 84.7% chr2 - 153477716 153477772 57 browser details YourSeq 37 519 569 1228 86.3% chr1 - 156287976 156288026 51 browser details YourSeq 36 487 558 1228 77.5% chr2 - 91949206 91949266 61 browser details YourSeq 36 515 562 1228 87.5% chr11 + 107635373 107635420 48 Note: The 1228 bp section upstream of Exon 2 is BLAT searched against the genome. No significant similarity is found. BLAT Search Results (down) QUERY SCORE START END QSIZE IDENTITY CHROM STRAND START END SPAN ----------------------------------------------------------------------------------------------- browser details YourSeq 457 1 457 457 100.0% chr14 - 52226156 52226612 457 browser details YourSeq 40 65 380 457 95.6% chr18 - 61416920 61417400 481 browser details YourSeq 25 199 227 457 81.5% chr1 + 178855141 178855167 27 browser details YourSeq 24 156 189 457 70.4% chr14 - 103437479 103437505 27 browser details YourSeq 23 219 243 457 96.0% chr19 - 18190725 18190749 25 browser details YourSeq 23 115 140 457 96.2% chr10 + 11975590 11975617 28 browser details YourSeq 22 208 229 457 100.0% chr2 - 143832946 143832967 22 browser details YourSeq 21 191 211 457 100.0% chr5 + 113483420 113483440 21 browser details YourSeq 20 146 165 457 100.0% chr5 - 4059834 4059853 20 browser details YourSeq 20 118 137 457 100.0% chr4 - 69664653 69664672 20 browser details YourSeq 20 204 225 457 95.5% chr13 - 96568296 96568317 22 browser details YourSeq 20 396 415 457 100.0% chr5 + 57776266 57776285 20 browser details YourSeq 20 355 374 457 100.0% chr11 + 22493212 22493231 20 Note: The 457 bp section downstream of Exon 7 is BLAT searched against the genome. No significant similarity is found. Page 5 of 9 https://www.alphaknockout.com Gene and protein information: Chd8 chromodomain helicase DNA binding protein 8 [ Mus musculus (house mouse) ] Gene ID: 67772, updated on 10-Oct-2019 Gene summary Official Symbol Chd8 provided by MGI Official Full Name chromodomain helicase DNA binding protein 8 provided by MGI Primary source MGI:MGI:1915022 See related Ensembl:ENSMUSG00000053754 Gene type protein coding RefSeq status REVIEWED Organism Mus musculus Lineage Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus Also known as Chd-8; Duplin; HELSNF1; AU015341; mKIAA1564; 5830451P18Rik Summary This gene encodes a member of the chromodomain-helicase-DNA binding protein family, which is characterized by a SNF2- Expression like domain and two chromatin organization modifier domains. The encoded protein also contains brahma and kismet domains, which is common to the subfamily of chromodomain-helicase-DNA binding proteins to which this protein belongs. In mammals, this gene has been shown to function in several processes including transcriptional regulation, epigenetic remodeling, promotion of cell proliferation, and regulation of RNA synthesis. Knockout of this gene causes early embryonic lethality due to widespread apoptosis. Heterozygous loss of function mutations result in autism spectrum disorder-like behaviors that include increased anxiety, repetitive behavior, and altered social behavior. [provided by RefSeq, Dec 2016] Orthologs Ubiquitous expression in CNS E11.5 (RPKM 11.5), thymus adult (RPKM 10.3) and 28 other tissues See more human all Genomic context Location: 14; 14 C2 See Chd8 in Genome Data Viewer Exon count: 40 Annotation release Status Assembly Chr Location 108 current GRCm38.p6 (GCF_000001635.26) 14 NC_000080.6 (52198151..52258042, complement) Build 37.2 previous assembly MGSCv37 (GCF_000001635.18) 14 NC_000080.5 (52817826..52857247, complement) Chromosome 14 - NC_000080.6 Page 6 of 9 https://www.alphaknockout.com Transcript information: This gene has 16 transcripts Gene: Chd8 ENSMUSG00000053754 Description chromodomain helicase DNA binding protein 8 [Source:MGI Symbol;Acc:MGI:1915022] Gene Synonyms 5830451P18Rik, Duplin Location Chromosome 14: 52,198,151-52,257,780 reverse strand. GRCm38:CM001007.2 About this gene This gene has 16 transcripts (splice variants), 187 orthologues, 32 paralogues, is a member of 1 Ensembl protein family and is associated with 8 phenotypes. Transcripts Name Transcript ID bp Protein Translation ID Biotype CCDS UniProt Flags Chd8- ENSMUST00000200169.5 8509 2582aa ENSMUSP00000142890.1 Protein coding CCDS36919 Q09XV5 TSL:5 212 GENCODE basic APPRIS P1 Chd8- ENSMUST00000089752.10 8190 2582aa ENSMUSP00000087184.4 Protein coding CCDS36919 Q09XV5 TSL:1 201 GENCODE basic APPRIS P1 Chd8- ENSMUST00000149975.8 3094 1031aa ENSMUSP00000122995.2 Protein coding - F7AL76 CDS 5' and 3' 209 incomplete TSL:5 Chd8- ENSMUST00000227897.1 415 138aa ENSMUSP00000154595.1 Protein coding - A0A2I3BRI1 CDS 5' and 3' 216 incomplete Chd8- ENSMUST00000226307.1 360 22aa ENSMUSP00000154509.1 Protein coding - A0A2I3BR97 CDS 3' incomplete 213 Chd8- ENSMUST00000226681.1 3695 No - Retained - - - 215 protein intron Chd8- ENSMUST00000147827.7 3543 No - Retained - - TSL:1 207 protein intron Chd8- ENSMUST00000147309.1 3350 No - Retained - - TSL:1 206 protein intron Chd8- ENSMUST00000199135.1 3329 No - Retained - - TSL:NA 211 protein intron Chd8- ENSMUST00000134329.1 2444 No - Retained - - TSL:1 203 protein intron Chd8- ENSMUST00000145404.1 911 No - Retained - - TSL:3 205 protein intron Chd8- ENSMUST00000136528.1 741 No - Retained - - TSL:3 204 protein intron Chd8- ENSMUST00000122823.1 674 No - Retained - - TSL:5 202 protein intron Chd8- ENSMUST00000226625.1 488 No - Retained - - - 214 protein intron Chd8- ENSMUST00000155614.1 428 No - Retained - - TSL:3 210 protein intron Chd8- ENSMUST00000149694.1 479 No - lncRNA - - TSL:5 208 protein Page 7 of 9 https://www.alphaknockout.com 79.63 kb Forward strand 52.20Mb 52.22Mb 52.24Mb 52.26Mb Genes Gm26590-201 >lncRNA Gm43766-201 >TEC (Comprehensive set... Gm26590-202 >lncRNA Contigs AC159323.3 > < AC126037.4 Genes (Comprehensive set..