Mouse Ccdc70 Knockout Project (CRISPR/Cas9)
Total Page:16
File Type:pdf, Size:1020Kb
https://www.alphaknockout.com Mouse Ccdc70 Knockout Project (CRISPR/Cas9) Objective: To create a Ccdc70 knockout Mouse model (C57BL/6J) by CRISPR/Cas-mediated genome engineering. Strategy summary: The Ccdc70 gene (NCBI Reference Sequence: NM_026459 ; Ensembl: ENSMUSG00000017049 ) is located on Mouse chromosome 8. 2 exons are identified, with the ATG start codon in exon 2 and the TAG stop codon in exon 2 (Transcript: ENSMUST00000017193). Exon 2 will be selected as target site. Cas9 and gRNA will be co-injected into fertilized eggs for KO Mouse production. The pups will be genotyped by PCR followed by sequencing analysis. Note: Exon 2 starts from about 0.15% of the coding region. Exon 2 covers 100.0% of the coding region. The size of effective KO region: ~669 bp. The KO region does not have any other known gene. Page 1 of 8 https://www.alphaknockout.com Overview of the Targeting Strategy Wildtype allele 5' gRNA region gRNA region 3' 1 2 Legends Exon of mouse Ccdc70 Knockout region Page 2 of 8 https://www.alphaknockout.com Overview of the Dot Plot (up) Window size: 15 bp Forward Reverse Complement Sequence 12 Note: The 2000 bp section upstream of start codon is aligned with itself to determine if there are tandem repeats. Tandem repeats are found in the dot plot matrix. The gRNA site is selected outside of these tandem repeats. Overview of the Dot Plot (down) Window size: 15 bp Forward Reverse Complement Sequence 12 Note: The 2000 bp section downstream of stop codon is aligned with itself to determine if there are tandem repeats. No significant tandem repeat is found in the dot plot matrix. So this region is suitable for PCR screening or sequencing analysis. Page 3 of 8 https://www.alphaknockout.com Overview of the GC Content Distribution (up) Window size: 300 bp Sequence 12 Summary: Full Length(2000bp) | A(22.95% 459) | C(25.55% 511) | T(33.35% 667) | G(18.15% 363) Note: The 2000 bp section upstream of start codon is analyzed to determine the GC content. No significant high GC-content region is found. So this region is suitable for PCR screening or sequencing analysis. Overview of the GC Content Distribution (down) Window size: 300 bp Sequence 12 Summary: Full Length(2000bp) | A(29.8% 596) | C(19.6% 392) | T(24.25% 485) | G(26.35% 527) Note: The 2000 bp section downstream of stop codon is analyzed to determine the GC content. No significant high GC-content region is found. So this region is suitable for PCR screening or sequencing analysis. Page 4 of 8 https://www.alphaknockout.com BLAT Search Results (up) QUERY SCORE START END QSIZE IDENTITY CHROM STRAND START END SPAN ----------------------------------------------------------------------------------------------- browser details YourSeq 2000 1 2000 2000 100.0% chr8 + 21971197 21973196 2000 browser details YourSeq 55 232 286 2000 100.0% chr15 - 31055375 31055429 55 browser details YourSeq 42 676 734 2000 97.8% chr18 - 70402674 70402732 59 browser details YourSeq 38 648 694 2000 88.1% chr15 - 53122222 53122266 45 browser details YourSeq 38 606 648 2000 95.4% chr8 + 117959005 117959048 44 browser details YourSeq 38 636 690 2000 82.3% chr14 + 110096179 110096229 51 browser details YourSeq 38 626 679 2000 95.5% chr11 + 91078027 91078088 62 browser details YourSeq 37 622 675 2000 85.2% chr16 - 75617684 75617739 56 browser details YourSeq 37 636 690 2000 76.2% chr15 - 31055843 31055888 46 browser details YourSeq 37 737 811 2000 67.5% chr14 - 68040365 68040408 44 browser details YourSeq 36 600 660 2000 66.7% chr1 + 5066486 5066527 42 browser details YourSeq 34 639 683 2000 92.4% chr15 - 36106341 36106385 45 browser details YourSeq 33 598 692 2000 64.2% chr11 - 64947092 64947140 49 browser details YourSeq 32 616 660 2000 72.3% chr6 - 30907886 30907921 36 browser details YourSeq 31 646 703 2000 97.1% chr11 + 36650562 36650623 62 browser details YourSeq 29 608 660 2000 65.7% chr16 - 71911695 71911726 32 browser details YourSeq 28 736 784 2000 96.7% chrX - 97549226 97549276 51 browser details YourSeq 25 634 659 2000 100.0% chr12 - 43548555 43548754 200 browser details YourSeq 24 232 255 2000 100.0% chr12 - 92224059 92224082 24 Note: The 2000 bp section upstream of start codon is BLAT searched against the genome. No significant similarity is found. BLAT Search Results (down) QUERY SCORE START END QSIZE IDENTITY CHROM STRAND START END SPAN ----------------------------------------------------------------------------------------------- browser details YourSeq 2000 1 2000 2000 100.0% chr8 + 21973866 21975865 2000 browser details YourSeq 25 1123 1164 2000 81.0% chrX - 39118886 39118928 43 browser details YourSeq 25 345 369 2000 100.0% chr1 + 73635251 73635275 25 browser details YourSeq 24 1150 1175 2000 88.0% chr1 + 106619170 106619194 25 browser details YourSeq 22 728 749 2000 100.0% chr4 - 100448571 100448592 22 Note: The 2000 bp section downstream of stop codon is BLAT searched against the genome. No significant similarity is found. Page 5 of 8 https://www.alphaknockout.com Gene and protein information: Ccdc70 coiled-coil domain containing 70 [ Mus musculus (house mouse) ] Gene ID: 67929, updated on 24-Oct-2019 Gene summary Official Symbol Ccdc70 provided by MGI Official Full Name coiled-coil domain containing 70 provided by MGI Primary source MGI:MGI:1915179 See related Ensembl:ENSMUSG00000017049 Gene type protein coding RefSeq status VALIDATED Organism Mus musculus Lineage Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus Also known as 1700112P19Rik Expression Restricted expression toward testis adult (RPKM 22.2) See more Orthologs human all Genomic context Location: 8; 8 A2 See Ccdc70 in Genome Data Viewer Exon count: 3 Annotation release Status Assembly Chr Location 108 current GRCm38.p6 (GCF_000001635.26) 8 NC_000074.6 (21969775..21974055) Build 37.2 previous assembly MGSCv37 (GCF_000001635.18) 8 NC_000074.5 (23081068..23084513) Chromosome 8 - NC_000074.6 Page 6 of 8 https://www.alphaknockout.com Transcript information: This gene has 2 transcripts Gene: Ccdc70 ENSMUSG00000017049 Description coiled-coil domain containing 70 [Source:MGI Symbol;Acc:MGI:1915179] Gene Synonyms 1700112P19Rik Location Chromosome 8: 21,969,775-21,974,041 forward strand. GRCm38:CM001001.2 About this gene This gene has 2 transcripts (splice variants), 88 orthologues, 4 paralogues and is a member of 1 Ensembl protein family. Transcripts Name Transcript ID bp Protein Translation ID Biotype CCDS UniProt Flags Ccdc70-201 ENSMUST00000017193.1 1054 223aa ENSMUSP00000017193.1 Protein coding CCDS22167 Q9D9B0 TSL:1 GENCODE basic APPRIS P1 Ccdc70-202 ENSMUST00000070649.1 961 223aa ENSMUSP00000069249.1 Protein coding CCDS22167 Q9D9B0 TSL:1 GENCODE basic APPRIS P1 24.27 kb Forward strand 21.96Mb 21.97Mb 21.98Mb Genes (Comprehensive set... Fam90a1a-201 >protein coding Ccdc70-202 >protein coding Ccdc70-201 >protein coding Contigs < AL590619.9 Regulatory Build 21.96Mb 21.97Mb 21.98Mb Reverse strand 24.27 kb Regulation Legend CTCF Promoter Flank Gene Legend Protein Coding merged Ensembl/Havana Ensembl protein coding Page 7 of 8 https://www.alphaknockout.com Transcript: ENSMUST00000017193 3.45 kb Forward strand Ccdc70-201 >protein coding ENSMUSP00000017... MobiDB lite Low complexity (Seg) Coiled-coils (Ncoils) PANTHER PTHR21533:SF22 PTHR21533 All sequence SNPs/i... Sequence variants (dbSNP and all other sources) Variant Legend missense variant synonymous variant Scale bar 0 20 40 60 80 100 120 140 160 180 200 223 We wish to acknowledge the following valuable scientific information resources: Ensembl, MGI, NCBI, UCSC. Page 8 of 8.