Mouse Actr1b Knockout Project (CRISPR/Cas9)
Total Page:16
File Type:pdf, Size:1020Kb
https://www.alphaknockout.com Mouse Actr1b Knockout Project (CRISPR/Cas9) Objective: To create a Actr1b knockout Mouse model (C57BL/6J) by CRISPR/Cas-mediated genome engineering. Strategy summary: The Actr1b gene (NCBI Reference Sequence: NM_146107 ; Ensembl: ENSMUSG00000037351 ) is located on Mouse chromosome 1. 11 exons are identified, with the ATG start codon in exon 1 and the TAA stop codon in exon 11 (Transcript: ENSMUST00000043951). Exon 1~11 will be selected as target site. Cas9 and gRNA will be co-injected into fertilized eggs for KO Mouse production. The pups will be genotyped by PCR followed by sequencing analysis. Note: Exon 1 starts from about 0.09% of the coding region. Exon 1~11 covers 100.0% of the coding region. The size of effective KO region: ~9773 bp. The KO region does not have any other known gene. Page 1 of 9 https://www.alphaknockout.com Overview of the Targeting Strategy Wildtype allele 5' gRNA region gRNA region 3' 10 1 2 3 4 5 6 7 8 9 11 Legends Exon of mouse Actr1b Knockout region Page 2 of 9 https://www.alphaknockout.com Overview of the Dot Plot (up) Window size: 15 bp Forward Reverse Complement Sequence 12 Note: The 2000 bp section upstream of start codon is aligned with itself to determine if there are tandem repeats. Tandem repeats are found in the dot plot matrix. The gRNA site is selected outside of these tandem repeats. Overview of the Dot Plot (down) Window size: 15 bp Forward Reverse Complement Sequence 12 Note: The 2000 bp section downstream of stop codon is aligned with itself to determine if there are tandem repeats. No significant tandem repeat is found in the dot plot matrix. So this region is suitable for PCR screening or sequencing analysis. Page 3 of 9 https://www.alphaknockout.com Overview of the GC Content Distribution (up) Window size: 300 bp Sequence 12 Summary: Full Length(2000bp) | A(30.45% 609) | C(23.25% 465) | T(26.65% 533) | G(19.65% 393) Note: The 2000 bp section upstream of start codon is analyzed to determine the GC content. Significant high GC-content regions are found. The gRNA site is selected outside of these high GC-content regions. Overview of the GC Content Distribution (down) Window size: 300 bp Sequence 12 Summary: Full Length(2000bp) | A(23.85% 477) | C(25.2% 504) | T(22.6% 452) | G(28.35% 567) Note: The 2000 bp section downstream of stop codon is analyzed to determine the GC content. No significant high GC-content region is found. So this region is suitable for PCR screening or sequencing analysis. Page 4 of 9 https://www.alphaknockout.com BLAT Search Results (up) QUERY SCORE START END QSIZE IDENTITY CHROM STRAND START END SPAN ----------------------------------------------------------------------------------------------- browser details YourSeq 2000 1 2000 2000 100.0% chr1 - 36709854 36711853 2000 browser details YourSeq 100 82 272 2000 88.5% chr11 + 54448307 54448544 238 browser details YourSeq 97 95 270 2000 89.6% chr10 + 76987418 77020818 33401 browser details YourSeq 90 1 193 2000 76.9% chrX - 151796084 151796259 176 browser details YourSeq 89 2 234 2000 85.0% chr13 - 51795143 51795447 305 browser details YourSeq 88 95 269 2000 87.8% chr4 + 34378839 34379016 178 browser details YourSeq 85 95 276 2000 92.9% chr5 - 25757705 25757889 185 browser details YourSeq 82 16 234 2000 91.2% chr3 - 94781870 94782372 503 browser details YourSeq 81 95 204 2000 87.2% chr7 + 65650643 65650752 110 browser details YourSeq 80 95 270 2000 92.7% chr12 + 8942544 8942744 201 browser details YourSeq 78 95 234 2000 88.3% chr8 + 25310143 25310280 138 browser details YourSeq 78 95 270 2000 86.8% chr12 + 12770489 12770687 199 browser details YourSeq 76 95 572 2000 75.6% chr6 + 15360154 15360511 358 browser details YourSeq 76 95 204 2000 90.5% chr1 + 120138133 120138242 110 browser details YourSeq 75 96 234 2000 86.4% chr8 + 105962701 105962837 137 browser details YourSeq 74 95 204 2000 86.3% chr2 - 70220182 70220286 105 browser details YourSeq 73 95 275 2000 84.8% chr7 + 6279076 6279253 178 browser details YourSeq 72 110 270 2000 93.1% chr18 - 56283237 56283399 163 browser details YourSeq 72 95 182 2000 88.6% chr17 + 24858835 24858921 87 browser details YourSeq 71 97 203 2000 90.7% chr11 - 82889079 82889186 108 Note: The 2000 bp section upstream of start codon is BLAT searched against the genome. No significant similarity is found. BLAT Search Results (down) QUERY SCORE START END QSIZE IDENTITY CHROM STRAND START END SPAN ----------------------------------------------------------------------------------------------- browser details YourSeq 2000 1 2000 2000 100.0% chr1 - 36698079 36700078 2000 browser details YourSeq 283 165 877 2000 83.5% chr18 - 12426754 12427424 671 browser details YourSeq 30 49 88 2000 84.3% chr15 + 102140515 102140553 39 browser details YourSeq 24 1622 1648 2000 84.0% chr15 - 100699815 100699839 25 browser details YourSeq 24 699 723 2000 100.0% chr13 - 100344345 100344371 27 browser details YourSeq 24 1035 1062 2000 85.2% chr15 + 11515885 11515911 27 browser details YourSeq 24 1950 1981 2000 87.5% chr11 + 110585136 110585167 32 browser details YourSeq 23 340 362 2000 100.0% chr4 - 97812620 97812642 23 browser details YourSeq 22 1151 1172 2000 100.0% chr17 + 49319541 49319562 22 browser details YourSeq 22 175 198 2000 95.9% chr11 + 90025685 90025708 24 browser details YourSeq 21 179 199 2000 100.0% chr9 + 117208558 117208578 21 browser details YourSeq 21 1025 1045 2000 100.0% chr9 + 95851406 95851426 21 browser details YourSeq 21 830 850 2000 100.0% chr15 + 21400126 21400146 21 Note: The 2000 bp section downstream of stop codon is BLAT searched against the genome. No significant similarity is found. Page 5 of 9 https://www.alphaknockout.com Gene and protein information: Actr1b ARP1 actin-related protein 1B, centractin beta [ Mus musculus (house mouse) ] Gene ID: 226977, updated on 15-Aug-2019 Gene summary Official Symbol Actr1b provided by MGI Official Full Name ARP1 actin-related protein 1B, centractin beta provided by MGI Primary source MGI:MGI:1917446 See related Ensembl:ENSMUSG00000037351 Gene type protein coding RefSeq status PROVISIONAL Organism Mus musculus Lineage Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus Also known as Arp1b; AA960180; AI851923; 2310066K23Rik Expression Ubiquitous expression in cortex adult (RPKM 55.7), cerebellum adult (RPKM 47.4) and 28 other tissues See more Orthologs human all Genomic context Location: 1; 1 B See Actr1b in Genome Data Viewer Exon count: 12 Annotation release Status Assembly Chr Location 108 current GRCm38.p6 (GCF_000001635.26) 1 NC_000067.6 (36698114..36710017, complement) Build 37.2 previous assembly MGSCv37 (GCF_000001635.18) 1 NC_000067.5 (36756047..36766770, complement) Chromosome 1 - NC_000067.6 Page 6 of 9 https://www.alphaknockout.com Transcript information: This gene has 7 transcripts Gene: Actr1b ENSMUSG00000037351 Description ARP1 actin-related protein 1B, centractin beta [Source:MGI Symbol;Acc:MGI:1917446] Gene Synonyms 2310066K23Rik, Arp1b Location Chromosome 1: 36,698,114-36,714,422 reverse strand. GRCm38:CM000994.2 About this gene This gene has 7 transcripts (splice variants), 106 orthologues, 27 paralogues and is a member of 1 Ensembl protein family. Transcripts Name Transcript ID bp Protein Translation ID Biotype CCDS UniProt Flags Actr1b-201 ENSMUST00000043951.9 3248 376aa ENSMUSP00000047326.3 Protein coding CCDS14887 Q8R5C5 TSL:1 GENCODE basic APPRIS P1 Actr1b-205 ENSMUST00000160084.1 728 142aa ENSMUSP00000125472.1 Protein coding - E0CZD4 CDS 3' incomplete TSL:5 Actr1b-202 ENSMUST00000159448.7 643 149aa ENSMUSP00000124343.1 Protein coding - E0CYB4 CDS 3' incomplete TSL:2 Actr1b-207 ENSMUST00000162684.2 780 No protein - Retained intron - - TSL:3 Actr1b-204 ENSMUST00000160043.1 417 No protein - Retained intron - - TSL:1 Actr1b-203 ENSMUST00000159675.1 327 No protein - Retained intron - - TSL:1 Actr1b-206 ENSMUST00000162662.1 356 No protein - lncRNA - - TSL:3 Page 7 of 9 https://www.alphaknockout.com 36.31 kb Forward strand 36.69Mb 36.70Mb 36.71Mb 36.72Mb Genes Cox5b-201 >protein coding 4933424G06Rik-201 >protein coding (Comprehensive set... Cox5b-204 >protein coding 4933424G06Rik-204 >protein coding Cox5b-206 >protein coding 4933424G06Rik-203 >retained intron Cox5b-202 >retained intron Cox5b-203 >retained intron Cox5b-205 >lncRNA Contigs < AC084389.1 Genes (Comprehensive set... < Actr1b-201protein coding < Actr1b-203retained intron< Actr1b-206lncRNA < Actr1b-207retained intron < Actr1b-202protein coding < Actr1b-205protein coding < Actr1b-204retained intron Regulatory Build 36.69Mb 36.70Mb 36.71Mb 36.72Mb Reverse strand 36.31 kb Regulation Legend CTCF Enhancer Open Chromatin Promoter Promoter Flank Transcription Factor Binding Site Gene Legend Protein Coding Ensembl protein coding merged Ensembl/Havana Non-Protein Coding processed transcript RNA gene Page 8 of 9 https://www.alphaknockout.com Transcript: ENSMUST00000043951 < Actr1b-201protein coding Reverse strand 11.89 kb ENSMUSP00000047... Superfamily SSF53067 SMART Actin family Prints Actin family Pfam Actin family PROSITE patterns Actin/actin-like conserved site Actin, conserved site PANTHER Actin family PTHR11937:SF370 Gene3D 3.90.640.10 3.30.420.40 CDD cd00012 All sequence SNPs/i... Sequence variants (dbSNP and all other sources) Variant Legend splice region variant synonymous variant Scale bar 0 40 80 120 160 200 240 280 320 376 We wish to acknowledge the following valuable scientific information resources: Ensembl, MGI, NCBI, UCSC. Page 9 of 9.