Mouse Acy1 Knockout Project (CRISPR/Cas9)
Total Page:16
File Type:pdf, Size:1020Kb
https://www.alphaknockout.com Mouse Acy1 Knockout Project (CRISPR/Cas9) Objective: To create a Acy1 knockout Mouse model (C57BL/6J) by CRISPR/Cas-mediated genome engineering. Strategy summary: The Acy1 gene (NCBI Reference Sequence: NM_025371 ; Ensembl: ENSMUSG00000023262 ) is located on Mouse chromosome 9. 15 exons are identified, with the ATG start codon in exon 2 and the TGA stop codon in exon 15 (Transcript: ENSMUST00000024031). Exon 2~15 will be selected as target site. Cas9 and gRNA will be co-injected into fertilized eggs for KO Mouse production. The pups will be genotyped by PCR followed by sequencing analysis. Note: Exon 2 starts from about 0.08% of the coding region. Exon 2~15 covers 100.0% of the coding region. The size of effective KO region: ~4637 bp. The KO region does not have any other known gene. Page 1 of 9 https://www.alphaknockout.com Overview of the Targeting Strategy Wildtype allele 5' gRNA region gRNA region 3' 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 Legends Exon of mouse Acy1 Knockout region Page 2 of 9 https://www.alphaknockout.com Overview of the Dot Plot (up) Window size: 15 bp Forward Reverse Complement Sequence 12 Note: The 2000 bp section upstream of start codon is aligned with itself to determine if there are tandem repeats. Tandem repeats are found in the dot plot matrix. The gRNA site is selected outside of these tandem repeats. Overview of the Dot Plot (down) Window size: 15 bp Forward Reverse Complement Sequence 12 Note: The 2000 bp section downstream of stop codon is aligned with itself to determine if there are tandem repeats. No significant tandem repeat is found in the dot plot matrix. So this region is suitable for PCR screening or sequencing analysis. Page 3 of 9 https://www.alphaknockout.com Overview of the GC Content Distribution (up) Window size: 300 bp Sequence 12 Summary: Full Length(2000bp) | A(26.45% 529) | C(25.1% 502) | T(21.05% 421) | G(27.4% 548) Note: The 2000 bp section upstream of start codon is analyzed to determine the GC content. Significant high GC-content regions are found. The gRNA site is selected outside of these high GC-content regions. Overview of the GC Content Distribution (down) Window size: 300 bp Sequence 12 Summary: Full Length(2000bp) | A(24.0% 480) | C(28.85% 577) | T(24.2% 484) | G(22.95% 459) Note: The 2000 bp section downstream of stop codon is analyzed to determine the GC content. No significant high GC-content region is found. So this region is suitable for PCR screening or sequencing analysis. Page 4 of 9 https://www.alphaknockout.com BLAT Search Results (up) QUERY SCORE START END QSIZE IDENTITY CHROM STRAND START END SPAN ----------------------------------------------------------------------------------------------- browser details YourSeq 2000 1 2000 2000 100.0% chr9 - 106437723 106439722 2000 browser details YourSeq 325 48 408 2000 95.3% chr17 - 45655599 45655943 345 browser details YourSeq 306 36 409 2000 92.9% chr12 + 62931214 62931585 372 browser details YourSeq 270 36 408 2000 90.7% chr2 - 30145534 30146003 470 browser details YourSeq 269 35 404 2000 88.5% chr7 - 127694524 127694858 335 browser details YourSeq 268 36 409 2000 87.9% chr10 - 78431255 78431598 344 browser details YourSeq 268 39 408 2000 92.6% chr1 + 39320733 39321128 396 browser details YourSeq 264 45 404 2000 90.4% chr18 + 20799349 20991849 192501 browser details YourSeq 261 46 398 2000 88.9% chr12 - 69604724 69605042 319 browser details YourSeq 259 36 398 2000 88.6% chr19 + 7313169 7313493 325 browser details YourSeq 256 45 412 2000 88.1% chr6 - 129360634 129360975 342 browser details YourSeq 254 36 404 2000 86.9% chr2 - 128725446 128725786 341 browser details YourSeq 254 36 397 2000 87.1% chr11 - 95933618 95933947 330 browser details YourSeq 251 45 398 2000 91.0% chr12 + 54149701 54583824 434124 browser details YourSeq 250 35 404 2000 87.2% chr14 - 63866608 63866926 319 browser details YourSeq 250 36 392 2000 89.1% chr19 + 7006212 7006538 327 browser details YourSeq 249 36 408 2000 86.2% chr17 - 12807411 12807754 344 browser details YourSeq 249 36 397 2000 88.7% chr12 + 112418932 112419260 329 browser details YourSeq 248 35 398 2000 86.3% chrX - 7225694 7226024 331 browser details YourSeq 248 60 404 2000 88.3% chr7 + 46494666 46494982 317 Note: The 2000 bp section upstream of start codon is BLAT searched against the genome. No significant similarity is found. BLAT Search Results (down) QUERY SCORE START END QSIZE IDENTITY CHROM STRAND START END SPAN ----------------------------------------------------------------------------------------------- browser details YourSeq 2000 1 2000 2000 100.0% chr9 - 106431084 106433083 2000 browser details YourSeq 477 1516 2000 2000 99.2% chr5 + 115801558 115802042 485 browser details YourSeq 476 1517 2000 2000 99.2% chr9 - 7751842 7752325 484 browser details YourSeq 475 1516 2000 2000 99.0% chr2 - 134804063 134804547 485 browser details YourSeq 475 1516 2000 2000 99.0% chr1 - 43189273 43189757 485 browser details YourSeq 472 1521 2000 2000 99.2% chr13 - 55085894 55086373 480 browser details YourSeq 471 1516 2000 2000 98.6% chr1 - 134518198 134518682 485 browser details YourSeq 471 1516 2000 2000 98.6% chr1 + 179520256 179520740 485 browser details YourSeq 469 1517 2000 2000 98.6% chr12 - 11099711 11100195 485 browser details YourSeq 465 1516 2000 2000 97.6% chr2 - 102174393 102174876 484 browser details YourSeq 459 1518 2000 2000 97.6% chr18 - 34737532 34738014 483 browser details YourSeq 457 1522 2000 2000 97.8% chr4 - 55254854 55255332 479 browser details YourSeq 456 1516 2000 2000 97.2% chr6 + 54177079 54177566 488 browser details YourSeq 455 1516 2000 2000 97.0% chr13 - 4614297 4614781 485 browser details YourSeq 452 1516 2000 2000 95.9% chr10 - 116631943 116632422 480 browser details YourSeq 451 1517 2000 2000 96.0% chr3 + 53401610 53402083 474 browser details YourSeq 450 1517 2000 2000 95.9% chr18 + 30458965 30459445 481 browser details YourSeq 445 1521 2000 2000 96.2% chr18 - 67562504 67562965 462 browser details YourSeq 442 1516 1997 2000 95.9% chr12 + 103027825 103028292 468 browser details YourSeq 441 1518 2000 2000 95.4% chr15 - 96188744 96189222 479 Note: The 2000 bp section downstream of stop codon is BLAT searched against the genome. No significant similarity is found. Page 5 of 9 https://www.alphaknockout.com Gene and protein information: Acy1 aminoacylase 1 [ Mus musculus (house mouse) ] Gene ID: 109652, updated on 12-Aug-2019 Gene summary Official Symbol Acy1 provided by MGI Official Full Name aminoacylase 1 provided by MGI Primary source MGI:MGI:87913 See related Ensembl:ENSMUSG00000023262 Gene type protein coding RefSeq status VALIDATED Organism Mus musculus Lineage Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus Also known as Acy-1; 1110014J22Rik Expression Broad expression in duodenum adult (RPKM 66.0), kidney adult (RPKM 51.7) and 19 other tissues See more Orthologs human all Genomic context Location: 9 F1; 9 57.49 cM See Acy1 in Genome Data Viewer Exon count: 15 Annotation release Status Assembly Chr Location 108 current GRCm38.p6 (GCF_000001635.26) 9 NC_000075.6 (106432981..106438236, complement) Build 37.2 previous assembly MGSCv37 (GCF_000001635.18) 9 NC_000075.5 (106335327..106340567, complement) Chromosome 9 - NC_000075.6 Page 6 of 9 https://www.alphaknockout.com Transcript information: This gene has 12 transcripts Gene: Acy1 ENSMUSG00000023262 Description aminoacylase 1 [Source:MGI Symbol;Acc:MGI:87913] Gene Synonyms 1110014J22Rik, Acy-1 Location Chromosome 9: 106,432,981-106,438,319 reverse strand. GRCm38:CM001002.2 About this gene This gene has 12 transcripts (splice variants), 209 orthologues, 3 paralogues and is a member of 1 Ensembl protein family. Transcripts Name Transcript ID bp Protein Translation ID Biotype CCDS UniProt Flags Acy1- ENSMUST00000024031.12 1529 408aa ENSMUSP00000024031.6 Protein coding CCDS23477 A0A0R4J050 TSL:1 201 GENCODE basic APPRIS P1 Acy1- ENSMUST00000214275.1 1284 373aa ENSMUSP00000149789.1 Protein coding - A0A1L1SS90 TSL:5 208 GENCODE basic Acy1- ENSMUST00000215395.1 1051 343aa ENSMUSP00000149394.1 Protein coding - A0A1L1SRC1 TSL:5 209 GENCODE basic Acy1- ENSMUST00000216400.1 1030 336aa ENSMUSP00000149636.1 Protein coding - A0A1L1SRW7 TSL:5 211 GENCODE basic Acy1- ENSMUST00000190972.2 855 219aa ENSMUSP00000139953.1 Protein coding - A0A087WPX1 CDS 3' 207 incomplete TSL:3 Acy1- ENSMUST00000190900.1 616 50aa ENSMUSP00000140582.1 Protein coding - A0A087WRE0 CDS 3' 206 incomplete TSL:3 Acy1- ENSMUST00000217531.1 466 155aa ENSMUSP00000149516.1 Protein coding - A0A1L1SRL5 CDS 5' and 3' 212 incomplete TSL:5 Acy1- ENSMUST00000215506.1 1122 201aa ENSMUSP00000150039.1 Nonsense mediated - A0A1L1SSU4 TSL:5 210 decay Acy1- ENSMUST00000190803.1 771 39aa ENSMUSP00000139931.1 Nonsense mediated - A0A087WPV3 TSL:5 204 decay Acy1- ENSMUST00000189097.6 1885 No - Retained intron - - TSL:5 203 protein Acy1- ENSMUST00000187324.7 1832 No - Retained intron - - TSL:5 202 protein Acy1- ENSMUST00000190851.2 1467 No - Retained intron - - TSL:5 205 protein Page 7 of 9 https://www.alphaknockout.com 25.34 kb Forward strand 106.425Mb 106.430Mb 106.435Mb 106.440Mb 106.445Mb Genes Rpl29-203 >protein coding Abhd14b-209 >protein coding (Comprehensive set... Rpl29-202 >protein coding Abhd14b-208 >protein coding Rpl29-201 >protein coding Abhd14b-205 >protein coding Rpl29-204 >protein coding Abhd14b-206 >protein coding Rpl29-206 >protein coding Abhd14b-204 >protein coding Rpl29-205 >protein coding