https://www.alphaknockout.com

Mouse Ipo8 Knockout Project (CRISPR/Cas9)

Objective: To create a Ipo8 knockout Mouse model (C57BL/6J) by CRISPR/Cas-mediated genome engineering.

Strategy summary: The Ipo8 (NCBI Reference Sequence: NM_001081113 ; Ensembl: ENSMUSG00000040029 ) is located on Mouse 6. 25 exons are identified, with the ATG start codon in exon 1 and the TGA stop codon in exon 25 (Transcript: ENSMUST00000048418). Exon 2~4 will be selected as target site. Cas9 and gRNA will be co-injected into fertilized eggs for KO Mouse production. The pups will be genotyped by PCR followed by sequencing analysis. Note:

Exon 2 starts from about 2.81% of the coding region. Exon 2~4 covers 13.14% of the coding region. The size of effective KO region: ~6522 bp. The KO region does not have any other known gene.

Page 1 of 9 https://www.alphaknockout.com

Overview of the Targeting Strategy

Wildtype allele 5' gRNA region gRNA region 3'

1 2 3 4 25

Legends Exon of mouse Ipo8 Knockout region

Page 2 of 9 https://www.alphaknockout.com

Overview of the Dot Plot (up) Window size: 15 bp

Forward Reverse Complement

Sequence 12

Note: The 2000 bp section upstream of Exon 2 is aligned with itself to determine if there are tandem repeats. Tandem repeats are found in the dot plot matrix. The gRNA site is selected outside of these tandem repeats.

Overview of the Dot Plot (down) Window size: 15 bp

Forward Reverse Complement

Sequence 12

Note: The 1322 bp section downstream of Exon 4 is aligned with itself to determine if there are tandem repeats. Tandem repeats are found in the dot plot matrix. The gRNA site is selected outside of these tandem repeats.

Page 3 of 9 https://www.alphaknockout.com

Overview of the GC Content Distribution (up) Window size: 300 bp

Sequence 12

Summary: Full Length(2000bp) | A(32.4% 648) | C(15.75% 315) | T(29.4% 588) | G(22.45% 449)

Note: The 2000 bp section upstream of Exon 2 is analyzed to determine the GC content. No significant high GC-content region is found. So this region is suitable for PCR screening or sequencing analysis.

Overview of the GC Content Distribution (down) Window size: 300 bp

Sequence 12

Summary: Full Length(1322bp) | A(23.98% 317) | C(23.68% 313) | T(33.13% 438) | G(19.21% 254)

Note: The 1322 bp section downstream of Exon 4 is analyzed to determine the GC content. No significant high GC-content region is found. So this region is suitable for PCR screening or sequencing analysis.

Page 4 of 9 https://www.alphaknockout.com

BLAT Search Results (up)

QUERY SCORE START END QSIZE IDENTITY CHROM STRAND START END SPAN ------browser details YourSeq 2000 1 2000 2000 100.0% chr6 - 148824486 148826485 2000 browser details YourSeq 301 1136 1797 2000 92.2% chr12 + 76542904 76543253 350 browser details YourSeq 295 1136 1817 2000 87.8% chr17 - 87662206 87662642 437 browser details YourSeq 284 1136 1814 2000 88.4% chr7 + 27648428 27648775 348 browser details YourSeq 277 1136 1798 2000 91.1% chr5 + 21020047 21020680 634 browser details YourSeq 271 1135 1804 2000 87.9% chr1 - 180617793 180618231 439 browser details YourSeq 255 1136 1777 2000 88.6% chr11 + 55462376 55462827 452 browser details YourSeq 246 1134 1768 2000 88.8% chr7 - 19774894 19775254 361 browser details YourSeq 244 1122 1776 2000 88.6% chr17 - 23772021 23772640 620 browser details YourSeq 237 1075 1767 2000 86.6% chr16 + 32128924 32129300 377 browser details YourSeq 236 1136 1740 2000 93.7% chr13 - 55240576 55241253 678 browser details YourSeq 233 1136 1738 2000 89.5% chr9 - 110126930 110127382 453 browser details YourSeq 233 1137 1782 2000 85.4% chr6 - 135100515 135100902 388 browser details YourSeq 233 1135 1768 2000 94.0% chr19 - 46023964 46024975 1012 browser details YourSeq 229 1134 1748 2000 87.9% chr7 + 101654382 101654698 317 browser details YourSeq 226 1136 1757 2000 88.6% chr10 + 123127076 123127355 280 browser details YourSeq 222 1136 1758 2000 86.7% chr11 + 115504485 115504945 461 browser details YourSeq 212 1144 1741 2000 86.3% chr1 - 168442156 168442455 300 browser details YourSeq 201 1137 1693 2000 88.0% chr14 - 55674481 55674863 383 browser details YourSeq 199 1107 1810 2000 87.4% chr16 + 18260690 18260916 227

Note: The 2000 bp section upstream of Exon 2 is BLAT searched against the genome. No significant similarity is found.

BLAT Search Results (down)

QUERY SCORE START END QSIZE IDENTITY CHROM STRAND START END SPAN ------browser details YourSeq 1322 1 1322 1322 100.0% chr6 - 148816642 148817963 1322 browser details YourSeq 100 434 929 1322 77.6% chr11 + 80345371 80345559 189 browser details YourSeq 77 809 930 1322 91.2% chr16 + 93282777 93282991 215 browser details YourSeq 73 803 918 1322 86.1% chr11 + 103754816 103754927 112 browser details YourSeq 69 804 931 1322 85.6% chr13 + 37853804 37853925 122 browser details YourSeq 68 800 912 1322 80.8% chr5 + 51684296 51684382 87 browser details YourSeq 66 803 921 1322 93.6% chr12 - 84942060 84942215 156 browser details YourSeq 63 822 899 1322 92.4% chr10 - 126219959 126220198 240 browser details YourSeq 60 815 918 1322 80.3% chr16 - 7006906 7007002 97 browser details YourSeq 60 793 890 1322 77.8% chr1 + 24533585 24533661 77 browser details YourSeq 59 828 906 1322 94.2% chr7 - 131871587 131871779 193 browser details YourSeq 56 797 915 1322 95.3% chr10 - 42595183 42595304 122 browser details YourSeq 53 797 862 1322 88.0% chr10 - 121119867 121119929 63 browser details YourSeq 52 798 928 1322 98.3% chr7 - 37029232 37029380 149 browser details YourSeq 49 846 924 1322 75.9% chr15 - 34679509 34679573 65 browser details YourSeq 48 797 869 1322 77.8% chr12 + 80784397 80784452 56 browser details YourSeq 48 824 893 1322 82.2% chr10 + 58827810 58827874 65 browser details YourSeq 46 815 881 1322 88.2% chr6 - 24370605 24370672 68 browser details YourSeq 46 826 914 1322 71.7% chr1 - 27301710 27301770 61 browser details YourSeq 45 847 921 1322 75.5% chr18 - 54761784 54761845 62

Note: The 1322 bp section downstream of Exon 4 is BLAT searched against the genome. No significant similarity is found.

Page 5 of 9 https://www.alphaknockout.com

Gene and information: Ipo8 8 [ Mus musculus (house mouse) ] Gene ID: 320727, updated on 12-Aug-2019

Gene summary

Official Symbol Ipo8 provided by MGI Official Full Name importin 8 provided by MGI Primary source MGI:MGI:2444611 See related Ensembl:ENSMUSG00000040029 Gene type protein coding RefSeq status PROVISIONAL Organism Mus musculus Lineage Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus Also known as Om1; MRP7; OM-1; Abcc10; Ranbp8; 6230418K12Rik; C130009K11Rik Expression Ubiquitous expression in CNS E11.5 (RPKM 7.3), bladder adult (RPKM 7.3) and 28 other tissues See more Orthologs human all

Genomic context

Location: 6; 6 G3 See Ipo8 in Genome Data Viewer Exon count: 26

Annotation release Status Assembly Chr Location

108 current GRCm38.p6 (GCF_000001635.26) 6 NC_000072.6 (148770683..148832931, complement)

Build 37.2 previous assembly MGSCv37 (GCF_000001635.18) 6 NC_000072.5 (148719206..148779989, complement)

Chromosome 6 - NC_000072.6

Page 6 of 9 https://www.alphaknockout.com

Transcript information: This gene has 12 transcripts

Gene: Ipo8 ENSMUSG00000040029

Description importin 8 [Source:MGI Symbol;Acc:MGI:2444611] Gene Synonyms 6230418K12Rik, C130009K11Rik, OM-1, Om1, Ranbp8 Location Chromosome 6: 148,770,683-148,831,467 reverse strand. GRCm38:CM000999.2 About this gene This gene has 12 transcripts (splice variants), 195 orthologues, 4 paralogues, is a member of 1 Ensembl protein family and is associated with 2 phenotypes. Transcripts

Name Transcript ID bp Protein Translation ID Biotype CCDS UniProt Flags

Ipo8- ENSMUST00000048418.13 5360 1010aa ENSMUSP00000046759.7 Protein coding CCDS39719 Q7TMY7 TSL:1 201 GENCODE basic APPRIS P1

Ipo8- ENSMUST00000239062.1 3641 513aa ENSMUSP00000159101.1 Protein coding - - CDS 5' 212 incomplete

Ipo8- ENSMUST00000145960.1 1262 361aa ENSMUSP00000117365.1 Protein coding - B2KGD7 CDS 3' 206 incomplete TSL:1

Ipo8- ENSMUST00000204936.1 1408 35aa ENSMUSP00000144744.1 Nonsense mediated - A0A0N4SUM9 TSL:5 210 decay

Ipo8- ENSMUST00000136896.7 6056 No - Retained intron - - TSL:1 203 protein

Ipo8- ENSMUST00000204424.1 4916 No - Retained intron - - TSL:NA 209 protein

Ipo8- ENSMUST00000145743.7 3845 No - Retained intron - - TSL:1 205 protein

Ipo8- ENSMUST00000205196.1 2907 No - Retained intron - - TSL:NA 211 protein

Ipo8- ENSMUST00000204016.1 2413 No - Retained intron - - TSL:NA 208 protein

Ipo8- ENSMUST00000123842.7 829 No - Retained intron - - TSL:2 202 protein

Ipo8- ENSMUST00000147955.1 553 No - Retained intron - - TSL:3 207 protein

Ipo8- ENSMUST00000144357.3 813 No - lncRNA - - TSL:3 204 protein

Page 7 of 9 https://www.alphaknockout.com

80.78 kb Forward strand 148.78Mb 148.80Mb 148.82Mb 148.84Mb 2610017A05Rik-201 >TEC (Comprehensive set...

Contigs AC134542.3 > Genes (Comprehensive set... < Ipo8-201protein coding

< Ipo8-203retained intron < Ipo8-205retained intron

< Ipo8-212protein coding < Ipo8-204lncRNA < Ipo8-211retained intron

< Ipo8-209retained intron < Ipo8-202retained intron < Ipo8-210nonsense mediated decay

< Ipo8-208retained intron < Ipo8-206protein coding

< Ipo8-207retained intron

Regulatory Build

148.78Mb 148.80Mb 148.82Mb 148.84Mb Reverse strand 80.78 kb

Regulation Legend CTCF Open Chromatin Promoter Promoter Flank Transcription Factor Binding Site

Gene Legend Protein Coding

Ensembl protein coding merged Ensembl/Havana

Non-Protein Coding

RNA gene processed transcript

Page 8 of 9 https://www.alphaknockout.com

Transcript: ENSMUST00000048418

< Ipo8-201protein coding

Reverse strand 60.78 kb

ENSMUSP00000046... MobiDB lite Low complexity (Seg) Superfamily Armadillo-type fold SMART Importin-beta, N-terminal domain Pfam Exportin-2, central domain

Importin-beta, N-terminal domain PROSITE profiles Importin-beta, N-terminal domain PANTHER PTHR10997:SF26

PTHR10997 Gene3D Armadillo-like helical

All sequence SNPs/i... Sequence variants (dbSNP and all other sources)

Variant Legend missense variant synonymous variant

Scale bar 0 100 200 300 400 500 600 700 800 900 1010

We wish to acknowledge the following valuable scientific information resources: Ensembl, MGI, NCBI, UCSC.

Page 9 of 9