https://www.alphaknockout.com

Mouse Map4k3 Conditional Knockout Project (CRISPR/Cas9)

Objective: To create a Map4k3 conditional knockout Mouse model (C57BL/6J) by CRISPR/Cas-mediated genome engineering.

Strategy summary: The Map4k3 (NCBI Reference Sequence: NM_001290345 ; Ensembl: ENSMUSG00000024242 ) is located on Mouse 17. 34 exons are identified, with the ATG start codon in exon 1 and the TGA stop codon in exon 34 (Transcript: ENSMUST00000025089). Exon 4~6 will be selected as conditional knockout region (cKO region). Deletion of this region should result in the loss of function of the Mouse Map4k3 gene. To engineer the targeting vector, homologous arms and cKO region will be generated by PCR using BAC clone RP23-187J7 as template. Cas9, gRNA and targeting vector will be co-injected into fertilized eggs for cKO Mouse production. The pups will be genotyped by PCR followed by sequencing analysis. Note: Mice homozygous for a gene trap allele exhibit decreased susceptibility to experimental autoimmune encephalomyelitis, decreased stimulated immunoglobin production, decreased stimulated T cell proliferation, and abnormal Th1, Th2, and Th17 differentiation.

Exon 4 starts from about 9.17% of the coding region. The knockout of Exon 4~6 will result in frameshift of the gene. The size of intron 3 for 5'-loxP site insertion: 8070 bp, and the size of intron 6 for 3'-loxP site insertion: 2734 bp. The size of effective cKO region: ~2560 bp. The cKO region does not have any other known gene.

Page 1 of 8 https://www.alphaknockout.com

Overview of the Targeting Strategy

Wildtype allele 5' gRNA region gRNA region 3'

1 4 5 6 34 Targeting vector

Targeted allele

Constitutive KO allele (After Cre recombination)

Legends Exon of mouse Map4k3 Homology arm cKO region loxP site

Page 2 of 8 https://www.alphaknockout.com

Overview of the Dot Plot Window size: 10 bp

Forward Reverse Complement

Sequence 12

Note: The sequence of homologous arms and cKO region is aligned with itself to determine if there are tandem repeats. No significant tandem repeat is found in the dot plot matrix. So this region is suitable for PCR screening or sequencing analysis.

Overview of the GC Content Distribution Window size: 300 bp

Sequence 12

Summary: Full Length(9060bp) | A(27.11% 2456) | C(20.1% 1821) | T(31.71% 2873) | G(21.08% 1910)

Note: The sequence of homologous arms and cKO region is analyzed to determine the GC content. No significant high GC-content region is found. So this region is suitable for PCR screening or sequencing analysis.

Page 3 of 8 https://www.alphaknockout.com

BLAT Search Results (up)

QUERY SCORE START END QSIZE IDENTITY CHROM STRAND START END SPAN ------browser details YourSeq 3000 1 3000 3000 100.0% chr17 - 80656172 80659171 3000 browser details YourSeq 216 838 1169 3000 83.0% chr18 + 80478348 80478674 327 browser details YourSeq 176 796 1053 3000 85.5% chr18 - 25234803 25235058 256 browser details YourSeq 160 785 1071 3000 88.7% chr5 - 24293670 24293955 286 browser details YourSeq 158 1744 2031 3000 84.6% chr6 - 117691676 117692049 374 browser details YourSeq 156 1746 2044 3000 87.0% chr5 + 9776346 9776729 384 browser details YourSeq 151 1737 2013 3000 88.8% chr16 - 95164941 95165305 365 browser details YourSeq 148 1738 2035 3000 87.5% chrX - 100378044 100378402 359 browser details YourSeq 147 1892 2911 3000 90.3% chr18 - 10985655 11168720 183066 browser details YourSeq 147 1741 2040 3000 90.3% chr17 - 10660536 10660916 381 browser details YourSeq 145 780 1036 3000 85.8% chr14 - 29436623 29436883 261 browser details YourSeq 139 1744 2040 3000 86.9% chr5 - 136777200 136777598 399 browser details YourSeq 136 778 1033 3000 85.1% chr2 + 19832485 19832751 267 browser details YourSeq 135 780 1023 3000 82.1% chr16 - 60698998 60699245 248 browser details YourSeq 132 780 1047 3000 86.3% chr10 - 45816543 45816814 272 browser details YourSeq 131 1743 2030 3000 90.8% chr18 - 82885379 82885728 350 browser details YourSeq 129 1745 2037 3000 87.9% chr18 + 75803124 75803502 379 browser details YourSeq 129 1752 2015 3000 90.6% chr17 + 5809521 5809887 367 browser details YourSeq 127 1743 2017 3000 90.6% chr15 + 59355178 59355493 316 browser details YourSeq 127 1743 2044 3000 85.4% chr13 + 8818875 8819266 392

Note: The 3000 bp section upstream of Exon 4 is BLAT searched against the genome. No significant similarity is found.

BLAT Search Results (down)

QUERY SCORE START END QSIZE IDENTITY CHROM STRAND START END SPAN ------browser details YourSeq 3000 1 3000 3000 100.0% chr17 - 80650612 80653611 3000 browser details YourSeq 29 1 71 3000 94.0% chr11 + 109594336 109594408 73 browser details YourSeq 27 291 321 3000 86.7% chr2 + 72915966 72915995 30 browser details YourSeq 23 1 37 3000 81.1% chr17 - 26042106 26042142 37

Note: The 3000 bp section downstream of Exon 6 is BLAT searched against the genome. No significant similarity is found.

Page 4 of 8 https://www.alphaknockout.com

Gene and information: Map4k3 mitogen-activated protein kinase kinase kinase kinase 3 [ Mus musculus (house mouse) ] Gene ID: 225028, updated on 10-Oct-2019

Gene summary

Official Symbol Map4k3 provided by MGI Official Full Name mitogen-activated protein kinase kinase kinase kinase 3 provided by MGI Primary source MGI:MGI:2154405 See related Ensembl:ENSMUSG00000024242 Gene type protein coding RefSeq status VALIDATED Organism Mus musculus Lineage Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus Also known as Glk; MEKKK3; MEKKK 3; MAPKKKK3; RAB8IPL1; 4833416M01Rik; 4833416M07Rik; 9530052P13Rik Expression Ubiquitous expression in bladder adult (RPKM 16.8), limb E14.5 (RPKM 14.8) and 28 other tissues See more Orthologs human all

Genomic context

Location: 17; 17 E3 See Map4k3 in Genome Data Viewer

Exon count: 36

Annotation release Status Assembly Chr Location

108 current GRCm38.p6 (GCF_000001635.26) 17 NC_000083.6 (80580513..80728806, complement)

Build 37.2 previous assembly MGSCv37 (GCF_000001635.18) 17 NC_000083.5 (80979852..81127433, complement)

Chromosome 17 - NC_000083.6

Page 5 of 8 https://www.alphaknockout.com

Transcript information: This gene has 8 transcripts

Gene: Map4k3 ENSMUSG00000024242

Description mitogen-activated protein kinase kinase kinase kinase 3 [Source:MGI Symbol;Acc:MGI:2154405] Gene Synonyms 9530052P13Rik Location Chromosome 17: 80,580,512-80,728,485 reverse strand. GRCm38:CM001010.2 About this gene This gene has 8 transcripts (splice variants), 211 orthologues, 35 paralogues, is a member of 1 Ensembl protein family and is associated with 16 phenotypes. Transcripts

Name Transcript ID bp Protein Translation ID Biotype CCDS UniProt Flags

Map4k3-201 ENSMUST00000025089.8 4220 894aa ENSMUSP00000025089.7 Protein coding CCDS70847 Q99JP0 TSL:5 GENCODE basic APPRIS P2

Map4k3-202 ENSMUST00000112389.8 4231 896aa ENSMUSP00000108008.2 Protein coding - E9QNE9 TSL:5 GENCODE basic APPRIS ALT1

Map4k3-203 ENSMUST00000234133.1 4088 873aa ENSMUSP00000157308.1 Protein coding - A0A3Q4EGQ9 GENCODE basic APPRIS ALT1

Map4k3-208 ENSMUST00000234585.1 3769 No protein - Retained intron - - -

Map4k3-206 ENSMUST00000234481.1 3647 No protein - Retained intron - - -

Map4k3-204 ENSMUST00000234200.1 2372 No protein - Retained intron - - -

Map4k3-207 ENSMUST00000234486.1 561 No protein - Retained intron - - -

Map4k3-205 ENSMUST00000234249.1 484 No protein - lncRNA - - -

Page 6 of 8 https://www.alphaknockout.com

167.97 kb Forward strand 80.60Mb 80.65Mb 80.70Mb Gm9959-201 >pseudogene (Comprehensive set...

Contigs < AC131712.3 AC169501.2 > Genes (Comprehensive set... < Cdkl4-203protein coding < Map4k3-206retained intron < Map4k3-207retained intron

< Cdkl4-202protein coding < Map4k3-205lncRNA

< Map4k3-202protein coding

< Map4k3-201protein coding

< Map4k3-203protein coding

< Map4k3-208retained intron

< Map4k3-204retained intron

Regulatory Build

80.60Mb 80.65Mb 80.70Mb Reverse strand 167.97 kb

Regulation Legend

CTCF Enhancer Open Chromatin Promoter Promoter Flank Transcription Factor Binding Site

Gene Legend Protein Coding

Ensembl protein coding merged Ensembl/Havana

Non-Protein Coding

RNA gene processed transcript pseudogene

Page 7 of 8 https://www.alphaknockout.com

Transcript: ENSMUST00000025089

< Map4k3-201protein coding

Reverse strand 147.58 kb

ENSMUSP00000025... MobiDB lite Low complexity (Seg) Superfamily Protein kinase-like domain superfamily SMART Protein kinase domain Citron homology (CNH) domain

Pfam Protein kinase domain Citron homology (CNH) domain

PROSITE profiles Protein kinase domain Citron homology (CNH) domain

PROSITE patterns Protein kinase, ATP binding site PIRSF Mitogen-activated protein (MAP) kinase kinase kinase kinase PANTHER PTHR24361:SF205

PTHR24361 Gene3D 1.10.510.10 CDD cd06613

All sequence SNPs/i... Sequence variants (dbSNP and all other sources)

Variant Legend missense variant splice region variant synonymous variant

Scale bar 0 80 160 240 320 400 480 560 640 720 800 894

We wish to acknowledge the following valuable scientific information resources: Ensembl, MGI, NCBI, UCSC.

Page 8 of 8