https://www.alphaknockout.com

Mouse Sla2 Conditional Knockout Project (CRISPR/Cas9)

Objective: To create a Sla2 conditional knockout Mouse model (C57BL/6J) by CRISPR/Cas-mediated genome engineering.

Strategy summary: The Sla2 (NCBI Reference Sequence: NM_029983 ; Ensembl: ENSMUSG00000027636 ) is located on Mouse 2. 8 exons are identified, with the ATG start codon in exon 2 and the TAG stop codon in exon 8 (Transcript: ENSMUST00000029164). Exon 5 will be selected as conditional knockout region (cKO region). Deletion of this region should result in the loss of function of the Mouse Sla2 gene. To engineer the targeting vector, homologous arms and cKO region will be generated by PCR using BAC clone RP23-138M2 as template. Cas9, gRNA and targeting vector will be co-injected into fertilized eggs for cKO Mouse production. The pups will be genotyped by PCR followed by sequencing analysis. Note: Mice homozygous for a knock-out allele exhibit normal B and T cells.

Exon 5 starts from about 35.52% of the coding region. The knockout of Exon 5 will result in frameshift of the gene. The size of intron 4 for 5'-loxP site insertion: 634 bp, and the size of intron 5 for 3'-loxP site insertion: 2732 bp. The size of effective cKO region: ~604 bp. The cKO region does not have any other known gene.

Page 1 of 7 https://www.alphaknockout.com

Overview of the Targeting Strategy

Wildtype allele gRNA region 5' gRNA region 3'

1 3 4 5 8 Targeting vector

Targeted allele

Constitutive KO allele (After Cre recombination)

Legends Exon of mouse Sla2 Homology arm cKO region loxP site

Page 2 of 7 https://www.alphaknockout.com

Overview of the Dot Plot Window size: 10 bp

Forward Reverse Complement

Sequence 12

Note: The sequence of homologous arms and cKO region is aligned with itself to determine if there are tandem repeats. No significant tandem repeat is found in the dot plot matrix. So this region is suitable for PCR screening or sequencing analysis.

Overview of the GC Content Distribution Window size: 300 bp

Sequence 12

Summary: Full Length(7104bp) | A(25.0% 1776) | C(27.11% 1926) | T(21.97% 1561) | G(25.91% 1841)

Note: The sequence of homologous arms and cKO region is analyzed to determine the GC content. No significant high GC-content region is found. So this region is suitable for PCR screening or sequencing analysis.

Page 3 of 7 https://www.alphaknockout.com

BLAT Search Results (up)

QUERY SCORE START END QSIZE IDENTITY CHROM STRAND START END SPAN ------browser details YourSeq 3000 1 3000 3000 100.0% chr2 - 156879058 156882057 3000 browser details YourSeq 289 818 1176 3000 92.4% chr17 + 79878313 79878657 345 browser details YourSeq 278 837 1179 3000 95.2% chr12 + 102036031 102036454 424 browser details YourSeq 272 818 1180 3000 96.7% chr4 + 103271037 103271817 781 browser details YourSeq 262 818 1179 3000 93.2% chr1 + 139345340 139345696 357 browser details YourSeq 258 840 1180 3000 96.2% chr1 + 176945645 177040648 95004 browser details YourSeq 254 843 1180 3000 95.2% chr1 + 24331411 24331764 354 browser details YourSeq 253 834 1180 3000 96.1% chr12 + 82163621 82163982 362 browser details YourSeq 253 854 1181 3000 95.8% chr11 + 4979650 4980054 405 browser details YourSeq 252 842 1180 3000 95.5% chr9 + 101580095 101580587 493 browser details YourSeq 252 818 1166 3000 94.1% chr12 + 102036078 102036488 411 browser details YourSeq 250 819 1172 3000 95.5% chr10 + 81010549 81010997 449 browser details YourSeq 243 820 1180 3000 89.7% chr6 + 87332104 87332448 345 browser details YourSeq 232 866 1180 3000 95.4% chr1 - 55097380 55097786 407 browser details YourSeq 230 838 1180 3000 94.0% chr14 + 116603192 116603553 362 browser details YourSeq 225 842 1180 3000 95.3% chr19 - 9996420 9996767 348 browser details YourSeq 222 817 1178 3000 95.3% chr1 - 55610304 55610791 488 browser details YourSeq 217 818 1180 3000 89.1% chr1 + 24331451 24331718 268 browser details YourSeq 216 916 1180 3000 97.1% chr1 - 9635418 9635852 435 browser details YourSeq 215 904 1178 3000 94.7% chr4 + 103270948 103271335 388

Note: The 3000 bp section upstream of Exon 5 is BLAT searched against the genome. No significant similarity is found.

BLAT Search Results (down)

QUERY SCORE START END QSIZE IDENTITY CHROM STRAND START END SPAN ------browser details YourSeq 3000 1 3000 3000 100.0% chr2 - 156875454 156878453 3000 browser details YourSeq 256 771 1407 3000 96.4% chr2 + 156877047 156877683 637 browser details YourSeq 42 784 901 3000 81.1% chr7 - 109061660 109061775 116 browser details YourSeq 42 983 1040 3000 89.4% chr2 - 141210765 141210821 57 browser details YourSeq 39 966 1046 3000 95.4% chr10 - 61684238 61684341 104 browser details YourSeq 37 979 1025 3000 90.3% chr9 + 48493834 48493879 46 browser details YourSeq 35 980 1046 3000 76.2% chr17 + 74309045 74309111 67 browser details YourSeq 33 1286 1369 3000 88.4% chr5 - 51851763 51851853 91 browser details YourSeq 32 1112 1162 3000 94.5% chr10 + 68054653 68054703 51 browser details YourSeq 30 966 1002 3000 96.9% chr2 - 86623834 86623871 38 browser details YourSeq 30 868 901 3000 94.2% chr9 + 25551175 25551208 34 browser details YourSeq 30 1117 1147 3000 100.0% chr17 + 12377195 12377226 32 browser details YourSeq 29 966 1002 3000 96.8% chr15 - 101106024 101106061 38 browser details YourSeq 28 1137 1170 3000 90.0% chr5 - 34193749 34193781 33 browser details YourSeq 28 1015 1055 3000 85.4% chr1 - 55130206 55130256 51 browser details YourSeq 28 1286 1322 3000 89.2% chr7 + 96000796 96000833 38 browser details YourSeq 28 864 901 3000 86.9% chr5 + 133924900 133924937 38 browser details YourSeq 25 1115 1145 3000 96.3% chr8 - 94743550 94743581 32 browser details YourSeq 25 1555 1582 3000 84.7% chr3 + 6120466 6120491 26 browser details YourSeq 24 1116 1147 3000 96.2% chr17 + 12759661 12759693 33

Note: The 3000 bp section downstream of Exon 5 is BLAT searched against the genome. No significant similarity is found.

Page 4 of 7 https://www.alphaknockout.com

Gene and information: Sla2 Src-like-adaptor 2 [ Mus musculus (house mouse) ] Gene ID: 77799, updated on 24-Oct-2019

Gene summary

Official Symbol Sla2 provided by MGI Official Full Name Src-like-adaptor 2 provided by MGI Primary source MGI:MGI:1925049 See related Ensembl:ENSMUSG00000027636 Gene type protein coding RefSeq status VALIDATED Organism Mus musculus Lineage Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus Also known as SLAP2; SLAP-2; AI430952; A930009E21Rik Expression Ubiquitous expression in thymus adult (RPKM 36.8), placenta adult (RPKM 10.3) and 27 other tissues See more Orthologs human all

Genomic context

Location: 2; 2 H1 See Sla2 in Genome Data Viewer

Exon count: 8

Annotation release Status Assembly Chr Location

108 current GRCm38.p6 (GCF_000001635.26) 2 NC_000068.7 (156874151..156887250, complement)

Build 37.2 previous assembly MGSCv37 (GCF_000001635.18) 2 NC_000068.6 (156698658..156712814, complement)

Chromosome 2 - NC_000068.7

Page 5 of 7 https://www.alphaknockout.com

Transcript information: This gene has 2 transcripts

Gene: Sla2 ENSMUSG00000027636

Description Src-like-adaptor 2 [Source:MGI Symbol;Acc:MGI:1925049] Gene Synonyms A930009E21Rik, SLAP-2, SLAP2 Location Chromosome 2: 156,872,457-156,887,192 reverse strand. GRCm38:CM000995.2 About this gene This gene has 2 transcripts (splice variants), 186 orthologues, 8 paralogues and is a member of 1 Ensembl protein family. Transcripts

Name Transcript ID bp Protein Translation ID Biotype CCDS UniProt Flags

Sla2-201 ENSMUST00000029164.8 2973 259aa ENSMUSP00000029164.2 Protein coding CCDS38302 Q8R4L0 TSL:1 GENCODE basic APPRIS P1

Sla2-202 ENSMUST00000109561.3 2634 259aa ENSMUSP00000105189.3 Protein coding CCDS38302 Q8R4L0 TSL:1 GENCODE basic APPRIS P1

34.74 kb Forward strand 156.87Mb 156.88Mb 156.89Mb Rab5if-201 >protein coding Gm14248-201 >processed pseudogene (Comprehensive set...

Rab5if-203 >lncRNA

Rab5if-204 >retained intron Rab5if-202 >lncRNA

Contigs AL935150.10 > Genes (Comprehensive set... < 5430405H02Rik-203lncRNA < Sla2-201protein coding < Gm14278-201unprocessed pseudogene

< 5430405H02Rik-202lncRNA < Sla2-202protein coding < Gm14247-201processed pseudogene

< 5430405H02Rik-201lncRNA

Regulatory Build

156.87Mb 156.88Mb 156.89Mb Reverse strand 34.74 kb

Regulation Legend CTCF Enhancer Promoter Promoter Flank

Gene Legend Protein Coding

merged Ensembl/Havana Ensembl protein coding

Non-Protein Coding

processed transcript RNA gene pseudogene

Page 6 of 7 https://www.alphaknockout.com

Transcript: ENSMUST00000029164

< Sla2-201protein coding

Reverse strand 14.62 kb

ENSMUSP00000029... MobiDB lite Low complexity (Seg) Superfamily SH3-like domain superfamily SH2 domain superfamily

SMART SH3 domain SH2 domain

Prints SH2 domain Pfam SH3 domain SH2 domain

PROSITE profiles SH3 domain SH2 domain

PANTHER Src-like-adapter 2

PTHR10155 Gene3D 2.30.30.40 SH2 domain superfamily

CDD SLAP, SH2 domain

All sequence SNPs/i... Sequence variants (dbSNP and all other sources)

Variant Legend inframe insertion missense variant splice region variant synonymous variant

Scale bar 0 40 80 120 160 200 259

We wish to acknowledge the following valuable scientific information resources: Ensembl, MGI, NCBI, UCSC.

Page 7 of 7