https://www.alphaknockout.com

Mouse Pclo Conditional Knockout Project (CRISPR/Cas9)

Objective: To create a Pclo conditional knockout Mouse model (C57BL/6J) by CRISPR/Cas-mediated genome engineering.

Strategy summary: The Pclo (NCBI Reference Sequence: NM_011995 ; Ensembl: ENSMUSG00000061601 ) is located on Mouse 5. 25 exons are identified, with the ATG start codon in exon 1 and the TGA stop codon in exon 25 (Transcript: ENSMUST00000030691). Exon 2 will be selected as conditional knockout region (cKO region). Deletion of this region should result in the loss of function of the Mouse Pclo gene. To engineer the targeting vector, homologous arms and cKO region will be generated by PCR using BAC clone RP23-35C9 as template. Cas9, gRNA and targeting vector will be co-injected into fertilized eggs for cKO Mouse production. The pups will be genotyped by PCR followed by sequencing analysis. Note: Mice homozygous for one deletion of Pclo are viable and fertile, and display no overt abnormal phenotype. Mice homozygous for another knock-out allele exhibit some premature lethality, decreased body size, and abnormal synaptic vesicle number.

Exon 2 starts from about 1.58% of the coding region. The knockout of Exon 2 will result in frameshift of the gene. The size of intron 1 for 5'-loxP site insertion: 5381 bp, and the size of intron 2 for 3'-loxP site insertion: 17085 bp. The size of effective cKO region: ~1983 bp. The cKO region does not have any other known gene.

Page 1 of 8 https://www.alphaknockout.com

Overview of the Targeting Strategy

Wildtype allele 5' gRNA region gRNA region 3'

1 2 25 Targeting vector

Targeted allele

Constitutive KO allele (After Cre recombination)

Legends Exon of mouse Pclo Homology arm cKO region loxP site

Page 2 of 8 https://www.alphaknockout.com

Overview of the Dot Plot Window size: 10 bp

Forward Reverse Complement

Sequence 12

Note: The sequence of homologous arms and cKO region is aligned with itself to determine if there are tandem repeats. No significant tandem repeat is found in the dot plot matrix. So this region is suitable for PCR screening or sequencing analysis.

Overview of the GC Content Distribution Window size: 300 bp

Sequence 12

Summary: Full Length(8483bp) | A(30.89% 2620) | C(18.88% 1602) | T(32.15% 2727) | G(18.08% 1534)

Note: The sequence of homologous arms and cKO region is analyzed to determine the GC content. No significant high GC-content region is found. So this region is suitable for PCR screening or sequencing analysis.

Page 3 of 8 https://www.alphaknockout.com

BLAT Search Results (up)

QUERY SCORE START END QSIZE IDENTITY CHROM STRAND START END SPAN ------browser details YourSeq 3000 1 3000 3000 100.0% chr5 + 14517593 14520592 3000 browser details YourSeq 31 769 806 3000 83.4% chr18 - 66210547 66210582 36 browser details YourSeq 31 783 818 3000 85.3% chr2 + 104103348 104103381 34 browser details YourSeq 29 766 806 3000 94.2% chr2 + 31679124 31679167 44 browser details YourSeq 28 770 797 3000 100.0% chr17 - 65579921 65579948 28 browser details YourSeq 23 773 796 3000 100.0% chr1 + 16205376 16205401 26 browser details YourSeq 22 297 321 3000 95.9% chr3 - 11231950 11231976 27

Note: The 3000 bp section upstream of Exon 2 is BLAT searched against the genome. No significant similarity is found.

BLAT Search Results (down)

QUERY SCORE START END QSIZE IDENTITY CHROM STRAND START END SPAN ------browser details YourSeq 3000 1 3000 3000 100.0% chr5 + 14522576 14525575 3000 browser details YourSeq 294 814 1580 3000 82.8% chr3 - 56173229 56407715 234487 browser details YourSeq 260 774 1409 3000 81.7% chr13 - 101648988 101649617 630 browser details YourSeq 250 760 1682 3000 77.3% chr14 + 40398770 40399695 926 browser details YourSeq 248 760 1596 3000 82.6% chr8 + 61033206 61457488 424283 browser details YourSeq 238 805 1688 3000 80.2% chr4 - 21590862 21591740 879 browser details YourSeq 228 777 1620 3000 82.1% chr5 + 64305526 64306520 995 browser details YourSeq 225 891 1362 3000 76.3% chr19 + 11369691 11370404 714 browser details YourSeq 224 748 1423 3000 81.0% chr4 - 35234418 35235080 663 browser details YourSeq 222 774 1542 3000 79.2% chr4 + 48120989 48121735 747 browser details YourSeq 214 937 1665 3000 81.2% chr13 - 90706689 90707443 755 browser details YourSeq 208 780 1278 3000 76.6% chr11 + 30332028 30332532 505 browser details YourSeq 202 774 1278 3000 81.4% chr13 + 48521324 48521848 525 browser details YourSeq 197 760 1542 3000 84.4% chr16 - 13062561 13063337 777 browser details YourSeq 192 806 1298 3000 78.0% chr19 - 20318863 20319352 490 browser details YourSeq 190 904 1414 3000 82.7% chr7 - 67582214 67582700 487 browser details YourSeq 190 1180 1626 3000 82.3% chr14 + 56971409 56971839 431 browser details YourSeq 190 1055 1546 3000 80.0% chr1 + 94613793 94614299 507 browser details YourSeq 185 799 1278 3000 82.6% chr16 + 12376675 12502113 125439 browser details YourSeq 183 832 1710 3000 78.3% chr16 - 5275846 5276689 844

Note: The 3000 bp section downstream of Exon 2 is BLAT searched against the genome. No significant similarity is found.

Page 4 of 8 https://www.alphaknockout.com

Gene and information: Pclo piccolo (presynaptic cytomatrix protein) [ Mus musculus (house mouse) ] Gene ID: 26875, updated on 12-Aug-2019

Gene summary

Official Symbol Pclo provided by MGI Official Full Name piccolo (presynaptic cytomatrix protein) provided by MGI Primary source MGI:MGI:1349390 See related Ensembl:ENSMUSG00000061601 Gene type protein coding RefSeq status VALIDATED Organism Mus musculus Lineage Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus Also known as Acz; Pico Expression Biased expression in cerebellum adult (RPKM 10.3), cortex adult (RPKM 8.0) and 5 other tissues See more Orthologs human all

Genomic context

Location: 5; 5 A1 See Pclo in Genome Data Viewer

Exon count: 25

Annotation release Status Assembly Chr Location

108 current GRCm38.p6 (GCF_000001635.26) 5 NC_000071.6 (14514906..14863465)

Build 37.2 previous assembly MGSCv37 (GCF_000001635.18) 5 NC_000071.5 (14514918..14863459)

Chromosome 5 - NC_000071.6

Page 5 of 8 https://www.alphaknockout.com

Transcript information: This gene has 4 transcripts

Gene: Pclo ENSMUSG00000061601

Description piccolo (presynaptic cytomatrix protein) [Source:MGI Symbol;Acc:MGI:1349390] Gene Synonyms Acz, Pico Location Chromosome 5: 14,514,918-14,863,459 forward strand. GRCm38:CM000998.2 About this gene This gene has 4 transcripts (splice variants), 294 orthologues, 1 paralogue, is a member of 1 Ensembl protein family and is associated with 9 phenotypes. Transcripts

Name Transcript ID bp Protein Translation ID Biotype CCDS UniProt Flags

Pclo-201 ENSMUST00000030691.16 20084 5068aa ENSMUSP00000030691.9 Protein coding CCDS51415 Q9QYX7 TSL:5 GENCODE basic APPRIS P1

Pclo-202 ENSMUST00000182407.7 16846 4863aa ENSMUSP00000138419.1 Protein coding CCDS59667 Q9QYX7 TSL:5 GENCODE basic

Pclo-204 ENSMUST00000182915.1 6738 1592aa ENSMUSP00000138607.1 Protein coding - S4R2E0 CDS 5' incomplete TSL:1

Pclo-203 ENSMUST00000182426.2 17063 No protein - Retained intron - - TSL:2

Page 6 of 8 https://www.alphaknockout.com

368.54 kb Forward strand 14.6Mb 14.7Mb 14.8Mb (Comprehensive set... Pclo-201 >protein coding

Pclo-202 >protein coding Gm43665-201 >TEC

Pclo-203 >retained intron

Gm42446-201 >TEC Gm42445-201 >TEC Gm43663-201 >TEC Gm43662-201 >TEC

Gm42447-201 >TEC Pclo-204 >protein coding

Gm42448-201 >TEC

Gm42444-201 >TEC

Contigs < AC125043.4 AC125533.4 > Genes < Gm26918-201lncRNA < Gm26954-202lncRNA (Comprehensive set...

< Gm26954-201lncRNA

Regulatory Build

14.6Mb 14.7Mb 14.8Mb Reverse strand 368.54 kb

Regulation Legend CTCF Enhancer Open Chromatin Promoter Promoter Flank Transcription Factor Binding Site

Gene Legend Protein Coding

Ensembl protein coding merged Ensembl/Havana

Non-Protein Coding

processed transcript RNA gene

Page 7 of 8 https://www.alphaknockout.com

Transcript: ENSMUST00000030691

348.54 kb Forward strand

Pclo-201 >protein coding

ENSMUSP00000030... MobiDB lite Low complexity (Seg) Coiled-coils (Ncoils) Superfamily Zinc finger, FYVE/PHD-type SSF49562

PDZ superfamily SMART C2 domain

PDZ domain Pfam Zinc finger, piccolo-type PDZ domain

C2 domain PROSITE profiles PDZ domain

C2 domain PANTHER Protein piccolo

PTHR14113 Gene3D Zinc finger, RING/FYVE/PHD-type 2.30.42.10

C2 domain superfamily CDD cd15774 cd15776 cd00992

cd04031

All sequence SNPs/i... Sequence variants (dbSNP and all other sources)

Variant Legend frameshift variant inframe insertion missense variant splice region variant synonymous variant

Scale bar 0 600 1200 1800 2400 3000 3600 4200 5068

We wish to acknowledge the following valuable scientific information resources: Ensembl, MGI, NCBI, UCSC.

Page 8 of 8