https://www.alphaknockout.com

Mouse Itgb1bp1 Conditional Knockout Project (CRISPR/Cas9)

Objective: To create a Itgb1bp1 conditional knockout Mouse model (C57BL/6J) by CRISPR/Cas-mediated genome engineering.

Strategy summary: The Itgb1bp1 (NCBI Reference Sequence: NM_008403 ; Ensembl: ENSMUSG00000062352 ) is located on Mouse 12. 7 exons are identified, with the ATG start codon in exon 2 and the TGA stop codon in exon 7 (Transcript: ENSMUST00000076260). Exon 3 will be selected as conditional knockout region (cKO region). Deletion of this region should result in the loss of function of the Mouse Itgb1bp1 gene. To engineer the targeting vector, homologous arms and cKO region will be generated by PCR using BAC clone RP24-149N4 as template. Cas9, gRNA and targeting vector will be co-injected into fertilized eggs for cKO Mouse production. The pups will be genotyped by PCR followed by sequencing analysis. Note: Mice homozygous for a null allele exhibit postnatal lethality, reduced weight and length, reduced ossification, and skull and skeleton abnormalities. Mice homozygous for a gene trap mutation are viable and do not exhibit any obvious abnormalites.

Exon 3 starts from about 12.17% of the coding region. The knockout of Exon 3 will result in frameshift of the gene. The size of intron 2 for 5'-loxP site insertion: 2501 bp, and the size of intron 3 for 3'-loxP site insertion: 1917 bp. The size of effective cKO region: ~579 bp. The cKO region does not have any other known gene.

Page 1 of 8 https://www.alphaknockout.com

Overview of the Targeting Strategy

Wildtype allele gRNA region 5' gRNA region 3'

1 3 4 7 Targeting vector

Targeted allele

Constitutive KO allele (After Cre recombination)

Legends Exon of mouse Itgb1bp1 Homology arm cKO region loxP site

Page 2 of 8 https://www.alphaknockout.com

Overview of the Dot Plot Window size: 10 bp

Forward Reverse Complement

Sequence 12

Note: The sequence of homologous arms and cKO region is aligned with itself to determine if there are tandem repeats. No significant tandem repeat is found in the dot plot matrix. So this region is suitable for PCR screening or sequencing analysis.

Overview of the GC Content Distribution Window size: 300 bp

Sequence 12

Summary: Full Length(7079bp) | A(25.43% 1800) | C(21.1% 1494) | T(31.56% 2234) | G(21.91% 1551)

Note: The sequence of homologous arms and cKO region is analyzed to determine the GC content. No significant high GC-content region is found. So this region is suitable for PCR screening or sequencing analysis.

Page 3 of 8 https://www.alphaknockout.com

BLAT Search Results (up)

QUERY SCORE START END QSIZE IDENTITY CHROM STRAND START END SPAN ------browser details YourSeq 3000 1 3000 3000 100.0% chr12 - 21277149 21280148 3000 browser details YourSeq 239 1562 1968 3000 86.5% chr5 + 106361907 106362358 452 browser details YourSeq 238 1498 1923 3000 86.7% chr8 + 86691150 86691694 545 browser details YourSeq 228 1408 1918 3000 85.0% chr13 - 113704795 113705293 499 browser details YourSeq 219 1551 1916 3000 83.8% chr15 + 41341368 41341759 392 browser details YourSeq 212 1407 1863 3000 88.2% chr3 - 69818219 69818685 467 browser details YourSeq 209 1469 1930 3000 86.2% chr7 - 140368946 140369442 497 browser details YourSeq 194 1601 1916 3000 85.7% chr15 + 42208580 42208911 332 browser details YourSeq 189 1414 1885 3000 86.9% chr13 + 30404836 30405347 512 browser details YourSeq 186 1436 1918 3000 83.1% chr12 - 27979444 27979940 497 browser details YourSeq 184 1603 1976 3000 90.8% chr5 + 32369653 32370045 393 browser details YourSeq 175 1405 1978 3000 84.8% chrX - 75595260 75595889 630 browser details YourSeq 175 1407 1896 3000 86.1% chr1 + 119492161 119492746 586 browser details YourSeq 172 1409 1976 3000 89.6% chr10 - 45816415 45817062 648 browser details YourSeq 161 1505 1971 3000 84.8% chr10 - 34053706 34054191 486 browser details YourSeq 151 1612 1971 3000 88.3% chr9 + 14962650 14963010 361 browser details YourSeq 143 1552 1968 3000 85.8% chr15 + 37619829 37620234 406 browser details YourSeq 142 1550 1968 3000 77.1% chr5 + 66243623 66244075 453 browser details YourSeq 139 1407 1916 3000 79.6% chr18 + 73081332 73081763 432 browser details YourSeq 136 1414 1955 3000 85.2% chr14 - 120809851 120810391 541

Note: The 3000 bp section upstream of Exon 3 is BLAT searched against the genome. No significant similarity is found.

BLAT Search Results (down)

QUERY SCORE START END QSIZE IDENTITY CHROM STRAND START END SPAN ------browser details YourSeq 3000 1 3000 3000 100.0% chr12 - 21273570 21276569 3000 browser details YourSeq 224 1348 2513 3000 97.9% chr11 + 97725466 97972363 246898 browser details YourSeq 192 2357 2623 3000 93.2% chrX + 13250827 13251306 480 browser details YourSeq 173 2280 2518 3000 88.3% chr4 + 107355013 107355212 200 browser details YourSeq 160 2346 2517 3000 98.3% chr2 - 26328368 26328539 172 browser details YourSeq 160 2320 2518 3000 90.3% chr1 - 157288824 157288999 176 browser details YourSeq 154 2361 2616 3000 85.3% chr4 + 125159867 125160119 253 browser details YourSeq 152 2305 2508 3000 92.6% chr1 - 99440740 99440958 219 browser details YourSeq 152 2323 2517 3000 89.7% chr13 + 92093378 92093542 165 browser details YourSeq 149 2346 2513 3000 94.7% chr10 + 33988750 33988918 169 browser details YourSeq 147 2359 2516 3000 96.2% chr12 + 28653409 28653565 157 browser details YourSeq 145 2360 2530 3000 92.9% chr1 - 85449801 85449984 184 browser details YourSeq 143 2346 2519 3000 91.4% chr15 - 22263998 22264173 176 browser details YourSeq 143 2373 2573 3000 92.4% chr1 - 86296702 86297280 579 browser details YourSeq 142 2358 2517 3000 95.0% chr1 - 72618610 72618776 167 browser details YourSeq 141 2333 2513 3000 89.5% chrX - 103188729 103188880 152 browser details YourSeq 141 2314 2514 3000 86.7% chr1 - 61661063 61661240 178 browser details YourSeq 140 2358 2516 3000 94.4% chr6 - 113708609 113708770 162 browser details YourSeq 139 505 657 3000 95.5% chr17 + 24172604 24172756 153 browser details YourSeq 138 503 664 3000 91.4% chr2 - 104944614 104944774 161

Note: The 3000 bp section downstream of Exon 3 is BLAT searched against the genome. No significant similarity is found.

Page 4 of 8 https://www.alphaknockout.com

Gene and information: Itgb1bp1 binding protein 1 [ Mus musculus (house mouse) ] Gene ID: 16413, updated on 24-Oct-2019

Gene summary

Official Symbol Itgb1bp1 provided by MGI Official Full Name integrin beta 1 binding protein 1 provided by MGI Primary source MGI:MGI:1306802 See related Ensembl:ENSMUSG00000062352 Gene type protein coding RefSeq status VALIDATED Organism Mus musculus Lineage Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus Also known as Icap1; AI449260; AU019480 Expression Ubiquitous expression in testis adult (RPKM 30.9), subcutaneous fat pad adult (RPKM 11.0) and 24 other tissues See more Orthologs human all

Genomic context

Location: 12; 12 A1.3 See Itgb1bp1 in Genome Data Viewer

Exon count: 8

Annotation release Status Assembly Chr Location

108 current GRCm38.p6 (GCF_000001635.26) 12 NC_000078.6 (21267246..21286292, complement)

Build 37.2 previous assembly MGSCv37 (GCF_000001635.18) 12 NC_000078.5 (21275667..21292098, complement)

Chromosome 12 - NC_000078.6

Page 5 of 8 https://www.alphaknockout.com

Transcript information: This gene has 7 transcripts

Gene: Itgb1bp1 ENSMUSG00000062352

Description integrin beta 1 binding protein 1 [Source:MGI Symbol;Acc:MGI:1306802] Gene Synonyms bodenin Location Chromosome 12: 21,240,825-21,286,284 reverse strand. GRCm38:CM001005.2 About this gene This gene has 7 transcripts (splice variants), 198 orthologues, is a member of 1 Ensembl protein family and is associated with 25 phenotypes. Transcripts

Name Transcript ID bp Protein Translation ID Biotype CCDS UniProt Flags

Itgb1bp1-201 ENSMUST00000076260.11 1921 200aa ENSMUSP00000075609.4 Protein coding CCDS25833 O35671 TSL:1 GENCODE basic APPRIS P1

Itgb1bp1-206 ENSMUST00000173729.7 1701 200aa ENSMUSP00000134627.1 Protein coding CCDS25833 O35671 TSL:1 GENCODE basic APPRIS P1

Itgb1bp1-207 ENSMUST00000232072.1 954 200aa ENSMUSP00000156312.1 Protein coding CCDS25833 O35671 GENCODE basic APPRIS P1

Itgb1bp1-202 ENSMUST00000172834.1 801 188aa ENSMUSP00000134508.1 Protein coding - Q3UGH2 TSL:1 GENCODE basic

Itgb1bp1-205 ENSMUST00000173688.1 512 84aa ENSMUSP00000133557.1 Protein coding - G3UX55 CDS 5' incomplete TSL:3

Itgb1bp1-204 ENSMUST00000173614.1 858 No protein - Retained intron - - TSL:2

Itgb1bp1-203 ENSMUST00000172962.1 448 No protein - lncRNA - - TSL:3

Page 6 of 8 https://www.alphaknockout.com

65.46 kb Forward strand

21.24Mb 21.25Mb 21.26Mb 21.27Mb 21.28Mb 21.29Mb Asap2-204 >protein coding Cpsf3-204 >nonsense mediated decay (Comprehensive set...

Asap2-202 >protein coding Cpsf3-207 >protein coding

Asap2-203 >protein coding Cpsf3-206 >protein coding

Asap2-201 >protein coding Cpsf3-209 >protein coding

Cpsf3-201 >protein coding

Cpsf3-211 >protein coding

Cpsf3-205 >protein coding

Gm25821-201 >snoRNA

Contigs AC156032.4 > Genes (Comprehensive set... < Itgb1bp1-203lncRNA < Gm20535-201lncRNA < Itgb1bp1-207protein coding

< Itgb1bp1-201protein coding

< Itgb1bp1-206protein coding

< Itgb1bp1-205protein coding

< Itgb1bp1-202protein coding

< Itgb1bp1-204retained intron

Regulatory Build

21.24Mb 21.25Mb 21.26Mb 21.27Mb 21.28Mb 21.29Mb Reverse strand 65.46 kb

Regulation Legend CTCF Enhancer Open Chromatin Promoter Promoter Flank

Gene Legend Protein Coding

merged Ensembl/Havana Ensembl protein coding

Non-Protein Coding

processed transcript RNA gene

Page 7 of 8 https://www.alphaknockout.com

Transcript: ENSMUST00000076260

< Itgb1bp1-201protein coding

Reverse strand 16.48 kb

ENSMUSP00000075... MobiDB lite Low complexity (Seg) Superfamily SSF50729 SMART PTB/PI domain Pfam Integrin binding protein, ICAP-1 PANTHER Integrin binding protein, ICAP-1 Gene3D PH-like domain superfamily CDD cd13163

All sequence SNPs/i... Sequence variants (dbSNP and all other sources)

Variant Legend synonymous variant

Scale bar 0 20 40 60 80 100 120 140 160 180 200

We wish to acknowledge the following valuable scientific information resources: Ensembl, MGI, NCBI, UCSC.

Page 8 of 8