Mouse Apoa2 Knockout Project (CRISPR/Cas9)

Total Page:16

File Type:pdf, Size:1020Kb

Mouse Apoa2 Knockout Project (CRISPR/Cas9) https://www.alphaknockout.com Mouse Apoa2 Knockout Project (CRISPR/Cas9) Objective: To create a Apoa2 knockout Mouse model (C57BL/6N) by CRISPR/Cas-mediated genome engineering. Strategy summary: The Apoa2 gene (NCBI Reference Sequence: NM_013474 ; Ensembl: ENSMUSG00000005681 ) is located on Mouse chromosome 1. 4 exons are identified, with the ATG start codon in exon 2 and the TGA stop codon in exon 4 (Transcript: ENSMUST00000005824). Exon 2~4 will be selected as target site. Cas9 and gRNA will be co-injected into fertilized eggs for KO Mouse production. The pups will be genotyped by PCR followed by sequencing analysis. Note: Homozygous null mutation of this gene results in a reduction of total cholesterol, HDL cholesterol, free fatty acids, insulin, and glucose levels in both the fasted and unfasted states. Strain specific alleles have been associated with varying degrees of amyloidosis. Exon 2 starts from about 0.33% of the coding region. Exon 2~4 covers 100.0% of the coding region. The size of effective KO region: ~2717 bp. The KO region does not have any other known gene. Page 1 of 9 https://www.alphaknockout.com Overview of the Targeting Strategy Wildtype allele 5' gRNA region gRNA region 3' 1 2 3 4 Legends Exon of mouse Apoa2 Knockout region Page 2 of 9 https://www.alphaknockout.com Overview of the Dot Plot (up) Window size: 15 bp Forward Reverse Complement Sequence 12 Note: The 2000 bp section upstream of start codon is aligned with itself to determine if there are tandem repeats. Tandem repeats are found in the dot plot matrix. The gRNA site is selected outside of these tandem repeats. Overview of the Dot Plot (down) Window size: 15 bp Forward Reverse Complement Sequence 12 Note: The 2000 bp section downstream of stop codon is aligned with itself to determine if there are tandem repeats. No significant tandem repeat is found in the dot plot matrix. So this region is suitable for PCR screening or sequencing analysis. Page 3 of 9 https://www.alphaknockout.com Overview of the GC Content Distribution (up) Window size: 300 bp Sequence 12 Summary: Full Length(2000bp) | A(24.4% 488) | C(22.25% 445) | T(29.35% 587) | G(24.0% 480) Note: The 2000 bp section upstream of start codon is analyzed to determine the GC content. No significant high GC-content region is found. So this region is suitable for PCR screening or sequencing analysis. Overview of the GC Content Distribution (down) Window size: 300 bp Sequence 12 Summary: Full Length(2000bp) | A(28.65% 573) | C(22.45% 449) | T(24.85% 497) | G(24.05% 481) Note: The 2000 bp section downstream of stop codon is analyzed to determine the GC content. No significant high GC-content region is found. So this region is suitable for PCR screening or sequencing analysis. Page 4 of 9 https://www.alphaknockout.com BLAT Search Results (up) QUERY SCORE START END QSIZE IDENTITY CHROM STRAND START END SPAN ----------------------------------------------------------------------------------------------- browser details YourSeq 2000 1 2000 2000 100.0% chr1 + 171223316 171225315 2000 browser details YourSeq 127 1135 1349 2000 93.9% chr10 - 24723126 24723354 229 browser details YourSeq 82 259 340 2000 100.0% chr16 - 52843736 52843817 82 browser details YourSeq 71 38 205 2000 75.9% chr16 + 21683445 21683555 111 browser details YourSeq 65 119 214 2000 92.3% chr3 - 121164923 121165034 112 browser details YourSeq 65 334 445 2000 82.2% chr12 - 55119412 55119528 117 browser details YourSeq 61 334 445 2000 80.2% chr12 - 55268717 55268833 117 browser details YourSeq 61 156 224 2000 94.3% chr15 + 64633115 64633183 69 browser details YourSeq 57 104 213 2000 78.3% chr2 - 132721996 132722083 88 browser details YourSeq 55 158 224 2000 91.1% chr6 - 103556366 103556432 67 browser details YourSeq 55 158 228 2000 88.8% chr12 - 47585765 47585835 71 browser details YourSeq 53 158 214 2000 96.5% chr13 - 99347589 99347645 57 browser details YourSeq 53 158 214 2000 96.5% chr5 + 150512622 150512678 57 browser details YourSeq 53 158 214 2000 96.5% chr19 + 36787701 36787757 57 browser details YourSeq 52 165 223 2000 91.0% chr11 + 102786435 102786491 57 browser details YourSeq 51 158 214 2000 94.8% chr14 - 48639398 48639454 57 browser details YourSeq 51 160 214 2000 92.6% chr6 + 146546179 146546232 54 browser details YourSeq 51 158 214 2000 94.8% chr15 + 73203881 73203937 57 browser details YourSeq 51 158 214 2000 94.8% chr10 + 78556608 78556664 57 browser details YourSeq 50 1128 1180 2000 98.1% chr9 - 67427649 67427702 54 Note: The 2000 bp section upstream of start codon is BLAT searched against the genome. No significant similarity is found. BLAT Search Results (down) QUERY SCORE START END QSIZE IDENTITY CHROM STRAND START END SPAN ----------------------------------------------------------------------------------------------- browser details YourSeq 2000 1 2000 2000 100.0% chr1 + 171226265 171228264 2000 browser details YourSeq 259 574 1039 2000 89.9% chr2 + 22879350 22879857 508 browser details YourSeq 230 565 874 2000 91.1% chr8 + 83304921 83383152 78232 browser details YourSeq 222 568 866 2000 90.1% chr2 - 170689823 170690412 590 browser details YourSeq 221 568 875 2000 91.2% chr12 + 74374250 74374591 342 browser details YourSeq 220 568 846 2000 92.7% chr1 - 182159708 182647492 487785 browser details YourSeq 217 568 849 2000 92.3% chr13 + 58954667 58955030 364 browser details YourSeq 210 585 866 2000 91.5% chr10 - 60517626 60517957 332 browser details YourSeq 210 585 874 2000 90.9% chr7 + 133226656 133226973 318 browser details YourSeq 208 568 849 2000 91.1% chr18 + 49970447 49970760 314 browser details YourSeq 207 590 866 2000 89.4% chrX - 40286934 40287262 329 browser details YourSeq 207 585 861 2000 90.7% chr6 - 49362948 49363256 309 browser details YourSeq 206 568 849 2000 89.4% chr1 - 88755493 88755832 340 browser details YourSeq 205 585 866 2000 93.0% chr9 - 80058186 80058495 310 browser details YourSeq 205 590 865 2000 88.0% chr14 + 106073899 106074196 298 browser details YourSeq 201 590 861 2000 91.5% chr13 - 57070722 57071020 299 browser details YourSeq 200 595 849 2000 91.8% chr16 + 89293976 89294257 282 browser details YourSeq 199 585 843 2000 91.8% chr16 - 35016606 35016927 322 browser details YourSeq 199 590 866 2000 90.5% chr18 + 35002922 35003225 304 browser details YourSeq 198 585 861 2000 89.6% chr1 - 168279794 168280092 299 Note: The 2000 bp section downstream of stop codon is BLAT searched against the genome. No significant similarity is found. Page 5 of 9 https://www.alphaknockout.com Gene and protein information: Apoa2 apolipoprotein A-II [ Mus musculus (house mouse) ] Gene ID: 11807, updated on 10-Oct-2019 Gene summary Official Symbol Apoa2 provided by MGI Official Full Name apolipoprotein A-II provided by MGI Primary source MGI:MGI:88050 See related Ensembl:ENSMUSG00000005681 Gene type protein coding RefSeq status REVIEWED Organism Mus musculus Lineage Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus Also known as Alp-2; Hdl-1; ApoAII; Apoa-2; Apo-AII; ApoA-II Summary This gene encodes a component of high density lipoproteins (HDL). Mice lacking the encoded protein have low HDL- Expression cholesterol levels, smaller HDL particles, increased clearance of triglyceride-rich lipoproteins and insulin hypersensitivity. Transgenic mice overexpressing the encoded protein have elevated levels of HDL-cholesterol and show increased susceptibility to atherosclerosis. Alternative splicing of this gene results in multiple variants. [provided by RefSeq, Mar 2015] Orthologs Biased expression in liver adult (RPKM 4143.8), liver E18 (RPKM 1768.2) and 3 other tissues See more human all Genomic context Location: 1 H3; 1 79.22 cM See Apoa2 in Genome Data Viewer Exon count: 5 Annotation release Status Assembly Chr Location 108 current GRCm38.p6 (GCF_000001635.26) 1 NC_000067.6 (171221564..171226379) Build 37.2 previous assembly MGSCv37 (GCF_000001635.18) 1 NC_000067.5 (173155185..173156510) Chromosome 1 - NC_000067.6 Page 6 of 9 https://www.alphaknockout.com Transcript information: This gene has 4 transcripts Gene: Apoa2 ENSMUSG00000005681 Description apolipoprotein A-II [Source:MGI Symbol;Acc:MGI:88050] Gene Synonyms Alp-2, ApoA-II, Apoa-2, Hdl-1 Location Chromosome 1: 171,225,054-171,226,379 forward strand. GRCm38:CM000994.2 About this gene This gene has 4 transcripts (splice variants), 83 orthologues, is a member of 1 Ensembl protein family and is associated with 8 phenotypes. Transcripts Name Transcript ID bp Protein Translation ID Biotype CCDS UniProt Flags Apoa2-204 ENSMUST00000111321.7 518 102aa ENSMUSP00000106953.1 Protein coding CCDS35773 P09813 TSL:2 GENCODE basic APPRIS P1 Apoa2-201 ENSMUST00000005824.11 492 102aa ENSMUSP00000005824.5 Protein coding CCDS35773 P09813 TSL:1 GENCODE basic APPRIS P1 Apoa2-203 ENSMUST00000111320.7 470 102aa ENSMUSP00000106952.1 Protein coding CCDS35773 P09813 TSL:2 GENCODE basic APPRIS P1 Apoa2-202 ENSMUST00000111319.1 458 102aa ENSMUSP00000106951.1 Protein coding CCDS35773 P09813 TSL:3 GENCODE basic APPRIS P1 Page 7 of 9 https://www.alphaknockout.com 21.33 kb Forward strand 171.220Mb 171.225Mb 171.230Mb 171.235Mb Genes (Comprehensive set... Nr1i3-201 >protein coding Apoa2-204 >protein coding Nr1i3-202 >protein coding Apoa2-201 >protein coding Nr1i3-208 >protein coding Apoa2-203 >protein coding Nr1i3-205 >retained intron Apoa2-202 >protein coding Nr1i3-204 >nonsense mediated decay Nr1i3-207 >retained intron Nr1i3-203 >protein coding Nr1i3-206 >retained intron
Recommended publications
  • Role of Tomm40'523 – Apoe Haplotypes in Alzheimer's Disease Etiology
    ROLE OF TOMM40’523 – APOE HAPLOTYPES IN ALZHEIMER’S DISEASE ETIOLOGY – FROM CLINICS TO MITOCHONDRIA RÈMY CARDOSO Tese para obtenção do grau de Doutor em EnvelHecimento e Doenças Crónicas Doutoramento em associação entre: Universidade NOVA de Lisboa (Faculdade de Ciências Médicas | NOVA Medical ScHool - FCM|NMS/UNL) Universidade de Coimbra (Faculdade de Medicina - FM/UC) Universidade do MinHo (Escola de Medicina - EMed/UM) Novembro, 2020 ROLE OF TOMM40’523 – APOE HAPLOTYPES IN ALZHEIMER’S DISEASE ETIOLOGY – FROM CLINICS TO MITOCHONDRIA Rèmy Cardoso Professora Doutora Catarina Resende Oliveira, Professora Catedrática Jubilada da FM/UC Professor Doutor Duarte Barral, Professor Associado da FCM|NMS/UNL Tese para obtenção do grau de Doutor em EnvelHecimento e Doenças Crónicas Doutoramento em associação entre: Universidade NOVA de Lisboa (Faculdade de Ciências Médicas | NOVA Medical ScHool - FCM|NMS/UNL) Universidade de Coimbra (Faculdade de Medicina - FM/UC) Universidade do MinHo (Escola de Medicina - EMed/UM) Novembro, 2020 This thesis was conducted at the Center for Neuroscience and Cell Biology (CNC.CIBB) of University of Coimbra and Coimbra University Hospital (CHUC) and was a collaboration of the following laboratories and departments with the supervision of Catarina Resende Oliveira MD, PhD, Full Professor of FM/UC and the co-supervision of Duarte Barral PhD, Associated professor of Nova Medical School, Universidade Nova de Lisboa: • Neurogenetics laboratory (CNC.CIBB) headed by Maria Rosário Almeida PhD • Neurochemistry laboratory (CHUC)
    [Show full text]
  • Establishing the Pathogenicity of Novel Mitochondrial DNA Sequence Variations: a Cell and Molecular Biology Approach
    Mafalda Rita Avó Bacalhau Establishing the Pathogenicity of Novel Mitochondrial DNA Sequence Variations: a Cell and Molecular Biology Approach Tese de doutoramento do Programa de Doutoramento em Ciências da Saúde, ramo de Ciências Biomédicas, orientada pela Professora Doutora Maria Manuela Monteiro Grazina e co-orientada pelo Professor Doutor Henrique Manuel Paixão dos Santos Girão e pela Professora Doutora Lee-Jun C. Wong e apresentada à Faculdade de Medicina da Universidade de Coimbra Julho 2017 Faculty of Medicine Establishing the pathogenicity of novel mitochondrial DNA sequence variations: a cell and molecular biology approach Mafalda Rita Avó Bacalhau Tese de doutoramento do programa em Ciências da Saúde, ramo de Ciências Biomédicas, realizada sob a orientação científica da Professora Doutora Maria Manuela Monteiro Grazina; e co-orientação do Professor Doutor Henrique Manuel Paixão dos Santos Girão e da Professora Doutora Lee-Jun C. Wong, apresentada à Faculdade de Medicina da Universidade de Coimbra. Julho, 2017 Copyright© Mafalda Bacalhau e Manuela Grazina, 2017 Esta cópia da tese é fornecida na condição de que quem a consulta reconhece que os direitos de autor são pertença do autor da tese e do orientador científico e que nenhuma citação ou informação obtida a partir dela pode ser publicada sem a referência apropriada e autorização. This copy of the thesis has been supplied on the condition that anyone who consults it recognizes that its copyright belongs to its author and scientific supervisor and that no quotation from the
    [Show full text]
  • Detailed Analysis of Focal Chromosome Arm 1Q
    Detailed Analysis of Focal Chromosome Arm 1q and 6p Amplifications in Urothelial Carcinoma Reveals Complex Genomic Events on 1q, and SOX4 as a Possible Auxiliary Target on 6p. Eriksson, Pontus; Aine, Mattias; Sjödahl, Gottfrid; Staaf, Johan; Lindgren, David; Höglund, Mattias Published in: PLoS ONE DOI: 10.1371/journal.pone.0067222 2013 Link to publication Citation for published version (APA): Eriksson, P., Aine, M., Sjödahl, G., Staaf, J., Lindgren, D., & Höglund, M. (2013). Detailed Analysis of Focal Chromosome Arm 1q and 6p Amplifications in Urothelial Carcinoma Reveals Complex Genomic Events on 1q, and SOX4 as a Possible Auxiliary Target on 6p. PLoS ONE, 8(6), [e67222]. https://doi.org/10.1371/journal.pone.0067222 Total number of authors: 6 General rights Unless other specific re-use rights are stated the following general rights apply: Copyright and moral rights for the publications made accessible in the public portal are retained by the authors and/or other copyright owners and it is a condition of accessing publications that users recognise and abide by the legal requirements associated with these rights. • Users may download and print one copy of any publication from the public portal for the purpose of private study or research. • You may not further distribute the material or use it for any profit-making activity or commercial gain • You may freely distribute the URL identifying the publication in the public portal Read more about Creative commons licenses: https://creativecommons.org/licenses/ Take down policy If you believe that this document breaches copyright please contact us providing details, and we will remove access to the work immediately and investigate your claim.
    [Show full text]
  • 4-6 Weeks Old Female C57BL/6 Mice Obtained from Jackson Labs Were Used for Cell Isolation
    Methods Mice: 4-6 weeks old female C57BL/6 mice obtained from Jackson labs were used for cell isolation. Female Foxp3-IRES-GFP reporter mice (1), backcrossed to B6/C57 background for 10 generations, were used for the isolation of naïve CD4 and naïve CD8 cells for the RNAseq experiments. The mice were housed in pathogen-free animal facility in the La Jolla Institute for Allergy and Immunology and were used according to protocols approved by the Institutional Animal Care and use Committee. Preparation of cells: Subsets of thymocytes were isolated by cell sorting as previously described (2), after cell surface staining using CD4 (GK1.5), CD8 (53-6.7), CD3ε (145- 2C11), CD24 (M1/69) (all from Biolegend). DP cells: CD4+CD8 int/hi; CD4 SP cells: CD4CD3 hi, CD24 int/lo; CD8 SP cells: CD8 int/hi CD4 CD3 hi, CD24 int/lo (Fig S2). Peripheral subsets were isolated after pooling spleen and lymph nodes. T cells were enriched by negative isolation using Dynabeads (Dynabeads untouched mouse T cells, 11413D, Invitrogen). After surface staining for CD4 (GK1.5), CD8 (53-6.7), CD62L (MEL-14), CD25 (PC61) and CD44 (IM7), naïve CD4+CD62L hiCD25-CD44lo and naïve CD8+CD62L hiCD25-CD44lo were obtained by sorting (BD FACS Aria). Additionally, for the RNAseq experiments, CD4 and CD8 naïve cells were isolated by sorting T cells from the Foxp3- IRES-GFP mice: CD4+CD62LhiCD25–CD44lo GFP(FOXP3)– and CD8+CD62LhiCD25– CD44lo GFP(FOXP3)– (antibodies were from Biolegend). In some cases, naïve CD4 cells were cultured in vitro under Th1 or Th2 polarizing conditions (3, 4).
    [Show full text]
  • Functional Characterization of the New 8Q21 Asthma Risk Locus
    Functional characterization of the new 8q21 Asthma risk locus Cristina M T Vicente B.Sc, M.Sc A thesis submitted for the degree of Doctor of Philosophy at The University of Queensland in 2017 Faculty of Medicine Abstract Genome wide association studies (GWAS) provide a powerful tool to identify genetic variants associated with asthma risk. However, the target genes for many allergy risk variants discovered to date are unknown. In a recent GWAS, Ferreira et al. identified a new association between asthma risk and common variants located on chromosome 8q21. The overarching aim of this thesis was to elucidate the biological mechanisms underlying this association. Specifically, the goals of this study were to identify the gene(s) underlying the observed association and to study their contribution to asthma pathophysiology. Using genetic data from the 1000 Genomes Project, we first identified 118 variants in linkage disequilibrium (LD; r2>0.6) with the sentinel allergy risk SNP (rs7009110) on chromosome 8q21. Of these, 35 were found to overlap one of four Putative Regulatory Elements (PREs) identified in this region in a lymphoblastoid cell line (LCL), based on epigenetic marks measured by the ENCODE project. Results from analysis of gene expression data generated for LCLs (n=373) by the Geuvadis consortium indicated that rs7009110 is associated with the expression of only one nearby gene: PAG1 - located 732 kb away. PAG1 encodes a transmembrane adaptor protein localized to lipid rafts, which is highly expressed in immune cells. Results from chromosome conformation capture (3C) experiments showed that PREs in the region of association physically interacted with the promoter of PAG1.
    [Show full text]
  • High Throughput Computational Mouse Genetic Analysis
    bioRxiv preprint doi: https://doi.org/10.1101/2020.09.01.278465; this version posted January 22, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. High Throughput Computational Mouse Genetic Analysis Ahmed Arslan1+, Yuan Guan1+, Zhuoqing Fang1, Xinyu Chen1, Robin Donaldson2&, Wan Zhu, Madeline Ford1, Manhong Wu, Ming Zheng1, David L. Dill2* and Gary Peltz1* 1Department of Anesthesia, Stanford University School of Medicine, Stanford, CA; and 2Department of Computer Science, Stanford University, Stanford, CA +These authors contributed equally to this paper & Current Address: Ecree Durham, NC 27701 *Address correspondence to: [email protected] bioRxiv preprint doi: https://doi.org/10.1101/2020.09.01.278465; this version posted January 22, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. Abstract Background: Genetic factors affecting multiple biomedical traits in mice have been identified when GWAS data that measured responses in panels of inbred mouse strains was analyzed using haplotype-based computational genetic mapping (HBCGM). Although this method was previously used to analyze one dataset at a time; but now, a vast amount of mouse phenotypic data is now publicly available, which could lead to many more genetic discoveries. Results: HBCGM and a whole genome SNP map covering 53 inbred strains was used to analyze 8462 publicly available datasets of biomedical responses (1.52M individual datapoints) measured in panels of inbred mouse strains. As proof of concept, causative genetic factors affecting susceptibility for eye, metabolic and infectious diseases were identified when structured automated methods were used to analyze the output.
    [Show full text]
  • Supplementary Table S4. FGA Co-Expressed Gene List in LUAD
    Supplementary Table S4. FGA co-expressed gene list in LUAD tumors Symbol R Locus Description FGG 0.919 4q28 fibrinogen gamma chain FGL1 0.635 8p22 fibrinogen-like 1 SLC7A2 0.536 8p22 solute carrier family 7 (cationic amino acid transporter, y+ system), member 2 DUSP4 0.521 8p12-p11 dual specificity phosphatase 4 HAL 0.51 12q22-q24.1histidine ammonia-lyase PDE4D 0.499 5q12 phosphodiesterase 4D, cAMP-specific FURIN 0.497 15q26.1 furin (paired basic amino acid cleaving enzyme) CPS1 0.49 2q35 carbamoyl-phosphate synthase 1, mitochondrial TESC 0.478 12q24.22 tescalcin INHA 0.465 2q35 inhibin, alpha S100P 0.461 4p16 S100 calcium binding protein P VPS37A 0.447 8p22 vacuolar protein sorting 37 homolog A (S. cerevisiae) SLC16A14 0.447 2q36.3 solute carrier family 16, member 14 PPARGC1A 0.443 4p15.1 peroxisome proliferator-activated receptor gamma, coactivator 1 alpha SIK1 0.435 21q22.3 salt-inducible kinase 1 IRS2 0.434 13q34 insulin receptor substrate 2 RND1 0.433 12q12 Rho family GTPase 1 HGD 0.433 3q13.33 homogentisate 1,2-dioxygenase PTP4A1 0.432 6q12 protein tyrosine phosphatase type IVA, member 1 C8orf4 0.428 8p11.2 chromosome 8 open reading frame 4 DDC 0.427 7p12.2 dopa decarboxylase (aromatic L-amino acid decarboxylase) TACC2 0.427 10q26 transforming, acidic coiled-coil containing protein 2 MUC13 0.422 3q21.2 mucin 13, cell surface associated C5 0.412 9q33-q34 complement component 5 NR4A2 0.412 2q22-q23 nuclear receptor subfamily 4, group A, member 2 EYS 0.411 6q12 eyes shut homolog (Drosophila) GPX2 0.406 14q24.1 glutathione peroxidase
    [Show full text]
  • Integrative Prediction of Gene Expression with Chromatin Accessibility and Conformation Data
    bioRxiv preprint doi: https://doi.org/10.1101/704478; this version posted July 16, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. Integrative prediction of gene expression with chromatin accessibility and conformation data Florian Schmidt1;2;3;y, Fabian Kern1;3;4;y, and Marcel H. Schulz1;2;3;5;6;∗ 1High-throughput Genomics & Systems Biology, Cluster of Excellence on Multimodal Computing and Interaction, Saarland Informatics Campus, 66123 Saarbr¨ucken Germany 2Computational Biology & Applied Algorithmics, Max-Planck Institute for Informatics, Saarland Informatics Campus, 66123 Saarbr¨ucken Germany 3Center for Bioinformatics, Saarland Informatics Campus, 66123 Saarbr¨ucken Germany 4Chair for Clinical Bioinformatics, Saarland Informatics Campus, 66123 Saarbr¨ucken Germany 5Institute of Cardiovascular Regeneration, Goethe-University, Theodor-Stern-Kai 7, 60590 Frankfurt am Main Germany 6German Center for Cardiovascular Research, Partner site Rhein-Main, Theodor-Stern-Kai 7, 60590 Frankfurt am Main Germany y These authors contributed equally ∗ Correspondence: [email protected] Abstract Background: Enhancers play a fundamental role in orchestrating cell state and development. Al- though several methods have been developed to identify enhancers, linking them to their target genes is still an open problem. Several theories have been proposed on the functional mechanisms of en- hancers, which triggered the development of various methods to infer promoter enhancer interactions (PEIs). The advancement of high-throughput techniques describing the three-dimensional organisa- tion of the chromatin, paved the way to pinpoint long-range PEIs.
    [Show full text]
  • Mouse Nr1i3 Knockout Project (CRISPR/Cas9)
    https://www.alphaknockout.com Mouse Nr1i3 Knockout Project (CRISPR/Cas9) Objective: To create a Nr1i3 knockout Mouse model (C57BL/6J) by CRISPR/Cas-mediated genome engineering. Strategy summary: The Nr1i3 gene (NCBI Reference Sequence: NM_009803.5 ; Ensembl: ENSMUSG00000005677 ) is located on Mouse chromosome 1. 9 exons are identified, with the ATG start codon in exon 2 and the TGA stop codon in exon 9 (Transcript: ENSMUST00000005820). Exon 2 will be selected as target site. Cas9 and gRNA will be co-injected into fertilized eggs for KO Mouse production. The pups will be genotyped by PCR followed by sequencing analysis. Note: Mice homozygous for a knock-out allele exhibit decreased sensitivity to TCPOBOP. Exon 2 starts from the coding region. Exon 2 covers 12.76% of the coding region. The size of effective KO region: ~137 bp. The KO region does not have any other known gene. Page 1 of 9 https://www.alphaknockout.com Overview of the Targeting Strategy Wildtype allele gRNA region 5' gRNA region 3' 1 2 9 Legends Exon of mouse Nr1i3 Knockout region Page 2 of 9 https://www.alphaknockout.com Overview of the Dot Plot (up) Window size: 15 bp Forward Reverse Complement Sequence 12 Note: The 137 bp section of Exon 2 is aligned with itself to determine if there are tandem repeats. No significant tandem repeat is found in the dot plot matrix. So this region is suitable for PCR screening or sequencing analysis. Overview of the Dot Plot (down) Window size: 15 bp Forward Reverse Complement Sequence 12 Note: The 137 bp section of Exon 2 is aligned with itself to determine if there are tandem repeats.
    [Show full text]
  • Alu Elements, Evolution of the Human Brain, and the Spectrum of Neurological Disease
    bioRxiv preprint doi: https://doi.org/10.1101/230367; this version posted December 7, 2017. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. 1 6 December 2017 2 3 Alu elements, evolution of the human brain, and the spectrum of neurological disease 4 Peter A. Larsen*1,2, Kelsie E. Hunnicutt1, Roxanne J. Larsen3, Anne D. Yoder1,2, and Ann M. 5 Saunders4 6 7 1Department of Biology, Duke University, Durham, NC 27712 8 2Duke Lemur Center, Duke University, Durham, NC 27712 9 3Duke University School of Medicine, Duke University, Durham, NC 27710 10 4Zinfandel Pharmaceuticals, Inc., Research Triangle Park, NC 27709 11 12 Running title: Alu elements and neurological disease 13 14 *Corresponding Author: 15 Peter A. Larsen 16 130 Science Drive, Box 90338 17 Department of Biology 18 Duke University 19 Durham, NC 27708 20 [email protected] 21 22 23 24 25 1 bioRxiv preprint doi: https://doi.org/10.1101/230367; this version posted December 7, 2017. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. 26 Abstract 27 Alu elements are a highly successful family of primate-specific retrotransposons that have 28 fundamentally shaped primate evolution, including the evolution of our own species. Alus 29 play critical roles in the formation of neurological networks and the epigenetic regulation of 30 biochemical processes throughout the central nervous system (CNS), and thus are 31 hypothesized to have contributed to the origin of human cognition.
    [Show full text]
  • WO 2013/064702 A2 10 May 2013 (10.05.2013) P O P C T
    (12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) (19) World Intellectual Property Organization I International Bureau (10) International Publication Number (43) International Publication Date WO 2013/064702 A2 10 May 2013 (10.05.2013) P O P C T (51) International Patent Classification: AO, AT, AU, AZ, BA, BB, BG, BH, BN, BR, BW, BY, C12Q 1/68 (2006.01) BZ, CA, CH, CL, CN, CO, CR, CU, CZ, DE, DK, DM, DO, DZ, EC, EE, EG, ES, FI, GB, GD, GE, GH, GM, GT, (21) International Application Number: HN, HR, HU, ID, IL, IN, IS, JP, KE, KG, KM, KN, KP, PCT/EP2012/071868 KR, KZ, LA, LC, LK, LR, LS, LT, LU, LY, MA, MD, (22) International Filing Date: ME, MG, MK, MN, MW, MX, MY, MZ, NA, NG, NI, 5 November 20 12 (05 .11.20 12) NO, NZ, OM, PA, PE, PG, PH, PL, PT, QA, RO, RS, RU, RW, SC, SD, SE, SG, SK, SL, SM, ST, SV, SY, TH, TJ, (25) Filing Language: English TM, TN, TR, TT, TZ, UA, UG, US, UZ, VC, VN, ZA, (26) Publication Language: English ZM, ZW. (30) Priority Data: (84) Designated States (unless otherwise indicated, for every 1118985.9 3 November 201 1 (03. 11.201 1) GB kind of regional protection available): ARIPO (BW, GH, 13/339,63 1 29 December 201 1 (29. 12.201 1) US GM, KE, LR, LS, MW, MZ, NA, RW, SD, SL, SZ, TZ, UG, ZM, ZW), Eurasian (AM, AZ, BY, KG, KZ, RU, TJ, (71) Applicant: DIAGENIC ASA [NO/NO]; Grenseveien 92, TM), European (AL, AT, BE, BG, CH, CY, CZ, DE, DK, N-0663 Oslo (NO).
    [Show full text]
  • Figure S1. 17-Mer Distribution in the Yangtze Finless Porpoise Genome
    Figure S1. 17-mer distribution in the Yangtze finless porpoise genome. The x-axis is 17-mer depth (X); the y-axis is the number of sequencing reads at that depth. Figure S2. Sequence depth distribution of the assembly data. The x-axis shows the sequencing depth (X) and the y-axis shows the number of bases at a given depth. The results demonstrate that 99% of bases sequencing depth is more than 20. Figure S3. Comparison of gene structure characteristics of Yangtze finless porpoise and other cetaceans. The x-axis represents the length of corresponding genetic element of exon number and the y-axis represents gene density. Figure S4. Phylogeny relationships between the Yangtze finless porpoise and other mammals reconstructed by RAxML with the GTR+G+I model. Table S1. Summary of sequenced reads Raw Reads Qualified Reads1 Total Read Sequence Physical Total Read Sequence Physical Library SRA Data Length Coverage2 Coverage2 Data Length Coverage2 Coverage2 Insert Size (bp) Number (Gb) (bp) (×) (×) (Gb) (bp) (×) (×) 289 58.94 150.00 23.67 22.80 57.84 149.75 23.23 22.41 SRR6923836 462 71.33 150.00 28.65 44.12 70.12 149.74 28.16 43.44 SRR6923837 624 67.47 150.00 27.10 56.36 63.90 149.67 25.66 53.50 SRR6923834 791 57.58 150.00 23.12 60.97 55.39 149.67 22.24 58.78 SRR6923835 4,000 108.73 150.00 43.67 582.22 70.74 150.00 28.41 378.80 SRR6923832 7,000 115.4 150.00 46.35 1,081.39 84.76 150.00 34.04 794.27 SRR6923833 11,000 107.37 150.00 43.12 1,581.08 79.78 150.00 32.04 1,174.81 SRR6923830 18,000 127.46 150.00 51.19 3,071.33 97.75 150.00 39.26 2,355.42 SRR6923831 Total 714.28 - 286.87 6,500.27 580.28 - 233.04 4,881.43 - 1Raw reads in mate-paired libraries were filtered to remove duplicates and reads with low quality and/or adapter contamination, raw reads in paired-end libraries were filtered in the same manner then subjected to k-mer-based correction.
    [Show full text]