Mouse Arf5 Conditional Knockout Project (CRISPR/Cas9)

Total Page:16

File Type:pdf, Size:1020Kb

Mouse Arf5 Conditional Knockout Project (CRISPR/Cas9) https://www.alphaknockout.com Mouse Arf5 Conditional Knockout Project (CRISPR/Cas9) Objective: To create a Arf5 conditional knockout Mouse model (C57BL/6J) by CRISPR/Cas-mediated genome engineering. Strategy summary: The Arf5 gene (NCBI Reference Sequence: NM_007480 ; Ensembl: ENSMUSG00000020440 ) is located on Mouse chromosome 6. 6 exons are identified, with the ATG start codon in exon 1 and the TAG stop codon in exon 6 (Transcript: ENSMUST00000020717). Exon 2~4 will be selected as conditional knockout region (cKO region). Deletion of this region should result in the loss of function of the Mouse Arf5 gene. To engineer the targeting vector, homologous arms and cKO region will be generated by PCR using BAC clone RP24-176I9 as template. Cas9, gRNA and targeting vector will be co-injected into fertilized eggs for cKO Mouse production. The pups will be genotyped by PCR followed by sequencing analysis. Note: Exon 2 starts from about 12.59% of the coding region. The knockout of Exon 2~4 will result in frameshift of the gene. The size of intron 1 for 5'-loxP site insertion: 534 bp, and the size of intron 4 for 3'-loxP site insertion: 494 bp. The size of effective cKO region: ~1452 bp. The cKO region does not have any other known gene. Page 1 of 7 https://www.alphaknockout.com Overview of the Targeting Strategy Wildtype allele 5' gRNA region gRNA region 3' 1 2 3 4 5 6 Targeting vector Targeted allele Constitutive KO allele (After Cre recombination) Legends Homology arm Exon of mouse Arf5 cKO region loxP site Page 2 of 7 https://www.alphaknockout.com Overview of the Dot Plot Window size: 10 bp Forward Reverse Complement Sequence 12 Note: The sequence of homologous arms and cKO region is aligned with itself to determine if there are tandem repeats. No significant tandem repeat is found in the dot plot matrix. So this region is suitable for PCR screening or sequencing analysis. Overview of the GC Content Distribution Window size: 300 bp Sequence 12 Summary: Full Length(7949bp) | A(22.52% 1790) | C(26.71% 2123) | T(23.88% 1898) | G(26.9% 2138) Note: The sequence of homologous arms and cKO region is analyzed to determine the GC content. Significant high GC-content regions are found. It may be difficult to construct this targeting vector. Page 3 of 7 https://www.alphaknockout.com BLAT Search Results (up) QUERY SCORE START END QSIZE IDENTITY CHROM STRAND START END SPAN -------------------------------------------------------------------------------------------------------------- browser details YourSeq 3000 1 3000 3000 100.0% chr6 + 28421041 28424040 3000 browser details YourSeq 161 1742 2216 3000 90.5% chr17 + 70677308 70678286 979 browser details YourSeq 147 1852 2203 3000 85.5% chr13 - 3587368 3587692 325 browser details YourSeq 145 1850 2029 3000 90.1% chr6 + 124950067 124950233 167 browser details YourSeq 145 1856 2212 3000 94.0% chr1 + 182056105 182056612 508 browser details YourSeq 143 1851 2028 3000 95.0% chr10 + 76466823 76467070 248 browser details YourSeq 141 1855 2029 3000 95.0% chr16 - 3290173 3290352 180 browser details YourSeq 141 1792 1998 3000 93.9% chr10 - 60004328 60004650 323 browser details YourSeq 141 1851 2004 3000 96.2% chr15 + 99154100 99154255 156 browser details YourSeq 140 1850 2004 3000 95.5% chr3 - 157980298 157980454 157 browser details YourSeq 140 1834 2002 3000 90.7% chr9 + 83593374 83593537 164 browser details YourSeq 140 1856 2029 3000 88.6% chr9 + 64768950 64769106 157 browser details YourSeq 140 1837 2001 3000 89.9% chr7 + 55786184 55786340 157 browser details YourSeq 140 1851 2004 3000 95.5% chr11 + 82777460 82777613 154 browser details YourSeq 139 1850 2004 3000 95.5% chr6 - 113535255 113535412 158 browser details YourSeq 139 1851 2004 3000 95.5% chr17 - 17422766 17422921 156 browser details YourSeq 139 1851 2004 3000 95.5% chr15 - 93727472 93727627 156 browser details YourSeq 139 1836 2004 3000 92.0% chr15 - 38534585 38534752 168 browser details YourSeq 139 1832 2004 3000 88.4% chr5 + 142684874 142685038 165 browser details YourSeq 139 1856 2004 3000 96.7% chr5 + 108254506 108254654 149 Note: The 3000 bp section upstream of Exon 2 is BLAT searched against the genome. No significant similarity is found. BLAT Search Results (down) QUERY SCORE START END QSIZE IDENTITY CHROM STRAND START END SPAN ----------------------------------------------------------------------------------------------- browser details YourSeq 3000 1 3000 3000 100.0% chr6 + 28425493 28428492 3000 browser details YourSeq 52 2706 2766 3000 86.0% chr9 + 67121911 67121967 57 browser details YourSeq 52 2708 2761 3000 98.2% chr7 + 72755100 72755153 54 browser details YourSeq 50 2708 2761 3000 94.3% chr13 - 45969697 45969749 53 browser details YourSeq 50 2708 2761 3000 94.3% chr12 - 13685566 13685618 53 browser details YourSeq 50 2708 2761 3000 94.3% chr12 - 8443011 8443063 53 browser details YourSeq 50 2708 2761 3000 94.3% chr1 - 27264706 27264758 53 browser details YourSeq 50 2708 2762 3000 98.2% chr15 + 91484483 91484538 56 browser details YourSeq 50 2708 2761 3000 94.3% chr13 + 115771860 115771912 53 browser details YourSeq 50 2708 2761 3000 94.3% chr1 + 184741220 184741272 53 browser details YourSeq 50 2708 2761 3000 94.3% chr1 + 140403553 140403605 53 browser details YourSeq 49 2708 2761 3000 98.1% chr14 - 54102937 54103013 77 browser details YourSeq 49 2708 2761 3000 98.1% chr11 - 109059341 109059394 54 browser details YourSeq 49 2708 2762 3000 92.5% chr10 + 19736409 19736462 54 browser details YourSeq 49 2708 2761 3000 98.1% chr1 + 136451881 136451973 93 browser details YourSeq 48 2708 2761 3000 92.4% chr8 - 65426397 65426449 53 browser details YourSeq 48 2708 2761 3000 92.4% chr14 - 9069078 9069130 53 browser details YourSeq 48 2706 2761 3000 94.0% chr12 - 99549959 99550013 55 browser details YourSeq 48 2712 2761 3000 98.0% chr10 + 115428037 115428086 50 browser details YourSeq 47 2715 2761 3000 100.0% chr10 - 96615685 96615731 47 Note: The 3000 bp section downstream of Exon 4 is BLAT searched against the genome. No significant similarity is found. Page 4 of 7 https://www.alphaknockout.com Gene and protein information: Arf5 ADP-ribosylation factor 5 [ Mus musculus (house mouse) ] Gene ID: 11844, updated on 10-Oct-2019 Gene summary Official Symbol Arf5 provided by MGI Official Full Name ADP-ribosylation factor 5 provided by MGI Primary source MGI:MGI:99434 See related Ensembl:ENSMUSG00000020440 Gene type protein coding RefSeq status VALIDATED Organism Mus musculus Lineage Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus Expression Ubiquitous expression in duodenum adult (RPKM 584.2), small intestine adult (RPKM 299.8) and 28 other tissues See Orthologs more human all Genomic context Location: 6; 6 A3.3 See Arf5 in Genome Data Viewer Exon count: 6 Annotation release Status Assembly Chr Location 108 current GRCm38.p6 (GCF_000001635.26) 6 NC_000072.6 (28423604..28426602) Build 37.2 previous assembly MGSCv37 (GCF_000001635.18) 6 NC_000072.5 (28373640..28376499) Chromosome 6 - NC_000072.6 Page 5 of 7 https://www.alphaknockout.com Transcript information: This gene has 3 transcripts Gene: Arf5 ENSMUSG00000020440 Description ADP-ribosylation factor 5 [Source:MGI Symbol;Acc:MGI:99434] Location Chromosome 6: 28,423,560-28,426,602 forward strand. GRCm38:CM000999.2 About this gene This gene has 3 transcripts (splice variants), 209 orthologues, 29 paralogues and is a member of 1 Ensembl protein family. Transcripts Name Transcript ID bp Protein Translation ID Biotype CCDS UniProt Flags Arf5-201 ENSMUST00000020717.11 1116 180aa ENSMUSP00000020717.5 Protein coding CCDS19952 P84084 TSL:1 GENCODE basic APPRIS P1 Arf5-202 ENSMUST00000169841.1 768 180aa ENSMUSP00000127281.1 Protein coding CCDS19952 P84084 TSL:5 GENCODE basic APPRIS P1 Arf5-203 ENSMUST00000202028.1 919 No protein - Retained intron - - TSL:NA 23.04 kb Forward strand 28.415Mb 28.420Mb 28.425Mb 28.430Mb 28.435Mb Genes (Comprehensive set... Arf5-201 >protein coding Fscn3-201 >protein coding Arf5-202 >protein coding Fscn3-202 >lncRNA Arf5-203 >retained intron Contigs AC068608.5 > Genes < Gcc1-202protein coding (Comprehensive set... < Gcc1-201protein coding < Gcc1-203protein coding < Gcc1-204protein coding Regulatory Build 28.415Mb 28.420Mb 28.425Mb 28.430Mb 28.435Mb Reverse strand 23.04 kb Regulation Legend CTCF Promoter Promoter Flank Gene Legend Protein Coding merged Ensembl/Havana Ensembl protein coding Non-Protein Coding processed transcript RNA gene Page 6 of 7 https://www.alphaknockout.com Transcript: ENSMUST00000020717 3.04 kb Forward strand Arf5-201 >protein coding ENSMUSP00000020... Coiled-coils (Ncoils) TIGRFAM Small GTP-binding protein domain Superfamily P-loop containing nucleoside triphosphate hydrolase SMART SM00175 SM00178 SM00177 Prints Small GTPase superfamily, ARF/SAR type Pfam Small GTPase superfamily, ARF/SAR type PROSITE profiles PS51417 PANTHER PTHR11711:SF314 PTHR11711 Gene3D 3.40.50.300 CDD cd04150 Scale bar 0 20 40 60 80 100 120 140 160 180 We wish to acknowledge the following valuable scientific information resources: Ensembl, MGI, NCBI, UCSC. Page 7 of 7.
Recommended publications
  • Mouse Fscn3 Conditional Knockout Project (CRISPR/Cas9)
    https://www.alphaknockout.com Mouse Fscn3 Conditional Knockout Project (CRISPR/Cas9) Objective: To create a Fscn3 conditional knockout Mouse model (C57BL/6J) by CRISPR/Cas-mediated genome engineering. Strategy summary: The Fscn3 gene (NCBI Reference Sequence: NM_019569 ; Ensembl: ENSMUSG00000029707 ) is located on Mouse chromosome 6. 7 exons are identified, with the ATG start codon in exon 1 and the TAG stop codon in exon 6 (Transcript: ENSMUST00000031719). Exon 2 will be selected as conditional knockout region (cKO region). Deletion of this region should result in the loss of function of the Mouse Fscn3 gene. To engineer the targeting vector, homologous arms and cKO region will be generated by PCR using BAC clone RP24-176I9 as template. Cas9, gRNA and targeting vector will be co-injected into fertilized eggs for cKO Mouse production. The pups will be genotyped by PCR followed by sequencing analysis. Note: Exon 2 starts from about 9.71% of the coding region. The knockout of Exon 2 will result in frameshift of the gene. The size of intron 1 for 5'-loxP site insertion: 1817 bp, and the size of intron 2 for 3'-loxP site insertion: 839 bp. The size of effective cKO region: ~1197 bp. The cKO region does not have any other known gene. Page 1 of 7 https://www.alphaknockout.com Overview of the Targeting Strategy Wildtype allele 5' gRNA region gRNA region 3' 1 2 3 7 Targeting vector Targeted allele Constitutive KO allele (After Cre recombination) Legends Homology arm Exon of mouse Fscn3 cKO region loxP site Page 2 of 7 https://www.alphaknockout.com Overview of the Dot Plot Window size: 10 bp Forward Reverse Complement Sequence 12 Note: The sequence of homologous arms and cKO region is aligned with itself to determine if there are tandem repeats.
    [Show full text]
  • Protein Interaction Network of Alternatively Spliced Isoforms from Brain Links Genetic Risk Factors for Autism
    ARTICLE Received 24 Aug 2013 | Accepted 14 Mar 2014 | Published 11 Apr 2014 DOI: 10.1038/ncomms4650 OPEN Protein interaction network of alternatively spliced isoforms from brain links genetic risk factors for autism Roser Corominas1,*, Xinping Yang2,3,*, Guan Ning Lin1,*, Shuli Kang1,*, Yun Shen2,3, Lila Ghamsari2,3,w, Martin Broly2,3, Maria Rodriguez2,3, Stanley Tam2,3, Shelly A. Trigg2,3,w, Changyu Fan2,3, Song Yi2,3, Murat Tasan4, Irma Lemmens5, Xingyan Kuang6, Nan Zhao6, Dheeraj Malhotra7, Jacob J. Michaelson7,w, Vladimir Vacic8, Michael A. Calderwood2,3, Frederick P. Roth2,3,4, Jan Tavernier5, Steve Horvath9, Kourosh Salehi-Ashtiani2,3,w, Dmitry Korkin6, Jonathan Sebat7, David E. Hill2,3, Tong Hao2,3, Marc Vidal2,3 & Lilia M. Iakoucheva1 Increased risk for autism spectrum disorders (ASD) is attributed to hundreds of genetic loci. The convergence of ASD variants have been investigated using various approaches, including protein interactions extracted from the published literature. However, these datasets are frequently incomplete, carry biases and are limited to interactions of a single splicing isoform, which may not be expressed in the disease-relevant tissue. Here we introduce a new interactome mapping approach by experimentally identifying interactions between brain-expressed alternatively spliced variants of ASD risk factors. The Autism Spliceform Interaction Network reveals that almost half of the detected interactions and about 30% of the newly identified interacting partners represent contribution from splicing variants, emphasizing the importance of isoform networks. Isoform interactions greatly contribute to establishing direct physical connections between proteins from the de novo autism CNVs. Our findings demonstrate the critical role of spliceform networks for translating genetic knowledge into a better understanding of human diseases.
    [Show full text]
  • Associated Marks Are Enriched in Imprinted Gene Regions and Predict Allele-Specific Modification
    Downloaded from genome.cshlp.org on October 4, 2021 - Published by Cold Spring Harbor Laboratory Press Methods Overlapping euchromatin/heterochromatin- associated marks are enriched in imprinted gene regions and predict allele-specific modification Bo Wen,1,4 Hao Wu,2,4 Hans Bjornsson,1 Roland D. Green,3 Rafael Irizarry,2 and Andrew P. Feinberg1,5 1Department of Medicine and Center for Epigenetics, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205, USA; 2Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland 21205, USA; 3NimbleGen Systems, Inc., Madison, Wisconsin 53711, USA Most genome-level analysis treats the two parental alleles equivalently, yet diploid genomes contain two parental genomes that are often epigenetically distinct. While single nucleotide polymorphisms (SNPs) can be used to distinguish these genomes, it would be useful to develop a generalized strategy for identifying candidate genes or regions showing allele-specific differences, independent of SNPs. We have explored this problem by looking for overlapping marks in the genome related to both euchromatin (histone H3 dimethyl lysine-4 [H3K4Me2]) and heterochromatin (DNA methylation [DNAm]). “Double hits” were defined by the intersection of H3K4Me2 and DNAm. For the top 5% of marks, defined by a sliding window, imprinted gene regions were enriched for double hits 5.4-fold. When the location information of CTCF binding sites were integrated, the “triple hits” were enriched 76-fold for known imprinted genes in the regions studied. The double hits in imprinted genes were found to occur usually at the site of alternative or antisense transcripts. In addition, four of four imprinted genes tested showing double hits also showed allele-specific methylation.
    [Show full text]
  • 4-6 Weeks Old Female C57BL/6 Mice Obtained from Jackson Labs Were Used for Cell Isolation
    Methods Mice: 4-6 weeks old female C57BL/6 mice obtained from Jackson labs were used for cell isolation. Female Foxp3-IRES-GFP reporter mice (1), backcrossed to B6/C57 background for 10 generations, were used for the isolation of naïve CD4 and naïve CD8 cells for the RNAseq experiments. The mice were housed in pathogen-free animal facility in the La Jolla Institute for Allergy and Immunology and were used according to protocols approved by the Institutional Animal Care and use Committee. Preparation of cells: Subsets of thymocytes were isolated by cell sorting as previously described (2), after cell surface staining using CD4 (GK1.5), CD8 (53-6.7), CD3ε (145- 2C11), CD24 (M1/69) (all from Biolegend). DP cells: CD4+CD8 int/hi; CD4 SP cells: CD4CD3 hi, CD24 int/lo; CD8 SP cells: CD8 int/hi CD4 CD3 hi, CD24 int/lo (Fig S2). Peripheral subsets were isolated after pooling spleen and lymph nodes. T cells were enriched by negative isolation using Dynabeads (Dynabeads untouched mouse T cells, 11413D, Invitrogen). After surface staining for CD4 (GK1.5), CD8 (53-6.7), CD62L (MEL-14), CD25 (PC61) and CD44 (IM7), naïve CD4+CD62L hiCD25-CD44lo and naïve CD8+CD62L hiCD25-CD44lo were obtained by sorting (BD FACS Aria). Additionally, for the RNAseq experiments, CD4 and CD8 naïve cells were isolated by sorting T cells from the Foxp3- IRES-GFP mice: CD4+CD62LhiCD25–CD44lo GFP(FOXP3)– and CD8+CD62LhiCD25– CD44lo GFP(FOXP3)– (antibodies were from Biolegend). In some cases, naïve CD4 cells were cultured in vitro under Th1 or Th2 polarizing conditions (3, 4).
    [Show full text]
  • Mapping of Leptin and Its Syntenic Genes to Chicken Chromosome
    Edinburgh Research Explorer Mapping of leptin and its syntenic genes to chicken chromosome 1p Citation for published version: Seroussi, E, Pitel, F, Leroux, S, Morisson, M, Bornelöv, S, Miyara, S, Yosefi, S, Cogburn, LA, Burt, DW, Anderson, L & Friedman-Einat, M 2017, 'Mapping of leptin and its syntenic genes to chicken chromosome 1p', BMC Genetics, vol. 18, no. 1, pp. 77. https://doi.org/10.1186/s12863-017-0543-1 Digital Object Identifier (DOI): 10.1186/s12863-017-0543-1 Link: Link to publication record in Edinburgh Research Explorer Document Version: Publisher's PDF, also known as Version of record Published In: BMC Genetics General rights Copyright for the publications made accessible via the Edinburgh Research Explorer is retained by the author(s) and / or other copyright owners and it is a condition of accessing these publications that users recognise and abide by the legal requirements associated with these rights. Take down policy The University of Edinburgh has made every reasonable effort to ensure that Edinburgh Research Explorer content complies with UK legislation. If you believe that the public display of this file breaches copyright please contact [email protected] providing details, and we will remove access to the work immediately and investigate your claim. Download date: 07. Oct. 2021 Seroussi et al. BMC Genetics (2017) 18:77 DOI 10.1186/s12863-017-0543-1 RESEARCHARTICLE Open Access Mapping of leptin and its syntenic genes to chicken chromosome 1p Eyal Seroussi1*, Frédérique Pitel2, Sophie Leroux2, Mireille Morisson2, Susanne Bornelöv3, Shoval Miyara1, Sara Yosefi1, Larry A. Cogburn4, David W. Burt5, Leif Anderson3,6,7 and Miriam Friedman-Einat1* Abstract Background: Misidentification of the chicken leptin gene has hampered research of leptin signaling in this species for almost two decades.
    [Show full text]
  • Whole-Genome Rnai Screen Highlights Components of the Endoplasmic Reticulum/Golgi As a Source of Resistance to Immunotoxin-Media
    Whole-genome RNAi screen highlights components of PNAS PLUS the endoplasmic reticulum/Golgi as a source of resistance to immunotoxin-mediated cytotoxicity Matteo Pasettoa,1,2, Antonella Antignania,1, Pinar Ormanoglub, Eugen Buehlerb, Rajarshi Guhab, Ira Pastana,3, Scott E. Martinb, and David J. FitzGeralda,3 aLaboratory of Molecular Biology, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892-4264; and bDivision of Preclinical Innovation, National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, MD 20850 Contributed by Ira Pastan, February 2, 2015 (sent for review October 6, 2014; reviewed by Arthur Frankel and Wayne I. Lencer) Immunotoxins (antibody–toxin fusion proteins) target surface EF2 no longer functions at the elongation step of protein trans- antigens on cancer cells and kill these cells via toxin-mediated in- lation. Death results from a combination of events: the loss of hibition of protein synthesis. To identify genes controlling this short-lived survival proteins (e.g., Mcl1), triggering apoptosis, process, an RNAi whole-genome screen (∼22,000 genes at three and stress responses that cannot be executed because protein siRNAs per gene) was conducted via monitoring the cytotoxicity of translation is shut down (21). Regarding the constituents of the the mesothelin-directed immunotoxin SS1P. SS1P, a Pseudomonas pathway responsible for transporting the immunotoxin from the exotoxin-based immunotoxin, was chosen because it is now in surface to the cytosol, there are only a handful of experimentally clinical trials and has produced objective tumor regressions in established components. These proteins include the surface patients. High and low concentrations of SS1P were chosen to target itself, the protease furin, and KDELR2 (discussed below).
    [Show full text]
  • Supplemental Information
    Supplemental information Dissection of the genomic structure of the miR-183/96/182 gene. Previously, we showed that the miR-183/96/182 cluster is an intergenic miRNA cluster, located in a ~60-kb interval between the genes encoding nuclear respiratory factor-1 (Nrf1) and ubiquitin-conjugating enzyme E2H (Ube2h) on mouse chr6qA3.3 (1). To start to uncover the genomic structure of the miR- 183/96/182 gene, we first studied genomic features around miR-183/96/182 in the UCSC genome browser (http://genome.UCSC.edu/), and identified two CpG islands 3.4-6.5 kb 5’ of pre-miR-183, the most 5’ miRNA of the cluster (Fig. 1A; Fig. S1 and Seq. S1). A cDNA clone, AK044220, located at 3.2-4.6 kb 5’ to pre-miR-183, encompasses the second CpG island (Fig. 1A; Fig. S1). We hypothesized that this cDNA clone was derived from 5’ exon(s) of the primary transcript of the miR-183/96/182 gene, as CpG islands are often associated with promoters (2). Supporting this hypothesis, multiple expressed sequences detected by gene-trap clones, including clone D016D06 (3, 4), were co-localized with the cDNA clone AK044220 (Fig. 1A; Fig. S1). Clone D016D06, deposited by the German GeneTrap Consortium (GGTC) (http://tikus.gsf.de) (3, 4), was derived from insertion of a retroviral construct, rFlpROSAβgeo in 129S2 ES cells (Fig. 1A and C). The rFlpROSAβgeo construct carries a promoterless reporter gene, the β−geo cassette - an in-frame fusion of the β-galactosidase and neomycin resistance (Neor) gene (5), with a splicing acceptor (SA) immediately upstream, and a polyA signal downstream of the β−geo cassette (Fig.
    [Show full text]
  • Functional Annotation of Novel Lineage-Specific Genes Using Co-Expression and Promoter Analysis Charu G Kumar1, Robin E Everts1,3, Juan J Loor1, Harris a Lewin1,2*
    Kumar et al. BMC Genomics 2010, 11:161 http://www.biomedcentral.com/1471-2164/11/161 RESEARCH ARTICLE Open Access Functional annotation of novel lineage-specific genes using co-expression and promoter analysis Charu G Kumar1, Robin E Everts1,3, Juan J Loor1, Harris A Lewin1,2* Abstract Background: The diversity of placental architectures within and among mammalian orders is believed to be the result of adaptive evolution. Although, the genetic basis for these differences is unknown, some may arise from rapidly diverging and lineage-specific genes. Previously, we identified 91 novel lineage-specific transcripts (LSTs) from a cow term-placenta cDNA library, which are excellent candidates for adaptive placental functions acquired by the ruminant lineage. The aim of the present study was to infer functions of previously uncharacterized lineage- specific genes (LSGs) using co-expression, promoter, pathway and network analysis. Results: Clusters of co-expressed genes preferentially expressed in liver, placenta and thymus were found using 49 previously uncharacterized LSTs as seeds. Over-represented composite transcription factor binding sites (TFBS) in promoters of clustered LSGs and known genes were then identified computationally. Functions were inferred for nine previously uncharacterized LSGs using co-expression analysis and pathway analysis tools. Our results predict that these LSGs may function in cell signaling, glycerophospholipid/fatty acid metabolism, protein trafficking, regulatory processes in the nucleus, and processes that initiate parturition and immune system development. Conclusions: The placenta is a rich source of lineage-specific genes that function in the adaptive evolution of placental architecture and functions. We have shown that co-expression, promoter, and gene network analyses are useful methods to infer functions of LSGs with heretofore unknown functions.
    [Show full text]
  • Supplementary Data
    Supplementary Fig. 1 A B Responder_Xenograft_ Responder_Xenograft_ NON- NON- Lu7336, Vehicle vs Lu7466, Vehicle vs Responder_Xenograft_ Responder_Xenograft_ Sagopilone, Welch- Sagopilone, Welch- Lu7187, Vehicle vs Lu7406, Vehicle vs Test: 638 Test: 600 Sagopilone, Welch- Sagopilone, Welch- Test: 468 Test: 482 Responder_Xenograft_ NON- Lu7860, Vehicle vs Responder_Xenograft_ Sagopilone, Welch - Lu7558, Vehicle vs Test: 605 Sagopilone, Welch- Test: 333 Supplementary Fig. 2 Supplementary Fig. 3 Supplementary Figure S1. Venn diagrams comparing probe sets regulated by Sagopilone treatment (10mg/kg for 24h) between individual models (Welsh Test ellipse p-value<0.001 or 5-fold change). A Sagopilone responder models, B Sagopilone non-responder models. Supplementary Figure S2. Pathway analysis of genes regulated by Sagopilone treatment in responder xenograft models 24h after Sagopilone treatment by GeneGo Metacore; the most significant pathway map representing cell cycle/spindle assembly and chromosome separation is shown, genes upregulated by Sagopilone treatment are marked with red thermometers. Supplementary Figure S3. GeneGo Metacore pathway analysis of genes differentially expressed between Sagopilone Responder and Non-Responder models displaying –log(p-Values) of most significant pathway maps. Supplementary Tables Supplementary Table 1. Response and activity in 22 non-small-cell lung cancer (NSCLC) xenograft models after treatment with Sagopilone and other cytotoxic agents commonly used in the management of NSCLC Tumor Model Response type
    [Show full text]
  • A Point Mutation in HIV-1 Integrase Redirects Proviral Integration Into
    bioRxiv preprint doi: https://doi.org/10.1101/2021.01.12.426369; this version posted January 12, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. A point mutation in HIV-1 integrase redirects proviral integration into centromeric repeats Shelby Winansa,b,c and Stephen P. Goffa.b,c# aDepartment of Biochemistry and Molecular Biophysics Columbia University Medical Center, New York, NY bDepartment of Microbiology and Immunology Columbia University Medical Center, New York, NY cHoward Hughes Medical Institute, Columbia University, New York, NY #Lead Contact: address correspondence to Stephen P. Goff, [email protected] bioRxiv preprint doi: https://doi.org/10.1101/2021.01.12.426369; this version posted January 12, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. 1 Abstract 2 Retroviruses utilize the viral integrase (IN) protein to integrate a DNA copy of their 3 genome into the host chromosomal DNA. HIV-1 integration sites are highly biased towards 4 actively transcribed genes, likely mediated by binding of the IN protein to specific host 5 factors, particularly LEDGF, located at these gene regions. We here report a dramatic 6 redirection of integration site distribution induced by a single point mutation in HIV-1 IN.
    [Show full text]
  • A 7Q31.33Q32.1 Microdeletion Including LRRC4 and GRM8 Is Associated with Severe Intellectual Disability and Characteristics of Autism
    OPEN Citation: Human Genome Variation (2017) 4, 17001; doi:10.1038/hgv.2017.1 Official journal of the Japan Society of Human Genetics www.nature.com/hgv DATA REPORT A 7q31.33q32.1 microdeletion including LRRC4 and GRM8 is associated with severe intellectual disability and characteristics of autism Noriko Sangu1,2, Keiko Shimojima1, Yuya Takahashi3, Tsukasa Ohashi4, Jun Tohyama5 and Toshiyuki Yamamoto1 A 4-year-old boy with severe intellectual disability (ID) and characteristics of autism was found to have a de novo 1.9-Mb microdeletion in 7q31.33q32.1, in which LRRC4, GRM8, and 11 other genes were included. GRM8 is associated with attention deficit hyperactivity disorder. LRRC4 is related to synaptic cell adhesion molecules, some of which are associated with autism. The deletion of LRRC4 may be responsible for the severe ID and characteristics of autism observed in the present patient. Human Genome Variation (2017) 4, 17001; doi:10.1038/hgv.2017.1; published online 9 February 2017 Interstitial deletions encompassing the 7q31 region are rare, and been obtained from the patient’s family, peripheral blood samples there are few reports of affected patients.1,2 We identified a de were obtained from the patient and his parents. Genomic DNA novo 1.9-Mb deletion in a patient with severe intellectual disability was extracted with a QIAquick DNA extraction kit (Qiagen, Hilden, and characteristics of autism. Here, we report the patient’s clinical Germany). Chromosomal microarray testing was performed characteristics and discuss the genotype–phenotype correlation. with an Agilent 60 K Human Genome CGH Microarray platform The boy, who was 4 years and 7 months old, was born to non- (Agilent Technologies, Santa Clara, CA, USA), as previously consanguineous healthy Japanese parents.
    [Show full text]
  • Mapping of Leptin and Its Syntenic Genes to Chicken Chromosome 1P
    Seroussi et al. BMC Genetics (2017) 18:77 DOI 10.1186/s12863-017-0543-1 RESEARCHARTICLE Open Access Mapping of leptin and its syntenic genes to chicken chromosome 1p Eyal Seroussi1*, Frédérique Pitel2, Sophie Leroux2, Mireille Morisson2, Susanne Bornelöv3, Shoval Miyara1, Sara Yosefi1, Larry A. Cogburn4, David W. Burt5, Leif Anderson3,6,7 and Miriam Friedman-Einat1* Abstract Background: Misidentification of the chicken leptin gene has hampered research of leptin signaling in this species for almost two decades. Recently, the genuine leptin gene with a GC-rich (~70%) repetitive-sequence content was identified in the chicken genome but without indicating its genomic position. This suggests that such GC-rich sequences are difficult to sequence and therefore substantial regions are missing from the current chicken genome assembly. Results: A radiation hybrid panel of chicken-hamster Wg3hCl2 cells was used to map the genome location of the chicken leptin gene. Contrary to our expectations, based on comparative genome mapping and sequence characteristics, the chicken leptin was not located on a microchromosome, which are known to contain GC-rich and repetitive regions, but at the distal tip of the largest chromosome (1p). Following conserved synteny with other vertebrates, we also mapped five additional genes to this genomic region (ARF5, SND1, LRRC4, RBM28, and FLNC), bridging the genomic gap in the current Galgal5 build for this chromosome region. All of the short scaffolds containing these genes were found to consist of GC-rich (54 to 65%) sequences comparing to the average GC-content of 40% on chromosome 1. In this syntenic group, the RNA-binding protein 28 (RBM28)wasin closest proximity to leptin.
    [Show full text]