Table 4 Selected Early and Late Genes in Adult Spo11-/- Testes and Juvenile Wild-Type Testes

Total Page:16

File Type:pdf, Size:1020Kb

Table 4 Selected Early and Late Genes in Adult Spo11-/- Testes and Juvenile Wild-Type Testes Table 4 Selected early and late genes in adult Spo11-/- testes and juvenile wild-type testes Average signal ratio Adult wild- 15dpp wild- Adult 15dpp type vs type vs Spo11 -/- vs Spo11 -/- vs reference adult wild- adult wild- 15dpp wild- GO/ Biological process ID Name RNA type type type Cluster 1 regulation of transcription, DNA-dependent BG086576 ankyrin repeat and FYVE domain containing 1 0.45 1.94 2.76 1.29 induction of programmed cell death BG074245 pleiomorphic adenoma gene-like 2 0.79 1.95 2.72 1.10 induction of programmed cell death BG065030 glutathione peroxidase 1 0.37 1.25 5.23 0.85 regulation of transcription, DNA-dependent BG072554 ankyrin repeat domain 10 0.21 1.84 2.05 0.98 regulation of transcription, DNA-dependent BG075473 nuclear receptor subfamily 3, group C, member 1 0.76 1.16 2.87 1.15 regulation of cell cycle BG077774 v-abl Abelson murine leukemia oncogene 1 0.27 1.70 2.23 1.04 germ-cell development BG063365 chemokine (C-X-C motif) receptor 4 0.48 1.77 2.41 0.98 regulation of transcription, DNA-dependent BG087880 jumonji, AT rich interactive domain 1C (Rbp2 like) 0.37 1.90 3.07 1.27 fertilization (sensu Animalia) BG073671 milk fat globule-EGF factor 8 protein 0.52 1.42 2.90 0.98 regulation of transcription, DNA-dependent BG084537 replication factor C 1 0.64 1.86 2.55 0.80 regulation of cell cycle BG088254 BRCA2 and CDKN1A interacting protein 1.02 1.30 2.02 0.92 regulation of transcription, DNA-dependent BG085396 transforming growth factor beta 1 induced transcript 4 1.47 0.88 2.25 0.88 positive regulation of transcription BG088871 GATA binding protein 4 (Gata4), mRNA 1.97 1.36 2.34 0.94 DNA damage response, signal transduction resulting in cell cycle arrest BG070939 checkpoint kinase 1 homolog (S. pombe) 0.73 1.84 2.78 1.02 apoptosis BG064583 testis enhanced gene transcript 0.48 1.36 2.21 1.03 apoptosis BG064074 B-cell receptor-associated protein 31 0.44 1.55 2.35 1.46 apoptosis BG085048 caspase recruitment domain family, member 10 0.76 1.17 2.40 1.10 translational initiation BG070960 eukaryotic translation initiation factor 4, gamma 2 0.54 1.38 2.06 0.99 DNA repair BG071999 UV radiation resistance associated gene 0.68 1.62 2.29 0.95 regulation of transcription, DNA-dependent BG073938 single-stranded DNA binding protein 3 0.64 1.56 2.12 1.09 cell death BG072209 clusterin 8.21 1.60 4.50 1.15 anti-apoptosis BG084405 baculoviral IAP repeat-containing 3 0.91 1.11 2.05 1.06 regulation of transcription, DNA-dependent BG086687 transcriptional regulator, SIN3B (yeast) 4.22 1.41 3.08 0.93 regulation of transcription, DNA-dependent BG067190 ankyrin repeat domain 6 0.86 1.93 2.79 1.16 DNA repair BG075420 casein kinase 1, delta 0.34 1.37 2.08 0.82 regulation of transcription, DNA-dependent BG076902 purine rich element binding protein A 0.59 1.80 2.21 0.93 apoptosis BG088241 lymphocyte specific 1 0.34 1.86 2.71 1.07 regulation of transcription, DNA-dependent BG066379 jumonji domain containing 2A 0.21 1.96 4.82 1.25 regulation of transcription, DNA-dependent BG067918 B-cell translocation gene 2, anti-proliferative 0.33 1.35 2.26 1.31 apoptosis BG077957 programmed cell death 6 0.56 1.28 2.01 1.11 regulation of transcription from Pol II promoter BG087452 Wilms tumor homolog 0.32 1.96 3.40 1.20 germ-cell development BG087137 kit oncogene 2.69 1.15 3.90 1.19 regulation of transcription from Pol II promoter BG084377 signal transducer and activator of transcription 3 0.51 1.81 2.88 1.05 DNA repair BG067126 single-strand selective monofunctional uracil DNA glycosylase 0.88 1.40 2.22 2.19 positive regulation of transcription from Pol II promoter BG077765 SRY-box containing gene 8 0.51 1.41 2.84 1.04 spermatogenesis BG064308 acyl-Coenzyme A oxidase 1, palmitoyl 1.65 0.25 5.00 1.01 regulation of transcription, DNA-dependent BG074917 general transcription factor II I 0.72 1.43 2.79 1.15 regulation of transcription, DNA-dependent BG064455 longevity assurance homolog 6 (S. cerevisiae) 0.84 1.58 2.51 1.32 transcription BG075851 polymerase (RNA) II (DNA directed) polypeptide B 0.51 1.16 2.45 1.05 regulation of transcription, DNA-dependent BG085373 GLI-Kruppel family member GLI2 0.51 1.54 2.15 1.19 nuclear division BG071683 centromere autoantigen H 0.59 1.44 2.34 0.99 regulation of transcription, DNA-dependent BG066892 upstream regulatory element binding protein 1e 0.27 2.00 4.33 1.82 translational elongation BG083078 LanC (bacterial lantibiotic synthetase component C)-like 2 0.85 1.43 2.05 0.87 male sex determination BG069156 insulin-like growth factor I receptor 0.41 1.92 2.04 1.03 apoptosis BG081632 Janus kinase 2 0.69 1.38 2.03 0.89 regulation of transcription, DNA-dependent BG072106 nuclear factor of activated T-cells, cytoplasmic, calcineurin-dependent 3 0.79 1.60 2.13 0.90 disassembly of cell structures BG074045 superoxide dismutase 1, soluble 1.15 0.85 2.23 0.96 induction of programmed cell death BG073154 BCL2/adenovirus E1B 19kD interacting protein like 0.86 1.50 2.86 1.08 apoptosis BG086954 amyloid beta (A4) precursor protein 0.59 1.75 2.59 1.12 apoptosis BG085493 ubiquitin-activating enzyme E1C 0.34 1.76 4.35 1.12 regulation of transcription, DNA-dependent BG069856 cyclin C 0.57 1.31 2.19 0.98 regulation of transcription, DNA-dependent BG079354 nuclear receptor co-repressor 2 0.82 1.96 2.52 1.08 anti-apoptosis BG086150 craniofacial development protein 1 0.23 1.46 2.09 0.91 regulation of transcription BG066422 prostaglandin E synthase 2 0.81 1.26 2.46 1.04 nucleotide-excision repair BG073656 serine/threonine kinase 2 0.14 1.96 2.33 1.02 apoptosis BG067419 growth arrest and DNA-damage-inducible 45 gamma 0.53 1.22 2.07 1.11 induction of programmed cell death BG076240 caspase 6 1.02 1.47 2.22 1.11 regulation of transcription, DNA-dependent AW552582 deformed epidermal autoregulatory factor 1 (Drosophila) 0.70 1.65 2.10 1.15 regulation of transcription, DNA-dependent BG077537 RIKEN cDNA 0610009M14 gene 0.65 1.22 2.19 1.08 nuclear division BG086107 pescadillo homolog 1, containing BRCT domain (zebrafish) 0.94 1.98 2.86 0.88 regulation of transcription, DNA-dependent BG084078 jumonji, AT rich interactive domain 1A (Rbp2 like) 1.06 1.85 3.61 1.17 regulation of transcription, DNA-dependent BG075386 bobby sox homolog (Drosophila) 0.89 1.42 2.02 1.01 positive regulation of transcription from Pol II promoter BG073001 cellular nucleic acid binding protein 0.72 1.69 2.08 0.87 fertilization (sensu Animalia) BG075310 transketolase 0.16 1.57 3.16 0.90 binding of sperm to zona pellucida BG084869 RIKEN cDNA A430096B05 gene 0.64 1.74 3.29 1.31 regulation of transcription, DNA-dependent BG079235 cellular repressor of E1A-stimulated genes 0.72 1.40 2.84 0.93 regulation of transcription, DNA-dependent BG064318 retinoic acid induced 1 1.19 0.65 2.77 1.01 cytolysis BG077623 hypoxanthine guanine phosphoribosyl transferase 0.90 1.77 2.01 1.21 regulation of transcription BG064108 PDZ and LIM domain 1 (elfin) 0.30 1.45 2.83 0.80 regulation of transcription, DNA-dependent BG087132 ubiquitin specific protease 49 1.39 1.22 2.04 0.94 DNA repair AU021534 flap structure specific endonuclease 1 0.66 1.76 2.59 0.92 positive regulation of transcription from Pol II promoter BG067330 nuclear receptor subfamily 5, group A, member 1 1.16 1.60 2.83 1.40 translational elongation BG074911 ribosomal protein, large, P1 0.32 1.34 2.12 1.13 induction of programmed cell death BG082842 Bcl-2-related ovarian killer protein 0.49 1.58 2.05 1.01 regulation of transcription, DNA-dependent BG064695 RIKEN cDNA A830092L04 gene 0.64 1.59 2.30 0.92 negative regulation of cell cycle BG086151 CDK2 (cyclin-dependent kinase 2)-associated protein 1 0.63 1.46 2.16 0.91 regulation of transcription, DNA-dependent BG086994 RAB7, member RAS oncogene family 0.41 1.63 2.77 1.12 single strand break repair BG069854 X-ray repair complementing defective repair in Chinese hamster cells 1 0.77 1.56 3.48 1.25 regulation of transcription, DNA-dependent BG069095 interferon consensus sequence binding protein 1 0.91 0.88 2.50 0.96 cell cycle arrest BG086554 microtubule-actin crosslinking factor 1 0.56 1.68 3.30 1.27 S phase of mitotic cell cycle BG084969 origin recognition complex, subunit 2-like (S. cerevisiae) 1.09 1.29 2.02 1.03 DNA repair BG072232 three prime repair exonuclease 1 1.17 1.38 2.09 0.84 S phase of mitotic cell cycle BG081111 topoisomerase (DNA) I 0.62 1.81 2.42 1.03 regulation of transcription, DNA-dependent BG086680 longevity assurance homolog 4 (S. cerevisiae) 0.23 1.86 3.17 0.94 regulation of transcription, DNA-dependent BG072359 fumarylacetoacetate hydrolase 0.61 1.76 2.37 1.04 cytolysis BG067456 lysozyme 1.28 1.77 4.27 0.62 transcription from Pol I promoter BG086106 fibrillarin 0.80 1.89 2.62 1.08 regulation of transcription, DNA-dependent BG076024 enhancer of zeste homolog 1 (Drosophila) 1.11 1.28 2.14 1.09 sex differentiation BG087691 anti-Mullerian hormone type 2 receptor 2.79 1.35 2.99 1.01 cell cycle BG074262 telomeric repeat binding factor 2 0.94 1.78 2.62 1.15 positive regulation of transcription from Pol II promoter BG086322 activating transcription factor 4 0.19 1.54 2.05 0.80 regulation of transcription, DNA-dependent BG088708 nuclear factor, erythroid derived 2,-like 1 0.86 1.38 2.11 1.00 regulation of transcription, DNA-dependent BG078428 placentae and embryos oncofetal gene 0.80 1.45 2.97 0.99 regulation of transcription, DNA-dependent BG068772 AE binding protein 2 0.34 1.74 3.94 1.12 regulation of transcription, DNA-dependent BG076162 zinc finger protein 612 1.44 1.57 2.24 1.20 induction of programmed cell death BG086310 caspase 9 0.42 1.40 2.50 0.98 regulation of
Recommended publications
  • Detecting Remote, Functional Conserved Domains in Entire Genomes by Combining Relaxed Sequence-Database Searches with Fold Recognition
    HMMerThread: Detecting Remote, Functional Conserved Domains in Entire Genomes by Combining Relaxed Sequence-Database Searches with Fold Recognition Charles Richard Bradshaw1¤a, Vineeth Surendranath1, Robert Henschel2,3, Matthias Stefan Mueller2, Bianca Hermine Habermann1,4*¤b 1 Bioinformatics Laboratory, Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Saxony, Germany, 2 Center for Information Services and High Performance Computing (ZIH), Technical University, Dresden, Saxony, Germany, 3 High Performance Applications, Pervasive Technology Institute, Indiana University, Bloomington, Indiana, United States of America, 4 Bioinformatics Laboratory, Scionics c/o Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Saxony, Germany Abstract Conserved domains in proteins are one of the major sources of functional information for experimental design and genome-level annotation. Though search tools for conserved domain databases such as Hidden Markov Models (HMMs) are sensitive in detecting conserved domains in proteins when they share sufficient sequence similarity, they tend to miss more divergent family members, as they lack a reliable statistical framework for the detection of low sequence similarity. We have developed a greatly improved HMMerThread algorithm that can detect remotely conserved domains in highly divergent sequences. HMMerThread combines relaxed conserved domain searches with fold recognition to eliminate false positive, sequence-based identifications. With an accuracy of 90%, our software is able to automatically predict highly divergent members of conserved domain families with an associated 3-dimensional structure. We give additional confidence to our predictions by validation across species. We have run HMMerThread searches on eight proteomes including human and present a rich resource of remotely conserved domains, which adds significantly to the functional annotation of entire proteomes.
    [Show full text]
  • Supplementary Data
    Progressive Disease Signature Upregulated probes with progressive disease U133Plus2 ID Gene Symbol Gene Name 239673_at NR3C2 nuclear receptor subfamily 3, group C, member 2 228994_at CCDC24 coiled-coil domain containing 24 1562245_a_at ZNF578 zinc finger protein 578 234224_at PTPRG protein tyrosine phosphatase, receptor type, G 219173_at NA NA 218613_at PSD3 pleckstrin and Sec7 domain containing 3 236167_at TNS3 tensin 3 1562244_at ZNF578 zinc finger protein 578 221909_at RNFT2 ring finger protein, transmembrane 2 1552732_at ABRA actin-binding Rho activating protein 59375_at MYO15B myosin XVB pseudogene 203633_at CPT1A carnitine palmitoyltransferase 1A (liver) 1563120_at NA NA 1560098_at AKR1C2 aldo-keto reductase family 1, member C2 (dihydrodiol dehydrogenase 2; bile acid binding pro 238576_at NA NA 202283_at SERPINF1 serpin peptidase inhibitor, clade F (alpha-2 antiplasmin, pigment epithelium derived factor), m 214248_s_at TRIM2 tripartite motif-containing 2 204766_s_at NUDT1 nudix (nucleoside diphosphate linked moiety X)-type motif 1 242308_at MCOLN3 mucolipin 3 1569154_a_at NA NA 228171_s_at PLEKHG4 pleckstrin homology domain containing, family G (with RhoGef domain) member 4 1552587_at CNBD1 cyclic nucleotide binding domain containing 1 220705_s_at ADAMTS7 ADAM metallopeptidase with thrombospondin type 1 motif, 7 232332_at RP13-347D8.3 KIAA1210 protein 1553618_at TRIM43 tripartite motif-containing 43 209369_at ANXA3 annexin A3 243143_at FAM24A family with sequence similarity 24, member A 234742_at SIRPG signal-regulatory protein gamma
    [Show full text]
  • Supplementary Table 2
    Supplementary Table 2. Differentially Expressed Genes following Sham treatment relative to Untreated Controls Fold Change Accession Name Symbol 3 h 12 h NM_013121 CD28 antigen Cd28 12.82 BG665360 FMS-like tyrosine kinase 1 Flt1 9.63 NM_012701 Adrenergic receptor, beta 1 Adrb1 8.24 0.46 U20796 Nuclear receptor subfamily 1, group D, member 2 Nr1d2 7.22 NM_017116 Calpain 2 Capn2 6.41 BE097282 Guanine nucleotide binding protein, alpha 12 Gna12 6.21 NM_053328 Basic helix-loop-helix domain containing, class B2 Bhlhb2 5.79 NM_053831 Guanylate cyclase 2f Gucy2f 5.71 AW251703 Tumor necrosis factor receptor superfamily, member 12a Tnfrsf12a 5.57 NM_021691 Twist homolog 2 (Drosophila) Twist2 5.42 NM_133550 Fc receptor, IgE, low affinity II, alpha polypeptide Fcer2a 4.93 NM_031120 Signal sequence receptor, gamma Ssr3 4.84 NM_053544 Secreted frizzled-related protein 4 Sfrp4 4.73 NM_053910 Pleckstrin homology, Sec7 and coiled/coil domains 1 Pscd1 4.69 BE113233 Suppressor of cytokine signaling 2 Socs2 4.68 NM_053949 Potassium voltage-gated channel, subfamily H (eag- Kcnh2 4.60 related), member 2 NM_017305 Glutamate cysteine ligase, modifier subunit Gclm 4.59 NM_017309 Protein phospatase 3, regulatory subunit B, alpha Ppp3r1 4.54 isoform,type 1 NM_012765 5-hydroxytryptamine (serotonin) receptor 2C Htr2c 4.46 NM_017218 V-erb-b2 erythroblastic leukemia viral oncogene homolog Erbb3 4.42 3 (avian) AW918369 Zinc finger protein 191 Zfp191 4.38 NM_031034 Guanine nucleotide binding protein, alpha 12 Gna12 4.38 NM_017020 Interleukin 6 receptor Il6r 4.37 AJ002942
    [Show full text]
  • LN-EPC Vs CEPC List
    Supplementary Information Table 5. List of genes upregulated on LN-EPC (LCB represents the variation of gene expression comparing LN-EPC with CEPC) Gene dystrophin (muscular dystrophy, Duchenne and Becker types) regulator of G-protein signalling 13 chemokine (C-C motif) ligand 8 vascular cell adhesion molecule 1 matrix metalloproteinase 9 (gelatinase B, 92kDa gelatinase, 92kDa type IV collagenase) chemokine (C-C motif) ligand 2 solute carrier family 2 (facilitated glucose/fructose transporter), member 5 eukaryotic translation initiation factor 1A, Y-linked regulator of G-protein signalling 1 ubiquitin D chemokine (C-X-C motif) ligand 3 transcription factor 4 chemokine (C-X-C motif) ligand 13 (B-cell chemoattractant) solute carrier family 7, (cationic amino acid transporter, y+ system) member 11 transcription factor 4 apolipoprotein D RAS guanyl releasing protein 3 (calcium and DAG-regulated) matrix metalloproteinase 1 (interstitial collagenase) DEAD (Asp-Glu-Ala-Asp) box polypeptide 3, Y-linked /// DEAD (Asp-Glu-Ala-Asp) box polypeptide 3, Y-linked transcription factor 4 regulator of G-protein signalling 1 B-cell linker interleukin 8 POU domain, class 2, associating factor 1 CD24 antigen (small cell lung carcinoma cluster 4 antigen) Consensus includes gb:AK000168.1 /DEF=Homo sapiens cDNA FLJ20161 fis, clone COL09252, highly similar to L33930 Homo sapiens CD24 signal transducer mRNA. /FEA=mRNA /DB_XREF=gi:7020079 /UG=Hs.332045 Homo sapiens cDNA FLJ20161 fis, clone COL09252, highly similar to L33930 Homo sapiens CD24 signal transducer mRNA
    [Show full text]
  • Supplementary Figures
    Supplementary Figures Supplementary Figure 1 | Sampling locality, genome size estimation, and GC content. (a) Sampling locality in Amami Island (i.e., Amami Oshima, Japan) and its relative location to Okinawa are shown with coordinates (adapted from Google Maps). (b) Sperm cells collected from gravid male gonads were stained with DAPI and subjected to fluorescence-activated cell sorting (FACS) flow cytometry analysis. Sperm with known genome size from zebrafish (Danio rerio) were used as an internal standard to estimate the Lingula genome size. (c) The analysis of stepwise assembly shows that the saturation point is achieved when input sequences reach 10 Gbp from 454 and Illumina reads. (d) K-mer analysis (17-mer) using Illumina reads shows two peaks, in which the homozygous peak coverage is twice the heterozygous peak. The estimated heterozygosity rate calculating the ratio of the peaks, is 1.6%. (e) Distribution of GC content calculated from 3,830 scaffolds. (f) Comparison of GC content in selected lophotrochozoans. Error bars, standard deviation. Supplementary Figure 2 | Schematic flow of sequencing and assembly of the Lingula genome. (a) Genomic DNA from a male gonad was extracted for genome sequencing using Roche 454, Illumina, and PacBio platforms. A total of 96-Gb of data was obtained with approximately 226- fold coverage of the 425-Mb Lingula genome. (b) Ten embryonic stages from egg to larva and seven adult tissues were collected for RNA-seq and reads were assembled de novo using Trinity. (c) Transcript information from RNA-seq was used to generate hints by spliced alignment with PASA and BLAT. Gene models were predicted with trained AUGUSTUS.
    [Show full text]
  • Supplemental Material 1
    Supplemental gure 1 Lin-Sca-1- Marker Thymus Bone Skin 41% ±1 64% ±8 92% ±4 CD29 % of max % of max % of max 50% ±10 58% ±9 87% ±4 CD51 % of max % of max % of max 3% ±1 47% ±3 72% ±7 CD140a % of max % of max % of max 39% ±6 51% ±3 71% ±3 CD140b % of max % of max % of max 4% ±2 1% ±1 7% ±1 CD34 % of max % of max % of max 21% ±16 30% ±10 42% ±10 gp38 % of max % of max % of max 50% ±4 43% ±7 55% ±5 Ly51 % of max % of max % of max 49% ±3 8% ±1 73% ±6 CD90.2 % of max % of max % of max 36% ±10 35% ±10 77% ±5 CD105 % of max % of max % of max 2% ±2 47% ±8 78% ±6 CD73 % of max % of max % of max 9% ±8 8% ±3 38% ±2 CD44 % of max % of max % of max 48% ±3 27% ±1 52% ±4 CD146 % of max % of max % of max 17% ±13 11% ±8 28% ±7 Nestin % of max % of max % of max 6% ±5 23% ±29 37% ±25 CXCL12 % of max % of max % of max SUPPLEMENTAL FIGURE 1. Flow cytometry analysis of Lin- Sca-1- cells from the thymus, bone and skin. Overlay histograms illustrate staining with the relevant antibody (in blue) and an isotype control (in red). Each overlay histogram is representative of three independent experiments (3-5 mice per biological replicate). Numbers represent the mean percentage of positive cells (+/- SD). Supplemental gure 2 A B 1000 250k M 100 tMC 98.2% K P bMC 200k R 10 sMC 1 A - 150k 1 9 1 a 4 8 1 0 5 A 2 5 0 3 3 5 9 0 C S CD CD CD gp Ly CD CD1 FSC 100k CD14 CD140b 1000 tMC 50k bMC M 100 sMC K 0 P Thymocytes R 10 -103 0 103 104 105 cTEC 1 mTEC SCA-1 t 5 e 1 m 1 i 4 3 3 n -k ca x c CD CD CD p o E F C tMC vs bMC tMC vs sMC bMC vs sMC 7 7 ) ) ) 6 695 6 6 365 5 426 sMC sMC bMC 5 5 4 4 4 3 3 3 2 2 2 (ReadCount (ReadCount 362 10 10 1 849 1 1 573 log log 0 log10(ReadCount 0 r = 0.476 0 r = 0.629 r = 0.607 0 1 2 3 4 5 6 0 1 2 3 4 5 6 0 1 2 3 4 5 6 log (ReadCount tMC) log10(ReadCount bMC) log10(ReadCount tMC) 10 SUPPLEMENTAL FIGURE 2.
    [Show full text]
  • SUPPLEMENTARY INFORMATION Gotree/Goalign
    SUPPLEMENTARY INFORMATION Gotree/Goalign : Toolkit and Go API to facilitate the development of phylogenetic workflows Frédéric Lemoine1,2∗ and Olivier Gascuel1,3 1 Unité de Bioinformatique Évolutive, Département de Biologie Computationnelle, Institut Pasteur, Paris, FRANCE, 2 Hub de Bioinformatique et Biostatistique, Département de Biologie Computationnelle, Institut Pasteur, Paris, FRANCE, 3 Current address: Institut de Systématique, Evolution, Biodiversité (ISYEB - UMR 7205), CNRS & Muséum National d’Histoire Naturelle, Paris, FRANCE *To whom correspondence should be addressed: [email protected] Supp. Text 1: Examples of Gotree/Goalign commands pp. 2-4 Supp Figure 1: Representation of the use case workflow and command templates pp. 5-6 Supp. Data 1: Nextflow implementation of the use case pp. 7-8 Supp. Data 2: List of analyzed primate species pp. 9 Supp. Data 3: List of 1,315 orthologous groups from OrthoDB pp. 10-15 1 Supplementary Text 1: Examples of Gotree/Goalign commands The comprehensive list of Gotree/Goalign commands is given on their respective GitHub repositories: https://github.com/evolbioinfo/gotree/blob/master/docs/index.md https://github.com/evolbioinfo/goalign/blob/master/docs/index.md 1) Reformatting a tree from newick to nexus1 gotree reformat nexus -i itol://129215302173073111930481660 The input tree is directly downloaded from iTOL, using its identifier and reformatted in Newick locally. 2) Reformatting an alignment from Fasta to Phylip1 goalign reformat phylip -i https://github.com/evolbioinfo/goalign/raw/master/tests/data/test_xz.xz
    [Show full text]
  • Structure-Based Subfamily Classification of Homeodomains
    Structure-Based Subfamily Classification of Homeodomains by Jennifer Ming-Jiun Tsai A thesis submitted in conformity with the requirements for the degree of Master of Science Graduate Department of Molecular Genetics University of Toronto © Copyright by Jennifer Ming-Jiun Tsai 2008 STRUCTURE-BASED SUBFAMILY CLASSIFICATION OF HOMEODOMAINS A thesis submitted in conformity with the requirements for Master of Science Jennifer Ming-Jiun Tsai Graduate Department of Molecular Genetics, University of Toronto, 2008 Abstract Eukaryotic DNA-binding proteins mediate many important steps in embryonic development and gene regulation. Consequently, a better understanding of these proteins would hopefully allow a more complete picture of gene regulation to be determined. In this study, a structure- based subfamily classification of the homeodomain family of DNA-binding proteins was undertaken in order to determine whether sub-groupings of a protein family could be identified that corresponded to differences in specific function, and identification of subfamily-determining residues was performed in order to gain some insight on functional differences via analysis of the residue properties. Subfamilies appear to have different specific DNA binding properties, according to DNA profiles obtained from TRANSFAC [1] and other sources in the literature. Subfamily-specific residues appear to be frequently associated with the protein-DNA interface and may influence DNA binding via interactions with the DNA phosphate backbone; these residues form a conserved profile uniquely identifying each subfamily. ii Acknowledgements First and foremost, I would like to thank my husband Christopher, my parents David and Virginia, and my sister Margaret for their unfailing love and support that has enabled me to maintain my focus on my studies and research.
    [Show full text]
  • 4 353 Skin Oral 1 B
    A B Supplementary Figure S1: Differentially expressed piRNAs during skin and oral mucosal wound healing. (A) piRNA 0hr-1 0hr-3 0hr-2 24hr-2 24hr-1 24hr-3 5day-3 5day-1 5day-2 0hr-1 0hr-2 0hr-3 24hr-2 24hr-1 24hr-3 5day-1 5day-2 5day-3 profiles were obtained on mouse skin and oral mucosal (palate) wound healing time course (0hr, 24 hr, and 5 day). A total of 357 differentially expressed piRNA were identified during skin wound healing (Bonferroni adjusted P value <0.05). See Supplementary Table 1A for the full list. C (B) Five differentially expressed piRNA skin were identified during oral mucosal wound healing (P value <0.01, list presented in Supplementary Table 1B). Note: more 353 stringent statistical cut-off ((Bonferroni adjusted P value) yield 0 differentially expressed piRNA gene. (C) Venn diagram illustrates overlaps between differentially 4 expressedpiRNAsinskinandoral oral 1 mucosal wound healing. min max Supplementary Table S1a: Differentially expressed piRNAs in skin wound healing Mean StDev piRNA 0 hr 24 hr 5 day 0 hr 24 hr 5 day pVal adj P piR‐mmu‐15927330 5.418351 11.39746 10.799 0.34576 0.253492 0.154802 6.32E‐14 6.97E‐11 piR‐mmu‐49559417 5.301647 10.10777 9.816878 0.441719 0.222335 0.032479 3.95E‐12 4.36E‐09 piR‐mmu‐30053093 6.32531 11.26384 4.020902 0.280841 1.057847 0.178798 5.21E‐12 5.76E‐09 piR‐mmu‐29303577 5.15005 10.47662 9.52175 0.554877 0.26622 0.163283 1.53E‐11 1.69E‐08 piR‐mmu‐49254706 5.187673 10.19644 9.622374 0.520671 0.330378 0.191116 1.96E‐11 2.16E‐08 piR‐mmu‐49005170 5.415133 9.639725 9.565967 0.411507 0.281143
    [Show full text]
  • Homeodomain Proteins: an Update
    Chromosoma (2016) 125:497–521 DOI 10.1007/s00412-015-0543-8 REVIEW Homeodomain proteins: an update Thomas R. Bürglin1,2 & Markus Affolter1 Received: 7 June 2015 /Revised: 20 September 2015 /Accepted: 21 September 2015 /Published online: 13 October 2015 # The Author(s) 2015. This article is published with open access at Springerlink.com Abstract Here, we provide an update of our review on ho- homeodomain protein-DNA interaction. The PRD class of meobox genes that we wrote together with Walter Gehring in homeobox genes was of special interest to Walter Gehring in 1994. Since then, comprehensive surveys of homeobox genes the last two decades. The PRD class comprises six families in have become possible due to genome sequencing projects. Bilateria, and tinkers with four different motifs, i.e., the Using the 103 Drosophila homeobox genes as example, we PAIRED domain, the Groucho-interacting motif EH1 (aka present an updated classification. In animals, there are 16 ma- Octapeptide or TN), the homeodomain, and the OAR motif. jor classes, ANTP, PRD, PRD-LIKE, POU, HNF, CUT (with Homologs of the co-repressor protein Groucho are also pres- four subclasses: ONECUT, CUX, SATB, and CMP), LIM, ent in plants (TOPLESS), where they have been shown to ZF, CERS, PROS, SIX/SO, plus the TALE superclass with interact with small amphipathic motives (EAR), and in yeast the classes IRO, MKX, TGIF, PBC, and MEIS. In plants, (TUP1), where we find an EH1-like motif in MATα2. there are 11 major classes, i.e., HD-ZIP (with four subclasses: I to IV), WOX, NDX, PHD, PLINC, LD, DDT, SAWADEE, Keywords Homeobox .
    [Show full text]
  • Tablet S1. Transcription Factors Induced at Least Two Fold by OHT
    Tablet S1. Transcription factors induced at least two fold by OHT with p<0.01 Probe.Set.ID Gene Symbol Gene Title 207826_s_at ID3 inhibitor of DNA binding 3, dominant negative helix-loop-helix protein 218839_at HEY1 hairy/enhancer-of-split related with YRPW motif 1 203574_at NFIL3 nuclear factor, interleukin 3 regulated 209291_at ID4 inhibitor of DNA binding 4, dominant negative helix-loop-helix protein 44783_s_at HEY1 hairy/enhancer-of-split related with YRPW motif 1 220225_at IRX4 iroquois homeobox protein 4 220493_at DMRT1 doublesex and mab-3 related transcription factor 1 204194_at BACH1 BTB and CNC homology 1, basic leucine zipper transcription factor 1 204622_x_at NR4A2 nuclear receptor subfamily 4, group A, member 2 206140_at LHX2 LIM homeobox 2 212696_s_at RNF4 ring finger protein 4 207144_s_at CITED1 Cbp/p300-interacting transactivator, with Glu/Asp-rich carboxy-terminal domain, 1 41037_at TEAD4 TEA domain family member 4 36711_at MAFF v-maf musculoaponeurotic fibrosarcoma oncogene homolog F (avian) 216248_s_at NR4A2 nuclear receptor subfamily 4, group A, member 2 204131_s_at FOXO3A forkhead box O3A 204132_s_at FOXO3A forkhead box O3A 50221_at TFEB transcription factor EB 204281_at TEAD4 TEA domain family member 4 31845_at ELF4 E74-like factor 4 (ets domain transcription factor) 221558_s_at LEF1 lymphoid enhancer-binding factor 1 208502_s_at PITX1 paired-like homeodomain transcription factor 1 204249_s_at LMO2 LIM domain only 2 (rhombotin-like 1) 204798_at MYB v-myb myeloblastosis viral oncogene homolog (avian) 210319_x_at MSX2
    [Show full text]
  • Suppl Tabl.S4-Bimodal Genes TIVA-Tag
    Lovatt et al. TIVA-tag. Nature Methods Tissue single cells, bimodal genes Culture single cells, bimodal genes UCSC IDS GENE SYMBOL GENE NAME UCSC IDS GENE SYMBOL GENE NAME uc008pup.1 0610031J06Rik RIKEN cDNA 0610031J06 gene uc009kif.1 Urah urate (5-hydroxyiso-) hydrolase uc007wmv.1:uc007wmw.1:uc011zvd.1 1110038F14Rik RIKEN cDNA 1110038F14 gene uc012daf.1 6330407A03Rik RIKEN cDNA 6330407A03 gene uc009nre.1 1190005I06Rik RIKEN cDNA 1190005I06 gene uc008hes.1 8430431K14Rik RIKEN cDNA 8430431K14 gene uc009lpn.1:uc009lpo.1 1700029J07Rik RIKEN cDNA 1700029J07 gene uc007ivs.1 AK038627 uc008mex.2:uc008mey.2 1810024B03Rik RIKEN cDNA 1810024B03 gene uc007fno.1 AK046981 uc007mru.2 1810043H04Rik RIKEN cDNA 1810043H04 gene uc007waz.1 AK085274 uc007rgw.1:uc007rgx.2 2210408I21Rik RIKEN cDNA 2210408I21 gene uc008hiw.1 I830134H01Rik RIKEN cDNA I830134H01 gene uc009igo.1:uc009ign.1 2310010J17Rik RIKEN cDNA 2310010J17 gene uc007ptq.2:uc007ptr.2:uc007ptt.1 Btn1a1 butyrophilin, subfamily 1, member A1 uc007ned.2:uc011ykz.1:uc007nef.2 2410018L13Rik RIKEN cDNA 2410018L13 gene uc008rqf.2 Clca3 chloride channel calcium activated 3 uc008wdb.2:uc008wda.2 2610002J02Rik RIKEN cDNA 2610002J02 gene uc012gkt.1:uc012gkr.1:uc009nlu.1:uc009nlv.1:uc009nls.1:uc009nlr.1:uc009nlt.1:uc012gks.1Clec18a C-type lectin domain family 18, member A uc009pal.2:uc009pak.2 2610203C20Rik RIKEN cDNA 2610203C20 gene uc009gdp.1 Cox7a1 cytochrome c oxidase subunit VIIa 1 uc007nuo.2:uc007nun.2 2700049A03Rik RIKEN cDNA 2700049A03 gene uc008pny.2 Fam198b family with sequence similarity 198, member
    [Show full text]