Data Fusion Tutorial [BC]2 Basel, June 9, 2015
Jane looks foR help! jane’s personal hairball! Hi jane. NAR just published 176 new bio databases*! .... Messy. Think about What’s wrong? Ohhh! all different edge types! I have no idea how to How about stiching Make anything useful. them in a single TRMT61A data table? RecA_monomer-monomer_interface TOP3B Tried it. A nightmare! NFX1 PMS1 RPL11 GBP2 Homologous Recombination Repair RAD52 TP53 Double-Strand Break Repair Think of GO annotationS POLD4 RAD54B
ACACB POLD1 RAD9B DNA Repair I could work this out, RPA1 EXO1 in the data table RPA4 EIF5A ACACA CRYAB MLH1 RPA2 but not for every MND1 TERF2IP PMS2 TOP3A EME1 DNAJA3 of yeast phenotypes! POLR2K CDK2 TERF2 Meiotic Recombination MUS81 different data source RFC5 RFC2 PRKDC RFC4 ATR UBE2I BIOCARTA_ATM_PATHWAY MYO18A RPA3 POLD3 RAD9A out there. MSH3 BARD1 RFC3 FANCD2 BIOCARTA_ATRBRCA_PATHWAY RFC1 PCNA AIRE WRN ZNF280B MLH3 XRCC5 XRCC2 DMC1 MDC1 Told you! MRE11A CSNK1D DNA_recomb/repair_RecA RAD51 COPB2 APEX2 Homologous recombination BRCA1 MSH5 RAD50 DNA_recomb/repair_Rad51_C MSH6 BRIP1 POLD2 FANCL NBN MSH4 MSH2 XRCC6 HSPA9 SEC14L5 H2AFX BRCA2 RAD51D FANCF RAD54L BLM FANCC TOPBP1 CSNK1E MED6 ATM XRCC3 XRCC4 PPP1CC DNA_recomb_RecA/RadB_ATP-bd SHFM1 CHEK2 JUN FANCA Mismatch repair FANCE C10orf2 RAD51AP1 LIG1 MSH5-C6orf26 RAD51C FANCG CHEK1 FEN1 TP53BP1
FIGN SSBP1 RBBP8 UIMC1 PALB2 RAD51B Meiosis Homologous recombination repair of ... GYS1 BARD1 signaling events
Fanconi anemia pathway
CSTF1 FAM175A
ANAPC2
* Fernandez-suarez & galperin, nucleic acids research, 2013.
Large-scale data fusion by collective matrix factorization Tutorial at the Basel Computational Biology Conference, Basel, Switzerland, 2015
These notes include introduction Welcome to the hands-on Data Fusion Tutorial! This tutorial is designed to integrative data analysis with for data mining researchers and biologists with interest in data analysis examples from collaborative and large-scale data integration. We will explore latent factor models, a filtering and systems biology, popular class of approaches that have in recent years seen many and Orange workflows that we successful applications in integrative data analysis. We will describe the will construct during the tutorial. intuition behind matrix factorization and explain why factorization Tutorial instructors: approaches are suitable when collectively analyzing many heterogeneous Marinka Zitnik and Blaz Zupan, data sets. To practice data fusion, we will construct visual data fusion with the help from members of workflows using Orange and its Data Fusion Add-on. Bioinformatics Lab, Ljubljana. If you haven’t already installed Orange, please follow the installation guide at http://biolab.github.io/datafusion-installation-guide.
* See http://helikoid.si/recomb14/zitnik-zupan-recomb14.png for our full award-winning poster on data fusion.