RZ 3918 (# ZUR1801-015) 01/09/2018 Computer Science/Mathematics 19 pages Research Report INtERAcT: Interaction Network Inference from Vector Representations of Words Matteo Manica, Roland Mathis, Maria Rodriguez Martinez IBM Research – Zurich 8803 Rüschlikon Switzerland LIMITED DISTRIBUTION NOTICE This report has been submitted for publication outside of IBM and will probably be copyrighted if accepted for publication. It has been issued as a Research Report for early dissemination of its contents. In view of the transfer of copyright to the outside pub- lisher, its distribution outside of IBM prior to publication should be limited to peer communications and specific requests. After outside publication, requests should be filled only by reprints or legally obtained copies (e.g., payment of royalties). Some re- ports are available at http://domino.watson.ibm.com/library/Cyberdig.nsf/home. Research Africa • Almaden • Austin • Australia • Brazil • China • Haifa • India • Ireland • Tokyo • Watson • Zurich INtERAcT: Interaction Network Inference from Vector Representations of Words Matteo Manica1,2,*, Roland Mathis1,*, Mar´ıa Rodr´ıguez Mart´ınez1, y ftte,lth,
[email protected] 1 IBM Research Z¨urich, S¨aumerstrasse4, 8803 R¨uschlikon, Switzerland 2 ETH Z¨urich, Institute f¨urMolekulare Systembiologie, Auguste-Piccard-Hof 1 8093, Z¨urich, Switzerland * Shared first authorship y Corresponding author Abstract Background In recent years, the number of biomedical publications made freely available through literature archives is steadfastly growing, resulting in a rich source of untapped new knowledge. Most biomedical facts are however buried in the form of unstructured text, and their exploitation requires expert-knowledge and time-consuming manual curation of published articles.