<<

Bringing the tools to the data - Providing scientists with personalized and on-demand services on the cloud of the French Institute of Bioinformatics! ! ! Christophe BLANCHET, Jean-François GIBRAT (Institut Français de Bioinformatique , !IFB-core CNRS UMS3601)! !

Life science researchers, thanks to the continuous tools R BLAST improvement of experimental technologies, face a deluge of FastA OMSSA ClustalW2 SSearch data whose exploitation requires large computing resources PeptideShaker ARIA X!tandem HMMer BWA and appropriate software tools. They simultaneously use many TopHat Galaxy Muscle fastQC of the bioinformatics tools from the arsenal of thousands Omega

Create available from the international community. Usually they new cloud services + combine their data with public data that are too large to be system moved easily. So the computational infrastructure need to be Virtual Machines tightly connected to public biological databases.! Bioinformatics Marketplace

Structures Galaxy ! Sequences Proteomics ... At the French Institute of Bioinformatics (IFB), we developed several bioinformatics cloud services available as cloud appliances. A cloud appliance is a predefined virtual machine that can be run on a remote cloud infrastructure. As cloud appliances have size usually of gigabytes, this is more efficient to moved them where the terabytes of biological data to analyse are stored instead of moving the data. But that requires to have at least few computing resources close to the stored data.! ! We have created bioinformatics appliances Bioinformatics Marketplace providing, for example, an user-devoted Galaxy Select tools Structures Galaxy ... Scientists can filter (1) Sequences Proteomics the appliances through a portal, a virtual desktop environment for (2) Web interface to identify Move and launch (2) the cloud appropriate ones. proteomics analysis or a bioinformatics cluster data VMs tools: BLAST, ClustalW2, BI R, Galaxy… with a lot of standard tools (BLAST, ClustalW2, public data (1) user data UNIPROT Z R, Samtools, Bowtie, TopHat, etc.). Scientists EMBL shared B Genomes PDB W A PROSITE can run their own appliances through an user-

Cloud @ IFB Analyze (3) adapted web interface. To connect our cloud data

Use tools (3) infrastructure to existing public biological Scientists have access to their own cloud resources through web portal, remote virtual desktop or SSH. databases, we have configured it to connect

Cloud Galaxy portal Remote virtual desktop for NGS analyses with proteomics tools automatically all virtual machines to a local repository with public databases like UNIPROT ! or EMBL.! IFB is currently running an academic cloud infrastructure with the appropriate biological data and bioinformatics! tools to meet the needs of the life science community. ! ! ! ! French Institute of Bioinformatics - http://www.france-bioinformatique.fr !