SUPPLEMENTARY MATERIAL Impact of knowledge accumulation on pathway enrichment analysis Lina Wadi1, Mona Meyer1, Joel Weiser1, Lincoln Stein1,3, Jüri Reimand1,2,*
1 - Ontario Institute for Cancer Research 2 - Department of Medical Biophysics, University Of Toronto 3 - Department of Molecular Genetics, University Of Toronto * - Correspondence: [email protected] Supplementary information, Wadi et al.
Supplementary Figures:...... 4
Supplementary Figure 1: The majority of web-based pathway analysis tools are out of date...... 5
Supplementary Figure 2: Vocabulary of biological pathways and processes is growing rapidly...... 6
Supplementary Figure 3: Gene Ontology terms are increasingly specific...... 7
Supplementary Figure 4: GO terms are increasingly interconnected...... 8
Supplementary Figure 5: Human genes and pathways are increasingly annotated...... 9
Supplementary Figure 6: Genes and pathways of model organisms are increasingly annotated...... 10
Supplementary Figure 7: Dark matter of protein-coding genome is decreasing.. 11
Supplementary Figure 8: Annotation analysis is affected by mismatches in gene symbols...... 12
Supplementary Figure 9: Enrichment analysis of breast cancer essential genes using gene annotations from 2010 misses numerous pathways and processes...... 13
Supplementary Figure 10: Pathway analysis with smaller gene lists confirms the large fraction of pathways missed in analysis of breast cancer genes...... 14
Supplementary Figure 11: Analysis of GO and Reactome pathways with breast cancer data confirms loss of information when using outdated software...... 15
Supplementary Figure 12: Evolution of pathway information affects recently updated and out-of-date software tools...... 16