Supplementary Methods and Figures

Supplementary Information Decoding cancer heterogeneity: studying patient-specific signaling signatures towards personalized cancer therapy Efrat Flashner-Abramson1#, Swetha Vasudevan1#, Ibukun Adesoji Adejumobi1, Amir Sonnenblick2 and Nataly Kravchenko-Balasha1,* 1Department for Bio-medical Research, Institute of Dental Sciences, Hebrew University of Jerusalem, Jerusalem 91120, Israel 2 Oncology Division, Tel Aviv Sourasky Medical Center, and Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel *Corresponding author: [email protected] # Equal contribution Table of content: Supplementary Methods ................................................................................................................................................... 3 Surprisal analysis. ......................................................................................................................................................... 3 Determination of the number of significant unbalanced processes. .............................................................................. 6 Generation of functional subnetworks. ......................................................................................................................... 7 Calculation of barcodes. ................................................................................................................................................ 7 Cell lines and reagents. ................................................................................................................................................. 8 Methylene blue assays. ................................................................................................................................................. 8 Western blot analysis. ................................................................................................................................................... 8 Supplementary Figures and legends ................................................................................................................................. 9 Figure S1. ...................................................................................................................................................................... 9 Figure S2. .................................................................................................................................................................... 12 Figure S3. .................................................................................................................................................................... 13 Figure S4. .................................................................................................................................................................... 14 Figure S5. .................................................................................................................................................................... 15 Figure S6. .................................................................................................................................................................... 16 Figure S7. .................................................................................................................................................................... 18 Figure S8. .................................................................................................................................................................... 19 Figure S9. .................................................................................................................................................................... 20 Figure S10. .................................................................................................................................................................. 21 Figure S11. .................................................................................................................................................................. 22 References ....................................................................................................................................................................... 23 2 Supplementary Methods Surprisal analysis. Surprisal analysis is a thermodynamic-based information-theoretic approach [1–3]. The analysis is based on the premise that biological systems reach a balanced state when the system is free of constraints [4–6]. However, when under the influence of environmental and genomic constraints, the system is prevented from reaching the state of minimal free energy, and instead reaches a state which is higher in free energy (in biological systems, which are normally under constant temperature and constant pressure, minimal free energy equals maximal entropy). For example, if the system under study is a living cell, an environmental constraint can be exposure to a drug, which inflicts a change in protein concentrations and activities in the cell. The system can be influenced by genomic constraints as well, such as genomic mutations that in turn affect protein function, often eliciting alteration of specific signaling pathways to oppose the functions of the damaged protein. Surprisal analysis can take as input the expression levels of various macromolecules, e.g. genes, transcripts, or proteins. However, be it environmental or genomic alterations, it is the proteins that constitute the functional output in living systems, therefore we base our analysis on proteomic data. The varying forces, or constraints, that act upon living cells ultimately manifest as alterations in the cellular protein network. Each constraint induces a change in a specific part of the protein network in the cells. The subnetwork that is altered due to the specific constraint is termed an unbalanced process. System can be influenced by several constraints thus leading to the emergence of several unbalanced processes. When tumor systems are characterized, the specific set of unbalanced processes is what constitutes the tumor-specific signaling signature. For every protein, i, surprisal analysis calculates its expected expression level when the system is at the steady state 0 and free of constraints: Xi . This term was shown to be constant, i.e. is independent of time and of the actual state of 0 the system [7–10]. In terms of information theory, Xi represents the state of maximal entropy, or minimal information. 0 Xi(k) is the actual, experimentally measured expression level of the protein i in the tumor k. In cases where Xi(k)≠ Xi , we assume that the expression level of the protein i was altered due to constraints that operate on the system. Surprisal 3 analysis discovers the complete set of constraints operating on the system in any given tumor, k, by utilizing the following equation [7]: 0 ln Xi(k) = ln Xi (k) – ΣGiαλα(k) (1) The term ΣGiαλα(k) represents the sum of deviations in expression level of the protein i due to the various constraints, or unbalanced processes, that exist in the tumor k. The processes are indexed α =1, 2, 3, … , such that the dominance of the process decreases with increasing index, i.e. unbalanced process 1 is active in more tumors than unbalanced processes 2, 3 etc. 0 The difference between the steady state expression level, Xi , and the actual expression level, Xi(k), represents the amount of information we have about protein i. A protein that is influenced by constraints, i.e. is influenced by one or more unbalanced processes and is therefore functionally linked to other proteins, cannot take any possible expression level. Rather, its expression level is affected by the expression levels of other proteins. The term Giα denotes the degree of participation of the protein i in the unbalanced process α, and its sign indicates the correlation or anti-correlation between proteins in the same process. For example, in a certain process α, proteins can be assigned the values: Gprotein1,α = -0.50, Gprotein2,α = 0.24, and Gprotein3,α = 0.00, indicating that this process altered proteins 1 and 2 in opposite directions (i.e. protein 1 is upregulated and protein 2 is downregulated, or vice versa due to the process α), while not affecting protein 3. Note that each protein can take part in a number of unbalanced processes at once. Importantly, not all processes are active in all tumors. The term λα(k) represents the importance of the unbalanced process α in the tumor k. Its sign indicates the correlation or anti-correlation between the same processes in different tumors. For example, if the process α is assigned the values: λα(1) = 3.1, λα(20) = 0.0, and λα(138) = 2.5, it means that this process influences the tumors of the patients indexed 1 and 138 in the same direction, while it is inactive in patient 20. The partial deviations in expression level of the protein i due to the different constraints sum up to the total change in expression level (relative to the balance state level), ΣGiαλα(k) . Surprisal analysis uses a covariance matrix of the surprisals, meaning the covariance of the natural logarithms, as an intermediate numerical step (11). This has a physical, rather than purely statistical, significance [7]. That matrix has a 4 mathematical form dictated by the theory, which relates changes in protein concentrations to changes in chemical potentials (see mathematical explanation below). A change in chemical potential is directly related to the change in the free energy of the system. Therefore, changes in chemical

Supplementary Methods and Figures

A Thermodynamic Based Interpretation of Protein Expression Heterogeneity in Different GBM Tumors Identifies Tumor Specific Unbalanced Processes

ON STATISTICAL MODELS and PRIOR DISTRIBUTIONS in the THEORY of CHEMICAL REACTIONS A. BEN-SHAUL Main Characteristics: (I) the E

Surprisal Analysis Characterizes the Free Energy Time Course of Cancer Cells Undergoing Epithelial-To- Mesenchymal Transition

Information-Theoretic Analysis of Phenotype Changes in Early Stages of Carcinogenesis

A Synaptic Neural Network and Synapse Learning

Personalized Disease Signatures Through Information-Theoretic

General Rate Equation for Maximal Entropy Generation in a Maximum-Entropy Landscape with Time-Dependent Constraints

Single Cell Proteomics in Biomedicine: High-Dimensional Data Acquisition

Resistance Is Futile: Physical Science, Systems Biology and Single-Cell Analysis to Understanding the Plastic and Heterogeneous

Surprisal Analysis of Genome-Wide Transcript Profiling Identifies

Convergence of Logic of Cellular Regulation in Different

Surprisal Analysis of Glioblastoma Multiform (GBM) Microrna Dynamics Unveils Tumor Specific Phenotype