COLLAGEN CROSSLINKING AND EXTRACELLULAR MATRIX REMODELING IN
BREAST AND PANCREATIC CANCER
by
ALEXANDER STEVEN BARRETT
B.S., Florida State University, 2010
M.S., University of South Florida, 2013
A thesis submitted to the
Faculty of the Graduate School of the
University of Colorado in partial fulfillment
of the requirements for the degree of
Doctor of Philosophy
Structural Biology and Biochemistry
2017
This thesis for the Doctor of Philosophy degree by Alexander S. Barrett has been approved for the Structural Biology and Biochemistry Program by
Jeff S. Kieft, Chair Elan Z. Eisenmesser Robert S. Hodges Traci R. Lyons Virginia F. Borges Kirk C. Hansen, Advisor
Date: 12/15/2017
ii
Barrett, Alexander Steven (Ph.D., Structural Biology and Biochemistry)
Collagen Crosslinking and Extracellular Matrix Remodeling in Breast and Pancreatic
Cancer
Thesis directed by Associate Professor Kirk C. Hansen
ABSTRACT
Much effort has been devoted to understanding the molecular mechanisms by which stromal remodeling contributes to solid tumor progression. The extracellular matrix (ECM) is a major component of the stroma that normally regulates tissue development and homeostasis, however; its dysregulation contributes to tumor progression. While the ECM serves as the scaffold upon which tissues are organized, it also provides essential biochemical and biomechanical cues that direct cell growth, survival, migration, differentiation and immune function. While it is known that genetic mutations initiate and drive malignant transformation, cancer progresses within a dynamic ECM that is capable of modulating the hallmarks of cancer. Many solid tumors are characterized by tissue fibrosis that contributes to poor prognosis, however; comprehensive characterization of ECM composition and biomechanically relevant features, such as lysyl oxidase (LOX) mediated collagen crosslinking, remain understudied in tumor progression. Significant hurdles in characterizing a desmoplastic stroma include methods capable of characterizing insoluble ECM components and downstream analytical techniques to analyze these components. Therefore, this research aims at developing and applying new
iii analytical methods to unravel the complex connection between fibrosis, tissue stiffness, and collagen crosslinking by revealing a more complete and detailed molecular view of how the ECM changes during tumor progression. As such, we have applied these methods to characterize ECM composition and crosslinking in breast and pancreatic cancer, both of which are characterized by substantial ECM remodeling leading to formation of a desmoplastic stroma. We have used genetically engineered mouse models of breast and pancreatic cancer aimed at modulating tumor fibrosis in vivo and go on to assess similarities and differences in these models with clinical patient samples. Taken together, the work included in this thesis provides an updated toolkit for ECM characterization and sheds light into new prognostic markers and therapeutic targets that may improve the survival of patients whose tumors are characterized by fibrosis.
The form and content of this abstract are approved. I recommend its publication.
Approved: Kirk C. Hansen
iv
I dedicate this work to my grandmother, Eunice Barrett.
v
ACKNOWLEDGEMENTS
Completion of this thesis and my academic journey has been one of the great joys of my life thus far. The discoveries presented here are a testament to the love, support, and encouragement from a number of people, without whom none of this would be possible. Many of these people have helped to shape me into the scientist that I am today and to you all, I am forever grateful.
First and foremost, I would like to thank my thesis advisor and mentor Dr. Kirk
Hansen. Thank you for giving me intellectual freedom in my work, engaging me in new ideas, and demanding a high quality of work in all my endeavors. Your passion and drive to push the limits of scientific discovery is a constant unwavering beacon that has always drawn me in closer. Thank you for always making me look forward to the next experiment with your endless passion for discovery.
Next, I would like to thank the members of the Hansen Lab and Mass
Spectrometry Core Facilities. Without the help of Ryan Hill and Monika
Dzieciatkowska, I would not have become the expert in mass spectrometry and proteomics that I am today and I am standing on their shoulders with the completion of this thesis. Monika, you are an exceptional scientist and have taught me more about mass spectrometers than I thought I would ever get to know. Thank you for always being there for me with kind words and intelligent solutions. Ryan, you have laid the groundwork for all the exciting work I was able to accomplish in the lab. I count some of the projects we have worked on together among some of my best times in the lab and am grateful for your help along the way. Travis Nemkov, thank you for being my buddy in the lab, motivating me to accomplish my goals and going
vi through the journey with me. The fun times we have had along the way are some of my greatest memories from school. Angelo D’Alessandro, your unwavering support of my professional development and your dedication to scientific discovery has directly motivated me to do more each day. I have cherished the intelligent discussions we have had and your words of encouragement. I would like to thank the many collaborators that I have worked with during my time in school who were integral to the success of the projects presented here. In particular, I would like to thank Ori Maller for his friendship and help along the way. I would also like to thank the members of my thesis committee for their insightful comments and suggestions:
Robert Hodges, Elan Eisenmesser, Jeff Kieft, Traci Lyons and Ginger Borges. Also, thank you to the students, faculty and program administrators for their support during my time in school. Elizabeth Wethington, thank you for your friendship and always being there for me.
Lastly, I would like to thank my friends and family. To my parents, Steve and
Veronica, you have taught me to believe in myself and pushed me to achieve the goals I set out for myself. Thank you for always knowing just what to say when the going got tough. I would like to thank my brother Cam Barrett for always being there for me and reminding me to have fun. I would like to thank my grandma, Eunice for always being my number one supporter in everything that I do. To my soon to be wife, Katie: I would not have come this far without your love and belief in me. You are my best friend and your unbreakable spirit has motivated me to accomplish my goals and strive for success in everything that I do.
vii
TABLE OF CONTENTS CHAPTER
I: THE EXTRACELLULAR MATRIX AT A GLANCE ...... 1
Thesis Direction and Summary ...... 1
Introduction ...... 3
The Role of the ECM in Tissue Development and Homeostasis ...... 3
Properties and Features of the ECM ...... 5
Collagen and Elastin ...... 6
Collagen ...... 6
History ...... 6
The collagen family ...... 8
Collagen biosynthesis ...... 9
Collagen Crosslinking ...... 10
Immature bivalent reducible crosslinks ...... 11
Lysyl hydroxylases ...... 12
Hydroxy lysine aldehyde crosslinking pathway ...... 12
Lysine aldehyde crosslinking pathway ...... 13
Mature trivalent crosslinks ...... 14
Elastin ...... 15
Properties of elastin ...... 15
Elastin crosslinking ...... 16
Fibronectin ...... 17
Proteoglycans ...... 18
Laminin and Basement Membranes ...... 19
Matricellular Proteins ...... 20
The ECM and Cancer ...... 22
viii
The Provisional ECM and the Wound Healing Response ...... 22
Biomechanical Signaling in the Tumor Microenvironment ...... 23
Dysregulated ECM Dynamics are a Hallmark of Cancer ...... 23
ECM Stiffness and Collagen Crosslinking are Tumor Promotional ..... 25
Abnormal ECM Architectures are Associated with
Tumor Progression ...... 26
Figures ...... 27
II: HYDROXYLAMINE CHEMICAL DIGESTION FOR INSOLUBLE EXTRACELLULAR MATRIX CHARACTERIZATION ...... 32
Introduction ...... 32
Materials and Methods ...... 36
Reagents ...... 36
QconCAT Design...... 36
Sample Preparation for Proteomics ...... 36
Hydroxylamine (NH2OH) Treatment ...... 37
Cyanogen Bromide (CNBr) Treatment ...... 38
Trypsin Digestion ...... 38
LC-SRM Analysis ...... 38
LC-MS/MS Analysis ...... 39
Database Searching and Protein Identification ...... 40
Error Tolerant Searches ...... 41
Results ...... 41
CNBr Versus NH2OH Digestion of Insoluble ECM Components ...... 41
Femur ...... 43
Skin ...... 45
ix
Lung ...... 46
Muscle ...... 48
Liver ...... 50
Evaluation of Chemical Digestion Cleavage Specificity ...... 51
Error Tolerant Searches ...... 52
Discussion ...... 53
Figures ...... 58
III: EXTRACELLULAR MATRIX REMODELING IN MAMMARY GLAND
AND LIVER TISSUE MICROENVIRONMENTS DURING
THE REPRODUCTIVE CYCLE ...... 67
Introduction ...... 67
Materials and Methods ...... 70
Rodent Studies ...... 70
Cell Lines ...... 71
Portal Vein Injections ...... 71
Immunofluorescence and Imaging ...... 71
Sample Preparation for Proteomic Analysis ...... 72
Detergent/chaotrope Removal & Protein Digestion ...... 72
Liquid Chromatography Tandem Mass Spectrometry
& Data Analysis ...... 72
Statistics ...... 73
Results ...... 73
Development of Quantitative ECM Proteomic Methodology ...... 73
QconCAT Based Proteomics Reveals Unique and Shared
Mammary Gland and Liver ECM Profiles ...... 75
x
Mammary Gland ECM Proteomics Across the Reproductive Cycle .. 78
Discussion ...... 80
Note About Use of CNBr ...... 85
Figures ...... 86
IV: INCREASED MAMMOGRAPHIC DENSITY IS CORRELATED WITH
FIBRILLAR COLLAGEN ABUNDANCE ...... 93
Introduction ...... 93
Results ...... 95
Discussion ...... 100
Figures ...... 104
V: HYDROXY LYSINE DERIVED COLLAGEN CROSSLINKS
PROMOTE POOR BREAST CANCER PATIENT PROGNOSIS AND
TREATMENT RESISTANCE ...... 109
Introduction ...... 109
Materials and Methods ...... 112
Preparation of Tissue for Hydrolysis ...... 112
Protein Hydrolysis ...... 112
Preparation of Crosslink Enrichment Column ...... 113
UHPLC Analysis ...... 113
MS Data Acquisition ...... 114
Quantification of Crosslinked Amino Acids ...... 115
Human Breast Tissue ...... 116
Picrosirius Red Staining and Quantification ...... 116
Tissue Preparation for AFM Measurements of ECM Stiffness ...... 117
AFM Measurements of ECM Stiffness on Tissue Sections ...... 117
xi
SHG Image Acquisition ...... 118
LH2 IHC and Prognostic Analyses ...... 118
Study population ...... 118
Tumor evaluation ...... 119
IHC statistical analyses ...... 121
Statistical Analysis ...... 122
Results ...... 123
Overexpression of Lysyl Oxidase in the Mammary Tumor
Stroma Results in Increased Collagen Crosslinking ...... 123
Association Between Collagen Crosslinking Abundance
and Fiber Organization in Human Breast Tissue ...... 125
Characterization of Collagen Crosslinking in Human
Breast Tumor Subtypes ...... 126
High PLOD2 Expression is Associated with Poor Breast
Cancer Prognosis and Treatment Resistance ...... 127
Discussion ...... 129
Figures ...... 133
VI: GENOTYPE TUNES PDAC TENSION TO DRIVE MATRICELLULAR-
ENRICHED FIBROSIS AND TUMOR AGGRESSION ...... 142
Introduction ...... 142
Materials and Methods ...... 146
Mice Studies ...... 146
Histology ...... 147
LC-MS/MS and LC-SRM Proteomic Analysis ...... 147
Atomic Force Microscopy Measurements ...... 147
xii
Two-Photon Second Harmonic Microscopy and Analysis ...... 147
Results ...... 148
Discussion ...... 152
Human PDAC Pilot Summary ...... 155
Figures ...... 158
VII: EXAMINATION OF MATRICELLULAR FIBROSIS AND WOUND HEALING
IN A MODEL OF PANCREATIC DUCTAL ADENOCARCINOMA
PROGRESSION ...... 165
Introduction ...... 165
Materials and Methods ...... 168
Reagents ...... 168
Proteomic Sample Preparation ...... 168
Hydroxylamine (NH2OH) Treatment ...... 169
Trypsin Digestion ...... 170
LC-SRM Analysis ...... 170
LC-MS/MS Analysis ...... 170
Proteomic Data Analysis ...... 171
xAAA Sample Preparation ...... 171
Protein Hydrolysis ...... 172
xAAA Data Analysis ...... 172
Database Searching and Protein Identification ...... 173
H&E Staining ...... 173
Picrosirius Red Staining and Quantification ...... 173
Results ...... 174
Discussion ...... 182
xiii
Figures ...... 187
VIII: SUMMARY OF FINDINGS AND FUTURE DIRECTIONS ...... 193
Tools to Investigate the Extracellular Matrix ...... 193
Mammary Gland ECM Remodeling ...... 196
Human Breast Cancer ...... 197
Pancreatic Cancer ...... 200
Figures ...... 203
IX: COLORADO CLINICAL AND TRANSLATIONAL
SCIENCE FELLOWSHIP ...... 204
Medical Oncology - Breast Cancer Center ...... 204 Autopsy Pathology ...... 205 Summary ...... 207
REFERENCES ...... 208
APPENDIX ...... 237
A. Crosslinked Amino Acid Standard Characterization ...... 237
B. Publications ...... 240
xiv
LIST OF FIGURES FIGURE 1.1 Schematic Diagram of Lysine Hydroxylation and Crosslinking in Collagen .. 27
1.2 Hydroxy Lysine Aldehyde (Hylald) Crosslinking Pathway ...... 28
1.3 Lysine Aldehyde (Lysald) Collagen Crosslinking Pathway ...... 29
1.4 Proposed Scheme of Natural Crosslinking Reactions in Elastin ...... 30
1.5 Abnormal Breast ECM Architectures are Associated with Tumor Progression ...... 31 2.1 Workflow Diagram for Comparative Analysis of CNBr and NH2OH Using Quantitative QconCAT ECM Proteomics ...... 58
2.2 Quantitative Comparison of Chemical Digestion Methods for Enrichment of Insoluble Femur Matrix ...... 59
2.3 Quantitative Comparison of Chemical Digestion Methods for Enrichment of Insoluble Femur Matrix ...... 60
2.4 Quantitative Comparison of Chemical Digestion Methods for Enrichment of Insoluble Skin Matrix ...... 61
2.5 Quantitative Comparison of Chemical Digestion Methods for Enrichment of Insoluble Lung Matrix ...... 62
2.6 Quantitative Comparison of Chemical Digestion Methods for Enrichment of Insoluble Muscle Matrix ...... 63
2.7 Quantitative Comparison of Chemical Digestion Methods for Enrichment of Insoluble Liver Matrix ...... 64
2.8 CNBr Selectively Brominates Tyrosine Containing Collagen Peptides ...... 65
2.9 Fibrillar Collagen and ECM Abundance Across Tissues ...... 66
3.1 Quantitative QconCAT ECM Proteomics Pipeline ...... 86
3.2 QconCAT Based ECM Proteomics Reveals Unique and Shared Mammary Gland and Liver ECM Profiles ...... 88
3.3 Principal Component Analysis Reveals Dynamic and Cyclical Mammary
xv
Gland ECM Remodeling Across the Reproductive Cycle ...... 90
3.4 Quantitative ECM Proteomics Unravels The Unique Composition and Abundance of ECM Proteins Across the Reproductive Cycle ...... 91
4.1 Comparing the ECM Composition of Human Breast Tissue at Different Densities ...... 104
4.2 Fibrillar Collagen Abundance Correlates with Increased Mammographic Density ...... 106
4.3 Mammographic Density is Not Associated with Significant Changes in Collagen Crosslinking ...... 108
5.1 xAAA Workflow Diagram and Crosslink Analysis of Normal Tissues ...... 133
5.2 Overexpression of Lysyl Oxidase in the Mammary Tumor Epithelium Does Not Alter Collagen Crosslinking ...... 134
5.3 Collagen Crosslinking Closely Correlates with Fibrillar Collagen Accumulation and ECM Stiffness in a Mammary Tumors Overexpressing Lysyl Oxidase in the Stromal Compartment ...... 136
5.4 Curly and Straightened Invasive Ductal Adenocarcinoma Architectures are Both Associated with Increased in Collagen Crosslinking ...... 138
5.5 Triple-Negative Breast Cancer Patients Favor the Formation of Hydroxy Lysine Derived Collagen Crosslinks ...... 139
5.6 High LH2 Expression Correlates with Poorly Differentiated Tumors and Cumulative Distant Metastasis-Free Survival in Triple-Negative Patients .. 140
6.1 PDAC Genotype Tunes Epithelial Tension to Regulate Fibrosis ...... 158
6.2 Targeted Proteomics and Crosslinking Analysis Reveals Changes in Protein Abundance, Solubility and Crosslinking ...... 160
6.3 JAK-Stat3 Signaling Drives ECM Remodeling and Stiffening ...... 162
6.4 Human PDAC Lesions are Characterized by Increased Matricellular Protein Abundance and Crosslinking ...... 163
xvi
7.1 Characterization of Cell Population Markers in Normal Pancreas and During PDAC Progression ...... 187
7.2 KTC PDAC Demonstrate a Unique ECM Composition Relative to Normal Pancreatic Tissue ...... 188
7.3 KTC PDAC Demonstrate a Unique ECM Composition Relative to Normal Pancreatic Tissue ...... 190
7.4 Collagen Crosslinking Decreases During KTC PDAC Progression ...... 191
8.1 Towards Comprehensive Characterization of the Solid Tumor ECM ...... 203
8.2 Broad Applicability of xAAA Method to Hard and Soft Tissues ...... 204
A.1 Summary of xAA Standard Characterization ...... 255
A.2 xAA Standard Characterization by MS ...... 256
xvii
ABBREVIATIONS
BAPN ß-aminopropionitrile BGN Biglycan CELA2A Chymotrypsin Like Elastase Family Member 2A CHAPS 3-[(3-cholamidopropyl)dimethylammonio]-1-propanesulfonate CNBr Cyanogen Bromide COMP Cartilage Oligomeric Matrix Protein CPA1 Carboxypeptidase A1 CSPG Chondroitin Sulphate Proteoglycans CTRB1 Chymotrypsinogen B1 CV Coefficient of Variance CV Coefficient of Variance DCIS Ductal Carcinoma In Situ DCN Decorin deH-DHLNL dehydro-Dihydroxylysinonorleucine deH-HHMD dehydro-Histidino-Hydroxymerodesmosine deh-HLNL dehydro-Hydroxylysinonorleucine deH-HLNL dehydro-Hydroxylysinonorleucine deH-LNL dehydro-Lysinonorleucine DHLNL Dihydroxy Lysinonorleucine DPT Dermatopontin dPyr deoxy Lysyl Pyridinoline ECM Extracellular Matrix EDTA Ethylenediaminetetraacetic Acid EGFR Epidermal Growth Factor EHS Engelbreth-Holm Swarm EMT Epithelial to Mesenchymal Transition ER Endoplasmic Reticulum FA Formic Acid FACIT Fibril-Associated Collagens with Interrupted Triple Helices FGA Fibrinogen α FGB Fibrinogen FGG Fibrinogen FN Fibronectin GAG Glycosaminoglycan GCG Glucagon Precursor GEMM Genetically Engineered Mouse Model GFAP Glial Fibrillary Acidic Protein GFP Green Fluorescent Protein GPCR G-Protein Coupled Receptor H&E Hematoxylin and Eosin
xviii
HHMD Histidino-Hydroxymerodesmosine His Histidine ald HLCC Hydroxy Lysine aldehyde (Hyl )-derived Collagen Crosslinks HLKNL Hydroxylysino-Keto-Norleucine HLNL Hydroxy Lysinonorleucine HSPG Heparan Sulfate Proteoglycans Hyl Hydroxy Lysine ald Hyl Hydroxy Lysine Aldehydes Hyp Hydroxy Proline IDC Invasive Ductal Carcinoma iECM Insoluble ECM IHC Immunohistochemistry INS2 Insulin-2 Precursor JAK Janus Kinase KRT20 Keratin, type-I, Cytoskeletal 20 KRT7 Keratin type-II, Cytoskeletal 7 ald LCC Lysine aldehyde (Hyl )-derived Collagen Crosslinks LC-MS/MS Liquid Chromatography Mass Spectrometry LC-SRM Liquid Chromatography Select Reaction Monitoring LH Lysyl Hydroxylase LKNL Lysino-Keto-Norleucine LNL Lysinonorleucine LOX Lysyl Oxidase LOXL Lysyl Oxidase Like LUM Lumican Lys Lysine ald Lys Lysine Aldehydes MD Mammographic Density NaBH4 Sodium Borohydride NG Asparagine Glycine (Asn-Gly) NH2OH Hydroxylamine P3H Prolyl-3 Hydroxylase P4H Prolyl-4 Hydroxylase PanIN Pancreatic Intraepithelial Neoplasia PCA Principal Component Analysis PDAC Pancreatic Ductal Adenocarcinoma PEDF Pigment Epithelium-Derived Factor PLOD Procollagen-Lysine, 2-Oxoglutarate 5-Dioxygenase PLS-DA Partial Least Squares – Discriminant Analysis Pro Proline PRSS2 Anionic Trypsin-2 Precursor
xix
PTM Post-Translational Modification Pyr Hydroxy Lysyl Pyridinoline QconCAT Quantitative Concatemer QM-PDA Quasi-Mesenchymal PDAC ROCK Rho-associated Kinase sECM Soluble ECM SHG Second Harmonic Generation SHH Stromal Sonic Hedgehog SIL Stable Isotope Labeled SLRP Small Leucine Rich Proteoglycans SLRPs Small Leucine-Rich Proteoglycans SMA Smooth Muscle Actin SPARC Secreted Protein Acidic and Rich in Cysteine SPE Solid Phase Extraction SRY Sex-determining Region Y STAT3 Signal Transducer and Activator of Transcription 3 TACS Tumor Associated Collagen Signatures TFA Trifluoracetic Acid TFM Traction Force Microscopy TGF Transforming Growth Factor Beta THBS Thrombospondin TIC Total Ion Current TNC Tenascin-C TPM2 Tropomyosin beta chain TSR Thrombospondin type-1 Repeats VIM Vimentin VPLIR Virgin, Pregnancy, Lactation, Involution, Regression YAP Yes-associated Protein 1
xx
CHAPTER I
THE EXTRACELLULAR MATRIX AT A GLANCE
Thesis Direction and Summary
Tumors are stiffer than normal tissue and are characterized by a collagen-rich extracellular matrix (ECM) that is dynamically remodeled during disease progression. Recently, much effort has been put forth to understanding how ECM architectures and compositions are involved in the pathogenesis and progression of solid tumors. However, the majority of studies have taken reductionist approaches where only a handful of ECM components are studied at one time. While these studies remain highly influential, a lack of knowledge exists regarding ECM composition and crosslinking in tumor microenvironment. The problem is two-fold: 1) there is a lack of methods that exist capable of solubilizing insoluble ECM proteins
(i.e. fibrillar collagen) and 2) direct characterization of collagen crosslinking in the human tumors has not been attempted. Together these problems have led to a perpetual underrepresentation of ECM components in large proteomic datasets and a gap in knowledge regarding how alterations in the activity of crosslinking-related enzymes, such as lysyl oxidase (LOX) and lysyl hydroxylase (LH), directly affect the abundance and specificity of collagen crosslinks in tumors. In this chapter, I will provide a primer on the components, properties and features of the ECM and how the ECM can modulate some of the hallmarks of cancer. The rest of the chapters have been organized such that each represents an individual manuscript that is either accepted, under review, or in preparation. In Chapter II, I develop a new approach for the characterization of insoluble ECM components and demonstrate
1 how it may be applied to tissues that vary in their overall ECM content. Application of this approach to solubilize the insoluble ECM components of breast and pancreatic tissue in several different contexts is applied throughout this work (Chapters III, IV,
VI, VII) and further demonstrates the broad utility of the approach. In Chapter III, targeted proteomics is utilized to quantitatively characterize ECM remodeling in the rodent mammary gland and liver during the reproductive cycle. In Chapter IV, I investigate the relationship between mammographic density, fibrillar collagen abundance and collagen crosslinking in clinical patient samples using targeted proteomics and crosslinked amino acid analysis (xAAA). In Chapter V, I will provide a detailed description of the xAAA method. Additionally, I will offer evidence for the direct relationship between LOX, collagen crosslinking, and tissue stiffness in human breast cancer. I go on to demonstrate the importance of LH in the formation of hydroxy lysine derived collagen crosslinks (HLCCs) and how its expression affects distant metastasis free survival and treatment resistance in a large breast cancer patient cohort. In Chapters VI and VII, I apply both ECM proteomics and crosslinking analysis to study matricellular fibrosis and wound healing in human pancreatic ductal adenocarcinoma (PDAC) and three genotypically distinct murine models of PDAC. I also perform a follow up study that explores differences between the normal pancreas stroma and PDAC stroma at early and late timepoints to further investigate matricellular fibrosis in the context of disease progression. Taken together, the work included in this thesis delivers an extensive resource regarding
ECM composition and crosslinking in the tumor microenvironment and provides insight into potential therapeutic strategies that may improve the survival of breast
2 and pancreatic cancer patients whose tumors are characterized by fibrosis. A summary of findings is presented in Chapter VIII. A reflection on my experience in the Colorado Clinical and Translational Scientist training program is described in
Chapter IX.
Introduction
The Role of the ECM in Normal Tissue Development and Homeostasis
The extracellular matrix (ECM) is the non-cellular component present within all tissues and organs and is a major component of the cellular microenvironment (1,
2). In addition to functioning as a physical scaffold for cells, the ECM initiates and maintains essential biochemical and biomechanical cues required for tissue development, differentiation and homeostasis through sequestration and release of growth factors and matricryptins, modulation of hydration levels, and pH of the local microenvironment, to name a few (3). Taking this into account, the structure of tissues and organs is critical for their function. Indeed, loss of normal tissue architecture is a prerequisite for, and one of the hallmarks of most cancers. As such, normal organ architecture can act as a powerful tumor suppressor, capable of preventing and reverting malignant phenotypes even in cells with disease causing mutations (4-6). The fact that organ function and homeostasis are driven by organ architecture and that cells in every organ carry the same genetic information, begs the question of how tissue-specific form and function is achieved? Dynamic reciprocity is a concept that describes how tissue-specific function is achieved through interactions between the cell and surrounding extracellular matrix (7). The model describes the dynamic bi-directional cross talk between the ECM and the
3 cellular microenvironment that determines the structure and function of a given tissue (8).
The continuously regulated process of differentiation can be defined as the acquisition of tissue specific functions that modulate interactions between the ECM and the cellular microenvironment. In this way, bi-directional dynamic reciprocity plays a major role in maintaining stable expression of differentiation-specific genes
(9). However, final tissue specific architecture, form and function are influenced by the unique context in which they develop. Every organ is composed of tissues derived from one of the three embryonic germ layers: 1. Endoderm which forms the epithelium of the lungs, pancreas, liver and digestive organs, among others, 2.
Mesoderm which generates blood vessels, bone, muscle, and mesenchymal connective tissue, among others, and 3. Ectoderm which gives rise to the epithelium of the skin and its derivatives, including the mammary gland (8). Interactions between epithelial and mesenchymal constituents during development direct the creation of normal tissue architecture (e.g. morphogenesis). The idea that tissue development is not cell autonomous, but is instead instructed by the surrounding environment was hypothesized as early as 1817 (10).
While cell-ECM crosstalk is integral for the initial development of tissues, it is also important within the context of tissue repair after injury. The process of somatic wound healing, regardless of direct cause (e.g. immunological, physical etc.), proceeds similarly with that of initial embryonic development. As such, the various stages of wound healing are characterized by massive cell migration, phenotypic differentiation, and a heightened biosynthetic activity at the site of repair. The
4 dynamic interactions that occur between growth factors and ECM during these stages are integral to wound healing.
Properties and Features of the ECM
The direct and indirect mechanisms by which the ECM regulates cell behavior are complex. The ECM is composed of biochemically distinct macromolecules that can be broken down into three broad groups: fibrous proteins (including collagens and elastin), glycoproteins (including fibronectin, proteoglycans, laminin and basement membranes) and matricellular proteins (including tenascin and thrombospondins). The protein and non-protein constituents of the ECM vary not only in terms of their functional roles but also in terms of their structure. To that end, these components work together to form both a basement membrane as well as an interstitial matrix. Stromal cells primarily lay down the fibrous proteins that form the interstitial matrix such as fibrillar collagen, matricellular proteoglycans like tenascin C and structural glycoproteins such as fibronectin. The basement membrane is produced by epithelial, endothelial and stromal cells to separate the epithelium from the stroma. The basement membrane component is primarily composed of type IV collagens, and laminins (among others) which help to connect the basement membrane to the stroma through interactions with collagen. While the basement membrane is readily solubilized in high salt, detergent-rich buffers, the stroma is far more insoluble and requires extensive extraction in a strong chaotrope and subsequent chemical digestion. However, insoluble and soluble ECM components are ordered in a tissue-specific manner which can impart distinct positive or negative biochemical and biomechanical cellular cues. Mechanisms of ECM function include,
5 anchorage to the basement membrane (11, 12), block or facilitate cell migration (e.g. tumor cell dissemination) (13, 14), acting as a signal reservoir for growth factor and cytokines that helps form a concentration gradient and promote timed release mechanisms (15, 16), bind to growth factors and act as a low affinity co-receptor or presenter influencing cell-cell crosstalk (12, 17), action of ECM cleavage products
(i.e. matricryptins) on signaling mechanisms (18, 19), and biomechanical force generation through the focal adhesion complex (20, 21)
Each of these unique properties is related to one another and contributes to the importance of the ECM in development and disease. The ECM is highly dynamic with all of the above mentioned properties occurring simultaneously in space and time. Changes in dynamics can result from genotypic alterations that drive changes in ECM composition and architecture through synthesis and degradation of individual components (14).
Collagen and Elastin
Collagen
History. Collagen is the main structural protein that composes the ECM and is widely considered to be the most abundant protein in the human body. Originally a term coined in France in the 19th century, collagene, was meant to describe the constituent of connective tissues that produced glue. The word was later adapted to its current English form, collagen, in 1865, however; was not officially defined by the
Oxford Dictionary until 1893 – “that constituent of connective tissue that yields gelatin on boiling” (22). However, the uniqueness of the collagen fibril itself had been
6 documented in great detail much earlier by pathologists in the 19th century such as
Henle and Ranvier (23).
During this time, an active controversy was brewing between two schools of thought regarding the origin of collagen fibers. One school believed that collagen fibers developed directly from the cytoplasm of the fibroblast (24-27), and the other that they formed apart from the cell in what was called the “intercellular ground substance” (28-30). In the middle of this controversy, Jean Nageotte proved that acid solubilized collagen could be precipitated into observable fibers which looked similar to those of the intact tissue. This led him to suggest that extracted collagen was a precursor of collagen fibers – providing strong support for the theory of extracellular formation of collagen fibers in vivo (31).
Within the next century, significant progress had been made including the discovery of the existence of the monomeric building block (i.e. tropocollagen) that orders collagen fibrillogenesis (Jerome Gross, 1956) (32). It was soon discovered that the general “collagen molecule” (which we now know was collagen alpha-1 (I)), was composed of three polypeptide chains (2 α1 chains and 1 α2 chain) that assembled into a triple helix with a coiled-coil conformation. The primary sequence for this protein is exceptionally unique – being made for the most part of repeating
Gly-Xaa-Yaa triplets where proline (Pro) is often in the Xaa position, and hydroxy proline (Hyp) in the Yaa position. An extraordinary finding at the time, these modifications were found to be necessary for the maturation of collagen into its fibrillar form that is found in collagen-rich tissues by Ramachandran in 1967 (33).
7
The collagen family. In broader terms, we now know that collagens are comprised of a family of extracellular proteins that collectively make up ~30% of total protein mass (31, 34, 35). In vertebrates, there are now at least 28 collagen types, encoded by more than 40 different genes, which can be categorized into families based on their supramolecular assemblies and other specific features. Specific collagen families include fibrillar collagens, fibril-associated collagens with interrupted triple helices (FACITs), network-forming collagens, transmembrane collagens and some others. The common structural feature that all collagens share is the presence of a triple helix which ranges from 96% of total structure in the case of collagen I to as little as 10% of total structure in the case of collagen XII (35).
Interestingly, we continue to find proteins that contain triple-helical collagenous domains (e.g. C1q, adiponectin, collectins, ficolins, macrophage scavenger receptors, among others), many of which are involved in innate immunity (36).
Fibrillar collagens (type I, II, III, V, XI, XXIV and XXVII) are considered unique among other collagens because they form fibrils with unique repeated banding pattern called D-periodicity, which arises from the ordered staggering of collagen molecules (22, 35). Each molecule is formed from three polypeptide chains named α chains, either in a homotrimeric or a heterotrimeric fashion, depending on the collagen type and some existing variants. Another common characteristic of fibrillar collagens is their propensity to form longer triple helical region with unique Gly-X-Y triplet repeats, flanked by the short non-helical parts called telopeptides. Because type I collagen is the most abundant collagen found in various connective tissues
8 such as skin, bone, tendon and dentin, it is critical in order to maintain the integrity and elasticity of tissues (22, 35).
Collagen biosynthesis. The assembly of collagen molecules into fibrils is an entropy-driven process, similar to other protein self-assembly systems, such as actin filaments. It is believed that these processes are driven by the loss of solvent molecules from the surface of the collagen molecule and results in assemblies with a circular cross-section, minimizing the surface area/volume ratio of the final assembly. An essential feature of collagen fibril formation is the fact that they are synthesized as soluble procollagens in the endoplasmic reticulum (ER). In the rough
ER, procollagens undergo extensive post-translational modifications including the unique hydroxylation of Pro and lysine (Lys) residues (Figure 1.1A), subsequent O- linked glycosylation of specific hydroxylysine (Hyl) residues, and asparagine-linked glycosylation. Proline residues in the Y position of the Gly-X-Y repeat are mostly hydroxylated by prolyl-4-hydroxylase (P4H), while some of those at the X position are hydroxylated by prolyl-3 hydroxylase (P3H) (37). During this process, several proteins act as chaperones during trimerization and folding of the α chains, either by selectively binding to the unfolded procollagen α chains to prevent premature triple helix formation (e.g. P4H, protein disulphide isomerase (PDI), Bip/Grp78, Grp94, and immunophillins), or binding to the completely folded collagen molecule to stabilize the triple helix and possibly prevent premature aggregation. The modified procollagen molecules are then transported through the Golgi network and secreted.
Shortly after secretion, N- and C-terminal propeptides are removed by proteases, leaving short N- and C-terminal telopeptides. Following processing, collagen
9 molecules spontaneously self-assemble in ordered staggered arrays to form a right- handed super-helix whose intramolecular sterics force the center of the helix to be occupied by glycine residues (any amino acid sequence other than the Gly-X-Y repeat disrupts triple helix formation). Adjacent monomers overlap each other by 234 residues, forming the 67 nm D-period repetitive regions of collagen which consist of the unique “hole zone channels” and overlap zones in the collagen fibrils (36, 38).
There are several factors known to influence collagen fibrillogenesis and organization. The levels of post-translational modifications on the collagen molecule itself (e.g. lysine hydroxylation and glycosylation) can alter the type of fiber that forms, with more highly modified collagen molecules being associated with the formation of fibrils of a smaller diameter (39). Additionally, proteoglycans such as small leucine-rich proteoglycans (SLRPs) like decorin (DCN) that contain attachments of glycosaminoglycan (GAG) chains influence collagen fibril outgrowth through interactions with fibronectin (FN), thrombospondin (THBS) and transforming growth factor-beta (TGF ). As such, ECM proteins work together to promote collagen fibril formation through dense networks of protein-protein interactions that act to stabilize the initial aggregation of the collagen and its subsequent outgrowth
(36, 38).
Collagen Crosslinking
After collagen’s spontaneous aggregation into fibrils, it is further stabilized by a final post-translational modification, the formation of intra and intermolecular crosslinks. They are initiated by the generation of aldehydes at specific Lys or Hyl residues in the telopeptide regions of the α chains by the lysyl oxidase (LOX) family
10 of enzymes (Figure 1.1B). LOX is a copper-dependent amine oxidase that initiates the process of covalent intra- and intermolecular crosslinking of collagen by oxidatively deaminating specific Lys and Hyl residues in the telopeptide domains. It includes LOX and LOX-like (LOXL), LOXL1, LOXL2, LOXL3, and LOXL4 proteins.
LOX and LOXL enzymes have been shown to be active for fibrillar collagen while
LOXL2 has been associated with basement membrane type IV collagen (40). The substrate specificity for other isoforms has not been clearly defined (41). It was reported that LOX is highly functional for growing, native collagen fibrils and was suggested that the intermolecular interaction between collagen molecules was important for the enzyme activity (42). Studies have shown that the binding sites for
LOX in type I collagen are in the triple helical region, potentially in the area with highly conserved sequences (Hyl-Gly-His-Arg) where it can catalyze the formation of aldehydes in the telopeptides of the adjacent collagen molecule (43). The oxidative deamination activity of LOX enzymes can be inhibited by the lathyritic agent, ß- aminopropionitrile (BAPN) (44). The resulting Lys aldehydes (Lysald) or Hyl aldehydes (Hylald) are determinants for the tissue-specific crosslinking pathway by their involvement in a series of spontaneous intra- or intermolecular reactions – thus providing the matrices with tensile strength and elasticity which are essential for the functional integrity of the tissue. To that end, mechanical properties of collagen fibers primarily depend on the formation of head to tail Schiff base crosslinks between end-overlapped collagen molecules (44, 45).
Immature bivalent reducible crosslinks. The types of collagen crosslinks that form in normal connective tissues are determined prior to crosslink formation by
11 the optional hydroxylation of specific telopeptide and helical lysine residues on collagen by lysyl hydroxylases (LH1, LH2, and LH3) encoded by distinct procollagen-lysine, 2-oxoglutarate 5-dioxygenase (PLOD) genes (46). In fibrillar collagens, the telopeptides each contain one crosslinking site on the N-terminal telopeptide (Ntx) and one on the C-terminal telopeptide (Ctx) at residues 9 and 16, respectively. In the helical region of processed mature collagen, crosslinking sites are found at restudies 87 and 930 (47). After optional lysine hydroxylation, LOX acts on Lys and Hyl residues to initiate the process of covalent intra- and intermolecular crosslinking of mature collagen, thereby increasing the structural integrity, strength, and stiffness of the ECM. The dual action of LHs and LOXs ultimately form two unique, tissue-specific, crosslinking pathways: 1) Hydroxy lysine aldehyde (Hylald)- derived collagen crosslinks (HLCCs) and 2) Lysine aldehyde (Lysald)-derived collagen crosslinks (LCCs) (44, 45).
Lysyl hydroxylases. Lysine hydroxylation is important to the formation of collagen crosslinks as well as glycosylation (48). Lysyl hydroxylase (LH) is one of the 2-oxoglutarate dioxygenases which catalyzes the hydroxylation reaction of Lys residues in the procollagen α chains and proteins with collagenous sequences, co- and post-translationally. The reaction requires ferrous iron, oxygen, 2-oxoglutarate and ascorbate as cofactors and in turn releases succinate and carbon dioxide (CO2) along with the formation of Hyl (52). In collagens, Hyl residues are present exclusively in the Y position of G-X-Y triplets while in the non-helical region of α chains, Gly is replaced by serine in the N-telopeptide (X-Hyl-Ser) and by alanine in the C-telopeptide (X-Hyl-Ala) of the α chains (49).
12
Hydroxy lysine aldehyde crosslinking pathway. Lysine residues in the telopeptide region of fibrillar collagen are primarily hydroxylated prior to conversion to Hylald by LOX (Figure 1.2). Once LOX acts and the Hylald is formed, it reacts with the ε-amino group of Hyl or Lys residues in the helical region of the adjacent collagen molecule. The Schiff bases formed (dehydro-dihydroxylysinonorleucine
(deH-DHLNL) and dehydro-hydroxylysinonorleucine (deh-HLNL)) then undergo
Amadori rearrangements to form hydroxylysino-keto-norleucine (HLKNL, Hylald ×
Hyl) or lysine-keto-norleucine (LKNL, Hylald × Lys), respectively. These keto-amine forms are more stable crosslinks and likely contribute to the insolubility of fibrillar collagen in mineralized tissues. However, for these immature crosslinks to withstand acid hydrolysis, they have to be stabilized by the reduction process with sodium borohydride (NaBH4), and analyzed in their reduced forms (i.e. dihydroxy lysinonorleucine (DHLNL) and hydroxy lysinonorleucine (HLNL)) (44, 50).
Lysine aldehyde crosslinking pathway. The Lysald in the telopeptides reacts with Hyl or Lys in the helical region forming dehydro-hydroxylysinonorleucine
(deH-HLNL, Lysald × Hyl) or dehydro-lysinonorleucine (deH-LNL, Lysald × Lys)
(Figure 1.3). In addition, a unique reducible tetravalent crosslink, dehydro-histidino- hydroxymerodesmosine (deH-HHMD) can be formed between the aldol condensation product of two Lysald, histidine (His) and a helical Hyl (Lysald × Lysald ×
His × Hyl). Similar to the keto-amine crosslinks, the aldimines are also analyzed in their reduced forms i.e. hydroxylysinonorleucine (HLNL), lysinonorleucine (LNL) and histidino-hydroxymerodesmosine (HHMD) (44, 45, 50).
13
Mature trivalent crosslinks. As we age, the abundance of immature reducible crosslinks declines over time along with collagen solubility, yet paradoxically the strength of connective tissue increases (51, 52). How can this be so? This paradox led to the speculation of the maturation process from immature bivalent crosslinks to mature trivalent crosslinks. In 1977 Fujimoto was the first to isolate and characterize these crosslinks (53). So far only two major forms of the trivalent crosslinks have been identified and characterized (discussed below in detail). Bone is unique example of need for both immature and mature crosslinks.
The reducible crosslinks in bone collagen decreased steeply in content between birth and 25 years, but persist in significant amounts throughout adult life (54). It could be that bone is being constantly remodeled, therefore new collagen is always formed. In addition, it is suggested that the mineralization process may inhibit the maturation of bivalent into trivalent crosslinks (55). In the example of human cartilage, reducible crosslinks have virtually disappeared by 10-15 years of age, being replaced by crosslinked pyridinoline residues, their maturation products (54).
The best characterized and most widely distributed mature crosslinks are the hydroxy lysyl pyridinoline (Pyr) and its deoxy analog, lysyl pyridinoline (dPyr) crosslinks (Figures 1.2 and 1.3) (53, 55, 56). Pyr is widely distributed in the collagens of most vertebrate connective tissues, whereas dPyr, although widely distributed, predominates in bone and dentin (57). Pyr was first discovered in bovine achilles tendon among two Hylald and one Hyl (Hylald × Hylald × Hyl) (53). Later it was discovered that deoxy pyridinoline crosslinks two Hylald and one Lys (Hylald × Hylald ×
Lys) and is formed from the condensation reaction of lysine ketonorleucine (LKNL)
14 with Hylald (56). Their intrinsic fluorescence has allowed them to be characterized previously by traditional HPLC approaches (45, 58).
Pyrrole crosslinks were first identified by Kuyper in 1990 from bovine tendon collagen and it has been shown that decreased levels are found in aged connective tissue (59). Interestingly, pyrrole crosslinks have only been found at the N- telopeptide (Hylald × Lysald × Hyl or Lys) (57, 59).
Elastin
Properties of elastin. Elastin is the major protein component of the elastic fiber and is critical to the structural integrity and function of tissues in which reversible extensibility or deformability are a requirement, such as in major arterial vessels, lungs, and skin. In contrast to collagen, elastin is encoded by a single gene.
Elastin matures in the ECM through the assembly of a soluble precursor molecule
(i.e. tropoelastin) into a highly crosslinked polymer. Of the 37 lysines per 800 residues in tropoelastin, approximately 10 are involved in desmosine and isodesmosine crosslinks, approximately 15 are present in crosslink intermediates, and approximately 5 remain as lysine in mature elastin, with 7 residues unaccounted for (47, 60). In contrast to collagen which primarily self assembles, elastin requires the assistance of helper proteins to align the multiple crosslinking sites on elastin monomers (60). Once the maturation process is complete, crosslinked elastin is among the most insoluble hydrophobic proteins known, with few polar groups. In fact, elastin from higher vertebrates including humans contains over 30% glycine and approximately 75% of the entire sequence is made of just four hydrophobic amino acids (Gly, Val, Ala, Pro) (61). In particular, this property makes elastin
15 among the most stable proteins in the body, able to last the entire lifetime of the organism. Tissues that are rich in elastin include aorta and major vascular vessels
(28-32% dry mass), lung (3-7%), elastic ligaments (50%), tendon (4%), and skin (2-
3%) (61, 62) .
Elastin crosslinking. The highly insoluble and hydrophobic elastin molecule is stabilized by the formation of covalent crosslinks. It is crosslinked by two amino acids unique to elastin that form the centers of tetravalent crosslinked fibers, desmosine and isodesmosine (63). In fact, these were the first discovered of all of the crosslinks in collagen or elastin (64). All known elastin crosslinks are derived from lysine only, as hydroxy lysine is not present in the protein (61). The exact route of formation remains unknown, however; it has been proposed that condensation of allysine aldol with dehydrolysinonorleucine forms dihydrodesmosines, that are oxidized to form desmosine and isodesmosine (Figure 1.4) (47, 60). The entire sequence of pig tropoelastin has been sequenced, 50% by analysis of tryptic peptides which are separated into two groups 1) hydrophobic peptides ranging in size from 17 to 81 residues and 2) polar di, tri, and tetravalent peptides. The polar peptides present in elastin contain mostly Ala and Lys residues in the form of -Ala-
Ala-Lys- and -Ala-Ala-Ala-Lys-, which are the crosslinking sequences of elastin.
Three Lys residues in these sequence motifs are converted to allysine residues and the fourth, distinguished by the sequence Lys-Tyr (in pig elastin), provides the ring nitrogen of the pyridinium crosslink (65). Indeed, much of the biochemistry of elastin was elucidated from pig and bovine samples. However, in bovine elastin phenylalanine replaces the tyrosine C-terminal to the desmosine crosslink in several
16 of the crosslinking sequence motifs and it has been observed that bovine elastin contains less tyrosine residues (6 per 1000) than pig elastin (16 per 1000) (65).
Fibronectin
Fibronectin is a large adhesive glycoprotein that primarily functions by binding membrane spanning receptor proteins called integrins. Through these interactions it plays a role in cell adhesion, migration, growth and differentiation (66, 67).
Fibronectin also interacts directly with ECM components such as collagen, fibrin and other proteoglycans, such as heparan sulfate proteoglycans (HSPGs), among others. Fibronectins are produced from a single gene by a wide variety of epithelial and mesenchymal cells in vitro, including fibroblasts, chondrocytes, myofibroblasts, macrophages, and hepatocytes. It is secreted as a dimer consisting of two similar subunits bonded through a disulfide linkage (68). Each domain of fibronectin is responsible for one of fibronectins many binding functions. Three repeating sequence motifs (I, II, III) are organized into functional domains that contain binding sites for ECM proteins and cell surface receptors (e.g. integrins) (67, 69). Type three repeats account for more than 60% of the sequence and is considered the predominant structural feature of fibronectin (66). However, mRNAs for fibronectin have been shown to give rise to multiple versions of the protein through alternative
RNA spicing that occurs predominately at three sites 1) extra type III domain A (EDA or EIIIA), 2) extra type III domain B (EDB or EIIIB), 3) the connecting segment between the fourteenth and fifteenth type III repeat (IIICS) (70, 71). Five variants are produced from splicing in the IIICS segment. Splicing at these three segments can result in over twenty different fibronectin subunits (72). In addition, several malignant
17 cell lines in three-dimensional, laminin-rich ECM have been shown to preferentially upregulate protein of the fibronectin splice variant, EDA+, compared with nonmalignant cells (73). Despite this knowledge, the biological functions of fibronectin isoforms remains poorly understood.
Fibronectin and its integrin receptors have been shown to play an integral role in the progression of metastatic disease (74). Fibronectin exhibits at least two independent cell adhesion regions with different integrin receptor specificities. As cancer cells are less adhesive (i.e. more invasive) than normal cells, these processes are implicated in tumor progression and are important for tumor cell migration, invasion and metastasis. This interaction may also play a role in chemotaxis and control of proliferative pathways.
Proteoglycans
In contrast to the predominantly fibrillar structure of collagens, proteoglycans form the basis of higher order ECM structures around cells and are composed of genetically distinct families of multidomain proteins that have one or more covalently attached glycosyl amino glycan (GAG) chains (75). GAGs are long, negatively charged, linear chains of disaccharide repeats. At least 25 gene products with at least one GAG modification have been identified with many structural variants. Of note, there are no structural domains common to all proteoglycans. The primary biological function of proteoglycans derives from the biochemical and hydrodynamic characteristics of the GAG components, which bind water to provide hydration and compressive resistance. As such, proteoglycans are highly abundant in compressible ECM tissues like cartilage. Names of proteoglycan classes of proteins
18 are based on the type of GAG chain that is attached, as well as the distribution and the density of these chains along the core protein. This distinguishing characteristic allows proteoglycans to be grouped into several broad categories: heparin sulphate proteoglycans (HSPGs), chondroitin sulphate proteoglycans (CSPGs), small leucine rich proteoglycans (SLRPs), hyaluronan and keratin sulphate, each having a unique function in the ECM (76, 77).
In addition to being dominant components of the ECM, proteoglycans can also function as accessory proteins in tissues rich in other matrix proteins. For example, the SLRPs are a family of proteoglycans that have been implicated in fibrillar collagen assembly and includes well-known members such as decorin, biglycan and lumican, which act to stabilize collagen through association with the mature fibril. SLRPs have also been show to participate in cell – ECM signaling with binding sites for cytokines and growth factors being recently discovered (78).
Laminin and Basement Membranes
Laminin and other basement membrane components (i.e. collagen IV) are primarily found in the basal lamina and mesenchymal compartments and function to bridge the gap between structural ECM molecules and reinforce the network which provides support for cells and soluble molecules within the matrix (79). These large glycoproteins are primarily composed of laminin-type epidermal growth factor (EGF)- like repeats and alpha helical domains. Laminins mediate cell interactions with other
ECM components through cell surface receptors (e.g. integrins) and consist of α, , and chains that combine via a triple-helical coiled-coil domain at the center of each chain to form a cruciform shape (80). Perhaps the most famous laminin isoform is
19 laminin 111 which is composed of single α, and chains that associate to form a cruciform shape. Dr. Mina Bissell’s work over the last γ0 years has provided extensive insight into how laminins mediate tissue-specific gene expression in the mammary epithelium and can enhance functional differentiation.
Collagen-IV is also a major component of the basement membrane, associates with laminin directly, and is integral to the formation of normal physiologic collagen networks (81, 82). Collagen-IV assembles to form tetramers that are stabilized by a covalent Met-Lys crosslink (S-hydroxylysino-methionine) between
Met93 and Hyl211 (83). Loss of an intact basement membrane is a pre-requisite for tumor cell invasion and metastasis and it has been reported via proteomic studies in our own lab that collagen-IV abundance changes drastically during malignant transformation (84).
Matricellular Proteins
Matricellular proteins are a diverse group of proteins that modulate cell function by interacting with cell-surface receptors, proteases, and hormones, among others (85). Matricellular proteins are secreted into the ECM but do not actually have a structural role (86). Instead, distinguishing characteristics of these groups of proteins include 1) increased expression during development and wound healing, 2) binding to many cell – surface receptors, ECM components, growth factors and cytokines and proteases, 3) roles in de-adhesion or counter-adhesion in contrast to the adhesive nature of most structural ECM proteins, 4) a subtle phenotype that is observed in mice with a targeted disruption or some matricellular protein genes (86,
20
87). Their role in these processes is very contextual and dependent upon individual properties of a tissue-specific matrix.
Initially, only three members comprised the matricellular group of proteins –
SPARC, TSP-1, and TNC – mainly grouped as secreted proteins that modulated cell-ECM interactions (88-90). This narrow classification has been expanded to include additional SPARC, TSP (TSPs 1-4, COMP) and tenascin (TN-C, R, W, X, Y) family members with new proteins being introduced in recent years including osteopontin, CCN family of proteins, periostin, R-spondins, short fibulins, galectins,
SLRPs, PEDF, and Plasminogen activator inhibitor-1 (91). Although new and old members are considered to be structurally diverse, most contain repeats of common
ECM structural motifs such as thrombospondin type 1 repeats (TSRs), fibronectin type-III repeats, EGF-like repeats, and are able to bind calcium (91).
Much attention has been given to the role of tenascins and thrombospondins during tumor progression and fibrosis. It has been suggested that these help to regulate formation of provisional matrix during solid tumor progression that is characterized by a wound healing response (92). Indeed, our group and others have observed drastic changes in the expression of tenascins (i.e. TNC), fibrinogens and thrombospondins that support a de-adhesive and pro-migratory cellular phenotype
(93). Along these same lines, a myriad of studies have also implicated matricellular protein expression to the increased production of fibrillar collagen and the formation of fibrosis, a hallmark of many solid tumors (94). Through a diverse set of interactions with soluble growth factors, cytokines and ECM molecules themselves,
21 these proteins can either promote or suppress cancer progression in a tissue- specific manner.
The ECM and Cancer
The Provisional ECM and the Wound Healing Response
Repair of tissue after injury depends on the synthesis and deposition of a fibrous extracellular matrix to replace lost or damaged tissue (92). The architecture of the newly synthesized ECM is remodeled over time to emulate a normal tissue matrix. The ECM plays an integral role in the repair process through regulation of the behavior of the wide variety of cell types that are mobilized to the damaged area in order to rebuild the tissue (90, 95). Acute inflammation, re-epithelialization, and contraction all depend on cell–ECM interactions and help to minimize infection and promote rapid wound closure (96). Matricellular proteins are up-regulated during wound healing where they modulate interactions between cells and the extracellular matrix to exert control over events that are essential for efficient tissue repair (86).
Many solid cancers are characterized by a perpetual wound healing response that contributes to fibrotic disease. Aberrant wound healing processes act to stiffen a tissue, offsetting biomechanically sensitive signaling cascades involved in cell proliferation (87). The role of this process in tumor progression is only now beginning to be appreciated. Further exploration of the interplay between these mechanisms may better identify how proteins implicated in these processes may be exploited for therapeutic intervention.
22
Biomechanical Signaling in the Tumor Microenvironment
In vivo studies have demonstrated how aberrant mechanical properties in cancerous tissue contributes to tumor aggression and reduced treatment efficiency
(94, 97-99). Studies have shown that stromal cells regulate the mechanical properties of the ECM through associations with numerous growth factors that act as a signaling reservoir to modulate cellular behavior (17, 100). For example, transforming growth factor 1 (TGF 1) is a potent primary activator of quiescent fibroblasts to form highly contractile myofibroblasts involved in tissue fibrosis. Under fibrotic conditions where the ECM is stiff, TGF 1 is more highly retained in the matrix and promotes TGF induced epithelial to mesenchymal transition (EMT) and induces a basal like‐ tumor‐ cell phenotype that promotes invasion and metastasis
(95, 101). As another‐ example, several KRAS-driven genetically engineered mouse models of pancreatic cancer exhibit both the loss of TGF- signaling and elevated