The persistence of haploinsufficiency and its role in genome evolution
by
Summer Ashlee Morrill
B.S. Biology Tufts University, 2015
Submitted to the Department of Biology in Partial Fulfillment of the requirements for the Degree of
Doctor of Philosophy in Biology
at the
MASSACHUSETTS INSTITUTE OF TECHNOLOGY
September 2020
©2020 Summer A. Morrill. All rights reserved.
The author hereby grants to MIT permission to reproduce and to distribute publicly paper and electronic copies of this thesis document in whole or in part in any medium now known or hereafter created.
Signature of Author: ______Department of Biology July 31st, 2020
Certified by: ______Angelika Amon Kathleen and Curtis Marble Professor of Cancer Research Investigator, Howard Hughes Medical Institute Thesis Supervisor
Accepted by: ______Amy Keating Professor of Biology and Biological Engineering Co-Director, Biology Graduate Committee
2
The persistence of haploinsufficiency and its role in genome evolution
by
Summer Ashlee Morrill
Submitted to the Department of Biology On July 31st, 2020 in Partial Fulfillment of the Requirements for Degree of Doctor of Philosophy in Biology Abstract
In diploid organisms there are two copies of every gene, one from each parent. While the majority of genes are robust to deletion of one of the two copies, a subset of genes remains highly dosage sensitive, causing a significant decrease in fitness when heterozygously deleted. These genes, known as haploinsufficient (HI) genes, are present in eukaryotic species from yeast to humans. Why haploinsufficiency persists over evolutionary time is not known. To answer this, I systematically tested two existing models of haploinsufficiency: 1) the dosage stabilizing hypothesis, which states that haploinsufficiency is caused by imbalances among protein complex members, and 2) the insufficient amounts hypothesis, which says that haploinsufficient gene products are limiting for growth. In this thesis I find that having a single extra copy of haploinsufficient genes was sufficient to cause a growth defect in Saccharomyces cerevisiae. This showed that HI genes are sensitive to both over- and under-expression. Although having an extra copy of HI genes resulted in heightened sensitivity to proteotoxic stress agents, proteotoxicity could not wholly explain the fitness defect that occurred when HI genes were heterozygously deleted. Haploinsufficiency phenotypes were still present even when all members of a complex were deleted at the once, restoring protein balance but not expression levels. In creating a new dosage sensitivity dataset by pooled fitness competition, I found that genes sensitive to increased copy number and HI genes are not mutually defined. All together, these data suggested that HI genes are unique among dosage sensitive genes, and that HI genes must also be rate-limiting for maximal growth. Many HI genes showed strong evidence of growth-limiting phenotypes, including ribosomal genes, and genes involved in protein folding. I propose a “dosage-stabilizing” model for haploinsufficiency, which states that HI genes are unable to increase or decrease their expression without fitness penalty. This is due to both the growth-limiting nature of HI genes, and to the proteotoxicity of dosage imbalance. From these two selective pressures, HI genes have very narrow ranges of expression – unable to modulate expression over time. This has caused haploinsufficiency to persist throughout the evolution of the eukaryotic genome.
Thesis supervisor: Angelika Amon Title: Kathleen and Curtis Marble Professor of Cancer Research
3
For my parents – who taught me to ask questions, find my own answers, and never be afraid to try a different path.
“Curiosity is an essential part of the way human beings learn, and it always has been. In order to learn something, we must first wonder about it.”
― Joshua R. Eyler, How Humans Learn
4
Acknowledgements
Foremost, I would like to thank my thesis advisor, Angelika, for her enthusiasm, support, and seemingly never-ending source of ideas. You are a wonderful scientific role model and an incredibly thoughtful mentor. Thank you for talking me through failed experiments with kindness and compassion, and always celebrating my accomplishments with a hug.
Thank you to my amazing lab mates, who helped to make lab a home away from home. I will miss seeing you every day! I am grateful to have had such smart, kind, and silly people by my side through the ups and downs of graduate school. I especially need to thank Chris for being the best first bay mate, showing me the ropes, and tolerating all of my random questions. Thank you to Ian for being my early morning breakfast buddy. And of course, a huge thank you to my lab ladies – Juliet, Allegra, Cassie, Becca, Wendy, Teresa, and Franny (honorary Amon Lab lady) – for your encouragement, excellent cooking/baking skills, and deep love of Netflix romcoms. I am so thankful for all of our adventures outside of lab that kept me sane, especially pink wine nights, old lady nights, reality TV Fridays, and pizza making parties. I look forward to celebrating all of the Amon Lab successes in the years to come.
Thank you to my thesis committee members, Steve Bell and Gene-Wei Li, for providing me with a lot of help and advice on this long journey, especially as I explored new scientific areas. Thank you to Mike Springer for generously serving as my outside committee member.
Thank you to my undergraduate mentor, Steve Fuchs, who somehow convinced me that graduate school was the right decision (he was right!), and to my undergraduate advisor, Susan Koegel, who encouraged me to apply to MIT and fueled my passion for teaching in biology.
Thank you to my fellow BioREFS for making MIT Biology such a wonderful community. You are such a huge asset to our program – I could always count on you for cookies, and to point me in the right direction through any issue.
Thank you to the MIT Teaching and Learning Lab for continuing to foster my love for teaching, and for helping me to develop my educational philosophy. Thank you to Iain Cheeseman and Anu Seshan for being such wonderful mentors and role models in teaching, and for helping me to explore a career path in education.
Thank you to the real actual doctor in my life who beat me to the degree (but still has lots of school left), Catherine Coughlin. I can’t thank you enough for your endless love and pep talks.
Finally, and most importantly, thank you to my husband, Alex, and my parents, Tami and Jeff, who have been my constant source of motivation. You have been my greatest cheerleaders, and I am so happy to get to share in this accomplishment with you – I’m sure we will shed many happy tears now that the moment is here. None of us really knew what I was getting myself into, but I’m finally done with school and I couldn’t have done it without you!
5
6
Table of Contents
Abstract 3 Acknowledgements 4 Table of Contents 7
Chapter 1: Introduction 9
The diversity of genomes 10
Genome imbalance 10 Copy number variation 11 The origin of CNVs 13 Replicative mechanisms 14 Non-replicative mechanisms 15 Functional deletions 17 Chromosome missegregation 18 Consequences of genomic imbalance on fitness 22 Gene expression and gene dosage 23 Global cellular consequences 24 Gene-specific cellular consequences 27 Organismal consequences 28
Haploinsufficiency 29 Defining haploinsufficiency in model organisms 29 Conditional and morphological phenotypes 33 Computational predictions of haploinsufficiency 35 Early theories of dominance 36 The cause of haploinsufficiency: current hypotheses 37 Evolutionary conservation of haploinsufficient genes 40
Haploinsufficiency and human disease 42 Human case studies 43 Ribosomopathies 43 Tumor suppressors 44 Implications for clinical treatment 45 Gene therapies 46 Drug screening 47
Concluding Remarks 47
References 49
7
Chapter 2: Why haploinsufficiency persists 55
Significance Statement 56
Abstract 56
Introduction 57
Results 59 Haploinsufficient genes are sensitive to increased copy number 59 Why are haploinsufficient genes toxic upon dosage increase? 66 Most dosage sensitive genes are not haploinsufficient 67 Dosage imbalance does not fully explain haploinsufficiency 73 Haploinsufficient genes are rate limiting for cellular fitness 78 Haploinsufficient genes have a narrow expression range 79
Discussion 81
Materials and Methods 83
Strain Tables 91
Acknowledgements 93
References 93
Chapter 3: Conclusions and Future Directions 97
Summary of key conclusions 98
Adaptations to haploinsufficiency 101 Duplication of haploinsufficient genes 101 Haploinsufficiency and karyotype maintenance 107
Concluding remarks 111
References 112
8
Chapter 1: Introduction
9
The diversity of genomes
Eukaryotic genomes range in size from 10 million base pairs in molds and unicellular fungi, to
>100 billion base pairs in some flowering plants (1). This 10,000-fold difference reflects large changes to genome content, regulation, and complexity. Yet, all organisms are tasked with the same objective: to faithfully maintain and transmit genetic material throughout every division, and into every new generation. How do organisms achieve this objective when there is such variability in genome size? How often do organisms get it wrong, and do these mistakes matter?
In particular, does the number of copies of each gene or each chromosome impact organismal fitness?
Genome imbalance
Every genome, big or small, is characterized by a balanced karyotype, meaning that all chromosomes are inherited proportionally with respect to each other. If two copies of chromosome 1 are present, then one would expect to find two copies of chromosome 2, chromosome 3, and so on, excluding the sex chromosomes. From the perspective of protein complex stoichiometry, this makes sense. Different chromosomes encode different members of shared protein complexes and cellular pathways, and the expression of these proteins must remain coordinated to carry out their functions. Necessarily, gene regulation must also be a part of maintaining the balance of expression. To disrupt this balance, either at the level of a single gene or a whole chromosome, is to disrupt the proportionality of the cell. Such disruption can have severe consequences for the cell, leading to decreased cellular proliferation and fitness,
10
development of disease, or cell death. Characterizing the prevalence of genome imbalance, and studying the mechanisms for how such changes occur, has helped us to better understand the critical role that balance plays in shaping genomes.
Copy number variation
A copy number variant (CNV) is any change to the number of copies of genetic information carried by a cell or organism. Such changes can happen during DNA replication,
DNA repair, or chromosome segregation, and may result in both gains and losses of genetic information. CNVs range in size from a few base pairs to millions of base pairs, and a single alteration event may span one or many genes. For the purpose of this thesis, I will also consider gene-inactivating mutational events to be CNVs, as they functionally reduce the copy number of the gene. Using this broad definition of a CNV, we can identify three main sources of copy number alteration: changes to chromosome structure, disruption of gene function, or altered karyotype. As a result, chromosomes may contain amplifications and deletions, genes may be inactivated, or whole chromosomes may be gained and lost (Fig. 1).
The detection of CNVs is achieved primarily through microarray and next-generation sequencing technologies. CNVs may be found in all cells of an organism if the mutation is inherited through the germline, or may vary from cell to cell. Although analysis of bulk DNA collected from tissues provides robust detection of germline CNVs across individuals, recent advances in single-cell sequencing technologies have also allowed the detection of somatic cell
CNVs within particular cells of individuals (2). Single cells collected from human tissues harbor megabase-scale non-clonal CNVs at low frequencies in normal, healthy tissues, across multiple
11
tissue types (2). Humans are not unique in this. All branches of life, from bacteria to humans, carry CNVs that must be accommodated by cells and organisms throughout development. It is therefore crucial to understand both how CNVs occur, and what impact they have on the cells and organisms that harbor them.
A genome A B C D E Changes to chromosome structure amplification A B B C D E
deletionA A C D E genome A B C D E B Disruption of gene function inactivatingamplification A B B C D E mutation A B C D E deletion inversion A A C B C C D D E E
C inversion ChangesA toC karyotypeB D E
B
euploid cell aneuploid cell
Fig. 1: Types of copy number variants (CNVs).
12
The chromosomal DNA shown is split into five segments A,B,C,D,E for visual reference; these segments could contain one or multiple genes, ranging from kilobases to megabases in size.
Copy number alterations focus on chromosome segment B.
(A) Changes to chromosomal structure such as the amplification or deletion of region B, increase or decrease the copy number of genes within B, respectively.
(B) Disruption of gene function is a mechanism for decreasing copy number. This could occur through a gene-inactivating mutation, or an inversion event which disrupts the function of genes lying at the boundaries of the inversion.
(C)Whole-chromosome copy number alterations lead to gain or loss of chromosomes. Cells with a normal karyotype are referred to as euploid; those which have an abnormal number of chromosomes (not a multiple of the haploid number) are considered aneuploid. Here, the aneuploid cell has gained a copy of the orange chromosome.
The origin of CNVs
Any event which leads to DNA breakage, misalignment, deleterious mutation, or the misappropriation of DNA from one cell into the next can lead to the formation of CNVs. Such events can result from the repair of exogenous attacks on the genome, like the DNA damage that occurs with the sun’s intense radiation, or through problems encountered during DNA replication and mitosis. The type of mutational event depends on which cellular process is affected, and where in the genome it occurs. For example, sub-chromosomal rearrangements are particularly likely to occur in repetitive areas of the genome, where templates for replication and repair can misalign, whereas whole chromosomal gains and losses are only likely to occur during cell
13
division. Here we will explore five potential mechanisms of generating CNVs: replicative, non- replicative, functional deletion, chromosome missegregation, and whole genome duplication.
Replicative mechanisms
When normal DNA replication becomes blocked, attempts to continue DNA synthesis can lead to expansion or contraction of DNA at the site of replication fork stalling. Replication may become blocked as a result of DNA damage or the depletion of nucleotide pools. There are also many intrinsic mechanisms of fork stalling; DNA-binding proteins, transcriptional machinery, and unusual DNA secondary structures can all create barriers to fork progression that lead to blocked or stalled replication (3). For DNA synthesis to continue and the cell cycle to be completed, the cell must make a choice to either synthesize through the lesion that caused the stall (if possible) or to find another template. To synthesize across a stalled region requires a specialized DNA polymerase known as a translesion polymerase, which is capable of bypassing
DNA lesions because of the increased size of its active site. Employing translesion synthesis can lead to a contraction event (Fig. 2B), deleting the affected region of the genome. Alternatively, replication forks may “skip over” the affected region until an alternative template can be found, as in template switching. Template switching is thought to resolve stalled forks in vivo through recombination-mediated progression (Fig. 2C), where partially replicated sister strands serve as templates for repair at the site of stalling (4). Template switching can result in duplication, deletion, inversion, or translocation depending on how the template used for homology is aligned, and how the new replication fork is oriented (5).
14
Non-replicative mechanisms
Cells can encounter a broad range of events that damage DNA throughout their lifetime.
This can occur through chemical or physical means, such as through contact with carcinogenic agents or irradiation, or through the natural accumulation of oxidative stress in proliferating cells. DNA damage which results in a double-stranded break of the DNA helix is most relevant to the creation of CNVs as it requires either a rough end-joining of the broken DNA ends, or the process of seeking out a homologous template for repair. Here, we will specifically consider what happens to free DNA ends in the absence of active DNA replication. Ideally, DNA double- stranded breaks are repaired through homologous recombination with a properly aligned template, such that all genetic information is retained. In this case, no copy number variants are generated. If, however, a homologous template for repair is found but misaligned, the subsequent repair event can lead to duplication or deletion of the affected chromosomal region, increasing or decreasing its copy number in the genome, respectively (5). In the case that there is no suitable homologous template, the break must be repaired by removing regions of the surrounding DNA and ligating them back together in the process of non-homologous end-joining (NHEJ). NHEJ results in deletions of regions of the genome surrounding the DNA break, leading to a loss of genetic information.
Non-replicative duplications and deletions can also occur in the absence of exogenous
DNA damage, as occurs during unequal crossover events in meiosis. In meiosis, double-stranded breaks are induced endogenously and require homology to resolve. Misalignments during crossing over result in alterations to chromosome structure (Fig. 2A).
15
A Unequal crossover B Translesion Synthesis homologous regions A B C D E A B C D E
duplication A B B C D E deletion A C D E
Fig. 2: Potential mechanisms leading to copy number alteration.
(A) Unequal crossover of chromosome arms between repetitive regions of the chromosome can lead to gene duplication or deletion events, depending on how repeats are aligned. Such misalignments can occur during meiotic crossover as depicted here, or during recombinational repair events following DNA damage during the mitotic cell cycle.
(B) Copy number changes can occur when DNA replication is blocked. In some cases, this can be due to the formation of secondary structure on the lagging strand template, which is single- stranded. If the fork continues to replicate over the extruded DNA, as with a translesion polymerase, this results in a deletion of the affected region. Such an event is often referred to as replication slippage.
16
(C) Exposed single-stranded DNA at a stalled replication fork can seek out a new template to copy from. The 3’ end is freed from its blocked template on the lagging strand, seeking out other regions of homology in the genome from which to replicate. This mechanism of replication is known as template switching, and can lead to a variety of copy number changes depending on the source and orientation of the new template.
Figures adapted from Hastings et. al. (2009) (5).
Functional deletions
So far, the mechanisms we have focused on involve direct rearrangements of chromosomal structure. Some mutations which affect copy number, however, can involve gene inactivation, rather than whole-gene deletions. Ranging from point mutations to single nucleotide insertions and deletions, these mutations are relatively small in scale, but have a large effect on the gene product. A missense point mutation which changes a critical portion of the protein’s structure could prevent it from binding with essential binding partners, or disrupt the active site of the protein. Nonsense mutations introduce stop codons, truncating the protein products, often making it non-functional or leading to the degradation of its mRNA or protein. Small insertions or deletions of just a few nucleotides can cause a frameshift in reading out the genetic code of a protein, leading to protein that no longer takes on the same shape or function. Without a working copy of the protein being produced the gene copy number is effectively reduced by half, functionally deleting one of two copies of the gene. Notably, such functional deletions may also result from large-scale chromosomal rearrangements, such as inversions and translocations, when they disrupt the coding sequence of genes at the boundaries of the rearrangement.
17
Chromosome missegregation
Whole chromosome changes to DNA copy number must occur through errors in mitosis or meiosis. Such errors can result from genetic perturbation to the mitotic machinery itself, or to the surveillance mechanisms that monitor chromosome attachments and tension. For example, mutations in kinetochore components that attach chromosomes to the spindle (Fig. 3C), or cohesion proteins that hold replicated sister chromatids together (Fig. 3B), can result in widespread chromosome missegregation events. Chromosome missegregation can also occur if the structure or stability of the mitotic spindle is disrupted. In particular, correct spindle formation depends on the presence of two centrosomes (or spindle pole bodies in yeast), one at each pole. If more than two centrosomes are present, a multipolar spindle develops, leading to the production of highly aneuploid cells (Fig. 3D). Mitotic spindle formation also depends on cells having kinetically-dynamic microtubules. Spindle dynamics are potently altered by cold temperatures, and by microtubule-stabilizing or -destabilizing agents, leaving cells unable to form a stable mitotic spindle and segregate chromosomes. Finally, the spindle assembly checkpoint (SAC) is in place to monitor all chromosome attachments to the spindle. If chromosomes are attached properly, they experience equal tension on sister chromatids at the metaphase plate, originating from the microtubules that attach them to the spindle poles on either side of the cell. If the forces pulling on each side of the chromosome are not balanced, the SAC arrest cells until proper chromosome attachments and tension can be restored, so that chromosomes are divided evenly between daughter cells. As such, perturbation of the SAC can lead to cells being unaware of unattached or improperly attached chromosomes in mitosis or meiosis, leading to chromosome missegregation (Fig. 3A). Chromosome missegregation events occur at a frequency of 10-4 or 10-5, meaning that chromosome copy number alterations occur
18
frequently even in otherwise wild-type populations. If we compare this rate of mutation to the number of cells in the human body, which is approximately 1013, this means that ~108-109 cells have potentially undergone copy number changes throughout the course of human development.
Fig. 3: Mechanisms of chromosomal gain and loss.
(A) Inactivation of the spindle assembly checkpoint (SAC) causes cells to enter anaphase without checking that all chromosomes were properly attached and oriented.
(B) Premature loss of sister chromatid cohesion results in the release of sister chromatid pairs prior to anaphase. This leads to missegregation of the affected chromosomes.
(C) Incorrect attachments of spindle microtubules to the kinetochores, either resulting in too few or too many attachments, can lead to improper segregation of the aberrantly attached chromosome. Here, a merotelic attachment is shown, where a single kinetochore becomes attached to microtubules emanating from both spindle poles.
19
(D) Microtubule organizing centers, such as centrosomes, dictate the axis of spindle formation.
Having the wrong number of centrosomes leads to formation of a multipolar spindle instead of a bipolar spindle, and thus results in improper segregation of chromosomes to the daughter cells.
Figure adapted from Siegel & Amon (2012) (6).
Whole genome duplication
Several times throughout eukaryotic evolution, genomes have seen increases in ploidy.
These increases in ploidy are typically unstable, and are followed by widespread gene copy loss, generating a large number of CNVs at once (7). One such shift occurred in the Saccharomyces clade of budding yeast approximately 100 million years ago (Fig. 4A), leading to the rapid accumulation of CNVs (8). Approximately 20% of all genes that were duplicated during the whole genome duplication event are still maintained as paralogs in the modern descendants of the ancestral strain, including the model organism Saccharomyces cerevisiae (Fig. 4B) (9).
Paralogs formed in this way are also called ohnologs, to honor Susumu Ohno who proposed that whole genome duplication was a potential mechanism for driving evolutionary novelty through copy number variation (10). Because the copy number of so many genes is affected at once, whole genome duplication and subsequent gene loss represent a powerful and relatively swift mechanism for generating widespread CNVs.
20
A ~108 years
B
Fig. 4: Gene dosage imbalance following whole genome duplication in Saccharomyces cerevisiae.
(A) An abbreviated phylogenetic history of the Saccharomycetaceae family of yeast, depicting the whole genome duplication event in the lineage of budding yeasts.
21
Figure modified from Byrne et. al. (2005) (9).
(B) Schematic showing how duplication of the ancestral yeast genome led to massive genomic rearrangement and gene loss. Here, the ancestral genome is shown in pink, with individual ORFs numbered for reference. Immediately following the whole genome duplication (WGD), two copies of each ORF were present, one shown in purple and the other in orange. As large-scale rearrangements occurred following the whole genome duplication, causing widespread loss of gene copies, some duplicated genes were maintained over evolutionary time as paralogs
(highlighted in the bottom panel). These paralogs have altered copy number compared to the reference genome, and are interleaved throughout the modern yeast genome. Saccharomyces cerevisiae is considered a post-WGD species of budding yeast, maintaining these paralogs.
Figure adapted from Kellis et. al. (2004) (7).
Consequences of genomic imbalance on fitness
Changes to DNA copy number, whether at the level of whole chromosomes or single genes, have the potential to negatively affect the fitness of organisms. From a theoretical standpoint, duplications and chromosome gains lead to increased gene expression, placing burden on transcriptional, translational, and protein homeostasis machinery. Deletions and chromosome loss events lead to the depletion of key cellular components, and can also lead to protein imbalances that must be dealt with by the cell. This section will discuss how genome balance, and its effect on gene expression, impacts cellular and organismal fitness.
In general, the fitness defect of an organism which has acquired a CNV is proportional to its degree of genome imbalance (11). For example, a cell which gains a whole chromosome is
22
likely to show a greater fitness defect on average than one with a sub-chromosomal gain.
However, we also know that changes in the expression of individual genes can contribute significantly to the fitness defect of organisms, and that the strength of the fitness defect and the range of expression across which it occurs varies widely among genes (12). It is therefore necessary to consider both the global and the gene-specific effects of copy number alterations when describing the consequences of genome imbalance on cellular fitness. Additionally, among multicellular organisms, one must also consider whether the CNV is present in one cell or many cells of the organism, and what consequence this has on organismal fitness.
Gene expression and gene dosage
To understand the consequence of genomic imbalance on fitness, we must first know what happens to the functionality of DNA regions which are affected. DNA copy number changes have been shown to cause corresponding changes to gene expression, both at the RNA and protein level (Fig. 5) (6, 13). Duplications result in increased expression of genes within the affected region, and deletions cause decreased expression. Although it is thought that some genes possess mechanisms to correct for these changes in dosage, the majority of genes are not dosage- compensated fully, or their expression does not return to wild-type levels (13–15). A notable exception to this is genes which are contained on sex chromosomes, which must be dosage- compensated to keep expression consistent between the biological sexes. In general though,
DNA copy number changes are proportionally relayed into their RNA and protein products, carrying the potential to affect the cellular processes in which they play a role.
23
Fig. 5: Changes to DNA copy number lead to corresponding changes in gene expression.
A gain of chromosome V in yeast, as shown by microarray DNA content analysis (top), is accompanied by a 2-fold increase in expression of genes encoded on chromosome V (log2 ratio over wild-type = 1). Microarray gene expression data are used to measure RNA expression
(middle), and SILAC protein profiling is used to measure protein levels (bottom).
Data from Torres et. al. (2007, 2010a) (11, 16); Figure from Siegel & Amon (2012) (6).
Global cellular consequences
Large scale dosage imbalances are known to have a number of severe physiological consequences which affect the fitness and proliferation of cells. This has been studied extensively within the context of aneuploidy, where whole chromosomes are gained or lost.
There are several stresses found to be shared amongst aneuploid strains in eukaryotes, regardless
24
of the identity of the chromosome(s) gained or lost. Shared cellular responses among aneuploid cells include proteotoxicity, the induction of a generalized stress-associated transcriptional program, and genome instability, among others (17).
Cells containing copy number alterations experience proteotoxic stress due to the increased burden placed on protein homeostasis machinery (18). The burden on protein homeostasis triggered by CNVs can originate in two ways: First, copy number gains can increase the amount of protein being produced, overburdening chaperones which aid in protein folding.
Second, both gains and losses create stoichiometric imbalances among protein complex members, and these uncomplexed protein subunits tend to misfold. Therefore, depending on the amount of uncomplexed subunits and the level at which the proteins are expressed, a potentially large quantity of misfolded protein must be remedied by the cell, either through protein degradation (19), aggregation (20), or autophagy (21) (Fig. 6).