bioRxiv preprint doi: https://doi.org/10.1101/2020.03.09.984815; this version posted March 12, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission.
1 APEX2 proximity proteomics resolves flagellum subdomains and identifies flagellum tip-specific
2 proteins in Trypanosoma brucei
3
4 Daniel E. Vélez-Ramírez,a,b Michelle M. Shimogawa,a Sunayan Ray,a* Andrew Lopez,a Shima
5 Rayatpisheh,c* Gerasimos Langousis,a* Marcus Gallagher-Jones,d Samuel Dean,e James A. Wohlschlegel,c
6 Kent L. Hill,a,f,g#
7
8 aDepartment of Microbiology, Immunology, and Molecular Genetics, University of California Los
9 Angeles, Los Angeles, California, USA
10 bPosgrado en Ciencias Biológicas, Universidad Nacional Autónoma de México, Coyoacán, Ciudad de
11 México, México
12 cDepartment of Biological Chemistry, University of California Los Angeles, Los Angeles, California, USA
13 dDepartment of Chemistry and Biochemistry, UCLA-DOE Institute for Genomics and Proteomics, Los
14 Angeles, California, USA
15 eWarwick Medical School, University of Warwick, Coventry, England, United Kingdom
16 fMolecular Biology Institute, University of California Los Angeles, Los Angeles, California, USA
17 gCalifornia NanoSystems Institute, University of California Los Angeles, Los Angeles, California, USA
18
19 Running title: APEX2 proteomics in Trypanosoma brucei
20
21 #Address correspondence to Kent L. Hill, [email protected].
Page 1 of 33
bioRxiv preprint doi: https://doi.org/10.1101/2020.03.09.984815; this version posted March 12, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission.
22 *Present address: Sunayan Ray: Invivoscribe, San Diego, California, USA. Shima Rayatpisheh: Genomics
23 Institute of the Novartis Research Foundation, San Diego, California, USA. Gerasimos Langousis:
24 Friedrich Miescher Institute for Biomedical Research, Basel, Switzerland.
Page 2 of 33
bioRxiv preprint doi: https://doi.org/10.1101/2020.03.09.984815; this version posted March 12, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission.
25 ABSTRACT
26 Trypanosoma brucei is the protozoan parasite responsible for sleeping sickness, a lethal vector-borne
27 disease. T. brucei has a single flagellum that plays critical roles in parasite biology, transmission and
28 pathogenesis. An emerging concept in flagellum biology is that the organelle is organized into
29 subdomains, each having specialized composition and function. Overall flagellum proteome has been
30 well-studied, but a critical gap in knowledge is the protein composition of individual flagellum
31 subdomains. We have therefore used APEX-based proximity proteomics to examine protein composition
32 of T. brucei flagellum subdomains. To assess effectiveness of APEX-based proximity labeling, we fused
33 APEX2 to the DRC1 subunit of the nexin-dynein regulatory complex, an axonemal complex distributed
34 along the flagellum. We found that DRC1-APEX2 directs flagellum-specific biotinylation and purification
35 of biotinylated proteins yields a DRC1 “proximity proteome” showing good overlap with proteomes
36 obtained from purified axonemes. We next employed APEX2 fused to a flagellar membrane protein that
37 is restricted to the flagellum tip, adenylate cyclase 1 (AC1), or a flagellar membrane protein that is
38 excluded from the flagellum tip, FS179. Principal component analysis demonstrated the pools of
39 biotinylated proteins in AC1-APEX2 and FS179-APEX2 samples are distinguished from each other.
40 Comparing proteins in these two pools allowed us to identify an AC1 proximity proteome that is
41 enriched for flagellum tip proteins and includes several proteins involved in signal transduction. Our
42 combined results demonstrate that APEX2-based proximity proteomics is effective in T. brucei and can
43 be used to resolve proteome composition of flagellum subdomains that cannot themselves be readily
44 purified.
45 IMPORTANCE
46 Sleeping sickness is a neglected tropical disease, caused by the protozoan parasite Trypanosoma brucei.
47 The disease disrupts the sleep-wake cycle, leading to coma and death if left untreated. T. brucei motility,
Page 3 of 33
bioRxiv preprint doi: https://doi.org/10.1101/2020.03.09.984815; this version posted March 12, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission.
48 transmission, and virulence depend on its flagellum (aka cilium), which consists of several different
49 specialized subdomains. Given the essential and multifunctional role of the T. brucei flagellum, there is
50 need of approaches that enable proteomic analysis of individual subdomains. Our work establishes that
51 APEX2 proximity labeling can, indeed, be implemented in the biochemical environment of T. brucei, and
52 has allowed identification of proximity proteomes for different subdomains. This capacity opens the
53 possibility to study the composition and function of other compartments. We further expect that this
54 approach may be extended to other eukaryotic pathogens, and will enhance the utility of T. brucei as a
55 model organism to study ciliopathies, heritable human diseases in which cilia function is impaired.
56 INTRODUCTION
57 Trypanosoma brucei is a flagellated parasite that is transmitted between mammalian hosts by a
58 hematophagous vector, the tsetse fly, and is of medical relevance as the causative agent of sleeping
59 sickness in humans [1]. T. brucei also presents a substantial economic burden in endemic regions due to
60 infection of livestock, causing an estimated loss of over 1 billion USD/year [2]. As such, T. brucei is
61 considered both cause and consequence of poverty in some of the poorest regions in the world. T.
62 brucei also provides an excellent model system for understanding the cell and molecular biology of
63 related flagellates T. cruzi and Leishmania spp., which together present a tremendous health burden
64 across the globe.
65 T. brucei has a single flagellum that is essential for parasite viability, infection and transmission [3-5].
66 The flagellum drives parasite motility, which is necessary for infection of the mammalian host [4], and
67 for transmission by the tsetse fly [5]. In addition to its canonical function in motility, the flagellum plays
68 important roles in cell division and morphogenesis [6-8] and mediates direct interaction with host
69 tissues [9]. Moreover, recent work has demonstrated that the trypanosome flagellum is the site of
Page 4 of 33
bioRxiv preprint doi: https://doi.org/10.1101/2020.03.09.984815; this version posted March 12, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission.
70 signaling pathways that control the parasite’s response to external signals and are required for
71 transmission and virulence [4, 10-15].
72 The trypanosome flagellum is built on a canonical “9 + 2” axoneme that originates at the basal body in
73 the cytoplasm near the posterior end of the cell [3]. Triplet microtubules of the basal body extend to
74 become doublets in the transition zone, which marks the boundary between the basal body and 9 + 2
75 axoneme [16]. The axoneme exits the cytoplasm through a specialized invagination of the plasma
76 membrane, termed the “flagellar pocket” (FP) [17]. As it emerges from the flagellar pocket, the
77 axoneme is attached to an additional filament, termed the “paraflagellar rod” (PFR) [18], that extends
78 alongside the axoneme to the anterior end of the cell. The axoneme and PFR remain surrounded by cell
79 membrane and this entire structure is laterally attached to the cell body along its length, except for a
80 small region at the distal tip that extends beyond the cell’s anterior end [19]. Lateral flagellum
81 attachment is mediated by proteins in the flagellum and cell body that hold the flagellum and plasma
82 membranes in tight apposition, constituting a specialized “flagellar attachment zone” (FAZ) that extends
83 from the flagellar pocket to the anterior end of the cell [19, 20].
84 As in other flagellated eukaryotes, the trypanosome flagellar apparatus (Fig. 1) can be subdivided into
85 multiple subdomains, each having specialized function and protein composition. The FP, for example,
86 demarcates the boundary between the flagellar membrane and cell membrane. In T. brucei, the FP is
87 the sole site for endocytosis and secretion, thus presenting a critical portal for host-parasite interaction
88 [17]. The basal body functions in flagellum duplication, segregation and axoneme assembly [21]. The
89 region encompassing the transition zone lies between the cytoplasm and flagellar compartment and
90 includes proteins that control access into and out of the flagellum [13, 16]. The 9 + 2 axoneme is the
91 engine of motility [7], while the PFR in trypanosomes is considered to physically influence flagellum
92 beating and to serve as a scaffold for assembly of signaling and regulatory proteins [22-25]. The FAZ is a
Page 5 of 33
bioRxiv preprint doi: https://doi.org/10.1101/2020.03.09.984815; this version posted March 12, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission.
93 trypanosome-specific structure and is critical for parasite motility and cell morphogenesis [26]. The
94 flagellum tip marks the site of cleavage furrow initiation during cytokinesis [27] and mediates
95 attachment to the tsetse fly salivary gland epithelium, which is crucial for development into human
96 infectious parasites [28]. The flagellum tip is also the site of signaling proteins that function in cell-cell
97 communication and signaling [11, 29].
98 Given the essential and multifunctional roles of the trypanosome flagellum, much effort has been made
99 to define the protein composition of the organelle. However, although we know a lot about protein
100 composition of the flagellum as a whole, a critical knowledge gap is the protein composition of
101 individual flagellum subdomains. Classical proteomics approaches typically require purification of the
102 flagellum, or subfractions of the flagellum [16, 30-34]. This approach is very useful and has been used to
103 determine proteomes of the transition zone [16] and axoneme tip [31], but it can be cumbersome, is
104 dependent on quality of the purified fraction and cannot resolve subdomains that are not part of a
105 specific structure that can be purified.
106 To overcome limitations of conventional flagellum proteomics approaches, we have adapted APEX2
107 proximity labeling [35] for use in T. brucei. APEX2 is an engineered monomeric ascorbate peroxidase
108 that converts biotin-phenol into a short-lived biotin radical that is highly reactive. The biotin-phenol
109 radical interacts with nearby proteins, resulting in covalent attachment of a biotin tag. Biotinylated
110 proteins can be affinity-purified with streptavidin and identified by shotgun proteomics, allowing for
111 facile identification of proteins within a specific subcellular location from a complex and largely
112 unfractionated sample [35, 36]. Here, we report successful implementation of APEX2 proximity labeling
113 in T. brucei, to define the proximity proteome of flagellar proteins that are either distributed along the
114 axoneme or restricted to the tip of the flagellum membrane. Our results establish APEX-based proximity
115 proteomics as a powerful tool for T. brucei, demonstrate the approach can resolve flagellum
Page 6 of 33
bioRxiv preprint doi: https://doi.org/10.1101/2020.03.09.984815; this version posted March 12, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission.
116 subdomains that are not separated by a physical boundary and support the idea that the flagellum tip
117 subdomain is specialized for cell signaling.
118 RESULTS
119 To evaluate APEX2 labeling in T. brucei, we selected an axonemal protein as bait, because the axoneme
120 is a well-defined cellular component whose protein composition in T. brucei has been examined in prior
121 studies [16, 30, 31, 34]. We selected the DRC1 subunit of the nexin-dynein regulatory complex (NDRC)
122 [37], because this protein has a defined localization along the axoneme and its position relative to major
123 axonemal substructures, e.g. microtubule doublets, radial spokes and dynein arms, is known [38].
124 Having selected DRC1 as bait, we used in situ gene tagging [39] to generate cell lines expressing DRC1
125 fused to a C-terminal APEX2 tag that includes an HA-tag following the APEX2 tag, referred to as “DRC1-
126 APEX2”. Expression of DRC1-APEX2 was demonstrated in Western blots of whole cell lysates (Fig. 2A).
127 Extraction with non-ionic detergent leaves the axoneme intact in a detergent-insoluble cytoskeleton
128 fraction that can be isolated from detergent-soluble proteins by centrifugation [40]. We found that
129 DRC1-APEX2 fractionates almost completely with the detergent-insoluble cytoskeleton as expected for
130 an NDRC protein (Fig. 2A). Growth curves demonstrated that expression of DRC1-APEX2 does not affect
131 growth of T. brucei in vitro (Supplemental Fig. S1).
132 We next asked if APEX2 is functional within the biochemical environment of the T. brucei cell. Cells
133 expressing DRC1-APEX2 were incubated with biotin-phenol, which was then activated with brief H2O2
134 treatment followed by quenching with trolox and L-ascorbate. Cells were then probed with streptavidin-
135 Alexa 594 and subjected to fluorescence microscopy to assess biotinylation. As shown in Figure 2C, we
136 observed APEX2-dependent biotinylation and this was highly enriched in the flagellum. There was some
137 background staining in the cytoplasm, as revealed by parallel analysis of parental cells lacking the
138 APEX2-tagged protein (Fig. 2C), but flagellum staining was only observed in cells expressing DRC1-
Page 7 of 33
bioRxiv preprint doi: https://doi.org/10.1101/2020.03.09.984815; this version posted March 12, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission.
139 APEX2. In the proximal region of the flagellum, the streptavidin signal extended further than the PFR
140 (Fig. 2C), indicating streptavidin labeling is on the axoneme. Therefore, DRC1-APEX2 directs specific
141 biotinylation in the flagellum.
142 Having established that DRC1-APEX2 directs flagellum-specific biotinylation, we used shotgun
143 proteomics to identify biotinylated proteins. Samples were extracted with non-ionic detergent and
144 separated into detergent-soluble supernatant and detergent-insoluble pellet fractions. Biotinylated
145 proteins in each fraction were then isolated using streptavidin purification and subjected to shotgun
146 proteomics for protein identification. WT and DRC1-APEX2 cells were processed in parallel (Fig. 3A). Our
147 focus was the detergent-insoluble pellet because this fraction includes the axoneme and PFR, and the
148 protein composition of these structures has been characterized [19, 25, 30, 34, 41]. The pellet fraction
149 also includes non-axonemal structures such as the basal body, tripartite attachment complex,
150 cytoplasmic FAZ filament and subpellicular cytoskeleton, thus enabling us to test for enrichment of
151 axonemal proteins. The analysis was done using three independent biological replicates. In one case, the
152 sample was split into two aliquots and shotgun proteomics was done on both in parallel, giving a total of
153 four replicates each for DRC1-APEX2 and WT pellet samples.
154 Principal component analysis demonstrated that the biotinylated protein profile of the DRC1-APEX2
155 pellet samples (DRC1p) was distinct from that of WT samples (WTp) processed in parallel (Fig. 3B). We
156 detected bona fide axonemal proteins, including DRC1 and axonemal dynein subunits in some WT
157 replicates, but these were enriched in DRC1-APEX2 samples relative to WT. We therefore assembled a
158 “DRC1p proximity proteome” that included only proteins meeting the following three criteria: i)
159 detected in all four replicates of the DRC1p sample, ii) had a normalized spectrum count of two or more,
160 and iii) were enriched in the DRC1p versus WTp sample. This yielded a DRC1p proximity proteome of
161 697 proteins (Supplemental Table S1). Human homologues were identified for 372 proteins in the
Page 8 of 33
bioRxiv preprint doi: https://doi.org/10.1101/2020.03.09.984815; this version posted March 12, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission.
162 DRC1p proximity proteome and among these, 38 are linked to human diseases that have been
163 connected to cilium defects (Table 1).
164 To evaluate whether APEX proximity labeling was effective in identifying flagellar proteins, we used
165 Gene Ontology (GO) analysis [42], comparison to prior T. brucei flagellar proteomes [30, 32, 34] and
166 independent tests of localization [43]. GO analysis demonstrated significant enrichment of flagellar
167 proteins in the DRC1p proximity proteome compared to the genome as a whole (Fig. 3C and D). As
168 discussed above, our efforts were focused on the pellet fraction. We did however, complete GO analysis
169 on the detergent-soluble “DRC1s proximity proteome”, which also showed significant enrichment of
170 flagellar proteins, as well as signaling proteins (Supplemental Fig. S2).
171 When compared with prior proteomic analyses of T. brucei flagella, the DRC1p proximity proteome
172 encompassed a larger fraction (45%) of the flagellum skeleton proteome [30] than of the intact
173 flagellum proteome (36%) [34], perhaps due to the fact that the latter includes detergent-soluble
174 proteins, which are not expected in the DRC1p proximity proteome. As anticipated, minimal overlap was
175 observed with the flagellum surface plus matrix proteome [32], which includes only detergent-soluble
176 proteins.
177 TrypTag localization data [43] were available for 677 proteins in the DRC1p proximity proteome and 509
178 of these (75%) are annotated as having TrypTag localization that includes one or more flagellum
179 structures. The DRC1p proximity proteome includes 350 proteins that were not identified in prior
180 proteomic analyses of the T. brucei flagellum or axoneme fragments [16, 25, 30-32, 34] (Supplemental
181 Table S1A). TrypTag localization data were available for 337 of these and 241 (71%) are annotated as
182 having a TrypTag localization to one or more flagellum structures. In some cases, localization was
183 specific to flagellum structures, while in others the protein showed multiple locations. This finding
184 supports the idea that many of these 350 proteins are bona fide flagellar proteins despite going
Page 9 of 33
bioRxiv preprint doi: https://doi.org/10.1101/2020.03.09.984815; this version posted March 12, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission.
185 undetected in earlier flagellum proteome studies. The combined results demonstrate that APEX2
186 proximity labeling is functional in T. brucei and enables identification of flagellar proteins without the
187 need to purify the flagellum. The data also indicate protein composition of the T. brucei flagellum is
188 more complex than indicated by earlier studies alone.
189 APEX labeling readily distinguishes proteins in close proximity but separated by a membrane [36] and
190 this is evidenced in our data when considering protein components of the FAZ [19]. Proteins on the
191 flagellar side of the FAZ are substantially enriched in the DRC1-APEX2 sample, whereas proteins on the
192 cell body side of the FAZ are not (Supplemental Fig. S3). Furthermore, the short half-life of the biotin-
193 phenol radical [35] means that APEX labeling can resolve proteins separated by distance even in the
194 absence of a membrane boundary. As discussed above and shown previously [44], APEX resolves
195 flagellar versus cytoplasmic proteins despite these two compartments being contiguous. Within the
196 DRC1p proximity proteome, we noted that proteins distributed similarly to DRC1 along the axoneme
197 were well-represented, while proteins restricted to the distal or proximal end of the flagellum were less
198 represented (Fig. 4 and Supplemental Table S2). While total abundance may contribute to this result, it
199 nonetheless suggested that beyond flagellum versus cytoplasm, APEX labeling might also be able to
200 resolve proteins from different subdomains within the flagellum. We were particularly interested in the
201 flagellum tip because of its importance in trypanosomes and other organisms for signal transduction
202 [10, 11, 45, 46], flagellum length regulation [47-51] and interaction with host tissues [9].
203 We generated APEX2-tagged versions of flagellar proteins that are tip-specific, AC1 [29], or tip-excluded,
204 FS179 [32]. Expression of either AC1-APEX2 or FS179-APEX2 did not affect T. brucei doubling time
205 (Supplemental Fig. S1) and both tagged proteins fractionated in the detergent-soluble fraction as
206 expected. To assess biotinylation, AC1-APEX2 and FS179-APEX2-expressing cells were incubated with
207 biotin-phenol, activated with H2O2, then quenched, probed with streptavidin-Alexa 594 and examined
Page 10 of 33
bioRxiv preprint doi: https://doi.org/10.1101/2020.03.09.984815; this version posted March 12, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission.
208 by fluorescence microscopy (Fig. 5A-D). The signal in AC1-APEX2 expressors was enriched at the
209 flagellum tip, while the signal in FS179-APEX2 expressors was distributed along the flagellum but lacking
210 or diminished at the flagellum tip (Fig. 5A-D). Samples were solubilized with detergent and centrifuged
211 to remove insoluble material. Biotinylated proteins were isolated from the soluble fraction by
212 streptavidin affinity purification and then identified by shotgun proteomics. As negative controls,
213 samples from WT cells without an APEX2 tag and cells expressing AC145-APEX2, an AC1 truncation that
214 lacks the C-terminal 45aa and is localized to the cytoplasm instead of the flagellum [29], were processed
215 in parallel (Fig. 5E).
216 Principal component analysis demonstrated that the biotinylated protein profile of AC1-APEX2
217 detergent-soluble (AC1s) samples was readily distinguished from that of FS179-APEX2 (FS179s) and WT
218 (2913s) controls (Fig. 6A). Therefore, APEX2 proximity proteomics was able to distinguish protein
219 composition of the tip versus FAZ subdomains within the flagellum, even though they have no physical
220 barrier between them.
221 To define an “AC1s proximity proteome”, we compared the biotinylated protein profile of the AC1s
222 sample to that of 2913s and AC145s samples processed in parallel. We used known flagellum tip
223 proteins (Supplemental Table S3) to set the enrichment threshold for inclusion in the AC1s proximity
224 proteome. Finally, we included only those proteins that were enriched in AC1s versus FS179s and DRC1s.
225 This yielded a final AC1s proximity proteome of 48 proteins, including 27 adenylate cyclases and 21
226 additional proteins (Table 2). Extensive sequence identity among adenylate cyclases poses challenges for
227 distinguishing between some isoforms, so the number of 27 might be an overestimate. Nonetheless,
228 adenylate cyclases represent the majority of proteins identified. We performed a parallel analysis to
229 define an “FS179s proximity proteome”, in this case comparing to 2913s and AC145s samples as
230 negative controls and using intraflagellar transport (IFT) proteins [52] to set the enrichment threshold
Page 11 of 33
bioRxiv preprint doi: https://doi.org/10.1101/2020.03.09.984815; this version posted March 12, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission.
231 for inclusion. Gene ontology analysis showed enrichment of flagellar proteins and signaling proteins in
232 the AC1s proximity proteome, while the FS179s proximity proteome is enriched for flagellar proteins,
233 but not signaling proteins (Fig. 6B and C).
234 Prevalence of adenylate cyclases within the AC1s proximity proteome supports the idea that the dataset
235 is enriched for tip proteins, because all T. brucei adenylate cyclases studied to date are flagellar and, in
236 procyclics, many are enriched at the flagellum tip [29]. AC2 is localized all along the flagellum [29], yet it
237 is found in the AC1s proximity proteome, perhaps due to the fact that AC2 and AC1 dimerize and share
238 ~90% amino acid sequence identity [29].
239 Among the 21 non-AC proteins in the AC1s proximity proteome, 20 have independent data on
240 localization [43]. Four of these have previously been published as being flagellum tip-specific (FLAM8
241 and CALP1.3), flagellum-specific and tip-enriched (KIN-E), or located throughout the cell but also found
242 in the flagellum tip (CALP7.2) [34, 53, 54]. For the remaining 16 proteins we assessed localization by
243 referencing the TrypTag database [43] and/or epitope tagging directly. We find that half of these 16
244 proteins are either tip-specific or enriched at the flagellum tip while also being located elsewhere in the
245 cell (Fig. 7 and Table 2). Notably, most of the proteins that did not exhibit tip localization were among
246 the least enriched in the AC1s versus FS179s samples (Table 2). Among 11 proteins enriched >2-fold in
247 AC1s versus FS179s and having localization data, 9 are enriched in the flagellum tip. Therefore, the AC1s
248 proximity proteome is enriched for flagellum tip proteins.
249 DISCUSSION
250 The protein composition of flagellum subdomains in T. brucei is a knowledge gap in understanding the
251 biology of these pathogens. To overcome this, we implemented APEX2 proximity proteomics. Our
252 results demonstrate that APEX-based proximity labeling is effective in T. brucei and is capable of
253 resolving flagellum subdomains even if they are not separated by physical barriers. Use of the APEX
Page 12 of 33
bioRxiv preprint doi: https://doi.org/10.1101/2020.03.09.984815; this version posted March 12, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission.
254 system has allowed us to define a soluble flagellum tip proteome that indicates the tip is enriched for
255 signaling proteins. While this tip proteome is likely incomplete, our work represents an important step in
256 defining protein composition of flagellum subdomains and other trypanosome cellular compartments
257 that cannot themselves be purified.
258 One major advantage of proximity labeling-based proteomics versus other proteomic approaches to
259 define organelle protein composition is that it allows for isolation of proteins of interest from crude cell
260 lysates in a simple, one-step purification, without the need to purify the organelle. Prior proteomic
261 analyses of the T. brucei flagellum have required purification of the flagellum away from the cell body
262 [34]. This is problematic, because the flagellum in T. brucei is laterally connected to the cell body along
263 most of its length. Therefore, purification approaches have employed genetic manipulation to remove
264 lateral connections, followed by sonication or shearing to detach the flagellum at its base [32, 34] or
265 have employed detergent extraction followed by selective depolymerization of subpellicular
266 microtubules while leaving axoneme microtubules intact [30]. The latter approach does not allow for
267 identification of flagellum matrix or membrane proteins that are detergent-soluble. In both cases,
268 flagellum detachment is followed by centrifugation to separate flagellum fractions from cell bodies and
269 solubilized material, then electron microscopy to evaluate sample quality. The APEX approach eliminates
270 the need for subcellular fractionation and evaluation of the purified flagellum sample, although it can be
271 coupled with fractionation to distinguish detergent-soluble from insoluble proteins, as done in our case.
272 APEX labeling also enables easy detection of detergent-soluble and insoluble proteins from the same
273 cells. A disadvantage of proximity labeling-based proteomics is that owing to the requirement for
274 proteins to be close to the APEX-tagged bait, one might miss proteins that are part of the organelle in
275 question, but distant from the bait protein. For this reason, efforts to define a comprehensive, whole-
276 organelle proteome should employ multiple independent approaches.
Page 13 of 33
bioRxiv preprint doi: https://doi.org/10.1101/2020.03.09.984815; this version posted March 12, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission.
277 Our ability to distinguish protein profiles of AC1s and FS179s samples (Fig 6A) illustrates the capacity of
278 APEX to distinguish between subcellular regions that are not separated by a physical barrier. Although
279 this capacity was shown previously in mammalian cells for the cilium compartment versus the cytoplasm
280 [44] and for distinct locations in the cytoplasm [55], our studies now demonstrate this is also possible
281 within the spatially-restricted volume of the flagellum. The eukaryotic flagellum is comprised of specific
282 subdomains, each with specialized functions and protein composition [29]. The distal tip of the flagellum
283 in many organisms, for example, is important for transduction of extracellular signals, flagellum length
284 regulation and cell-cell adhesion [10, 11, 45-51]. The transition zone at the flagellum’s proximal end has
285 specialized functions controlling access into and out of the flagellar compartment [16, 56]. Even dyneins
286 are distributed differentially from proximal to distal ends of the axoneme [57]. APEX labeling has
287 previously been used to identify proteins throughout the cilium in mammalian cells [44]. To our
288 knowledge however, our studies are the first to extend this system to distinguishing protein composition
289 of specific flagellum subdomains, thus providing a powerful addition to tools available for dissecting
290 flagellum function and mechanisms of ciliary compartmentation.
291 Prior work has provided important advances by determining protein composition of two specific
292 subdomains within the T. brucei flagellum, the transition zone [16] and tip [58] of the detergent-
293 insoluble axoneme, the latter including an “axonemal capping structure” (ACS) and the T. brucei-specific
294 flagellar connector (FC). Those studies developed a novel method termed “structure
295 immunoprecipitation” (SIP), in which detergent-insoluble axonemes are prepared, then fragmented into
296 small pieces and immunoprecipitated using antibody to a marker protein localized to the domain of
297 interest. Independent localization, biochemical and functional analysis were then used to define a
298 corresponding transition zone, ACS and FC proteomes. Given that the FC and ACS are at the tip of the
299 flagellum, we compared their proteomes to the AC1s proximity proteome. We did not find any overlap,
300 as might be expected because the FC and ACS analyses were restricted to detergent-insoluble
Page 14 of 33
bioRxiv preprint doi: https://doi.org/10.1101/2020.03.09.984815; this version posted March 12, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission.
301 components, while the AC1s proximity proteome uniquely identifies detergent-soluble proteins. Our
302 work therefore complements and extends earlier proteome analyses of T. brucei flagellum sub-domains
303 and expands our knowledge of proteins that mediate flagellum tip-specific functions.
304 The AC1s proximity proteome is enriched for known and previously unknown flagellum tip proteins (Fig.
305 7 and Table 2). Function for many of these proteins remains to be determined but features and
306 properties in some cases indicate a role in signal transduction. One, previously identified as “cAMP
307 response protein 3” (CARP3), is a candidate cAMP effector [59] and over half of the proteins are
308 adenylate cyclases. A role for the flagellum tip in cAMP signaling has been previously demonstrated [10,
309 11] and flagellar cAMP signaling is required for the T. brucei transmission cycle in the tsetse fly [12].
310 Another protein group well-represented in the AC1s proximity proteome is calpain-like proteins.
311 Calpains function in Ca++ signal transduction, although the calpain-like proteins in the AC1s proximity
312 proteome possess only a subset of the domains typically seen in more classical calpains. Interestingly,
313 the flagellar tip kinesin KIN-E [53] contains a domain III-like domain typically found in calpains and
314 thought to function in lipid binding [60]. The AC1s proximity proteome includes four proteins annotated
315 as having homology to a serine-rich adhesin protein from bacteria [61]. Whether these function as
316 adhesins in the flagellum is unknown, but adhesion functions could contribute to attachment of the
317 axoneme to the flagellar membrane or in attachment functions of the FC [58].
318 The DRC1p proximity proteome showed substantial overlap with earlier proteome analyses of purified
319 flagellum skeletons [16, 30, 31, 34], while also including 350 proteins not found in those prior studies.
320 Independent data on 241 of these 350 proteins support flagellum association for 71%, indicating protein
321 composition of the T. brucei flagellum is more complex than suggested from earlier studies. Many
322 proteins uncovered in prior flagellum proteome analyses were not found in the DRC1p proximity
323 proteome. There are several potential explanations for this. First, some earlier studies include
Page 15 of 33
bioRxiv preprint doi: https://doi.org/10.1101/2020.03.09.984815; this version posted March 12, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission.
324 detergent-soluble matrix and membrane proteins [32, 34], which are not expected in the DRC1p
325 proximity proteome. Second, the threshold for inclusion in the DRC1p dataset is high, as proteins must
326 be present in all four DRC1p replicates and enriched over negative controls, while some of the earlier
327 studies examined a limited number of replicates [30], or lacked extensive negative controls [32]. Third,
328 each approach will contain false positives and false negatives. Lastly, proteins that are far away from
329 DRC1 might not be biotinylated in our analysis. We recognize that several bona fide axonemal proteins
330 were identified in WT, negative control samples processed in parallel to DRC1p samples. We do not
331 presently know the reason for this, but the problem was less evident in the analysis of detergent-soluble
332 AC1s samples, so may relate to residual insolubility of the axoneme, or high abundance of axoneme
333 proteins, which may be present at thousands of copies per axoneme.
334 To our knowledge, this is the first work to demonstrate APEX2 proximity proteomics in a pathogenic
335 eukaryote. This is an important point, because presence of systems for removing reactive oxygen
336 species in any given organism may limit suitability of APEX-based labeling [36]. For example,
337 Plasmodium spp. rely on a series of redundant glutathione- and thioredoxin-dependent reactions to
338 remove H2O2 and maintain redox equilibrium [62]. BioID [63] is an alternative proximity-labeling method
339 that has been used widely, including several applications in T. brucei [64] and other parasites [65]. BioID
340 and APEX methods are complementary, with APEX offering the added advantage of capacity for doing
341 time course analyses. Another advantage is that APEX catalyzes oxidation of 3,3’-diaminobenzidine
342 (DAB), which polymerizes and can be visualized by electron microscopy to determine precise subcellular
343 location of the APEX-tagged protein [66]. Therefore, our studies expand the capacity in T. brucei for
344 proximity-labeling, which is an increasingly important tool in post-genomic efforts to define protein
345 location and function in pathogenic organisms [67].
346 MATERIALS AND METHODS
Page 16 of 33
bioRxiv preprint doi: https://doi.org/10.1101/2020.03.09.984815; this version posted March 12, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission.
347 Trypanosoma brucei culture
348 Procyclic Trypanosoma brucei brucei (strain 29-13) [68] was used as control and to generate all the
349 APEX2-tagged cell lines. Cells were cultivated in SM media supplemented with 10% heat inactivated fetal
350 bovine serum and incubated at 28°C with 5% CO2. Selection for transformants was done using 1 μg/mL
351 of puromycin.
352 In situ Tagging
353 APEX2-tagged cell lines were generated by in situ tagging [69]. In each case, cells were transfected with
354 a cassette containing the APEX2-NES tag [35] followed by a puromycin resistance marker and flanked on
355 the 5’ end by the 3’ end of the target gene ORF and on the 3’ end by the target gene 3’ UTR. For AC1
356 (Tb927.11.17040) and FS179 (Tb927.10.2880), the AC1-HA [32] and FS179-HA [29] in situ tagging vectors
357 were modified by inserting the APEX2-NES tag in-frame between the target gene ORF and HAx3 tag to
358 generate the final APEX2 tagging cassettes. For DRC1 (Tb927.10.7880), the last 592 bp of the ORF were
359 PCR-amplified from genomic DNA and cloned upstream of the HAx3 tag in the pMOTag2H [69] vector
360 backbone. Similarly, the first 404 bp of the DRC1 3’-UTR were PCR-amplified from genomic DNA and
361 cloned downstream of the puromycin resistance marker. The APEX2-NES tag was then inserted between
362 the target gene ORF and HAx3 tag as described above. All sequences were verified by DNA sequencing.
363 Tagging cassettes were excised from the pMOTag backbone and gel purified. Trypanosome cells were
364 transfected by electroporation and selected with puromycin as described [39].
365 Biotinylation
366 Biotinylation was done using a modified version of [36]. Briefly, cells were harvested by centrifugation
367 for 10 min at 1,200 x g, then resuspended at 2 × 107 cells/mL in growth medium supplemented with 5
368 mM biotin-phenol (biotin tyramide, Acros Organics). After a one-hour incubation, cells were treated
Page 17 of 33
bioRxiv preprint doi: https://doi.org/10.1101/2020.03.09.984815; this version posted March 12, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission.
369 with 1 mM H2O2 for one minute. To quench unreacted hydrogen peroxide, an equal volume of 2x
370 quenching buffer (10 mM Trolox and 20 mM L-Ascorbic Acid Sodium Salt in PBS, pH 7.2) was added, cells
371 were harvested by centrifugation and two additional washes were made with 1x quenching buffer.
372 Immunofluorescence
373 Cells were washed once in PBS and fixed by addition of paraformaldehyde to 0.1% for 5 min on ice.
374 Fixed cells were washed once in PBS and air-dried onto coverslips. The coverslips were incubated for 10
375 min in -20°C methanol followed by 10 min in -20°C acetone, and then air-dried. Cells were re-hydrated
376 for 15 min in PBS and blocked overnight in blocking solution (PBS + 5% bovine serum albumin [BSA] + 5%
377 normal donkey serum). Coverslips were incubated with Streptavidin coupled to Alexa 594 (Life
378 Technologies) and anti-PFR [29] diluted in blocking solution for 1.5 h. After three washes in PBS + 0.05%
379 Tween-20 for 10 min each, coverslips were incubated with donkey anti-rabbit IgG coupled to Alexa 488
380 (Invitrogen) in blocking solution for 1.5 h. After three washes in PBS + 0.05% Tween-20 for 10 min each,
381 cells were fixed with 4% paraformaldehyde for 5 min. Coverslips were washed three times in PBS +
382 0.05% Tween-20 and one time in PBS for 10 min each. Cells were mounted with Vectashield containing
383 DAPI (Vector). Images were acquired using a Zeiss Axioskop II compound microscope and processed
384 using Axiovision (Zeiss, Inc., Jena, Germany) and Adobe Photoshop (Adobe Systems, Inc., Mountain
385 View, CA).
386 Fractionation of Whole Cells and Purification of Biotinylated Proteins
387 For purification of biotinylated proteins, 6 × 108 cells were washed once in PBS and lysed in lysis buffer:
388 PEME buffer (100 mM PIPES • 1.5 Na, 2 mM EGTA, 1 mM MgSO4 • 7 H2O, and 0.1 mM EDTA-Na2 • 2
389 H2O) + 0.5% Nonidet P-40 (NP40) + EDTA free protease inhibitors (Sigma), for 10 min on ice. Lysates
390 were centrifuged for 8 min at 2,500 x g at room temperature (RT) to separate the NP40-soluble
Page 18 of 33
bioRxiv preprint doi: https://doi.org/10.1101/2020.03.09.984815; this version posted March 12, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission.
391 supernatant and the insoluble pellet. The pellet was boiled for 5 min in lysis buffer + 1% SDS, centrifuged
392 for 3 min at 21,000 x g at RT to remove insoluble debris and SDS supernatant collected.
393 Capture of biotinylated proteins from the NP40-soluble and SDS supernatants on streptavidin beads was
394 done essentially as described [32]. Cell fractions were incubated with 120 µL streptavidin beads (GE
395 Healthcare) overnight at 4°C with gentle agitation. Biotinylated proteins bound to Streptavidin beads
396 were separated from the unbound molecules by centrifugation. Beads were washed once with lysis
397 buffer at 4°C, whereas the rest of the washes were made at RT as follows: once in buffer A (8 M urea,
398 200 mM NaCl, 2% SDS, and 100 mM Tris), once in buffer B (8 M urea, 1.2 M NaCl, 0.2% SDS, 100 mM
399 Tris, 10% ethanol, and 10% isopropanol), once in buffer C (8 M urea, 200 mM NaCl, 0.2% SDS, 100 mM
400 Tris, 10% ethanol, and 10% isopropanol), and 5 times in buffer D (8 M urea, and 100 mM Tris); pH of all
401 wash buffers was 8.
402 Shotgun Proteomics
403 Shotgun proteomics was done based on [32]. Streptavidin-bound proteins were digested on beads by
404 the sequential addition of lys-C and trypsin protease. Peptide samples were fractionated online using
405 reversed phase chromatography followed by tandem mass spectrometry analysis on a Thermofisher Q-
406 Exactive™ mass spectrometer (ThermoFisher). Data analysis was performed using the IP2 suite of
407 algorithms (Integrated Proteomics Applications). Briefly, RawXtract (version 1.8) was used to extract
408 peaklist information from Xcalibur-generated RAW files. Database searching of the MS/MS spectra was
409 performed using the ProLuCID algorithm (version 1.0) and a user assembled database consisting of all
410 protein entries from the TriTrypDB for T. brucei strain 927 (version 7.0). Other database search
411 parameters included: 1) precursor ion mass tolerance of 10 ppm, 2) fragment ion mass tolerance of
412 10ppm, 3) only peptides with fully tryptic ends were considered candidate peptides in the search with
413 no consideration of missed cleavages, and 4) static modification of 57.02146 on cysteine residues.
Page 19 of 33
bioRxiv preprint doi: https://doi.org/10.1101/2020.03.09.984815; this version posted March 12, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission.
414 Peptide identifications were organized and filtered using the DTASelect algorithm which uses a linear
415 discriminate analysis to identify peptide scoring thresholds that yield a peptide-level false discovery rate
416 of less than <1.8% as estimated using a decoy database approach. Proteins were considered present in
417 the analysis if they were identified by two or more peptides using the <1.8% peptide-level false
418 discovery rate.
419 For principal component analysis (PCA) and comparison of proteins identified in each sample,
420 proteomics data were parsed using the IP2 Integrated Proteomics Pipeline Ver. 5.1.2. The output from
421 the Protein Identification STAT Compare tool (IDSTAT_COMPARE) in IP2 was processed using a custom
422 R-based web app (https://uclaproteomics.shinyapps.io/iscviewer/) to generate the PCA graphs. For
423 proteins identified in different samples, the ID_COMPARE output was exported to Excel to generate
424 mass spectrometry data analysis tables (Supplemental Tables S4 and S5).
425 For the DRC1p proximity proteome, data were from three independent experiments each for DRC1-
426 APEX2 and 29-13 (WT) pellet samples. In one case each, the sample was split into two aliquots and
427 shotgun proteomics was done on both in parallel, giving a total of four replicates each for DRC1-APEX2
428 and WT samples. Ratios of average normalized spectra were used to determine inclusion in the DRC1p
429 proximity proteome as described in the text. Of 739 proteins identified, 42 redundant GeneIDs were
430 removed for a total proximity proteome of 697 (Table S1 and S4). For the AC1s proximity proteome,
431 data were from four independent experiments for 29-13 samples (0313, 0410, 0629-A and 0629-B) and
432 two independent experiments each for AC1 (0313, 0410) and FS179 (0629-A and O629-B). For the 0313
433 and 0410 experiments, each sample was split into two aliquots and shotgun proteomics done in parallel
434 on each, for a total of six replicates for WT and four replicates for AC1. Ratios of average normalized
435 spectra were used to determine inclusion in the AC1s proximity proteome as described in the text.
436 Bioinformatics
Page 20 of 33
bioRxiv preprint doi: https://doi.org/10.1101/2020.03.09.984815; this version posted March 12, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission.
437 To identify human homologues, we developed an algorithm to automatically return reciprocal best blast
438 hits from the large lists of proteins produced by shotgun proteomics. The algorithm was implemented in
439 python using the biopython library
440 (https://academic.oup.com/bioinformatics/article/25/11/1422/330687) and works as follows. Individual
441 sequences were parsed sequentially and used as a query for BLASTp to find a list of similar sequences
442 with an e-value threshold of 0.1 from a database of human protein sequences retrieved from NCBI. The
443 top three most similar sequences were then used as query sequences for a subsequent BLASTp call
444 against a database of T. brucei protein sequences. If the original sequence was found in the top three
445 from this call then the query was then returned as a homologue. The python code for performing this
446 can be found at the following URL: https://github.com/marcusgj13/Reciprocal-BB. Comparison to
447 published T. brucei flagellar proteomes [30, 32, 34] was done using the search tools in the TriTryp
448 Genome Database (https://tritrypdb.org/tritrypdb/) [61]. Word clouds (Figures 3, 6 and S2) were
449 obtained using the GO Enrichment tool of TriTrypDB, using T. brucei brucei TREU927 and a 0.05 P-Value
450 cutoff.
451 To assess protein localization as determined by TrypTag [43], proteins in the DRC1p proximity proteome
452 dataset were cross-referenced with TrypTag to look at matches vs mis-matches. The GO terms searched
453 for were: flagellum, transition zone, flagellar cytoplasm, pro-basal body, basal body, flagellar membrane,
454 flagella connector, paraflagellar rod, flagellum attachment zone, intraflagellar transport particle, hook
455 complex, flagellar tip, and axoneme. For the full DRC1p proximity proteome, the number of genes that
456 were searched for was 739. The number of genes for which there is TrypTag localization data for at least
457 one terminus was 677. The number of TrypTag-tagged genes that matched a flagellum GO term were
458 509. The number of TrypTag-tagged genes that DID NOT match a flagellum GO term was 168. Therefore
459 of 677 proteins with TrypTag localization data, 75% have a TrypTag localization that matches the query
460 GO terms. For DRC1p proximity proteome proteins not found in earlier proteome analyses, the number
Page 21 of 33
bioRxiv preprint doi: https://doi.org/10.1101/2020.03.09.984815; this version posted March 12, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission.
461 of query genes searched for was 350. The number of query genes for which there is TrypTag localization
462 data for at least one terminus was 337. The number of query genes where TrypTag localization
463 MATCHES the query GO term was 241. The number of query genes where TrypTag localization MIS-
464 MATCHES the query GO term was 96. Therefore, of 337 proteins with TrypTag localization data, 71%
465 have a TrypTag localization that MATCHES the query GO terms.
466 Data availability
467 The python code used to identify human homologues was deposited in GitHub, and it can be found at
468 the following URL: https://github.com/marcusgj13/Reciprocal-BB. The code, explanation of it, as well as
469 instructions to install and run the program can be found there.
470 ACKNOWLEDGMENTS
471 We thank Vincent Tran for help with molecular cloning. Funding was provided by National Institutes of
472 Health grants AI052348 and AI142544 to K.H., and GM089778 to J.A.W. The authors would like to thank
473 the TrypTag consortium, which is supported by the Wellcome Trust [108445/Z/15/Z], for providing data
474 which enabled part of this work. D.E.V.R. was recipient of a Fulbright scholarship trough the United
475 States-Mexico Commission for Educational and Cultural Exchange (COMEXUS), a postdoctoral fellowship
476 of the Mexican National Council of Science and Technology (CONACyT) (CVU #325790), and a
477 postdoctoral fellowship of the University of California Institute for Mexico and the United States (UC
478 MEXUS), in conjunction with CONACyT. The funders did not have any role on the study design, data
479 collection and interpretation, nor the decision to submit the work for publication.
480 REFERENCES
Page 22 of 33
bioRxiv preprint doi: https://doi.org/10.1101/2020.03.09.984815; this version posted March 12, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission.
481 1. Stuart, K., R. Brun, S. Croft, A. Fairlamb, R.E. Gurtler, J. McKerrow, S. Reed, and R. Tarleton,
482 Kinetoplastids: related protozoan pathogens, different diseases. J Clin Invest, 2008. 118(4): p.
483 1301-10.
484 2. FAO, Controling tsetse and trypanosomosis to protect African livestock keepers, public health
485 and farmers' livelihoods. 2019.
486 3. Langousis, G. and K.L. Hill, Motility and more: the flagellum of Trypanosoma brucei. Nat Rev
487 Microbiol, 2014. 12(7): p. 505-18.
488 4. Shimogawa, M.M., S.S. Ray, N. Kisalu, Y. Zhang, Q. Geng, A. Ozcan, and K.L. Hill, Parasite motility
489 is critical for virulence of African trypanosomes. Sci Rep, 2018. 8(1): p. 9122.
490 5. Rotureau, B., C.P. Ooi, D. Huet, S. Perrot, and P. Bastin, Forward motility is essential for
491 trypanosome infection in the tsetse fly. Cell Microbiol, 2014. 16(3): p. 425-33.
492 6. Ralston, K.S., A.G. Lerner, D.R. Diener, and K.L. Hill, Flagellar motility contributes to cytokinesis in
493 Trypanosoma brucei and is modulated by an evolutionarily conserved dynein regulatory system.
494 Eukaryot Cell, 2006. 5(4): p. 696-711.
495 7. Branche, C., L. Kohl, G. Toutirais, J. Buisson, J. Cosson, and P. Bastin, Conserved and specific
496 functions of axoneme components in trypanosome motility. J Cell Sci, 2006. 119(Pt 16): p. 3443-
497 55.
498 8. Moreira-Leite, F.F., T. Sherwin, L. Kohl, and K. Gull, A trypanosome structure involved in
499 transmitting cytoplasmic information during cell division. Science, 2001. 294(5542): p. 610-2.
500 9. Vickerman, K., On the surface coat and flagellar adhesion in trypanosomes. J Cell Sci, 1969. 5(1):
501 p. 163-93.
502 10. Oberholzer, M., E.A. Saada, and K.L. Hill, Cyclic AMP Regulates Social Behavior in African
503 Trypanosomes. MBio, 2015. 6(3): p. e01954-14.
Page 23 of 33
bioRxiv preprint doi: https://doi.org/10.1101/2020.03.09.984815; this version posted March 12, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission.
504 11. Lopez, M.A., E.A. Saada, and K.L. Hill, Insect stage-specific adenylate cyclases regulate social
505 motility in African trypanosomes. Eukaryot Cell, 2015. 14(1): p. 104-12.
506 12. Shaw, S., S.F. DeMarco, R. Rehmann, T. Wenzler, F. Florini, I. Roditi, and K.L. Hill, Flagellar cAMP
507 signaling controls trypanosome progression through host tissues. Nat Commun, 2019. 10(1): p.
508 803.
509 13. Langousis, G., M.M. Shimogawa, E.A. Saada, A.A. Vashisht, R. Spreafico, A.R. Nager, W.D.
510 Barshop, M.V. Nachury, J.A. Wohlschlegel, and K.L. Hill, Loss of the BBSome perturbs endocytic
511 trafficking and disrupts virulence of Trypanosoma brucei. Proc Natl Acad Sci U S A, 2016. 113(3):
512 p. 632-7.
513 14. Salmon, D., G. Vanwalleghem, Y. Morias, J. Denoeud, C. Krumbholz, F. Lhomme, S. Bachmaier,
514 M. Kador, J. Gossmann, F.B. Dias, G. De Muylder, P. Uzureau, S. Magez, M. Moser, P. De
515 Baetselier, et al., Adenylate cyclases of Trypanosoma brucei inhibit the innate immune response
516 of the host. Science, 2012. 337(6093): p. 463-6.
517 15. Gould, M.K. and H.P. de Koning, Cyclic-nucleotide signalling in protozoa. FEMS Microbiol Rev,
518 2011. 35(3): p. 515-41.
519 16. Dean, S., F. Moreira-Leite, V. Varga, and K. Gull, Cilium transition zone proteome reveals
520 compartmentalization and differential dynamics of ciliopathy complexes. Proc Natl Acad Sci U S
521 A, 2016. 113(35): p. E5135-43.
522 17. Field, M.C. and M. Carrington, The trypanosome flagellar pocket. Nat Rev Microbiol, 2009. 7(11):
523 p. 775-86.
524 18. Cachon, J., M. Cachon, M.-P. Cosson, and J. Cosson, The paraflagellar rod: a structure in search
525 of a function. Biology of the Cell, 1988. 63(2): p. 169-181.
526 19. Sunter, J.D. and K. Gull, The Flagellum Attachment Zone: 'The Cellular Ruler' of Trypanosome
527 Morphology. Trends Parasitol, 2016. 32(4): p. 309-324.
Page 24 of 33
bioRxiv preprint doi: https://doi.org/10.1101/2020.03.09.984815; this version posted March 12, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission.
528 20. Sun, S.Y., C. Wang, Y.A. Yuan, and C.Y. He, An intracellular membrane junction consisting of
529 flagellum adhesion glycoproteins links flagellum biogenesis to cell morphogenesis in
530 Trypanosoma brucei. J Cell Sci, 2013. 126(Pt 2): p. 520-31.
531 21. Hoog, J.L., C. Bouchet-Marquis, J.R. McIntosh, A. Hoenger, and K. Gull, Cryo-electron
532 tomography and 3-D analysis of the intact flagellum in Trypanosoma brucei. J Struct Biol, 2012.
533 178(2): p. 189-98.
534 22. Hughes, L.C., K.S. Ralston, K.L. Hill, and Z.H. Zhou, Three-dimensional structure of the
535 Trypanosome flagellum suggests that the paraflagellar rod functions as a biomechanical spring.
536 PLoS One, 2012. 7(1): p. e25700.
537 23. Koyfman, A.Y., M.F. Schmid, L. Gheiratmand, C.J. Fu, H.A. Khant, D. Huang, C.Y. He, and W. Chiu,
538 Structure of Trypanosoma brucei flagellum accounts for its bihelical motion. Proc Natl Acad Sci U
539 S A, 2011. 108(27): p. 11105-8.
540 24. Oberholzer, M., P. Bregy, G. Marti, M. Minca, M. Peier, and T. Seebeck, Trypanosomes and
541 mammalian sperm: one of a kind? Trends Parasitol, 2007. 23(2): p. 71-7.
542 25. Portman, N., S. Lacomble, B. Thomas, P.G. McKean, and K. Gull, Combining RNA interference
543 mutants and comparative proteomics to identify protein components and dependences in a
544 eukaryotic flagellum. J Biol Chem, 2009. 284(9): p. 5610-9.
545 26. Sunter, J.D., C. Benz, J. Andre, S. Whipple, P.G. McKean, K. Gull, M.L. Ginger, and J. Lukes,
546 Modulation of flagellum attachment zone protein FLAM3 and regulation of the cell shape in
547 Trypanosoma brucei life cycle transitions. J Cell Sci, 2015. 128(16): p. 3117-30.
548 27. Hoog, J.L., S. Lacomble, C. Bouchet-Marquis, L. Briggs, K. Park, A. Hoenger, and K. Gull, 3D
549 Architecture of the Trypanosoma brucei Flagella Connector, a Mobile Transmembrane Junction.
550 PLoS Negl Trop Dis, 2016. 10(1): p. e0004312.
Page 25 of 33
bioRxiv preprint doi: https://doi.org/10.1101/2020.03.09.984815; this version posted March 12, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission.
551 28. Rotureau, B. and J. Van Den Abbeele, Through the dark continent: African trypanosome
552 development in the tsetse fly. Front Cell Infect Microbiol, 2013. 3: p. 53.
553 29. Saada, E.A., Z.P. Kabututu, M. Lopez, M.M. Shimogawa, G. Langousis, M. Oberholzer, A. Riestra,
554 Z.O. Jonsson, J.A. Wohlschlegel, and K.L. Hill, Insect stage-specific receptor adenylate cyclases
555 are localized to distinct subdomains of the Trypanosoma brucei Flagellar membrane. Eukaryot
556 Cell, 2014. 13(8): p. 1064-76.
557 30. Broadhead, R., H.R. Dawe, H. Farr, S. Griffiths, S.R. Hart, N. Portman, M.K. Shaw, M.L. Ginger, S.J.
558 Gaskell, P.G. McKean, and K. Gull, Flagellar motility is required for the viability of the
559 bloodstream trypanosome. Nature, 2006. 440(7081): p. 224-7.
560 31. Varga, V., F. Moreira-Leite, N. Portman, and K. Gull, Protein diversity in discrete structures at the
561 distal tip of the trypanosome flagellum. Proc Natl Acad Sci U S A, 2017. 114(32): p. E6546-e6555.
562 32. Oberholzer, M., G. Langousis, H.T. Nguyen, E.A. Saada, M.M. Shimogawa, Z.O. Jonsson, S.M.
563 Nguyen, J.A. Wohlschlegel, and K.L. Hill, Independent analysis of the flagellum surface and
564 matrix proteomes provides insight into flagellum signaling in mammalian-infectious
565 Trypanosoma brucei. Mol Cell Proteomics, 2011. 10(10): p. M111.010538.
566 33. Shimogawa, M.M., E.A. Saada, A.A. Vashisht, W.D. Barshop, J.A. Wohlschlegel, and K.L. Hill, Cell
567 Surface Proteomics Provides Insight into Stage-Specific Remodeling of the Host-Parasite
568 Interface in Trypanosoma brucei. Mol Cell Proteomics, 2015. 14(7): p. 1977-88.
569 34. Subota, I., D. Julkowska, L. Vincensini, N. Reeg, J. Buisson, T. Blisnick, D. Huet, S. Perrot, J. Santi-
570 Rocca, M. Duchateau, V. Hourdel, J.C. Rousselle, N. Cayet, A. Namane, J. Chamot-Rooke, et al.,
571 Proteomic analysis of intact flagella of procyclic Trypanosoma brucei cells identifies novel
572 flagellar proteins with unique sub-localization and dynamics. Mol Cell Proteomics, 2014. 13(7):
573 p. 1769-86.
Page 26 of 33
bioRxiv preprint doi: https://doi.org/10.1101/2020.03.09.984815; this version posted March 12, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission.
574 35. Lam, S.S., J.D. Martell, K.J. Kamer, T.J. Deerinck, M.H. Ellisman, V.K. Mootha, and A.Y. Ting,
575 Directed evolution of APEX2 for electron microscopy and proximity labeling. Nat Methods, 2015.
576 12(1): p. 51-4.
577 36. Hung, V., N.D. Udeshi, S.S. Lam, K.H. Loh, K.J. Cox, K. Pedram, S.A. Carr, and A.Y. Ting, Spatially
578 resolved proteomic mapping in living cells with the engineered peroxidase APEX2. Nat Protoc,
579 2016. 11(3): p. 456-75.
580 37. Wirschell, M., H. Olbrich, C. Werner, D. Tritschler, R. Bower, W.S. Sale, N.T. Loges, P.
581 Pennekamp, S. Lindberg, U. Stenram, B. Carlen, E. Horak, G. Kohler, P. Nurnberg, G. Nurnberg, et
582 al., The nexin-dynein regulatory complex subunit DRC1 is essential for motile cilia function in
583 algae and humans. Nat Genet, 2013. 45(3): p. 262-8.
584 38. Oda, T., H. Yanagisawa, and M. Kikkawa, Detailed structural and biochemical characterization of
585 the nexin-dynein regulatory complex. Mol Biol Cell, 2015. 26(2): p. 294-304.
586 39. Oberholzer, M., M.A. Lopez, K.S. Ralston, and K.L. Hill, Approaches for functional analysis of
587 flagellar proteins in African trypanosomes. Methods Cell Biol, 2009. 93: p. 21-57.
588 40. Kabututu, Z.P., M. Thayer, J.H. Melehani, and K.L. Hill, CMF70 is a subunit of the dynein
589 regulatory complex. J Cell Sci, 2010. 123(Pt 20): p. 3587-95.
590 41. Lacomble, S., N. Portman, and K. Gull, A protein-protein interaction map of the Trypanosoma
591 brucei paraflagellar rod. PLoS One, 2009. 4(11): p. e7685.
592 42. Thomas, P.D., The Gene Ontology and the Meaning of Biological Function. Methods Mol Biol,
593 2017. 1446: p. 15-24.
594 43. Dean, S., J.D. Sunter, and R.J. Wheeler, TrypTag.org: A Trypanosome Genome-wide Protein
595 Localisation Resource. Trends Parasitol, 2017. 33(2): p. 80-82.
596 44. Mick, D.U., R.B. Rodrigues, R.D. Leib, C.M. Adams, A.S. Chien, S.P. Gygi, and M.V. Nachury,
597 Proteomics of Primary Cilia by Proximity Labeling. Dev Cell, 2015. 35(4): p. 497-512.
Page 27 of 33
bioRxiv preprint doi: https://doi.org/10.1101/2020.03.09.984815; this version posted March 12, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission.
598 45. Nager, A.R., J.S. Goldstein, V. Herranz-Perez, D. Portran, F. Ye, J.M. Garcia-Verdugo, and M.V.
599 Nachury, An Actin Network Dispatches Ciliary GPCRs into Extracellular Vesicles to Modulate
600 Signaling. Cell, 2017. 168(1-2): p. 252-263.e14.
601 46. He, M., S. Agbu, and K.V. Anderson, Microtubule Motors Drive Hedgehog Signaling in Primary
602 Cilia. Trends Cell Biol, 2017. 27(2): p. 110-125.
603 47. Blaineau, C., M. Tessier, P. Dubessay, L. Tasse, L. Crobu, M. Pages, and P. Bastien, A novel
604 microtubule-depolymerizing kinesin involved in length control of a eukaryotic flagellum. Curr
605 Biol, 2007. 17(9): p. 778-82.
606 48. Bertiaux, E., B. Morga, T. Blisnick, B. Rotureau, and P. Bastin, A Grow-and-Lock Model for the
607 Control of Flagellum Length in Trypanosomes. Curr Biol, 2018. 28(23): p. 3802-3814.e3.
608 49. Keeling, J., L. Tsiokas, and D. Maskey, Cellular Mechanisms of Ciliary Length Control. Cells, 2016.
609 5(1).
610 50. Fort, C. and P. Bastin, [Elongation of the axoneme and dynamics of intraflagellar transport]. Med
611 Sci (Paris), 2014. 30(11): p. 955-61.
612 51. He, M., R. Subramanian, F. Bangs, T. Omelchenko, K.F. Liem, Jr., T.M. Kapoor, and K.V. Anderson,
613 The kinesin-4 protein Kif7 regulates mammalian Hedgehog signalling by organizing the cilium tip
614 compartment. Nat Cell Biol, 2014. 16(7): p. 663-72.
615 52. Absalon, S., T. Blisnick, L. Kohl, G. Toutirais, G. Dore, D. Julkowska, A. Tavenet, and P. Bastin,
616 Intraflagellar transport and functional analysis of genes required for flagellum formation in
617 trypanosomes. Mol Biol Cell, 2008. 19(3): p. 929-44.
618 53. An, T. and Z. Li, An orphan kinesin controls trypanosome morphology transitions by targeting
619 FLAM3 to the flagellum. PLoS Pathog, 2018. 14(5): p. e1007101.
620 54. Liu, W., K. Apagyi, L. McLeavy, and K. Ersfeld, Expression and cellular localisation of calpain-like
621 proteins in Trypanosoma brucei. Mol Biochem Parasitol, 2010. 169(1): p. 20-6.
Page 28 of 33
bioRxiv preprint doi: https://doi.org/10.1101/2020.03.09.984815; this version posted March 12, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission.
622 55. Mavylutov, T., X. Chen, L. Guo, and J. Yang, APEX2- tagging of Sigma 1-receptor indicates
623 subcellular protein topology with cytosolic N-terminus and ER luminal C-terminus. Protein Cell,
624 2018. 9(8): p. 733-737.
625 56. Fisch, C. and P. Dupuis-Williams, Ultrastructure of cilia and flagella - back to the future! Biol Cell,
626 2011. 103(6): p. 249-70.
627 57. Edwards, B.F.L., R.J. Wheeler, A.R. Barker, F.F. Moreira-Leite, K. Gull, and J.D. Sunter, Direction of
628 flagellum beat propagation is controlled by proximal/distal outer dynein arm asymmetry. Proc
629 Natl Acad Sci U S A, 2018. 115(31): p. E7341-e7350.
630 58. Briggs, L.J., P.G. McKean, A. Baines, F. Moreira-Leite, J. Davidge, S. Vaughan, and K. Gull, The
631 flagella connector of Trypanosoma brucei: an unusual mobile transmembrane junction. J Cell Sci,
632 2004. 117(Pt 9): p. 1641-51.
633 59. Gould, M.K., S. Bachmaier, J.A. Ali, S. Alsford, D.N. Tagoe, J.C. Munday, A.C. Schnaufer, D. Horn,
634 M. Boshart, and H.P. de Koning, Cyclic AMP effectors in African trypanosomes revealed by
635 genome-scale RNA interference library screening for resistance to the phosphodiesterase
636 inhibitor CpdA. Antimicrob Agents Chemother, 2013. 57(10): p. 4882-93.
637 60. Tompa, P., Y. Emori, H. Sorimachi, K. Suzuki, and P. Friedrich, Domain III of calpain is a ca2+-
638 regulated phospholipid-binding domain. Biochem Biophys Res Commun, 2001. 280(5): p. 1333-9.
639 61. Aslett, M., C. Aurrecoechea, M. Berriman, J. Brestelli, B.P. Brunk, M. Carrington, D.P. Depledge,
640 S. Fischer, B. Gajria, X. Gao, M.J. Gardner, A. Gingle, G. Grant, O.S. Harb, M. Heiges, et al.,
641 TriTrypDB: a functional genomic resource for the Trypanosomatidae. Nucleic Acids Res, 2010.
642 38(Database issue): p. D457-62.
643 62. Jortzik, E. and K. Becker, Thioredoxin and glutathione systems in Plasmodium falciparum. Int J
644 Med Microbiol, 2012. 302(4-5): p. 187-94.
Page 29 of 33
bioRxiv preprint doi: https://doi.org/10.1101/2020.03.09.984815; this version posted March 12, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission.
645 63. Varnaite, R. and S.A. MacNeill, Meet the neighbors: Mapping local protein interactomes by
646 proximity-dependent labeling with BioID. Proteomics, 2016. 16(19): p. 2503-2518.
647 64. Morriswood, B., K. Havlicek, L. Demmel, S. Yavuz, M. Sealey-Cardona, K. Vidilaseris, D. Anrather,
648 J. Kostan, K. Djinovic-Carugo, K.J. Roux, and G. Warren, Novel bilobe components in
649 Trypanosoma brucei identified using proximity-dependent biotinylation. Eukaryot Cell, 2013.
650 12(2): p. 356-67.
651 65. Chen, A.L., E.W. Kim, J.Y. Toh, A.A. Vashisht, A.Q. Rashoff, C. Van, A.S. Huang, A.S. Moon, H.N.
652 Bell, L.A. Bentolila, J.A. Wohlschlegel, and P.J. Bradley, Novel components of the Toxoplasma
653 inner membrane complex revealed by BioID. MBio, 2015. 6(1): p. e02357-14.
654 66. Martell, J.D., T.J. Deerinck, S.S. Lam, M.H. Ellisman, and A.Y. Ting, Electron microscopy using the
655 genetically encoded APEX2 tag in cultured mammalian cells. Nat Protoc, 2017. 12(9): p. 1792-
656 1816.
657 67. Rees, J.S., X.W. Li, S. Perrett, K.S. Lilley, and A.P. Jackson, Protein Neighbors and Proximity
658 Proteomics. Mol Cell Proteomics, 2015. 14(11): p. 2848-56.
659 68. Wirtz, E., S. Leal, C. Ochatt, and G.A. Cross, A tightly regulated inducible expression system for
660 conditional gene knock-outs and dominant-negative genetics in Trypanosoma brucei. Mol
661 Biochem Parasitol, 1999. 99(1): p. 89-101.
662 69. Oberholzer, M., S. Morand, S. Kunz, and T. Seebeck, A vector series for rapid PCR-mediated C-
663 terminal in situ tagging of Trypanosoma brucei genes. Mol Biochem Parasitol, 2006. 145(1): p.
664 117-20.
665
666 Legends
Page 30 of 33
bioRxiv preprint doi: https://doi.org/10.1101/2020.03.09.984815; this version posted March 12, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission.
667 Figure 1. Schematic diagram of major flagellum substructures. Schematic depicts emergence of the
668 flagellum from the basal body near the cell’s posterior end (left), extending to the cell’s anterior end
669 (right). Major substructures are labeled.
670
671 Figure 2. APEX2 directs organelle-specific biotinylation in T. brucei.
672 Panel A. Western blot of whole cell lysate (W), NP40-extracted supernatant (S) and pellet (P) samples
673 from WT and DRC1-APEX2-expressing cells. Samples were probed with anti-HA antibody.
674 Panel B. Samples as in A were stained with SYPRO Ruby to assess loading.
675 Panel C. WT and DRC1-APEX2 cells were examined by immunofluorescence with anti-PFR antibody
676 (Alexa 488, green), Streptavidin (Alexa 594, red) and DAPI.
677
678 Figure 3. DRC1-APEX2 proximity proteome is enrichened for flagellar proteins.
679 Panel A. Scheme used to identify biotinylated proteins from WT and DRC1-APEX2 expressing T. brucei
680 cells.
681 Panel B. Principal Component Analysis of proteins identified in pellet fractions from WT cells (WTp) and
682 DRC1-APEX2 cells (DRC1p). The experiment was done using three independent biological replicates
683 (0313, 0623-A and 0623-B), and the 0313 protein sample was split into two aliquots and shotgun
684 proteomics done on each in parallel (0313-A1 and 0313-A2).
685 Panels C and D. Word clouds showing GO analysis of the DRC1p proximity proteome. Size and shading of
686 text reflects the p-value according to the scale shown.
687
688 Figure 4. Spatial distribution of proteins identified in the DRC1p proximity proteome.
689 Bars in the histogram indicate the percentage of proteins from each of the indicated complexes that
690 were identified in the DRC1p proximity proteome. Schematic below the histogram illustrates the relative
Page 31 of 33
bioRxiv preprint doi: https://doi.org/10.1101/2020.03.09.984815; this version posted March 12, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission.
691 distribution of the complexes indicated in the histogram, with the mitochondrion and kinetoplast
692 indicated in black at left.
693
694 Figure 5. APEX2-labeling resolves flagellum sub-domains.
695 Panels A, B. AC1-APEX2 cells were fixed and examined by fluorescence microscopy after staining with
696 Streptavidin (red) and DAPI (blue).
697 Panels C, D. FS179-APEX2 cells were examined by immunofluorescence with anti-PFR antibody (Alexa
698 488, green), Streptavidin (Alexa 594, red) and DAPI (blue). White brackets indicate the distal region of
699 the flagellum that is not labelled by streptavidin. Box shows zoomed in version of the cell in the middle
700 of the field.
701 Panel E. Scheme used to identify biotinylated proteins from the indicated cell lines (WT, AC-APEX2 and
702 FS179-APEX2).
703
704 Figure 6. APEX2 proximity proteomics differentiates protein composition of distinct flagellum
705 subdomains.
706 Panel A. Principal Component Analysis of proteins identified in supernatant fractions from WT (2913s),
707 AC1-APEX2 (AC1s) and FS179-APEX2 (FS179s) cells. For 2913 controls, three independent experiments
708 are shown (0410, 0629-A and 0629-B) and for one experiment the sample was split into two aliquots
709 (0410-A1 and 0410-A2) that were subjected to shotgun proteomics in parallel. For FS179s, two
710 independent experiments are shown (0629-A, 0629-B). For AC1s, one sample was split into two aliquots
711 that were subjected to shotgun proteomics in parallel (0410-A1, 0410-A2).
712 Panel B. Word cloud representing GO analysis for Cellular Component and Biological Process of the
713 proteins identified in the AC1 proximity proteome.
Page 32 of 33
bioRxiv preprint doi: https://doi.org/10.1101/2020.03.09.984815; this version posted March 12, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission.
714 Panel C. Word cloud representing GO analysis for Cellular Component and Biological Process of the
715 proteins identified in the FS179 proximity proteome.
716
717 Figure 7. AC1s proximity proteome identifies tip proteins.
718 Fluorescence microscopy of trypanosomes expressing the indicated protein tagged with neon green
719 (Green). Samples are stained with DAPI (Blue). Top panel shows fluorescence plus phase contrast
720 merged images. Bottom panel shows fluorescence image.
721
722 Supplemental Figure S1. Growth curves of APEX2-tagged cell lines.
723 Comparison of doubling times of 29-13 (parental cell line), AC1-APEX2, DRC1- APEX2, and FS179- APEX2
724 cell lines in suspension culture over the course of 8 days. Cell densities were adjusted daily to 1e6
725 cells/mL in order to insure logarithmic growth.
726
727 Supplemental Figure S2. Word cloud representing GO analysis for Cellular Component (A) and
728 Biological Process (B) of the proteins identified in the DRC1s proximity proteome.
729
730 Supplemental Figure S3. FAZ proteins identified on the DRC1p proximity proteome.
731 Panel A is a schematic of the FAZ. Panel B indicates the FAZ zone to which each protein has been
732 localized, as well as the ratio of spectra identified in the DRC1p versus 2913p sample. FAZ schematic and
733 zone locations are based on Sunter, J.D. and K. Gull, The Flagellum Attachment Zone: 'The Cellular Ruler'
734 of Trypanosome Morphology. Trends Parasitol, 2016. 32(4): p. 309-324.
735
Page 33 of 33 bioRxiv preprint doi: https://doi.org/10.1101/2020.03.09.984815; this version posted March 12, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission.
Table 1. Human homologues with disease association T. brucei gene Human Human gene product Human disease phenotype e value homologue description Tb927.3.4020 XP_016884319.1 PI4KA phosphatidylinositol 4- Polymicrogyria, perisylvian, with cerebellar hypoplasia 2.00411e-94 kinase alpha isoform X4 and arthrogryposis Tb927.5.3650 EAW71112.1 MLPH melanophilin, isoform Griscelli syndrome type 3, identified in cancer 0.00600032 CRA_a susceptibility loci NP_055710.2 CEP162 centrosomal protein Associated with diabetic retinopathy 2.69684e-17 of 162 kDa isoform a Tb927.10.1510 AAH00779.2 CNOT1 protein, partial Associated with QT interval duration, PR interval, and 2.47459e-60 QRS duration Tb927.10.5380 XP_005247666.1 IFT122 intraflagellar transport Craneoectodermal dysplasia 1 0.0 protein 122 homolog isoform X13 EAW91274.1 asp (abnormal spindle)-like, Primary autosomal recessive microcephaly 5, lupus 0.000116493 microcephaly associated nephritis susceptibility in women Tb927.10.970 EAW91271.1 asp (abnormal spindle)-like, Primary autosomal recessive microcephaly 5, lupus 4.26269e-12 microcephaly associated, nephritis susceptibility in women, locus associated with isoform CRA_b ischemic stroke Tb927.8.6870 BAG06714.1 MYO5B variant protein Congenital microvillous atrophy 9.0414e-05 Tb927.7.7260 XP_011512661.1 kinesin-like protein KIF6 ADHD, antibody response to smallpox vaccine, dental 4.63801e- isoform X4 caries 135 Tb927.5.950 CAA09375.1 thioredoxin-like protein Susceptibility to HIV-1 1.87138e-32 Tb927.10.13000 EAW93986.1 PDE1C phosphodiesterase 1C, Endometriosis 8.12662e-66 calmodulin-dependent 70kDa, isoform CRA_b AAD50326.1 truncated RAD50 protein Nijmegen breakage syndrome-like disorder 5.70368e-59 Tb927.9.9580 AAF23275.2 KIAA0998, partial (tubulin Cone-rod dystrophy 19, susceptibility to ischemic 7.80409e-56 tyrosine ligase like 5) stroke and coronary heart disease EAW97654.1 UTP20, small subunit (SSU) Influences neurodegeneration in Alzheimer’s disease, 2.98428e-20 processome component, associated with subclinical atherosclerosis isoform CRA_c Tb927.10.9380 BAC04112.1 UBXN11 UBX domain protein Pathophysiology of childhood obesity 1.17265e-23 bioRxiv preprint doi: https://doi.org/10.1101/2020.03.09.984815; this version posted March 12, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission.
11 Tb927.8.5290 BAG64996.1 ITCH itchy E3 ubiquitin protein Autoimmune disease 2.61032e-17 ligase Tb927.11.3140 AAH31244.1 Dual-specificity tyrosine-(Y)- Allergic rhinitis 5.35331e- phosphorylation regulated 119 kinase 4 Tb927.5.1920 EAW64669.1 CCDC13 coiled-coil domain Clozapine-induced agranulocytosis, chronic 6.40896e-13 containing 13, isoform CRA_a periodontitis, pathophysiology of childhood obesity Tb927.8.5970 XP_011529701.1 CUL4B cullin-4B isoform X3 Syndromic X-linked mental retardation, Cabezas type 1.67244e-50 Tb927.3.4510 BAH13910.1 TRAF3 TNF receptor Schizophrenia, susceptibility to herpes simplex 0.0509298 associated factor 3 encephalitis 3 Tb927.8.4510 AAH36816.1 Thioredoxin domain Primary ciliary dyskinesia 6, risk of fracture, 2.4852e-06 containing 3 susceptibility for Alzheimer’s disease BAG37189.1 CCDC65 coiled-coil domain Primary ciliary dyskinesia 27 1.04609e-36 containing 65 Tb927.8.1890 AAA35730.1 cytochrome c1, partial Nuclear type 6 mitochondrial complex III deficiency 2.43602e-49 Tb927.8.5020 BAA25448.1 KIAA0522 protein, partial X-linked 1 mental retardation, triplosensitivity, 0.0592341 haploinsufficeincy CAD38878.1 VWA8 von Willebrand factor A Emphysema, TB risk 1.15659e- domain containing 8 116 Tb927.5.4110 AAH50721.1 CEP104 centrosomal protein Joubert syndrome 25 7.07883e-14 104 Tb927.3.1190 EAW64508.1 DLEC1 deleted in lung and Lung cancer, malignant tumor of esophagus 0.00498254 esophageal cancer 1, isoform CRA_a Tb927.6.4750 BAG56787.1 BUB3, mitotic checkpoint Age of menarche and natural menopause 0.000868412 protein Tb927.11.6350 XP_006712094.1 ATAD 2B ATPase family AAA Pathophysiology of childhood obesity 4.90261e- domain-containing protein 2B 102 isoform X10 Tb927.11.4210 AAC18038.1 antigen NY-CO-7 Autosomal recessive 16 spinocerebellar ataxia 0.00394461 Tb927.10.2900 BAG37316.1 KPNB1 karyopherin subunit Circulating levels of very long-chain saturated fatty 5.10086e-78 beta 1 acids, multiple sclerosis susceptibility Tb927.10.8390 AAH40929.1 HERC1 protein, partial Macrocephaly, dysmorphic facies, psychomotor 3.352e-64 bioRxiv preprint doi: https://doi.org/10.1101/2020.03.09.984815; this version posted March 12, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission.
retardation, neuronal pattern development Tb927.4.3810 BAF82512.1 POLR2B RNA polymerase II Age-related macular degeneration, adult height 0.0 subunit B Tb927.11.12430 AAT38107.1 CDC14A cell division cycle 14 Autosomal recessive 32 deafness 4.10273e-46 homolog A Tb927.7.1560 AAI03916.1 TTC6 protein Allergy-specific susceptibility 1.23027e-09 Tb927.7.290 XP_005250264.1 FBXL13 F-box/LRR-repeat Blood pressure response to interventions 1.17865e-09 protein 13 isoform X5 Tb927.11.14680 NP_001341508.1 serine/threonine-protein Familial cutaneous telangiectasia and cancer 1.05991e- kinase ATR isoform 2 syndrome, Seckel syndrome 1 113 Tb927.11.6430 XP_011524505.1 CCDC68 coiled-coil domain- Schizophrenia, severe diabetic retinopathy, psychiatric 0.0250287 containing protein 68 isoform disorders X3 bioRxiv preprint doi: https://doi.org/10.1101/2020.03.09.984815; this version posted March 12, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission.
Table 2. AC1s proximity proteome.
Source for location AC1s/FS179s TriTryp GeneID TriTryp Annotation Localization* data avg ratio&
Tb927.2.5860 Hypothetical Tip TrypTag1 15.88 Tb927.2.5870 Hypothetical Tip TrypTag 1 + this work 10.71 Tb927.4.1500 RNA editing associated helicase 2 Not Flagellar TrypTag1 7.45 Tb927.4.4400 Hypothetical Flagellum, distal points + Endocytic TrypTag1 5.18 Tb927.2.5760 Flagellar Member 8 Tip 2 5.03 Tb927.11.14410 Ankyrin repeats Not Flagellar TrypTag1 3.89 Tb927.1.2120 Calpain-like protein CALP1.3 Tip 3 3.60 Tb927.11.17040 AC1# Tip 4 3.37 Tb927.7.4060 cysteine peptidase, Clan CA, family C2 Flagellum, Tip-enriched + Cytoplasm TrypTag1 + 3 3.02 Tb927.9.7540 cysteine peptidase, Clan CA, family C2, putative Flagellum, Tip-enriched TrypTag 1 + this work 2.98 Tb927.7.4070 cysteine peptidase, Clan CA, family C2 Tip + Cytoplasm 3 2.86 Tb927.7.5340 cAMP response protein 3 Flagellum, Tip-enriched + Cytoplasm TrypTag + this Work 2.41 Tb927.10.15700 Hypothetical NA 2.11 Tb927.4.4220 small GTP-binding rab protein NA 2.11 1 Tb927.10.9380 SEP domain/UBX domain containing protein New Flagellum-distal points TrypTag 1.86 Tb927.5.2090 kinesin Flagellum + Cytoplasm TrypTag1 1.74 Tb927.6.4710 calmodulin Flagellum TrypTag1 1.68 Tb927.3.4640 VIT family Not Flagellar TrypTag1 1.50 Tb927.7.7260 kinesin Flagellum-puncta 5 1.32 Tb927.9.9690 Hypothetical Flagellum, Tip-enriched TrypTag1 1.22 Tb927.7.3090 Galactose Oxidase-central domain =-containing Not Flagellar TrypTag1 1.17 Tb927.5.2410 kinesin Flagellum, Tip-enriched TrypTag1 + 6 1.16 bioRxiv preprint doi: https://doi.org/10.1101/2020.03.09.984815; this version posted March 12, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission.
*Localization defined as: Tip = the prominent or sole location is flagellum tip. Flagellum, Tip-enriched = in the flagellum and enriched at tip. Tip- enriched = enriched at the flagellum tip, but also present outside the flagellum. 1Dean, S., J.D. Sunter, and R.J. Wheeler, TrypTag.org: A Trypanosome Genome-wide Protein Localisation Resource. Trends Parasitol, 2017. 33(2): p. 80-82. 2Subota, I., et al., Proteomic analysis of intact flagella of procyclic Trypanosoma brucei cells identifies novel flagellar proteins with unique sub- localization and dynamics. Mol Cell Proteomics, 2014. 13(7): p. 1769-86. 3Liu, W., et al., Expression and cellular localisation of calpain-like proteins in Trypanosoma brucei. Mol Biochem Parasitol, 2010. 169(1): p. 20-6. 4Saada, E.A., et al., Insect stage-specific receptor adenylate cyclases are localized to distinct subdomains of the Trypanosoma brucei Flagellar membrane. Eukaryot Cell, 2014. 13(8): p. 1064-76. 5Demonchy, R., et al., Kinesin 9 family members perform separate functions in the trypanosome flagellum. J Cell Biol, 2009. 187(5): p. 615-22. 6An, T. and Z. Li, An orphan kinesin controls trypanosome morphology transitions by targeting FLAM3 to the flagellum. PLoS Pathog, 2018. 14(5): p. e1007101. #AC1 represents a total of 39 ACs identified in the AC1s proximity proteome (Supplementary Table S4). &Normalized spectral count average of all experiments and replicates of AC1s, over normalized spectral count average of all experiments and replicates of FS179s. bioRxiv preprint doi: https://doi.org/10.1101/2020.03.09.984815; this version posted March 12, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. bioRxiv preprint doi: https://doi.org/10.1101/2020.03.09.984815; this version posted March 12, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. bioRxiv preprint doi: https://doi.org/10.1101/2020.03.09.984815; this version posted March 12, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. bioRxiv preprint doi: https://doi.org/10.1101/2020.03.09.984815; this version posted March 12, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. bioRxiv preprint doi: https://doi.org/10.1101/2020.03.09.984815; this version posted March 12, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. bioRxiv preprint doi: https://doi.org/10.1101/2020.03.09.984815; this version posted March 12, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. bioRxiv preprint doi: https://doi.org/10.1101/2020.03.09.984815; this version posted March 12, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission.