Proteomic analysis of polymeric salivary : No evidence for MUC19 in saliva Karine Rousseau, Sara Kirkham, Lindsay Johnson, Brian Fitzpatrick, Marj Howard, Emily J Adams, Duncan F Rogers, David Knight, Peter Clegg, David J Thornton

To cite this version:

Karine Rousseau, Sara Kirkham, Lindsay Johnson, Brian Fitzpatrick, Marj Howard, et al.. Proteomic analysis of polymeric salivary mucins: No evidence for MUC19 in human saliva. Biochemical Journal, Portland Press, 2008, 413 (3), pp.545-552. ￿10.1042/BJ20080260￿. ￿hal-00478964￿

HAL Id: hal-00478964 https://hal.archives-ouvertes.fr/hal-00478964 Submitted on 30 Apr 2010

HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Biochemical Journal Immediate Publication. Published on 22 Apr 2008 as manuscript BJ20080260

Proteomic analysis of polymeric salivary mucins: No evidence for MUC19 in human saliva

Karine Rousseau1, Sara Kirkham1, Lindsay Johnson1, Brian Fitzpatrick1, Marj Howard1, Emily J. Adams2, Duncan F. Rogers2, David Knight1, Peter Clegg1,3 & David J. Thornton1

1 Wellcome Trust Centre for Cell-Matrix Research, Faculty of Life Sciences, University of Manchester, Manchester, M13 9PT, UK. 2Airway Disease, National Heart & Lung Institute, Imperial College London, London SW3 6LY, UK, 3University of Liverpool, Faculty of Veterinary Science, Leahurst, Neston, CH64 7TE, UK.

Running head: Salivary mucins Corresponding author: David J. Thornton Wellcome Trust Centre for Cell-Matrix Research Faculty of Life Sciences The Michael Smith Building University of Manchester Manchester M13 9PT, UK.

Tel: 0161 275 5647 Fax: 0161 275 1505 E-mail: [email protected]

Key words: saliva, mucins, Muc19, Muc5b THIS IS NOT THE FINAL VERSION - see doi:10.1042/BJ20080260

Abbreviations: TR: tandem repeat; GdmCl: Guanidinium chloride; PAS: Periodic Acid Schiff's; PSM: Pig submaxillary ; BSM: Bovine submaxillary mucin; DTT: ditiothretiol; CsCl: Caesium chloride; MS-MS; Tandem mass spectrometry.

Stage 2(a) POST-PRINT 1

Licenced copy. Copying is not permitted, except with prior permission and as allowed by law. © 2008 The Authors Journal compilation © 2008 Biochemical Society Biochemical Journal Immediate Publication. Published on 22 Apr 2008 as manuscript BJ20080260

ABSTRACT

MUC5B is the predominant polymeric mucin in human saliva (Thornton, Khan, Mehrotra, Howard, Veerman, Packer, and Sheehan (1999) Glycobiology 9, 293-302) where it contributes to oral cavity hydration and protection. More recently the for another putative polymeric mucin, MUC19, has been shown to be expressed in human salivary glands (Chen, Zhao, Kalaslavadi, Hamati, Nehrke, Le, Ann, and Wu, (2004) Am.J.Respir.Cell Mol.Biol. 30, 155-165). However, to date, the MUC19 mucin has not been isolated from human saliva. Our aim was therefore to purify and characterize the MUC19 from human saliva. Saliva was solubilised in 4M guanidinium chloride and the high-density mucins were purified by density gradient centrifugation. The presence of MUC19 was investigated using tandem mass spectrometry of tryptic peptides derived from this mucin preparation. Using this approach we found multiple MUC5B- derived tryptic peptides but were unable to detect any putative MUC19 peptides. These data suggest that MUC19 is not a major component in human saliva. In contrast, using the same experimental approach we identified Muc19 and Muc5b in horse saliva. Moreover, we also identified Muc19 from pig, cow and rat saliva; the saliva of cow and rat also contained Muc5b, although, due to the lack of pig Muc5b genomic sequence data, we were unable to identify Muc5b in pig saliva. Our data suggest that unlike human saliva which contains MUC5B, cow, horse and rat saliva is a heterogeneous mixture of Muc5b and Muc19. The functional consequence of these species differences remains to be elucidated. THIS IS NOT THE FINAL VERSION - see doi:10.1042/BJ20080260

Stage 2(a) POST-PRINT 2

Licenced copy. Copying is not permitted, except with prior permission and as allowed by law. © 2008 The Authors Journal compilation © 2008 Biochemical Society Biochemical Journal Immediate Publication. Published on 22 Apr 2008 as manuscript BJ20080260

INTRODUCTION Saliva is a dilute, complex mixture of , glycoproteins, lipids and ions secreted from the major and minor salivary glands [1-3]. Saliva has many diverse functions including; aiding mastication and speech, digestion of food and maintenance of oral health [4]. The major macromolecular components of normal, unstimulated saliva are the mucins. There are two-populations of salivary mucins; the high molecular weight polymeric, gel- forming mucin MUC5B and the lower molecular weight, non-polymeric mucin MUC7 [5- 7]. Two distinctive structural characteristics of these glycoproteins have important functional consequences [8]. First, the MUC5B and MUC7 mucin polypeptides, have a large central domain, with tandemly repeated sequences enriched in serine and/or threonine residues (the TR or mucin domain), which are the sites of extensive substitution with O- glycans. Second, upstream and downstream of the MUC5B mucin domain are regions of the polypeptide similar to von Willebrand Factor D domains, which are important for mucin polymer formation [8]. Thus, as a result of their extreme size and abundance of negatively charged O-glycans these glycoproteins play key roles in hydration and lubrication of the oral surfaces. Furthermore, mucins bind to, and sequester bacteria via their glycans and domains [9;10].

Until 2003, four gel-forming mucins had been reported, namely MUC6, MUC2, MUC5AC and MUC5B which are encoded by four consecutive on 11p15.5 [11]. These mucins are expressed in a tissue, and cell specific manner. For instance, in , MUC6 is expressed in mucous cells of submucosal glands in the stomach, MUC5B is mainly expressed by the mucous cells of the salivary glands and the submucosal glands in the airways, MUC5AC is expressed by the goblet cells of the airways and the stomach, and MUC2 is mainly expressed by the goblet cells in the intestine [12-17]. This tissue-specific distribution has been found to be relatively well conserved between species [18;19]. For example, in the mouse, Muc5b, along with Muc5ac, have been found to be expressed in the airways, Muc2 is mainly expressed in the intestine and Muc5ac and Muc6 are both found in the stomach [20-23]. However, differences in expression between mouse and human have also been reported. For example, in contrast to humans, no evidence of Muc5b (nor Muc2, Muc5ac or Muc6) expression has been reported in mouse saliva [22;23]. In 2003, MUC19 was reported as a fifth human gel-forming mucin. Although, neither the complete amino acid nor mRNA sequence have been published, MUC19 is predicted to have similar structural features to the other human gel-forming mucins [24]. THIS IS NOT THE FINAL VERSION - see doi:10.1042/BJ20080260 MUC19 gene expression has been reported in human airways and, in particular, in human salivary glands [24]. In the mouse, the complete sequence of the Muc19 gene has been described and it is strongly expressed in sublingual and submandibular salivary glands [24;25]. Furthermore, mRNA sequences from porcine and bovine submaxillary mucins show high sequence similarity to the mouse Muc19 sequence [26-28]. On the basis of these data, MUC19/Muc19 would be expected to be present in saliva from humans and other mammals. However, studies on human saliva have identified MUC5B as the predominant polymeric mucin [7;16], and the MUC19 glycoprotein has yet to be identified in human saliva, or for that matter, in the saliva from other mammals. Moreover, in other animals, it Stage 2(a) POST-PRINT 3

Licenced copy. Copying is not permitted, except with prior permission and as allowed by law. © 2008 The Authors Journal compilation © 2008 Biochemical Society Biochemical Journal Immediate Publication. Published on 22 Apr 2008 as manuscript BJ20080260

is not known whether Muc5b is a major salivary mucin. Therefore, in this study, our first aim was to purify the polymeric mucins from human saliva in order to identify whether MUC19 was present. Subsequently, we analysed the saliva collected from horse, rat, pig and cow to identify if Muc5b and/or Muc19 were present. THIS IS NOT THE FINAL VERSION - see doi:10.1042/BJ20080260

Stage 2(a) POST-PRINT 4

Licenced copy. Copying is not permitted, except with prior permission and as allowed by law. © 2008 The Authors Journal compilation © 2008 Biochemical Society Biochemical Journal Immediate Publication. Published on 22 Apr 2008 as manuscript BJ20080260

EXPERIMENTAL METHODS Saliva collection: Unstimulated saliva was collected from 6 healthy human volunteers (between 20 and 35 of age) by spitting into 50 ml tubes that were kept on ice during the collection. Horse, cow and pig saliva were collected from excess secretions drooled while feeding. Necessary ethical approval was obtained for all the above animal sampling. Rats were anaesthetised with ketamine hydrochloride (100mg/kg) and xylazine (10mg/kg) by intraperitoneal injection and were injected intraperitoneally with the cholinergic agonist pilocarpine (25mg/kg) to stimulate salivary secretion. The anaesthetised rats were laid on their side and saliva dripping from the mouth was collected into plastic petri dishes. Experiments were carried out in accordance with the UK Home office guidelines for animal welfare based on the Animals (Scientific Procedures) Act 1986. Equal volumes of 8M guanidinium chloride (GdmCl) were added to each sample at the time of collection and up to 5 volumes of 4M GdmCl were subsequently added to solubilise the mucins. This step was performed by gentle agitation for at least 24 h at 4°C.

Purification and identification of human and horse salivary mucins: Human and equine salivary mucins, solubilised as above, were purified by caesium chloride/4M GdmCl density gradient centrifugation at a starting density of 1.4g/ml. The samples were centrifuged for at least 68 hours at 40 000 rpm and 15°C in a Beckman Ti45 rotor. After centrifugation tubes were emptied from the top into 20 fractions. An aliquot of each fraction was slot blotted onto a nitrocellulose membrane and stained with periodic acid Schiff’s (PAS) reagent according to our previously published method [29]. The PAS-rich fractions were pooled, reduced and carboxymethylated, dialysed into water and then lyophilised. The mucins present in the sample were identified by using tandem mass spectrometry as described previously [19]. In brief, samples were re-suspended into 0.1M ammonium bicarbonate prior to digestion with trypsin. The low molecular weight tryptic peptides were isolated by Sephacryl S-100 gel filtration chromatography and then separated by reverse- phase chromatography and analysed in-line by positive ion electrospray ionisation-MS-MS using a Q-ToF micro mass spectrometer (Waters, Manchester, UK). MS-MS data were analysed using a custom database containing putative mucin sequences (see below).

Mucin sequence prediction: Published Muc19 and MUC5B sequences (accession numbers NM_207243, AF005273 and AJ004862 [25;27;30]) obtained from mRNA sequencing data were used to perform a BLAST search of the genome sequence database of human, horse, bovine, pig and rat genomes. The Muc5b and Muc19 genes in each genome were THIS IS NOT THE FINAL VERSION - see doi:10.1042/BJ20080260 determined based on gene synteny and sequence similarity. In the human (chromosome 12), horse (chromosome number not identified) and rat (chromosome 7) genomes, the LRRK2 (Leucine Rich Repeat Kinase 2) gene is located upstream of Muc19 while the CNTN1 (Contactin 1) gene is located downstream of Muc19. In the current version of the cow genome, the Muc19 gene is located downstream of the LRRK2 gene on chromosome 5 but the CNTN1 gene is not in the same loci. However, the genome contains sequence gaps and the assembly might not be complete. In the , MUC5B is part of a MUC gene complex located between the APA2 gene (adaptor-related protein complex 2, alpha 2 subunit) and the Tollip gene (Toll-interacting protein). This was also the case in the horse Stage 2(a) POST-PRINT 5

Licenced copy. Copying is not permitted, except with prior permission and as allowed by law. © 2008 The Authors Journal compilation © 2008 Biochemical Society Biochemical Journal Immediate Publication. Published on 22 Apr 2008 as manuscript BJ20080260

genome, while, in the rat and cow genomes, it is not yet clear if synteny is conserved because of the gaps in the genomic sequence and the incomplete assembly of these genomes. The structure of the genes was predicted based on the conservation of the exon/intron structure and amino acid sequence, which has been shown to be very tightly conserved between oligomeric mucins across species [22;31]. The genome databases were accessed through the UCSC browser (http://genome.cse.ucsc.edu/) and the ensemble browser (http://www.ensembl.org/index.html). It should be noted that the predicted N- terminal sequences do not represent the entire amino terminal sequence, but start from exon 4 and 2 for Muc19 and Muc5b respectively. The predicted exonic sequences were translated and the resultant amino acid sequences of putative Muc5b and Muc19 mucin sequences were added to an in-house custom mucin database.

Database searching-- Tandem mass spectra were extracted, charge state deconvoluted and deisotoped by Masslynx version 4.0 (Waters, Manchester, UK). All MS-MS samples were analyzed using Mascot (Matrix Science, London, UK; version 2.2.03) and X! Tandem (www.thegpm.org; version 2007.01.01.1). X! Tandem was set up to search a subset of our in-house custom mucin database. Mascot was set up to search the MSDB_2006-Sep-08 database (selected for Mammalian) and our in-house custom mucin database. Mascot and X! Tandem were searched with a fragmentation mass tolerance of 0.4 Da and a parent ion tolerance of 1.3 Da. Iodoacetamide derivative of cysteine was specified in Mascot and X! Tandem as a fixed modification. Oxidation of methionine was specified in Mascot and X! Tandem as a variable modification.

Criteria for protein identification-- Scaffold (version Scaffold-01_07_00, Proteome Software Inc., Portland, OR) was used to validate MS-MS based peptide and protein identifications. Peptide identifications were accepted if they could be established at greater than 90% probability as specified by the PeptideProphet algorithm [32]. Protein identifications were accepted if they could be established at greater than 90% probability and contained at least 1 identified peptide. Protein probabilities were assigned by the ProteinProphet algorithm [33]. Proteins that contained similar peptides and could not be differentiated based on MS-MS analysis alone were grouped to satisfy the principles of parsimony.

Identification of mucins in the saliva of cow, rat and pig: To determine if Muc19 and Muc5b were present in cow, rat and pig saliva, we employed a less lengthy procedure to isolate the mucins. This was because we only wished to determine if the mucin was present THIS IS NOT THE FINAL VERSION - see doi:10.1042/BJ20080260 as opposed to obtaining maximum coverage of the polypeptide. For this approach, our aim was to enrich rather than purify the mucins. In brief, saliva from cow, rat and pig was dissolved with 8M GdmCl and then dialysed into 6M urea. An aliquot of this non-purified saliva was reduced and subjected to SDS-PAGE using NuPAGE 4 to12% bis-acrylamide gels (Invitrogen, Paisley, UK). Gels were run for 2 hours at 150V in NuPAGE MES-SDS running buffer and NuPAGE LDS sample buffer. After electrophoresis the gels were stained with PAS [29] and the high molecular weight smears at the top of the gel were excised, in-gel digested with trypsin [34] and analysed by tandem mass spectrometry as described above. Stage 2(a) POST-PRINT 6

Licenced copy. Copying is not permitted, except with prior permission and as allowed by law. © 2008 The Authors Journal compilation © 2008 Biochemical Society Biochemical Journal Immediate Publication. Published on 22 Apr 2008 as manuscript BJ20080260

Rate-zonal centrifugation: To determine if Muc19 is a polymeric mucin stabilised by disulphide bonds, untreated mucins and reduced, and carboxymethylated mucins from horse and rat, were layered onto 6-8M GdmCl gradients and centrifuged at 40000 rpm in a SW40 Ti Beckman rotor for 2 ½ h at 15°C as described previously [19]. The tubes were emptied into 24 fractions from the top and the mucin distribution analysed by PAS- staining.

MUC5B and MUC19 mRNA relative expression in human salivary glands: Normal human salivary gland total RNA (Clontech 636552, 24 male/female Caucasians, age 16 to 60) was reverse transcribed into cDNA using random primers. The SYBR green system (Eurogentec) was then used to amplify MUC5B and MUC19 cDNA by Real-Time quantitative PCR using the protocol supplied by the manufacturer. The oligonucleotide primers used to amplify MUC5B were; MUC5B-forward primer 5’ ACCCAACGGTGCAATGTCA 3’and MUC5B-reverse primer 5’CGGCCACTCTCTTGTACTCAAAG 3’ and the oligonucleotides primers used to amplify MUC19 were; MUC19-forward primer 5’GAGTTCAGATGGCAAAATGCACA 3’ and MUC19-reverse primer 5’ TGCCATCAGGACAGTCAAGTACA 3’ [24]. The efficiency of each pair of primers was determined by using 5 serial dilutions of the cDNA and was ≥ 99.8%. The identity of the amplified products was confirmed by sequencing. THIS IS NOT THE FINAL VERSION - see doi:10.1042/BJ20080260

Stage 2(a) POST-PRINT 7

Licenced copy. Copying is not permitted, except with prior permission and as allowed by law. © 2008 The Authors Journal compilation © 2008 Biochemical Society Biochemical Journal Immediate Publication. Published on 22 Apr 2008 as manuscript BJ20080260

RESULTS The first aim of the study was to determine whether MUC19 was present in human saliva. Our approach, which we have employed previously to identify human and horse polymeric mucins [19;35], was to purify salivary mucins, and then use tandem mass spectrometry of tryptic peptides to identify MUC19. However, only a partial MUC19 sequence, corresponding to the C-terminus of the mucin, has been reported [24]. Therefore, to maximise the chance of identifying MUC19, most of the polypeptide sequence of the N- terminus of MUC19 was predicted (for details see the experimental section). This resulted in a 1178 amino acid sequence, which was added to an in-house mucin protein sequence database for subsequent analysis of tandem MS data.

Identification of polymeric mucins in human saliva: Human saliva, pooled from 6 individuals, was solubilised in 4M GdmCl and fractionated by density gradient centrifugation. PAS-staining of fractions across the density distribution showed that the high-density fractions contained the bulk of the mucins (Fig. 1). The major PAS-rich peak was pooled (fractions 11-18), reduced and carboxymethylated, dialysed against water and lyophilised in order to concentrate the sample. The lyophilised material was digested with trypsin and the low molecular weight tryptic peptides were analysed by tandem MS [19]. This revealed 84 unique peptides which matched MUC5B (Fig. 2), but no peptides that matched MUC19 (Fig 3). Since MUC19 was not found in the high-density, PAS-rich fractions, the lower density material (fractions 1 to 10) from the density gradient was pooled and analysed in the same manner as above. The tandem MS analysis showed the presence of 5 peptides that matched MUC5B but no peptides that matched MUC19.

mRNA expression in human salivary glands: The apparent absence of MUC19 in human saliva extracts is at odds with published mRNA expression data [24]. Therefore, to confirm previous expression data and, to determine the relative expression of MUC5B to MUC19 mRNA we performed quantitative Real-Time PCR on pooled human salivary gland cDNA. This analysis showed MUC19 mRNA was expressed in human salivary glands and that there was approximately 10-fold more MUC5B mRNA than MUC19 mRNA. The presence of MUC19 mRNA in salivary glands suggests that our mass spectrometry-based approach might not have been able to detect this mucin. Therefore, to investigate whether our approach could identify this mucin, we collected saliva from other mammals (cow, horse, rat, and pig), isolated the mucins and tested for Muc19, and the homologue of the major human salivary mucin, Muc5b, using tandem MS of tryptic THIS IS NOT THE FINAL VERSION - see doi:10.1042/BJ20080260 peptides. However, for this to be successful we first had to predict the polypeptide sequences for horse, cow, pig and rat Muc19 and Muc5b.

Predicted N-terminal sequences of Muc5b and Muc19 from cow, horse, rat and pig: Mucin polypeptide sequences for cow, horse, rat, and pig were predicted as described in the experimental section. The Muc19 and Muc5b N-terminal deduced amino acid sequence alignments are shown in Figures 3 and 4 respectively. The Muc19 sequences show approximately 68% identity and that 99% of the cysteine residues (100/101) are conserved. Stage 2(a) POST-PRINT 8

Licenced copy. Copying is not permitted, except with prior permission and as allowed by law. © 2008 The Authors Journal compilation © 2008 Biochemical Society Biochemical Journal Immediate Publication. Published on 22 Apr 2008 as manuscript BJ20080260

The alignment of the Muc5b sequences shows they share 67% identity and all of the 90 cysteine residues are conserved.

Identification of Muc19 and Muc5b mucins from horse saliva: The purification and identification strategy used for analysis of human salivary mucins was employed for the analysis of horse saliva. The CsCl/4M GdmCl density gradient fractionation of horse saliva is shown in Figure 5. The high-density mucins (fractions 10-16) were reduced and carboxymethylated, digested with trypsin and subjected to tandem MS analysis. The results showed that 13 peptides matched the predicted N-terminal polypeptide of Muc5b and 25 peptides matched the predicted N-terminal polypeptide sequence of Muc19 (Table 1, Figs. 3 & 4). This validated our approach and showed the presence of Muc5b and Muc19 mucins in horse saliva. We then investigated whether one or both of these two mucins were present in the saliva of the cow, pig and rat.

Identification of Muc19 and Muc5b mucins from cow, pig and rat saliva: To more rapidly identify the mucins present in the saliva of these 3 animals we exploited the low electrophoretic migration of polymeric mucins in SDS-PAGE [36]. Solubilised saliva collected from rat, cow and pig was subjected to SDS-PAGE on a 4 to 12% gradient gel. After electrophoresis, gels were stained with the PAS reagent and the carbohydrate-rich smears located at the top of the gels (data not shown), were excised and digested with trypsin. Tandem MS analysis showed the presence of both Muc5b and Muc19 in rat and cow saliva (Table 1, Fig 3 & 4). Muc19 was also present in pig saliva with 8 peptide matches (Table 1, Fig. 3). However, the presence of Muc5b in pig saliva could not be tested because no Muc5b sequence is currently available in the pig genome.

Is Muc19 a polymeric mucin?: Muc19 is reported to be a polymeric mucin based on the presence of the von Willebrand factor-like D domains which are necessary for the disulphide-bond mediated oligomerisation of gel-forming mucins [24]. In order to test this, we examined if there was a change in size distribution of horse and rat salivary mucins (both mixtures of Muc5b and Muc19), before and after treatment with a reducing agent (10mM DTT), by rate zonal centrifugation (Fig. 6). The unreduced mucins were characterized by a broad range of sedimentation rates, characteristic of a polydisperse distribution of mucins [37;38], which, after reduction, exhibited a lower sedimentation rate. These results provide evidence that Muc19 (and Muc5b) is a polymeric mucin stabilised by disulphide bonds. THIS IS NOT THE FINAL VERSION - see doi:10.1042/BJ20080260

Stage 2(a) POST-PRINT 9

Licenced copy. Copying is not permitted, except with prior permission and as allowed by law. © 2008 The Authors Journal compilation © 2008 Biochemical Society Biochemical Journal Immediate Publication. Published on 22 Apr 2008 as manuscript BJ20080260

DISCUSSION The finding that the MUC19 gene is expressed in human salivary glands [24] raised the issue that a major component of the protective barrier in the mouth has been overlooked in previous studies on saliva. For instance, we have reported that MUC5B was the polymeric mucin responsible for the properties of saliva [7;35;39]. However, using standard methods for extraction and purification of polymeric mucins [40], we were unable in the present study to detect MUC19 either in the main mucin preparation after density gradient purification, or in any of the other fractions from the density gradient. Thus, the data presented here confirms these earlier studies and suggests that MUC5B is the predominant polymeric mucin in adult human saliva (pooled from 6 healthy volunteers); and that MUC19 is apparently absent. There are a number of reasons why MUC19 might not have been detected, (i) it was not solubilised in 4M GdmCl; (ii) peptides may not have been produced by trypsin digestion; thus the mucin would not have been identified using the MS-based approach; (iii) it is not present in saliva, or (iv) MUC19 is present but at levels below the limit of detection of the MS analysis. It seems unlikely that the first two reasons explain the lack of MUC19 since, using the same methodology, Muc19 was found in horse saliva. In addition, using the MS-based identification strategy, Muc19 was also present in rat, pig and cow saliva. We therefore propose that MUC19 is either not present in adult human saliva or is present at comparatively low levels. The latter proposal would seem to be favoured by the mRNA expression data which showed, in pooled human salivary gland mRNA, that MUC19 mRNA is only around 10% of the level of MUC5B mRNA. Thus, if the two mRNAs are translated with similar efficiency this would suggest MUC19 is a more minor component of saliva than MUC5B. However, on the basis on the large number of peptides (84 in total) matched to MUC5B from the mass spectrometry data, we might reasonably have expected to detect peptides from MUC19 if it were present at 10% of the level of MUC5B in saliva. Therefore, we cannot discount other possibilities to explain the apparent absence of MUC19 from human saliva. It is possible that the mRNA is not, or is only poorly translated into protein, the mucin is only present during development or childhood, or the protein is made but not secreted under normal circumstances. Interestingly, MUC19 levels (mRNA and protein) have been shown to be altered in the conjunctival tissue from a patient with Sjögrens syndrome [41], a condition that also affects saliva production. However, these authors did not investigate changes in MUC19 production in salivary glands.

Compositional differences of saliva have been reported between different mammals; for example, in amylase, histatin and proline-rich proteins [42;43]. Thus, the difference in THIS IS NOT THE FINAL VERSION - see doi:10.1042/BJ20080260 mucin composition between the different animals is not surprising. The functional significance of having or not having Muc19 in saliva is at present obscure. One might speculate that it impacts on the physical protective properties of saliva, in terms of hydration and lubrication, and/or it influences the innate defence properties of saliva, i.e. pathogen binding and sequestration of toxins. It is possible that the absence or low level of MUC19 in human saliva is an evolutionary adaptation, relating to diet or oral flora.

The polymeric properties of MUC5B/Muc5b are an established feature of this mucin [16;19;44-46], however, the oligomerisation potential of Muc19 was based on its Stage 2(a) POST-PRINT 10

Licenced copy. Copying is not permitted, except with prior permission and as allowed by law. © 2008 The Authors Journal compilation © 2008 Biochemical Society Biochemical Journal Immediate Publication. Published on 22 Apr 2008 as manuscript BJ20080260

sequence similarity to the other gel-forming mucins [24]. Here, we have provided experimental evidence that Muc19 from horse and rat forms polymers held together by disulphide bounds. While this is the first study that specifically demonstrates Muc19 is a polymeric glycoprotein, earlier studies on recombinant N- and C-terminal fragments of PSM (now known to be Muc19) showed they were able to assemble in a disulphide-bond dependent manner [47-49], further confirming the polymeric nature of Muc19/PSM.

Horse, rat, and cow saliva are a mixture of two polymeric mucins; Muc5b and Muc19. Mucus gels from other tissues have also been shown to be mixtures of two mucins; in the airways MUC5AC/Muc5ac and MUC5B/Muc5b [19;50] and in the stomach MUC5AC and MUC6 [13]. In all of these tissues the specific role of each mucin is not well understood. However, potential roles for the glycan structures on the mucins in the stomach have been proposed in defence of the epithelium against the bacterium H. pylori. Leb and sialyl Lex glycans on MUC5AC can act as receptors for H. pylori [51] and α1-4 linked N- acetylglucosamine termini of glycans on MUC6 act as an antibiotic [52]. Maybe Muc5b and Muc19 have similar roles in the oral cavity. Interestingly, in the rat we do have preliminary evidence that Muc19 has higher charge density than Muc5b (K. Rousseau and D.J. Thornton, unpublished observation).

In summary, our results confirm the earlier findings of Chen and co-workers [24] that MUC19 mRNA is expressed in human salivary glands; however, we have demonstrated that the MUC19 glycoprotein is not a major component of human adult saliva. Furthermore, we have shown for the first time that horse, cow and rat saliva do contain Muc19 and these secretions are mixtures of Muc5b and Muc19. Future studies are required to investigate the functional relevance of differences in mucin composition of saliva in the physiology of the oral cavity.

ACKNOWLEDGMENTS We would like to thank Emma Keevill and Julian Selley for their technical assistance with mass spectrometry analyses. This work was supported by the Wellcome Trust, the Horserace Betting Levy Board and the BBSRC. THIS IS NOT THE FINAL VERSION - see doi:10.1042/BJ20080260

Stage 2(a) POST-PRINT 11

Licenced copy. Copying is not permitted, except with prior permission and as allowed by law. © 2008 The Authors Journal compilation © 2008 Biochemical Society Biochemical Journal Immediate Publication. Published on 22 Apr 2008 as manuscript BJ20080260

Table 1: Numbers of peptides identified by tandem mass spectrometry from saliva extracts. species MUC5B* Muc5b MUC19 Muc19 Human 84 - none - Purified mucin Horse - 13 - 25 Rat - 5 - 20 Non-purified Cow - 7 - 3 saliva Pig - NT - 8

NT: not tested, the pig Muc5b sequence was not available in the pig genome. *Peptides matched to full length MUC5B sequence, for the others we only searched for matches against N-terminal polypeptide sequences. Dashes mean that samples were not analysed against these sequences. THIS IS NOT THE FINAL VERSION - see doi:10.1042/BJ20080260

Stage 2(a) POST-PRINT 12

Licenced copy. Copying is not permitted, except with prior permission and as allowed by law. © 2008 The Authors Journal compilation © 2008 Biochemical Society Biochemical Journal Immediate Publication. Published on 22 Apr 2008 as manuscript BJ20080260

FIGURE LEGENDS

Fig.1. Purification of human salivary mucins. Human salivary mucins were extracted from saliva with 4M GdmCl and subjected to CsCl/ 4M GdmCl density gradient centrifugation. After centrifugation the tube was emptied from the top and fractions were analysed for carbohydrate by PAS assay [29] (filled circles) and their density was determined by weighing (open squares). Fractions 11 to 18, which contain the mucins, were pooled for subsequent tandem MS analysis.

Fig.2. MUC5B peptide matches from tandem mass spectrometry analyses of purified human salivary mucins. Tryptic peptides generated from the mucins purified from human saliva by density gradient centrifugation were analysed by tandem MS. The 84 peptides that matched the human MUC5B sequence (Q9HC84) are highlighted in the black boxes.

Fig.3. Alignment of human MUC19, with horse, cow, pig (AAC62527) and rat Muc19 partial N-terminal sequences. The amino acids which are identical between all species are shown with an asterisk (*) under the alignment. The cysteine residues are highlighted in grey. The translations shown here are from the second exon of each Muc19 gene. The peptide matches found by tandem mass spectrometry analysis of mucins from the saliva from each animal are highlighted in the black boxes. No peptides were found that matched MUC19 in the human salivary mucin preparation. In contrast, peptides that matched to the putative Muc19 sequence were found in horse (25 peptides), cow (3 peptides), pig (8 peptides) and rat (20 peptides).

Fig.4. Alignment of human MUC5B (AAG33673) with horse, cow and rat Muc5b partial N-terminal sequence. The amino acids which are identical between all species are shown with an asterisk (*) under the alignment. The cysteine residues are highlighted in grey. The translations shown here are from the fourth exon of each Muc5b gene. The cow Muc5b sequences contains two gaps (due to the incomplete genome data) which are indicated with an x. The peptide matches found by tandem mass spectrometry analysis of mucins from the saliva from each animal are highlighted in the black boxes. Peptides that matched to the putative Muc5b sequence were found in human (42 peptides) horse (13 peptides), cow (7 peptides), rat (5 peptides). The pig Muc5b sequence was not found in the pig genome and therefore no peptide matches could be identified. THIS IS NOT THE FINAL VERSION - see doi:10.1042/BJ20080260

Fig.5. Purification of horse salivary mucins Horse saliva was fractionated by CsCl/4M GdmCl density gradient centrifugation. After centrifugation the tube was emptied from the top and fractions were analysed for carbohydrate by PAS assay [29] (filled circles) and their density was determined by weighing (open squares). Fractions 10 to 16, which contain the mucins, were pooled for subsequent tandem MS analysis.

Stage 2(a) POST-PRINT 13

Licenced copy. Copying is not permitted, except with prior permission and as allowed by law. © 2008 The Authors Journal compilation © 2008 Biochemical Society Biochemical Journal Immediate Publication. Published on 22 Apr 2008 as manuscript BJ20080260

Fig.6. Size distribution of horse and rat salivary mucins. Horse (top panel) and rat (bottom panel) saliva extracts (4M GdmCl), before (filled circles) and after reduction (open circles), were centrifuged on 6-8M GdmCl gradients. After centrifugation tubes were emptied from the top and fractions were analyzed for carbohydrate using a PAS assay [29]. Fractions 1 (6M GdmCl) and 24 (8M GdmCl) are the top and the bottom of the tube respectively. THIS IS NOT THE FINAL VERSION - see doi:10.1042/BJ20080260

Stage 2(a) POST-PRINT 14

Licenced copy. Copying is not permitted, except with prior permission and as allowed by law. © 2008 The Authors Journal compilation © 2008 Biochemical Society Biochemical Journal Immediate Publication. Published on 22 Apr 2008 as manuscript BJ20080260

Reference List

1 Helmerhorst, E. J. and Oppenheim, F. G. (2007) Saliva: a dynamic proteome. J. Dent. Res. 86, 680-93 2 Hu, S., Xie, Y., Ramachandran, P., Ogorzalek Loo, R. R., Li, Y., Loo, J. A., and Wong, D. T. (2005) Large-scale identification of proteins in human salivary proteome by liquid chromatography/mass spectrometry and two-dimensional gel electrophoresis-mass spectrometry. Proteomics 5, 1714-28 3 Whelton, H. (1996) in Saliva in oral health (Edgar, W. M. and O'Mullane, D. M., eds.), pp. 1-8, British Dental Journal, London 4 Nieuw Amerongen, A. V., Ligtenberg, A. J., and Veerman, E. C. (2007) Implications for diagnostics in the biochemistry and physiology of saliva. Ann. N. Y. Acad. Sci. 1098, 1-6 5 Bobek, L. A., Tsai, H., Biesbrock, A. R., and Levine, M. J. (1993) Molecular cloning, sequence, and specificity of expression of the gene encoding the low molecular weight human salivary mucin (MUC7). J. Biol. Chem. 268, 20563-69 6 Mehrotra, R., Thornton, D. J., and Sheehan, J. K. (1998) Isolation and physical characterization of the MUC7 (MG2) mucin from saliva: evidence for self- association. Biochem. J. 334, 415-22 7 Thornton, D. J., Khan, N., Mehrotra, R., Howard, M., Veerman, E., Packer, N. H., and Sheehan, J. K. (1999) Salivary mucin MG1 is comprised almost entirely of different glycosylated forms of the MUC5B gene product. Glycobiology 9, 293-302 8 Thornton, D. J., Rousseau, K., and McGuckin, M. A. (2008) Structure and Function of the Polymeric Mucins in Airways Mucus. Annu. Rev. Physiol 70, 459-86 9 Veerman, E. C., Ligtenberg, A. J., Schenkels, L. C., Walgreen-Weterings, E., and Nieuw Amerongen, A. V. (1995) Binding of human high-molecular-weight salivary mucins (MG1) to Hemophilus parainfluenzae. J. Dent. Res. 74, 351-57 10 Veerman, E. C., Bank, C. M., Namavar, F., Appelmelk, B. J., Bolscher, J. G., and Nieuw Amerongen, A. V. (1997) Sulfated glycans on oral mucin as receptors for Helicobacter pylori. Glycobiology 7, 737-43 11 Pigny, P., Guyonnet-Duperat, V., Hill, A. S., Pratt, W. S., Galiegue-Zouitina, S., d'Hooge, M. C., Laine, A., Van-Seuningen, I., Degand, P., Gum, J. R., Kim, Y. S., Swallow, D. M., Aubert, J. P., and Porchet, N. (1996) Human mucin genes assigned to 11p15.5: identification and organization of a cluster of genes. Genomics 38, 340-52 12 Davies, J. R., Svitacheva, N., Lannefors, L., Kornfalt, R., and Carlstedt, I. (1999) Identification of MUC5B, MUC5AC and small amounts of MUC2 mucins in cystic THIS IS NOT THE FINAL VERSION - see doi:10.1042/BJ20080260 fibrosis airway secretions. Biochem. J. 344, 321-30 13 Nordman, H., Davies, J. R., Lindell, G., De Bolos, C., Real, F., and Carlstedt, I. (2002) Gastric MUC5AC and MUC6 are large oligomeric mucins that differ in size, glycosylation and tissue distribution. Biochem. J. 364, 191-200 14 Toribara, N. W., Gum, J. R., Jr., Culhane, P. J., Lagace, R. E., Hicks, J. W., Petersen, G. M., and Kim, Y. S. (1991) MUC-2 human small intestinal mucin gene structure. Repeated arrays and polymorphism. J. Clin. Invest. 88, 1005-13 15 Vinall, L. E., Fowler, J. C., Jones, A. L., Kirkbride, H. J., De Bolos, C., Laine, A., Porchet, N., Gum, J. R., Kim, Y. S., Moss, F. M., Mitchell, D. M., and Swallow, D. Stage 2(a) POST-PRINT 15

Licenced copy. Copying is not permitted, except with prior permission and as allowed by law. © 2008 The Authors Journal compilation © 2008 Biochemical Society Biochemical Journal Immediate Publication. Published on 22 Apr 2008 as manuscript BJ20080260

M. (2000) Polymorphism of human mucin genes in chest disease: possible significance of MUC2. Am. J. Respir. Cell Mol. Biol. 23, 678-86 16 Wickstrom, C., Davies, J. R., Eriksen, G. V., Veerman, E. C., and Carlstedt, I. (1998) MUC5B is a major gel-forming, oligomeric mucin from human salivary gland, respiratory tract and endocervix: identification of glycoforms and C-terminal cleavage. Biochem. J. 334, 685-93 17 Ho, S. B., Takamura, K., Anway, R., Shekels, L. L., Toribara, N. W., and Ota, H. (2004) The adherent gastric mucous layer is composed of alternating layers of MUC5AC and MUC6 mucin proteins. Dig. Dis. Sci. 49, 1598-1606 18 Inatomi, T., Tisdale, A. S., Zhan, Q., Spurr-Michaud, S., and Gipson, I. K. (1997) Cloning of rat Muc5AC mucin gene: comparison of its structure and tissue distribution to that of human and mouse homologues. Biochem. Biophys. Res. Commun. 236, 789-97 19 Rousseau, K., Kirkham, S., McKane, S., Newton, R., Clegg, P., and Thornton, D. (2007) Muc5b and Muc5ac are the major oligomeric mucins in equine airway mucus. Am. J. Physiol. Lung Cell Mol. Physiol. 292, L1396-L1404 20 Borchers, M. T., Wesselkamper, S., Wert, S. E., Shapiro, S. D., and Leikauf, G. D. (1999) Monocyte inflammation augments acrolein-induced Muc5ac expression in mouse lung. Am. J. Physiol. 277, 489-97 21 Chen, Y., Zhao, Y. H., and Wu, R. (2001) In silico cloning of mouse Muc5b gene and upregulation of its expression in mouse asthma model. Am. J. Respir. Crit. Care Med. 164, 1059-66 22 Escande, F., Porchet, N., Aubert, J. P., and Buisine, M. P. (2002) The mouse Muc5b mucin gene: cDNA and genomic structures, chromosomal localization and expression. Biochem. J. 363, 589-98 23 Escande, F., Porchet, N., Bernigaud, A., Petitprez, D., Aubert, J. P., and Buisine, M. P. (2004) The mouse secreted gel-forming mucin gene cluster. Biochim. Biophys. Acta 1676, 240-50 24 Chen, Y., Zhao, Y. H., Kalaslavadi, T. B., Hamati, E., Nehrke, K., Le, A. D., Ann, D. K., and Wu, R. (2004) Genome-Wide Search and Identification of a Novel Gel- Forming Mucin MUC19/Muc19 in Glandular Tissues. Am. J. Respir. Cell Mol. Biol. 30, 155-65 25 Culp, D. J., Latchney, L. R., Fallon, M. A., Denny, P. A., Denny, P. C., Couwenhoven, R. I., and Chuang, S. (2004) The gene encoding mouse Muc19: cDNA, genomic organization and relationship to Smgc. Physiol. Genomics 19, 303- 18 THIS IS NOT THE FINAL VERSION - see doi:10.1042/BJ20080260 26 Bhargava, A. K., Woitach, J. T., Davidson, E. A., and Bhavanandan, V. P. (1990) Cloning and cDNA sequence of a bovine submaxillary gland mucin-like protein containing two distinct domains. Proc. Natl. Acad. Sci. U.S.A. 87, 6798-6802 27 Eckhardt, A. E., Timpte, C. S., DeLuca, A. W., and Hill, R. L. (1997) The complete cDNA sequence and structural polymorphism of the polypeptide chain of porcine submaxillary mucin. J. Biol. Chem. 272, 33204-210 28 Jiang, W., Woitach, J. T., Gupta, D., and Bhavanandan, V. P. (1998) Sequence of a second gene encoding bovine submaxillary mucin: implication for mucin heterogeneity and cloning. Biochem. Biophys. Res. Commun. 251, 550-56 Stage 2(a) POST-PRINT 16

Licenced copy. Copying is not permitted, except with prior permission and as allowed by law. © 2008 The Authors Journal compilation © 2008 Biochemical Society Biochemical Journal Immediate Publication. Published on 22 Apr 2008 as manuscript BJ20080260

29 Thornton, D. J., Carlstedt, I., and Sheehan, J. K. (1996) Identification of glycoproteins on nitrocellulose membranes and gels. Mol. Biotechnol. 5, 171-76 30 Desseyn, J. L., Buisine, M. P., Porchet, N., Aubert, J. P., and Laine, A. (1998) Genomic organization of the human mucin gene MUC5B. cDNA and genomic sequences upstream of the large central exon. J. Biol. Chem. 273, 30157-164 31 Buisine, M. P., Desseyn, J. L., Porchet, N., Degand, P., Laine, A., and Aubert, J. P. (1998) Genomic organization of the 3'-region of the human MUC5AC mucin gene: additional evidence for a common ancestral gene for the 11p15.5 mucin gene family. Biochem. J. 332, 729-38 32 Keller, A., Nesvizhskii, A. I., Kolker, E., and Aebersold, R. (2002) Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search. Anal. Chem. 74, 5383-92 33 Nesvizhskii, A. I., Keller, A., Kolker, E., and Aebersold, R. (2003) A statistical model for identifying proteins by tandem mass spectrometry. Anal. Chem. 75, 4646-58 34 Shevchenko, A., Wilm, M., Vorm, O., and Mann, M. (1996) Mass spectrometric sequencing of proteins silver-stained polyacrylamide gels. Anal. Chem. 68, 850-58 35 Raynal, B. D., Hardingham, T. E., Sheehan, J. K., and Thornton, D. J. (2003) Calcium-dependent protein interactions in MUC5B provide reversible cross-links in salivary mucus. J. Biol. Chem. 278, 28703-10 36 Swallow, D. M., Gendler, S., Griffiths, B., Corney, G., Taylor-Papadimitriou, J., and Bramwell, M. E. (1987) The human tumour-associated epithelial mucins are coded by an expressed hypervariable gene locus PUM. Nature 328, 82-84 37 Sheehan, J. K. and Thornton, D. J. (2000) Heterogeneity and size distribution of gel- forming mucins. Methods Mol.Biol. 125, 87-96 38 Thornton, D. J., Davies, J. R., Kraayenbrink, M., Richardson, P. S., Sheehan, J. K., and Carlstedt, I. (1990) Mucus glycoproteins from 'normal' human tracheobronchial secretion. Biochem. J. 265, 179-86 39 Raynal, B. D., Hardingham, T. E., Thornton, D. J., and Sheehan, J. K. (2002) Concentrated solutions of salivary MUC5B mucin do not replicate the gel-forming properties of saliva. Biochem. J. 362, 289-96 40 Davies, J. R. and Carlstedt, I. (2000) in Glycoprotein Methods and protocols, The mucins (Corfield, A. P., ed.), pp. 3-13, Humana Press, 41 Yu, D. F., Chen, Y., Han, J. M., Zhang, H., Chen, X. P., Zou, W. J., Liang, L. Y., Xu, C. C., and Liu, Z. G. (2008) MUC19 expression in human ocular surface and lacrimal gland and its alteration in Sjogren syndrome patients. Exp. Eye Res. 86, 403-11 42 Jacobsen, N., Lyche Melvaer, K., and Hensten-Pettersen, A. (1972) Some Properties THIS IS NOT THE FINAL VERSION - see doi:10.1042/BJ20080260 of Salivary Amylase: A Survey of the Literature and Some Observations. J. Dent. Res. Supplement to No 2 51, 381-88 43 Shimada, T. (2006) Salivary proteins as a defense against dietary tannins. J. Chem. Ecol. 32, 1149-63 44 Sheehan, J. K., Richardson, P. S., Fung, D. C., Howard, M., and Thornton, D. J. (1995) Analysis of respiratory mucus glycoproteins in asthma: a detailed study from a patient who died in status asthmaticus. Am. J. Respir. Cell Mol. Biol. 13, 748-56 Stage 2(a) POST-PRINT 17

Licenced copy. Copying is not permitted, except with prior permission and as allowed by law. © 2008 The Authors Journal compilation © 2008 Biochemical Society Biochemical Journal Immediate Publication. Published on 22 Apr 2008 as manuscript BJ20080260

45 Thornton, D. J., Sheehan, J. K., Lindgren, H., and Carlstedt, I. (1991) Mucus glycoproteins from cystic fibrotic sputum. Macromolecular properties and structural 'architecture'. Biochem. J. 276, 667-75 46 Wickstrom, C., Christersson, C., Davies, J. R., and Carlstedt, I. (2000) Macromolecular organization of saliva: identification of 'insoluble' MUC5B assemblies and non-mucin proteins in the gel phase. Biochem. J. 351, 421-28 47 Perez-Vilar, J., Eckhardt, A. E., and Hill, R. L. (1996) Porcine submaxillary mucin forms disulfide-bonded dimers between its carboxyl-terminal domains. J. Biol. Chem. 271, 9845-50 48 Perez-Vilar, J. and Hill, R. L. (1998) Identification of the half-cystine residues in porcine submaxillary mucin critical for multimerization through the D-domains. Roles of the CGLCG motif in the D1- and D3-domains. J. Biol. Chem. 273, 34527-34 49 Perez-Vilar, J. and Hill, R. L. (1998) The carboxyl-terminal 90 residues of porcine submaxillary mucin are sufficient for forming disulfide-bonded dimers. J. Biol. Chem. 273, 6982-88 50 Kirkham, S., Sheehan, J. K., Knight, D., Richardson, P. S., and Thornton, D. J. (2002) Heterogeneity of airways mucus: variations in the amounts and glycoforms of the major oligomeric mucins MUC5AC and MUC5B. Biochem. J. 361, 537-46 51 Linden, S., Nordman, H., Hedenbro, J., Hurtig, M., Boren, T., and Carlstedt, I. (2002) Strain- and blood group-dependent binding of Helicobacter pylori to human gastric MUC5AC glycoforms. Gastroenterology 123, 1923-30 52 Kawakubo, M., Ito, Y., Okimura, Y., Kobayashi, M., Sakura, K., Kasama, S., Fukuda, M. N., Fukuda, M., Katsuyama, T., and Nakayama, J. (2004) Natural antibiotic function of a human gastric mucin against Helicobacter pylori infection. Science 305, 1003-06

THIS IS NOT THE FINAL VERSION - see doi:10.1042/BJ20080260

Stage 2(a) POST-PRINT 18

Licenced copy. Copying is not permitted, except with prior permission and as allowed by law. © 2008 The Authors Journal compilation © 2008 Biochemical Society Biochemical Journal Immediate Publication. Published on 22 Apr 2008 as manuscript BJ20080260

Figure 1

1.6 (g/ml) Density 3

PAS response 1.4

2 1.2

1

0 5101520 Fraction number THIS IS NOT THE FINAL VERSION - see doi:10.1042/BJ20080260

Stage 2(a) POST-PRINT

Licenced copy. Copying is not permitted, except with prior permission and as allowed by law. © 2008 The Authors Journal compilation © 2008 Biochemical Society Biochemical Journal Immediate Publication. Published on 22 Apr 2008 as manuscript BJ20080260

FIGURE 2

MGAPSACRTLVLALAAMLVVPQAETQGPVEPSWGNAGHTMDGGAPTSSPTRRVSFVPPVTVFPSLSPLNPAHNGRVCSTWGDFHYKTFDGD VFRFPGLCNYVFSEHCRAAYEDFNVQLRRGLVGSRPVVTRVVIKAQGLVLEASNGSVLINGQREELPYSRTGLLVEQSGDYIKVSIRLVLT FLWNGEDSALLELDPKYANQTCGLCGDFNGLPAFNEFYAHNARLTPLQFGNLQKLDGPTEQCPDPLPLPAGNCTDEEGICHRTLLGPAFAE CHALVDSTAYLAACAQDLCRCPTCPCATFVEYSRQCAHAGGQPRNWRCPELCPRTCPLNMQHQECGSPCTDTCSNPQRAQLCEDHCVDGCF CPPGSTVLDDITHSGCLPLGQCPCTHGGRTYSPGTSFNTTCSSCTCSGGLWQCQDLPCPGTCSVQGGAHISTYDEKLYDLHGDCSYVLSKK CADSSFTVLAELRKCGLTDNENCLKAVTLSLDGGDTAIRVQADGGVFLNSIYTQLPLSAANITLFTPSSFFIVVQTGLGLQLLVQLVPLMQ VFVRLDPAHQGQMCGLCGNFNQNQADDFTALSGVVEATGAAFANTWKAQAACANARNSFEDPCSLSVENENYARHWCSRLTDPNSAFSRCH SIINPKPFHSNCMFDTCNCERSEDCLCAALSSYVHACAAKGVQLSDWRDGVCTKYMQNCPKSQRYAYVVDACQPTCRGLSEADVTCSVSFV PVDGCTCPAGTFLNDAGACVPAQECPCYAHGTVLAPGEVVHDEGAVCSCTGGKLSCLGASLQKSTGCAAPMVYLDCSNSSAGTPGAECLRS CHTLDVGCFSTHCVSGCVCPPGLVSDGSGGCIAEEDCPCVHNEATYKPGETIRVDCNTCTCRNRRWECSHRLCLGTCVAYGDGHFITFDGD RYSFEGSCEYILAQDYCGDNTTHGTFRIVTENIPCGTTGTTCSKAIKLFVESYELILQEGTFKAVARGPGGDPPYKIRYMGIFLVIETHGM AVSWDRKTSVFIRLHQDYKGRVCGLCGNFDDNAINDFATRSRSVVGDALEFGNSWKLSPSCPDALAPKDPCTANPFRKSWAQKQCSILHGP TFAACRSQVDSTKYYEACVNDACACDSGGDCECFCTAVAAYAQACHDAGLCVSWRTPDTCPLFCDFYNPHGGCEWHYQPCGAPCLKTCRNP SGHCLVDLPGLEGCYPKCPPSQPFFNEDQMKCVAQCGCYDKDGNYYDVGARVPTAENCQSCNCTPSGIQCAHSLEACTCTYEDRTYSYQDV IYNTTDGLGACLIAICGSNGTIIRKAVACPGTPATTPFTFTTAWVPHSTTSPALPVSTVCVREVCRWSSWYNGHRPEPGLGGGDFETFENL RQRGYQVCPVLADIECRAAQLPDMPLEELGQQVDCDRMRGLMCANSQQSPPLCHDYELRVLCCEYVPCGPSPAPGTSPQPSLSASTEPAVP TPTQTTATEKTTLWVTPSIRSTAALTSQTGSSSGPVTVTPSAPGTTTCQPRCQWTEWFDEDYPKSEQLGGDVESYDKIRAAGGHLCQQPKD IECQAESFPNWTLAQVGQKVHCDVHFGLVCRNWEQEGVFKMCYNYRIRVLCCSDDHCRGRATTPPPTTELETATTTTTQALFSTPQPTSSP GLTRAPPASTTAVPTLSEGLTSPRYTSTLGTATTGGPRQSAGSTEPTVPGVATSTLPTRSALPGTTGSLGTWRPSQPPTLAPTTMATSRAR PTGTASTASKEPLTTSLAPTLTSELSTSQAETSTPRTETTMSPLTNTTTSQGTTRCQPKCEWTEWFDVDFPTSGVASGDMETFENIRAAGG KMCWAPKSIECRAENYPEVSIDQVGQVLTCSLETGLTCKNEDQTGRFNMCFNYNVRVLCCDDYSHCPSTLATSSTATPSSTPGTTWILTKP TTTATTTASTGSTATASSTQATAGTPHVSTTATTPTVTSSKATPFSSPGTATALPALRSTATTPTATSFTAIPSSSLGTTWTRLSQTTTPM ATMSTATPSSTPETVHTSTVLTTTATTTGATGSVATPSSTPGTAHTTKVLTTTTTGFTATPSSSPGRARTLPVWISTTTTPTTRGSTVTPS SIPGTTHTPTVLTTTTTTVATGSMATPSSSTQTSGTPPSLTTTATTITATGSTTNPSSTPGTTPIPPVLTTTATTPAATSSTVTPSSALGT THTPPVPNTTATTHGRSLSPSSPHTVCTAWTSATSGILGTTHITEPSTGTSHTPAATTGTTQHSTPALSSPHPSSRTTESPPSPGTTTPGH TTATSRTTATATPSKTRTSTLLPSQPTSAPITTVVTMGCEPQCAWSEWLDYSYPMPGPSGGDFDTYSNIRAAGGAVCEQPLGLECRAQAQP GVPLRELGQVVECSLDFGLVCRNREQVGKFKMCFNYEIRVFCCNYGHCPSTPATSSTATPSSTPGTTWILTELTTTATTTESTGSTATPTS TLRTAPPPKVLTTTATTPTVTSSKATPSSSPGTATALPALRSTATTPTATSVTPIPSSSLGTTWTRLSQTTTPTATMSTATPSSTPETAHT STVLTATATTTGATGSVATPSSTPGTAHTTKVPTTTTTGFTATPSSSPGTALTPPVWISTTTTPTTRGSTVTPSSIPGTTHTATVLTTTTT TVATGSMATPSSSTQTSGTPPSLTTTATTITATGSTTNPSSTPGTRPIPPVLTTTATTPAATSSTVTPSSALGTTHTPPVPNTTATTHGRS LSPSSPHTVRTAWTSATSGTLGTTHITEPSTGTSHTPAATTGTTQHSTPALSSPHPSSRTTESPPSPGTTTPGHTTATSRTTATATPSKTR TSTLLPSSPTSAPITTVVTMGCEPQCAWSEWLDYSYPMPGPSGGDFDTYSNIRAAGGAVCEQPLGLECRAQAQPGVPLRELGQVVECSLDF GLVCRNREQVGKFKMCFNYEIRVFCCNYGHCPSTPATSSTATPSSTPGTTWILTEQTTAATTTATTGSTAIPSSTPGTAPPPKVLTSQATT PTATSSKATSSSSPRTATTLPVLTSTATKSTATSFTPIPSSTLGTTGTSQNRPPHPMATMSTIHPSSTPETTHTSTVLTTKATTTRATSSM STPSSTPGTTWILTELTTAATTTAALPHGTPSSTPGTTWILTEPSTTATVTVPTGSTATASSTRATAGTLKVLTSTATTPTVISSRATPSS SPGTATALPALRSTATTPTATSVTAIPSSSLGTAWTRLSQTTTPTATMSTATPSSTPETVHTSTVLTTTATTTRTGSVATPSSTPGTAHTT KVPTTTTTGFTATPSSSPGTALTPPVWISTTTTPTTRGSTVTPSSIPGTTHTATVLTTTTTTVATGSMATPSSSTQTSGTPPSLTTTATTI TATGSTTNPSSTPGTTPIPPVLTTTATTPAATSSTVTPSSALGTTHTPPVPNTTATTHGRSLPPSSPHTVPTAWTSATSGILGTTHITEPS TGTSHTPAATTGTTQPSTPALSSPHPSSRTTESPPSPGTTTPGHTRGTSRTTATATPSKTRTSTLLPSSPTSAPITTVVTTGCEPQCAWSE WLDYSYPMPGPSGGDFDTYSNIRAAGGAVCEQPLGLECRAQAQPGVPLRELGQVVECSLDFGLVCRNREQVGKFKMCFNYEIRVFCCNYGH CPSTPATSSTATPSSTPGTTWILTKLTTTATTTESTGSTATPSSTQGPPAGTPHVSTTATTPTVTSSKATPFSSPGTATALPALRSTATTP TATSFTAIPSSSLGTTWTRLSQTTTPMATMSTATPSSTPETVHTSTVLTTTATTTGATGSVATPSSTPGTAHTTKVPTTTTTGFTVTPSSS PGTARTPPVWISTTTTPTTSGSTVTPSSIPGTTHTPTVLTTTTQPVATGSMATPSSSTQTSGTPPSLITTATTITATGSTTNPSSTPGTTP IPPELTTTATTPAATSSTVTPSSALGTTHTPPVPNTTATTHGRSLSPSSPHTVRTAWTSATSGTLGTTHITEPSTGTSHTPAATTGTTTTS TPALSSPHPSSRTTESPPSPGTTTPGHTTATSRTTATATPSKTRTSTLLPSQPTSAPITTVVTTGCEPQCAWSEWLDYSYPMPGPSGGDFD TYSNIRAAGGAVCEQPLGLECRAQAQPGVPLGELGQVVECSLDFGLVCRNREQVGKFKMCFNYEIRVFCCNYGHCPSTPATSSTAMPSSTP GTTWILTELTTTATTTASTGSTATPSSTPGTAPPPKVLTSPATTPTATSSKATSSSSPRTATTLPVLTSTATKSTATSVTPIPSSTLGTTG TLPEQTTTPVATMSTIHPSSTPETTHTSTVLTTKATTRATSSTSTPSSTPGTTWILTELTTAATTTAGTGPTATPSSTPGTTWILTELTTT ATTTASTGSTATLSSTPGTTWILTEPSTTATVTVPTGSTATASSTQATAGTPHVSTTATTPTVTSSKATPSSSPGTATALPALRSTATTPT ATSFTAIPSSSLGTTWTRLSQTTTPTATMSTATPSSTPETVHTSTVLTTTATTTGATGSVATPSSTPGTAHTTKVPTTTTTGFTATPSSSP GTALTPPVWISTTTTPTTTTPTTSGSTVTPSSIPGTTHTARVLTTTTTTVATGSMATPSSSTQTSGTPPSLTTTATTITATGSTTNPSSTP GTTPIPPVLTSMATTPAATSSKATSSSSPRTATTLPVLTSTATKSTATSFTPIPSSTLWTTWTVPAQTTTPMSTMSTIHTSSTPETTHTST THIS IS NOT THE FINAL VERSION - see doi:10.1042/BJ20080260 VLTTTATMTRATNSTATPSSTLGTTRILTELTTTATTTAATGSTATLSSTPGTTWILTEPSTIATVMVPTGSTATTSSTLGTAHTPKVVTA MATMPTATASTVPSSSTVGTTRTPAVLPSSLPTFSVSTVSSSVLTTLRPTGFPSSHFSTPCFCRAFGQFFSPGEVIYNKTDRAGCHFYAVC NQHCDIDRFQGACPTSPPPVSSAPLSSPSPAPGCDNAIPLRQVNETWTLENCTVARCVGDNRVVLLDPKPVANVTCVNKHLPIKVSDPSQP CDFHYECECICSMWGGSHYSTFDGTSYTFRGNCTYVLMREIHARFGNLSLYLDNHYCTASATAAAARCPRALSIHYKSMDIVLTVTMVHGK EEGLILFDQIPVSSGFSKNGVLVSVLGTTTMRVDIPALGVTVTFNGQVFQARLPYSLFHNNTEGQCGTCTNNQRDDCLQRDGTTAASCKDM AKTWLVPDSRKDGCWAPTGTPPTASPAAPVSSTPTPTPCPPQPLCDLMLSQVFAECHNLVPPGPFFNACISDHCRGRLEVPCQSLEAYAEL CRARGVCSDWRGATGGLCDLTCPPTKVYKPCGPIQPATCNSRNQSPQLEGMAEGCFCPEDQILFNAHMGICVQACPCVGPDGFPKFPGERW VSNCQSCVCDEGSVSVQCKPLPCDAQGQPPPCNRPGFVTVTRPRAENPCCPETVCVCNTTTCPQSLPVCPPGQESICTQEEGDCCPTFRCR PQLCSYNGTFYGVGATFPGALPCHMCTCLSGDTQDPTVQCQEDACNNTTCPQGFEYKRVAGQCCGECVQTACLTPDGQPVQLNETWVNSHV DNCTVYLCEAEGGVHLLTPQPASCPDVSSCRGSLRKTGCCYSCEEDSCQVRINTTILWHQGCETEVNITFCEGSCPGASKYSAEAQAMQHQ CTCCQERRVHEETVPLHCPNGSAILHTYTHVDECGCTPFCVPAPMAPPHTRGFPAQEATAV Stage 2(a) POST-PRINT

Licenced copy. Copying is not permitted, except with prior permission and as allowed by law. © 2008 The Authors Journal compilation © 2008 Biochemical Society Biochemical Journal Immediate Publication. Published on 22 Apr 2008 as manuscript BJ20080260

Human K G S H I P E A T P K Y S E T N A IIG E I S T W S K G A Y K S F N G R I F F F E S S C P Y T F C R H C I E S G G D F N I E I K R N N D S E I E K I T V L I D N Horse K G S H I P E A T P K Y S E T D A I I G E A S T W G K G A Y K A F N G R I F S F E S S C T Y T F C R H C V D S G G D F N V E I K R N D D N E I E R I T V V I D S Cow ------E A S T W G K G A Y K A F N G R V F S F E S S C T Y T F C R H C V E S G G D F N I E I K R N N D S E I E K I T V IID N Pig K G S H I P E A T P K Y S E T D A IIG E A S T W G K G A Y K A F N G R V F S F E S S C S Y T F C R H C V E S G G D F N V E I K R N N D S E I E K I T V IID N Rat K A S H V P E A T P T Y S E A N E V A G E A A T W G K G T Y K A L N G R I F S F E S E C T F T F C R D C A E S G E D F N V E I K R H E N G D I E E I K A L I D D * ** ** ** *** * *** * **** * ** *** **** ** * ** Human N D V S I F G D T I L V N G E S V Q I P Y N N K L I H I K K Y G E H N V L N S R R G I L T L M W D K N N K L S L T L H K Q Y P T C G L C G N F N S T P G Q D IN Horse N D V S I L G D G I L V N G E S V Q I P Y N N K L I H I K K Y G E H N V L N S R R G I L T L M W D K N N K L S L T L H K Q Y S T C G L C G N F N S T P G D N IN Cow N D V S I F G D I L L V N G E S V Q I P Y N N K L I H I K K Y G E H N V L N S R R G I L S L M W D K N N K L S L T L H K Q Y P T C G L C G N F N S T P G D D IN Pig N D I S I S G D V I L V N G E S V H I P Y N N K L I H I K K Y G E H N V L N S R R G I L S L M W D K N N K L S L T I H K Q Y P T C G L C G N F N N T P G E D ID Rat V G I L V V R D T I S V N E E R V Q M P F S N K M I H I K K Q G D H Y V L K T R R K I L S L S W G K - N K L S L T L Y R Q Y I T C G L C G D F N S V P G E D IN * ** * * * ** *********************** ** ****** ** ** * Human E H I A N S K I P G D C P N A V G K S Y E V C E D G I Q H C N K IIG T Y F E K C G K V A A L S N D Y K M I C I D E Y C Q T - R D K T S T C D T Y S E L S R L C Horse E H I A N S K I P G D C P N A V S K S Y E V C E D G V Q H C N K IIE T Y F E K C G K V T A L S S D Y K M I C V D E Y C Q N - G G K E S T C D T Y S E L S R L C Cow E H I A D S K I P D D C S K A V S K S Y E V C E D G V Q Y C N K I I G T Y F E K C G K V S T L S S D Y K M I C I D E Y C Q S - R D R T S T C D T Y S E L S R L C Pig E H I A N S K I P G N C P N A V S K S Y E L C E D G V Q H C E K IIG T Y F E K C Q K V S S L S N D Y K IIC I D E Y C Q N - R D A K S T C D T Y S E L S R L C Rat E H I A N S K I S N D C P S T L S R N N E V C E D G V Q Y C D R I I G T Y F E K C S K V S P L S S E Y K N V C V D E Y C R T G G N K Q T T C N T Y S E L A R L C **** *** * * **** * * ** ****** ** ** ** * **** ** ***** *** Human A S D G P G T F E S W R S D S D V V C G T Q R C P E Q H I Y K E C G P S N P A T C S N V A P F Q D S E C V S G C T C P E G Y L L D D I G E K G K C V L K A E C P Horse A S D G P G T Y E Y W R D D S E V V C E K P I C P E K H I Y K E C G P S N P A T C S N V A P F Q D S E C V S G C T C L E G Y L L D D I G E K G R C V L K T E C P Cow A S D G P G T F E S W R D D P D V V C E K P I C P E K H I Y K E C G P S N P A T C S N V A P F Q D T E C V S G C T C P E G Y L L D D I G E K G R C V L K S D C P Pig A S D G P G V Y E S W R D D S A V V C E K P N C P E K H I Y K E C G P S N P A T C S N V A P F Q D S E C V S G C T C P E G Y L L D D I G K K G K C V L K A Y C P Rat A Y D G P G V F E H W R D D S A V V C A K Q Q C P G K H I Y K E C G P S N P P T C S N V A P F Q D S E C V S G C T C P E G Y L L D D I R E K G K C V L K E K C P * **** * ** * *** ** *********** ********** ******** ******** ** **** ** Human C E S S G T V Y Q P G E V R E G P C G S Q C T C Q D A K W S C T E A L C P G R C K V E G S S L T T F D G V K Y N F P G N C H F L A V H N E D W S I S V E L R P C Horse C E S N G K V Y Q S G E V R E G P C G S Q C T C Q E A K W S C T E A L C P G R C K V E G S S I T T F D G V K Y N H P G N C H F L A I H D E D W S V S V E L R P C Cow C E S N G K V Y Q S G E V R E G S C G S L C T C Q E A K W S C T K T L C P G R C K I E G S L I T T F D G V K Y N H P G N C H F L A I H D K D W S I S V E L R P C Pig C E S N G N V Y Q S G E V R E G P C G S Q C T C Q E A K W S C T E A L C P G R C K V E G S L I T T F D G V K Y N H P G N C H F L A I H D K D W S I S V E L R P C Rat C E S N G K V Y K P G E V R E G P C G S Q C T C Q E A K W S C T E A R C P G I C K V E G S S F T T F D D N K F S H P G D C H F L A V H N D E I S I S V E I H P C *** * ** ****** *** **** ****** *** ** *** **** * ** ***** * * *** ** Human P S G Q T G T C L N S V T L L L N S S V P V D K Y V F N S D G T V T N D K I R N Q G Y Y Y S D K I Q I F N A S S S Y L Q V E T Y F H V K L Q I Q I V P V M Q L Y Horse P S G Q S G T C L N S V T L L L N S S V S V D K Y V F N R D G T V T N D K I R N E S Y Y Y S D K I Q I F K A S S S Y L Q V E T Y F H V K M Q I Q I V P V M Q L Y Cow P S G Q S G T C L N S V T L L L N S S V Q V D K Y V F N R D G T V T N D K F G N L G Y Y Y S D K I Q I F N A S S S Y L Q A E T Y F H G K M Q I Q I F P V M Q L Y Pig Q S G Q S G T C L N S V T L L L N S S V S V D K Y I F N R D G T V T N D K I R N Q S Y Y Y S D K V Q I F K V S S S Y L Q A E T Y F H V K M Q I Q I V P V M Q L Y Rat G D G Q T G S C L T S V M V L L N S S T S G N R F V F N R D G T V T K D G V IIK G Y Y Y S D G V Q I F N S S S L Y M Q A E I L S H I K L Q I Q M V P K M Q L Y ** * ** ** ***** ** ***** * ***** *** ***** ********** Human V S M P P N Q F T D T V G L C G S Y N N K A E D D F M S S Q N I L E K T S Q A F A N S W E M M S C P K G N P S S C I S I E K E K F A E R H C G I L L D S S G P L Horse V S M P P N Q F T D T V G L C G S Y N N R A E D D F M S S Q N I L E K T S Q A F A N S W E M M S C P K G K P A S C I S I E T E K F A D S Y C G I L L D S G G P F Cow V S M P P N Q F T D T V G L C G S H N N R A E D D F M S S Q N I L E K T S Q A F A S S W E M M P C P K A S T A S C I S I E K E R F A E R H C G I L L D L S G P F Pig V S M P P N Q F T D T V G L C G S Y N N R A E D D F M S S Q N I L E K T S Q A F A N S W E M M S C S K G N T A S C I S I E K E K F A E K N C G I L L D S S G P F Rat V S L A P D T S T D T V G L C G S F N N K A E D D F M S S Q K I L E S T A Q A F A N S W E M M P C P K G S P S S C V S I E T E K F A E S N C E I L L S S S G P F ** * ********* ** ********* *** * **** ***** * * ** *** * ** * *** ** Human A S C H P I V N P K P Y H E E C K K Y T C T C E N S Q D C L C T I L G N Y V K A C A E K E T Y I V G W R T G L C E H S C P S G L V F K Y N V K A C N S S C R S L Horse A S C H A I V N P K T Y H E E C K K Y T C T C E N S Q D C L C T V L G N Y V K A C A E K E T Y I V G W R A G R C D H S C P S G L V F K Y N V K A C N S S C R S L Cow A S C H S I V D P K P Y H E E C K K Y T C T C E N S Q D C L C T I L G N Y V K A C A E K E T S M V G W R A G L C D Q S C P S G L V F K Y N V K T C N S S C R S L Pig A S C H P I V N P K P Y H E E C K K Y T C A C E N S K D C L C T I L G N Y V K A C A E K E T Y I V G W R A E L C D H S C P S G L V F K Y N V K A C N S S C R S L Rat A A C H Q T V D P K F Y H E E C K R Y T C S C E N S Q D C L C T V L G N Y V K A C A E K E T Y L V G W R D G L C E V S C P T G L V F N Y K V K T C N S S C R S L * ** * ** ****** *** **** ***** ************* **** * *** **** * ** ******** Human S E R D R S C D V E D V P V D G C T C P D A M Y Q N N E G N C V L K S Q C D C Y I N D E V M Q P G K L I H I D D N K C V C R D G I L L C Q I P I D L T L Q N C S Horse S E R D R S C D V E D V P V D G C T C P D G M Y Q N N E G N C V A K S Q C D C Y I N D E V I Q P G K L I Q I D D N N C V C R D G I L L C Q I P I N L T P Q N C S Cow S E R D K S C D M E G I S V D G C T C P D G M Y K N N E G N C V S K S Q C D C Y I N D E V M Q P G K L I H I D D N K C V C R D G I L L C Q T P I D L T L Q N C S Pig S E R D R S C D V E D V P I D G C T C P D G M Y H N N E G T C V P K S Q C D C Y I N D E V M Q P G K L I H I G N N K C V C R D G I L L C Q T P I D L T L Q N C T Rat S A R D R S C D I E D I L V D G C T C P D G M Y Q N N E G N C V Q K S E C D C Y V E D E T V Q P G K S I L I D D N K C V C Q D G V L H C Q T P L D L T L Q N C S * ** *** * ******* ** **** ** ** **** ** **** * * * *** ** * ** * ** *** Human G G A E Y V D C S D P K A Q R R T N R T C S T R N I P V F D E N L P C K R G C F C P E G M V R N S K G I C V F P N D C P C S F G G R E Y D E G S V T S V G C N E Horse G G A E Y V D C S D P K A Q R R V D S T C S T R N I P S F E E N L P C K R G C Y C P V G M V R N S K G N C V F P D D C P C S F G G R E Y D Q G S V T S V G C N K Cow G G A E Y V D C R N P K A Q R R V D S T C S T R N I P S F D E N L P C K R G C Y C P E G M V R N S K G S C V F P D D C P C S F G G R E Y D Q G S V T S V G C N K Pig G G A E Y V D C R D P K A Q R R A D S T C S T W N I P S F E E N L P C K R G C Y C P V G M V R N S K G N C V F P E D C P C S F G G R E Y D Q G S V T S V G C N K Rat R G A E Y I D C K D P K A Q R R T E R T C A T R N I P D F E G D L P C K R G C Y C P V G M V R N S K G I C I H P D D C P C S F G D R E Y E Q G S V T S V G C N E **** ** ****** ** * *** * ******* ** ******** * * ******* *** ********* Human C T C I K G S W S C T Q N E C Q T I C H I Y G E G H V R T F D G K S Y S F D G L C Q Y S F L E D Y C G H E N G T F R I L T E S V P C C E D G L T C S R K IIV A Horse C T C I K G S W N C T Q N E C Q T T C H I Y G E G H V Q T F D G K S Y S F D G L C Q Y T F L E D Y C G S Q N G T F R I L T E S V P C C E D G L T C S R K I V V A Cow C T C I K G S W N C T Q N E C Q T T C H I Y G E G H V R T F D G K S Y S F D G L C Q Y S F I E D Y C G R E N G T F R I L T E S V P C C E D G L T C S R K IIV A Pig C T C I K G S W N C T Q N E C Q T T C H I Y G E G H I R T F D G K T Y S F D G L C Q Y S F L E D Y C G S E N G T F R I L T E S V P C C E D G L T C S R K IIV A Rat C T C I K G S W N C T Q N E C Q S T C H V Y G E G H Y R T F D G E S Y S F D G L C Q Y T F L E D Y C G Q E N G T F R I L T E S V P C C E N G L T C S R K IIV A ******** ******* ** ***** **** ********* * ***** *************** ******** ** Human F Q D Q N I V L Q D G K V T A V K S T E S K K C E L N A N A Y S I H T V G L Y L I L K F Q N G IIV I W D K N T R L S V I L D P N W N G K V C G L C G N N N G D Horse F Q D Q N I V L H D G K V T A A K T T E S K E C E P N G N T Y S V H T V G L Y L I V K F L N G I T IIW D K N T R M S V I L D P S W N G K V C G L C G N N N G D Cow F Q D Q N I V L H D G K V T A V K T T E S K E C E L N G N S Y S V H T V G L Y L I L K F L N G I T IIW D K N T R I S V I L D P R W N G Q V C G L C G N N N G D Pig F Q D Q N V V L H D G K V T A V K T T E S K E C E L E E N S Y S V H T V G L Y L I L K F L S G I T IIW D K N T R I S V I L D P R W N G K V C G L C G N N N G D Rat F Q D Q N IIL Q D G K V T A V Q T A E S E D C E R S A S S Y S I H T V G L Y L I V K L L N G IIL I W D K Y T K V S V I L D P S W Q N K V C G L C G N N N G D ***** * ****** ** ** ** ******** * ** **** * ****** * *********** Human L K D D F T T R Y S S V A S G A L E F G N S W K T S Q E C S D T V A Q T F P C D S N P Y C K A W A V R K C E I L R D S T F R D C H N K V D P S A Y H D V C I E E Horse L K D D F T T R Y S S V A T G A L E F G N S W K T S Q E C S D T V T Q T F P C D S N P Y C K A W A V R K C E IIR D S T F R D C H N K V D P S A Y Y D A C I E E Cow L K D D F T T R Y S S V A A G T L E F G N S W K T S Q E C S D T V A Q T F P C D S N P Y C K A W A V R K C E IIR D S T F R E C H N K V D P N E Y Y D A C I E E Pig L K D D F T T R Y S S V A V G E L E F G N S W K T S Q E C S D T V T Q S F P C D S N P Y C K A W A V R K C E IIR D S T F R D C H N K V D P S A Y Y D A C I E E THIS IS NOT THE FINAL VERSION - see doi:10.1042/BJ20080260 Rat L K D D F T T R H S S V A A G A L E F A N S W K T S Q E C S D T V T Q S F P C D S N P Y C K A W A E K K C E IIR D D T F R D C H S K V D P T T Y Y N A C I E E ******** **** * *** ************* * ************* **** ** *** ** **** * **** Human A C A C D M E G K Y L G F C T A V A M Y A E A C S A V G V C V S W R K P N L C P V Y C D Y Y N A P G E C R W H Y E P C G T V T A K T C K D Q L V G Q K F S S L L Horse A C A C D M E G K Y L G F C T A V A M Y A E A C S A V G V C V T W R K P D L C P V Y C D Y Y N A P G E C S W H Y E P C G T V A A K T C K D G V I G Q K F S A V L Cow A C A C D M E G K Y L G F C T A V A M Y A E A C S A V G V C V T W R K P D L C P V Y C D Y Y N A P G E F S W H Y E P C G T V T A K T C K D R V I G Q K F S A L L Pig A C A C D M E G K Y L G F C T A V A M Y A E A C S A V G V C V T W R K P D L C P V Y C D Y Y N A P G E C S W R Y E P C G T V T A K T C K D R V I G Q K F S A L L Rat A C S C D M E G K Y L G F C T A V A M Y A E A C N A A G V C V S W R K P N L C P V Y C D Y Y N A P G E C S W H Y E P C G T V T A K T C K D Q V I G Q K F S S L L ** ********************* * **** **** ************** * ******* ****** ***** * Human E G C Y A K C P D S A P Y L D E N T M K C V S L S E C S C F Y N D V I P A G G V I E D N C G R T C Y C I A G Q L E C S 1178 amino acids Horse E G C Y A K C P D S A P Y L D E N I M K C V S L S E C S C F Y N D IIP A G G V I Q D N C G R T ------1167 amino acids Cow E G C Y A K C P D S A P Y L D E N T M K C V S L A E C S C F Y N D I V P A G G V I Q D N C G R T C Y C I A G E L E C - 1157 amino acids Pig E G C Y A K C P D S A P Y L D E N T M K C V S L S E C S C F Y N D IIP A G G V I Q D N C G R I C Y C I A G E L E C S 1178 amino acids Rat E G C Y A K C P E N A P Y L D E N T M K C V Q L S E C S C F Y N D V I P A G G A V V D D C G R T C S C T A G E L E C S 1179 amino acids ******** ******* **** * ******** **** * *** FIGURE 3 Stage 2(a) POST-PRINT

Licenced copy. Copying is not permitted, except with prior permission and as allowed by law. © 2008 The Authors Journal compilation © 2008 Biochemical Society Biochemical Journal Immediate Publication. Published on 22 Apr 2008 as manuscript BJ20080260

Human L N P A H N G R V C S T W G D F H Y K T F D G D V F R F P G L C N Y V F S E H C R A A Y E D F N V Q L R R G L V G S R P V V T R V V I K A Q G L V L K A S N G S Horse V N P A H N G R V C S T W G D F H Y K T F D G Y V F R F P G L C N Y V F S A H C G A A Y E D F N I Q L R R G L A G S K P T I T H V V L K T Q G L V L E I S N G S Cow L N A A H G G R V C S T W G D F H Y K T F D G D V F R F P G L C N Y I F S A H C G S A Y E D F N L Q L R R G L L G S R P T I T H I V L R S Q G L V L E V S N G S Rat L H L A H N G R V C S T W G D F H Y K T F D G D V F R F P G L C N Y V F S S H C G A T Y E D F N I Q L R R G L E G S R P T V T Y V F L R A Q G L V I E L S N G S ** ***************** ********** ** ** ***** ****** ** * * **** **** Human V L I N G Q R E E L P Y S R T G L L V E Q S G D Y I K V S I R L V L T F L W N G E D S A L L E L D P K Y A N Q T C G L C G D F N G L P A F N E F Y A H N A R L T Horse I L I N G Q Q E E L P Y S R A G L L V E Q S S V Y V K V N V R L V L T F M W N G E D S A L L E L D P K Y A N Q T C G L C G D F N G L P A I N E F Y A H N T R L T Cow V L I N G W R E E L P Y S R A G L L V E R S S T Y V K I N I R L M L T F M W N G E D S A L L E L D P K Y A N Q T C G L C G D F N G L R A V S E F Y A H N T R L S Rat V L V N G H R E E L P Y S R A G L L M E K T S G Y V K I S I R L V L N F L W N E E D S V L L E L D S K Y T N Q T C G L C G D F N G L P A V N E F Y A H N T R L T * ** ******* *** * * * ************** ** ************* * ****** ** Human P L Q F G N L Q K L D G P T E Q C P D P L P L P - A G N C T D E E G I C H R T L L G P A F A E C H A L V D S T A Y L A A C A Q D L C R C P T C P C A T F V E Y S Horse P L Q F G N L Q K L D G P T E Q C Q D P L P S P - A D N C T D E E G I C R R T L L G L A F A Q C H G L V D T D V Y V A A C T Q D L C R C P T C P C A T F A E Y S Cow P L Q F G N L Q K L D G P T E Q C Q D P L P S P A A D N C T D E x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x Rat P V Q F G N L Q K L D G P T E Q C Q D V L L S P - A N N C T D T E G I C R H I L L G P A F A E C I A L V D V N V Y L D A C V Q D L C R C P T C P C A T F A E Y S * *************** * * * * **** Human R Q C A H A G G Q P R N W R C P E L C P R T C P L N M Q H Q E C G S P C T D T C S N P Q R A Q L C E D H C V D G C F C P P G S T V L D D I T H S G C L P L G Q C Horse R Q C T H A G G Q P Q N W R G P D L C P R T C P L N L Q H Q E C G S P C T D T C S N P D H S Q L C E D H C V D G C F C P P G - T V L D D V S H A G C L P L E Q C Cow x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x Rat R Q C A H A G G H P Q N W R R S D L C N W T C P F N M E Y Q E C S S P C V D T C S N P Q R S Q L C E D H C M D G C F C P P G - T V L D D V R H L G C L P L E Q C

Human P C T H G G R T Y S P G T S F N T T C S S C T C S G G L W Q C Q D L P C P G T C S V Q G G A H I S T Y D E K L Y D L H G D C S Y V L S K K C A D S S F T V L A E Horse S C T H S G R T Y A P G A S F T T S C S S C T C S S G M L Q C Q D L P C P G T C S V Q G G S H I S T Y D K K L Y N V H G D C S Y I L S K T C A D S G F T V L A E Cow x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x C P G T C S V Q G G S H I S T Y D E K L Y D V H G D C S Y V L T K V C A D S A L T V L A E Rat H C T H G G H I Y A P G E S F N T T C S S C T C F G G L W Q C Q D L P C P G T C S V Q G G S H I S T F D E K L Y N V H G D C S Y I L T K K C A D S S F T V L V D ********** **** * *** ****** * * **** *** Human L R K C G L T D N E N C L K A V T L S L - D G G D T A I R V Q A D G G V F L N S I Y T Q L P L S A A N I T L F T P S S F F I V V Q T G L G L Q L L V Q L V P L M Horse L R Q C G L T D N E N C L K T V T L K M T S G G D T V I Q I Q A N G G V F M N S I Y T Q L P V S A A N I T I F K P S S F F I L V Q T G L G L Q L Q V Q L V P L M Cow L R K C G L T D N E N C L K T V T L S L - N G G D T T V Q I Q A N G G V F V N S I Y S Q L P M S V A D V T V F R P S S F F I L V Q T G P W L Q L Q V Q L V P L M Rat L R K C G M T D T E N C L K A V T L S L - N S G D M V V R I Q V N G A V F L N S I F T Q L P L S A G N V T I F R P S S F F V I V Q V G T G L Q L Q V Q L V P F M ** ** ** ***** *** ** * * ** *** *** * * * ***** ** * *** ***** * Human Q V F V R L D P A H Q G Q M C G L C G N F N Q N Q A D D F T A L S G V V E A T G A A F A N T W K A Q A A C A N A R N S F E D P C S L S V E N E N Y A R H W C S R Horse Q V Y L R L D P S Y R G Q M C G L C G N F N Q N Q A D D F R I L G G V V E G T A A A F A N T W K T Q A A C P N V K N S F E D P C S L S V E N E N F A Q H W C S L Cow Q V F L R L D P A Y R G Q M C G L C G N F N Q N Q A D D F R T V S G V V E A T A A A F A N T W K T Q A A C P N V K N S F E D P C S L S V E N E N Y A Q H W C S L Rat Q V F V R L D R S Y Q G Q M C G L C G N F N Q N Q A D D F T A L S G V V E G T G A A F S N T W K T Q A S C P N S K N T Y E D P C S Y S V E N E N F A R E W C S M ** *** ****************** **** * *** **** ** * * * ***** ****** * *** Human L T D P N S A F S R C H S I I N P K P F H S N C M F D T C N C E R S E D C L C A A L S S Y V H A C A A K G V Q L S D W R D G V C T K Y M Q N C P K S Q R Y A Y V Horse L T D P T G A F S P C H S I V N P A P F H S N C L F D T C N C E K S E A C M C A A L S S Y V W A C A A K G V L L H G W R D G V C T K Y M S S C P K S Q N Y T Y V Cow L T R P A G P F S P C H S V I S P G P F H S N C L F D T C N C E K S E D C M C A A L S S Y V Q A C A A R G V L L S G W R D G V C T K Y A S S C P K T Q S Y A H V Rat L T E S S G V F S A C H A T V S P V P F Y S N C L F D T C N C E N S E D C M C A A L S S Y V R A C A A K G V L L S G W R G K A C Y K Y M N N C P Q T K E Y S Y S ** ** ** * ** *** ******* ** * ******** **** ** * ** * ** ** * Human V D A C Q P T C R G L S E A D V T C S V S F V P V D G C T C P A G T F L N D A G A C V P A Q E C P C Y A H G T V L A P G E V V H D E G A V C S C T G G K L S C L Horse V D S C Q P T C R R L S Q A D V T C G I A F V P V D G C T C P S G T F L D D T S T C V P A E A C P C Y F R G S V V A P G E V V H D D G V V C S C V S G K L S C L Cow V D S C Q P T C R S L S Q P D V S C D V A F V P V D G C V C P R G x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x S C V S G R L S C L Rat V S T C Q P T C R S L S E V D V T C S I P F V P V D G C T C P E G T F L N D K D H C V P V E E C P C Y F H G T V V A S G E V V M D N G V V C S C T N G K L T C L * ****** ** ** * ******* ** * ** * * ** Human G A S L Q K S T G C A A P M V Y L D C S N S S A G T P G A E C L R S C H T L D V G C F S T H C V S G C V C P P G L V S D G S G G C I A E E D C P C V H N E A T Y Horse G A E E Q R S I G C V A P M L Y L D C T N A S A G T P G A E C L R S C H V L D V D C F S T H C V S G C V C P P G L V S D G S G G C V A E E D C P C L H N E A T Y Cow G A T E Q - S T G C V A P M V F L D C S N A S A D A P G A E C V R S C H T L D V D C F S T H C V S G C V C P V G L L S D G S G G C V A E E D C P C M H N E A A Y Rat G A L M Q R N K E C Q A P M V Y L D C N N A S V G D H G A E C L R S C H T L D V D C F S T Q C V S G C V C P S G L V A D G N G G C I A E E D C P C V H N E A T Y ** * * *** *** * * **** **** *** **** ******** ** ** *** ******* **** * Human K P G E T I R V D C N T C T C R N R R W E C S H R L C L G T C V A Y G D G H F I T F D G D R Y S F E G S C E Y I L A Q D Y C - G D N T T H G T F R I V T E N I P Horse K P G E T I R V D C N T C T C R N R R W E C S H Q P C L G T C V A Y G D G H F I T F D G E R Y S F E G S C E Y T L V Q D H C G G N G T A N G T F R I V T E N V P Cow K P G E V I K V D C N T C T C R G R R W E C S D R P C L G T C V A Y G D G H F L T F D G E R Y G F E G S C E Y T L A Q D Y C V G S D T A N G T F R I V T E N V P Rat R P G E IIR V D C N N C T C R N R R W E C T N Q P C M G A C V A Y G D G H F V T F D G E R Y I F E G N C E Y T L A Q D Y C R G N T S T D G T F R I V T E N V P *** * **** **** ***** * * ********* **** ** *** *** * ** * * ********* * Human C G T T G T T C S K A I K L F V E S Y E L I L Q E G T F K A V A R G P G G D P P Y K I R Y M G I F L V I E T H - G M A V S W D R K T S V F I R L H Q D Y K G R V Horse C G T T G V T C S K T I K L F L E N Y E L I L H E G T Y K V V Q R G P G R D L P Y R I R Y M G I Y L T I E T R S G M V V S W D R K T S V F I R L R Q D Y K G R V Cow C G T T G V T C S K A I K I F L G S Y E L I L H E G T H R V L Q R G P G G D L P Y R V R Y M G I Y L T V E T H G G V V V S W D R K T S V IIR L R H E Y K G R V Rat C G T T G T T C S K A I K I F V E S Y E L I L H E G N F K V V A R G P S G D P P Y K I R Y M G I F L V I E I R S G I V V S W D R K T S V F V R L Q Q H Y K G R V ***** **** ** * ***** ** *** * ** ***** * * * ********* ** ***** Human C G L C G N F D D N A I N D F A T R S R S V V G D A L E F G N S W K L S P S C P D A L A P K D P C T A N P F R K S W A Q K Q C S I L H G P T F A A C R S Q V D S Horse C G L C G N F D D N A I N D F T T R S Q S V V G N A L E F G N S W K F S P S C P D A P A P K D P C T A N P Y R K S W A Q K Q C S I I N S A T F A T C R S Q V D S Cow C G L C G N F D D N A L N D F T T R S Q S V A S D V L E F G N S W K F S P S C P D A L A P R D P C T T N P Y R R S W A Q K Q C S I I N S A T F S A C R S Q V D P Rat C G L C G N F D D N A I N D F T T R S Q S V V G D V L E F G N S W K F S P S C P D A P V P K D P C I A N P Y R K S W A Q K K C S IIN S A T F A A C H S Q V D S *********** *** *** ** ******** ******* * *** ** * ***** *** ** * **** Human T K Y Y E A C V N D A C A C D S G G D C E C F C T A V A A Y A Q A C H D A G L C V S W R T P D T C P L F C D F Y N P H G G C E W H Y Q P C G A P C L K T C R N P Horse T K Y Y E A C V S D A C A C N S G G D C E C F C T A V A A Y A Q A C R D V G V C V S W R S P D V C P L F C D Y Y N P H G E C E W H Y E P C G A P C L R T C R N P Cow T R Y Y E A C V S D A C A C D S G G D C E C F C T A V A A Y A Q A C H E A G V C V S W R T P D V C P L F C D Y Y N P H G Q C E W H Y Q P C G A P C L K T C R N P Rat T K Y Y E A C V H D V C A C D S G G D C E C F C T A V A A Y A Q A C R D V G V C L S W R T P D I C P L F C D Y Y N P H G Q C E W H Y Q P C G A P C L K T C R N P * ****** * *** ******************* * * *** ** ****** ***** ***** ******* ***** Human S G H C L V D L P G L E G C Y P K C P P S Q P F F N E D Q M K C V A Q C - G C Y D K D G N Y Y D V G A R V P T A E N C Q S C N C T P S G I Q C A H S L E A C T C Horse S G H C L V D L P G L E G C Y P R C P P G R P F F S E D E M E C V A Q C - G C Y D E D G N Y H E V G A R V P A A E N C Q S C E C T P N G I Q C I H S L E A C R C Cow S G L C L M D L P G L E G C Y P K C P S S K P F F N E D Q M E C V A Q C S G C Y D G D G N Y Y D A G T R V P S T E N C Q S C D C T S S G L Q C T H S P E A C T C Rat S G H C L V D L P G L E G C Y P Q C P P S Q P F F N E D Q M K C V A Q C - G C Y D D D G N Y H D I G T Q V P T A E N C Q S C L C T P G G L Q C A Y N L T A C T C

** ** ********** ** *** ** * ***** **** **** * ** ****** ** * ** ** * THIS IS NOT THE FINAL VERSION - see doi:10.1042/BJ20080260 Human T Y E D R T Y S Y Q D V I Y N T T D G L G A C L I A I C G S N G T IIR K A V A C P G T P A T T P F T F T T A W V P H S T T 1257 amino acids Horse T Y E G K T Y G Y E E V I Y N T T D G L G A C L L A I C G N N G T IIR S V V E C P G T L S T T P F T F T T T A A P P S T T 1259 amino acids Cow T Y E G R T Y A Y G D V I Y N T T D G L G A C L I A I C R D N G T I V R R A V E C P G T L F K T P F T F T S T A A P P S T T 940 amino acids Rat I Y E G R T Y N Y N D V V Y N T T D G L G A C L V A I C K D N G T I K R T T E E C P E V P S T T P F I F T T T L T P L T T N 1258 amino acids ** ** * * *********** *** **** * ** *** ** * *

Figure 4

Stage 2(a) POST-PRINT

Licenced copy. Copying is not permitted, except with prior permission and as allowed by law. © 2008 The Authors Journal compilation © 2008 Biochemical Society THIS IS NOT THE FINAL VERSION - see doi:10.1042/BJ20080260 see - VERSION FINAL THE NOT IS THIS

Density (g/ml) 1.6 1.4 1.2 Figure 5 Figure Fraction number 5101520

5 3 1 0 4 2

PAS response Stage 2(a) POST-PRINT 2(a) Stage © 2008 The Authors Journal compilation © 2008 Biochemical Society Licenced copy. Copying is not permitted, except with prior permission and as allowed by law. Licenced copy. Copying is not permitted, except with prior permission Biochemical Journal Immediate Publication. Published on 22 Apr 2008 as manuscript BJ20080260 as manuscript 22 Apr 2008 on Published Publication. Immediate Journal Biochemical THIS IS NOT THE FINAL VERSION - see doi:10.1042/BJ20080260 see - VERSION FINAL THE NOT IS THIS 6

Fraction number Figure 5101520

4 3 2 1 0

1.5 1.0 0.5 0.0 PAS response Stage 2(a) POST-PRINT 2(a) Stage © 2008 The Authors Journal compilation © 2008 Biochemical Society Licenced copy. Copying is not permitted, except with prior permission and as allowed by law. Licenced copy. Copying is not permitted, except with prior permission Biochemical Journal Immediate Publication. Published on 22 Apr 2008 as manuscript BJ20080260 as manuscript 22 Apr 2008 on Published Publication. Immediate Journal Biochemical