Biology Direct BioMed Central

Discovery notes Open Access Identification of a crenarchaeal orthologue of Elf1: implications for chromatin and transcription in Jan-Peter Daniels1, Steven Kelly1,2,3, Bill Wickstead1 and Keith Gull*1

Address: 1Sir William Dunn School of Pathology, University of Oxford, South Parks Road, Oxford, OX1 3RE, UK, 2Centre for Mathematical Biology, University of Oxford, 24-29 St Giles', Oxford, OX1 3LB , UK and 3Oxford Centre for Interactive Systems Biology, Department of Biochemistry, University of Oxford, South Parks Road, Oxford, OX1 3QU, UK Email: Jan-Peter Daniels - [email protected]; Steven Kelly - [email protected]; Bill Wickstead - [email protected]; Keith Gull* - [email protected] * Corresponding author

Published: 29 July 2009 Received: 4 July 2009 Accepted: 29 July 2009 Biology Direct 2009, 4:24 doi:10.1186/1745-6150-4-24 This article is available from: http://www.biology-direct.com/content/4/1/24 © 2009 Daniels et al; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Abstract The transcription machineries of Archaea and eukaryotes are similar in many aspects, but little is understood about archaeal chromatin and its role in transcription. Here, we describe the identification in hyperthermophilic and a Korarchaeon of an orthologue of the eukaryotic transcription elongation factor Elf1, which has been shown to function in chromatin structure maintenance of actively transcribed templates. Our discovery has implications for the relationship of chromatin and transcription in Archaea and the evolution of these processes in eukaryotes. Reviewers: This article was reviewed by Chris P. Ponting and Eugene V. Koonin.

Findings In eukaryotes, transcription elongation factors assist RNAP Despite their prokaryotic morphology, Archaea are more in overcoming pausing and arrest on the template [6]. TFIIS similar to eukaryotes in their mechanisms of copying and releases RNAPII from transcriptional arrest by supporting expression of their genetic information than to bacteria [1]. RNA transcript cleavage, whereas the yeast DSIF complex With the recent description of an RPB8 orthologue (RpoG) consisting of Spt4 and Spt5 (bacterial homologue NusG) is in hyperthermophilic Crenarchaeota and a Korarchaeon [2] thought to belong to the class of chromatin elongation fac- and demonstration of its constitutive incorporation into tors that affect RNAP transcription through chromatin [7]. archaeal RNAP [3], homologues of all of the twelve DNA- The archaeal TFIIS-homologue TFS appears to operate in an dependent RNA polymerase (RNAP) eukaryotic core subu- equivalent manner to eukaryotic TFIIS [1]. Also, an ortho- nits have been identified in Archaea [4]. The structure of the logue of Spt5/NusG and a protein with sequence and struc- archaeal RNAP closely resembles the eukaryotic RNAPII [5]. tural similarity to Spt4 have been identified in Archaea, Eukaryotic transcription furthermore depends on accessory further supporting the ancestral link between archaeal and factors aiding initiation. Of those, homologues of eukaryotic eukaryotic transcription [1]. basal transcription initiation factors TBP (TATA-binding pro- tein), TFIIB (transcription factor II B) and of the α-subunit of Elf1 is a transcription elongation factor that has recently been TFIIE are found in Archaea (TBP, TFB and TFE) [1]. identified and characterized in Saccharomyces cerevisiae in a screen for mutations that cause synthetic lethality with muta-

Page 1 of 5 (page number not for citation purposes) Biology Direct 2009, 4:24 http://www.biology-direct.com/content/4/1/24

tions in other genes coding for transcription elongation fac- searches of its predicted proteome was pen- tors [8]. A role for Elf1 in transcription elongation was dens. In order to ensure that Elf1 orthologues were not demonstrated by genetic interaction with several transcrip- missed in our searches because they had not been anno- tion elongation factor genes, including those coding for tated in the predicted proteomes, we conducted tBLASTn TFIIS, Spt4 and Spt5. Elf1 is recruited to regions of active [16] queries of the archaeal genome sequences with all 14 transcription [8]. Transcription initiation from a gene-inter- identified archaeal Elf1 proteins and gene order analyses nal site in elf1Δ cells and the production of short transcripts using the UCSC Archaea genome browser [http:// in an elf1Δ hir1Δ background strongly suggested that Elf1 acts archaea.ucsc.edu/; Additional file 1, Table S5; [17]]. By by maintaining the chromatin structure of active transcrip- this approach, Elf1 was found in all hyperthermophilic tion units [8]. Crenarchaeota, including T. pendens, and Candidatus K. cryptofilum, but not in any Euryarchaeota, C.symbiosum or During a sensitive sequence-similarity search for transcrip- N.maritimus (Figure 1). Failure to detect an Elf1 ortho- tion elongation factors in an evolutionary wide range of logue in C.symbiosum and N.maritimus is in agreement organisms, we noticed high-scoring hits for Elf1 in a subset with their phylogenetic position in a separate archaeal of archaeal predicted proteomes. Consequently, a thorough phylum, the Thaumarchaeota, as recently proposed [15]. search of more archaeal genomes was initiated. A multiple Although Pfam family PF05129.5 describes many of the sequence alignment of Homo sapiens Elf1 (NP_115753.1) orthologues relationships identified in this work, we and S.cerevisiae Elf1 (NP_012762.1) was generated using chose our iterative hidden Markov model approach, MAFFT [9] and trimmed to well aligning sequence blocks. because this enabled us to define a set and gather- The alignment was then utilized to build a profile-hidden ing threshold which would be both sensitive and selective Markov model [10], which was queried against the predicted for this specific task. Comparison confirms that our proteomes of 48 archaeal and 28 eukaryotic organisms with method was more selective (Additional file 1, Fig. S1). a wide evolutionary diversity (Additional file 1, Tables S1 Other iterative procedures, such as PSI-BLAST [16], may and S2). Hits with expectation values lower than a threshold achieve similar results, however. of 10-3 were selected and aligned. The alignment was trimmed to aligned sequence containing no more than 50% Interestingly, this phylogenetic distribution is identical to gaps and sequences were removed so that no sequence pairs that of RpoG, the divergent archaeal orthologue of the with an identity higher than 95% remained in order to pre- eukaryotic RNAP subunit RPB8 [2]. The same group of vent sequence bias. A new hidden-Markov model was built archaeal organisms that contain the full set of all twelve and the whole process was repeated iteratively until no new RNAP subunits also has an orthologue of the eukaryotic sequences could be identified [11]. This method provides a transcription elongation factor Elf1. This raises interesting sensitive and high-quality dataset of Elf1 homologues. questions as to whether and how Elf1 functions in archaeal transcription and whether it interacts with particular subu- In this way, we found 46 sequences from 14 archaeal and nits of RNAP. Considering the function of yeast Elf1 in the 25 eukaryotic organisms (Figure 1A; Additional file 1, maintenance of chromatin structure on actively transcribed Tables S3 and S4). All the Elf1 homologues identified con- genes [8], the issue arises as to how chromatin is constructed tain a C4-zinc finger signature and most sequences are in Archaea. In eukaryotic chromatin, nucleosomes consist of characterized by a stretch of up to seven consecutive basic the highly conserved histone proteins H2A, H2B, H3 and amino acids at their N-terminus (Figure 1B). All Archaea H4. Archaeal histones have been found in many euryar- and most eukaryotes with an Elf1 only contain a single chaeal organisms and Cenarchaeum [18] and have recently gene, although some eukaryotic organisms have lineage- also been described in T. pendens and Korarcheum [19,20]. In specific gene duplications (Figure 1A). Importantly, Elf1 order to revisit this question in the light of new sequence homologues were restricted to a distinct subset of the data, we built a profile-hidden Markov model for archaeal archaeal predicted proteomes. Elf1 was identified only in histones. In addition to T. pendens (ABL77757.1), and Can- hyperthermophilic Crenarchaeota and the Korarchaeon didatus K.cryptofilum (ACB06807.1 and ACB07883.1), we Candidatus Korarchaeum cryptofilum. It was found in none were able to also identify a histone homologue in of the predicted proteomes from Euryarchaeota or in the maquilingensis (ABW02527.1) (Figure 1A; Additional file 1, mesophilic marine group I Cenarchaeum symbiosum and Figure S2). Thus, only some of the archaeal genomes in Nitrosopumilus maritimus [12-14] (now classified as which we identified an Elf1 orthologue also code for his- belonging to the Thaumarchaeota) [15]. This demon- tones. It is therefore likely that the function of archaeal Elf1 strates that these organisms either do not have Elf1 ortho- is independent of histone-containing chromatin. logues or that homologues in these lineages are very divergent from both crenarchaeal and eukaryotic versions. Other, non-histone chromatin proteins have also been identified in Archaea. Alba is found in Crenarchaeota and a The only hyperthermophilic Creanarchaeon without a hit subset of Euryarchaeota [18], while Sul7d is restricted to for Elf1 in the profile-hidden Markov model-based Sulfolobus [21]. In eukaryotes, the transcription of a tem-

Page 2 of 5 (page number not for citation purposes) Biology Direct 2009, 4:24 http://www.biology-direct.com/content/4/1/24

Crenarchaeota Thaumarchaeota Euryarchaeota A bacter thermautotrophicus Methanoregula boonei Korarchaeum cryptofilum Korarchaeum olobus acidocaldarius Archaeoglobus fulgidus Methanosphaera stadtmanae Methanosarcina mazei Methanosarcina mazei Methanosarcina barkeri Methanothermo Methanosarcina acetivorans Halobacterium sp. Halobacterium Haloquadratum walsbyi lacusprofundi Halorubrum Methanoculleus marisnigri Methanopyrus kandleri Methanosaeta thermophila Haloarcula marismortui Methanocorpusculum labreanum Ferroplasma acidarmanus Methanococcus maripaludis Methanococcus maripaludis Methanococcus voltae Candidatus Methanococcoides burtonii Methanococcus aeolicus Methanocaldococcus jannaschii Natronomonas pharaonis Picrophilus torridus Pyrococcus abyssi Methanospirillum hungatei Pyrococcus furiosus Pyrococcus horikoshii Nanoarchaeum equitans kodakaraensis Thermococcus Thermoplasma acidophilum Thermoplasma volcanium Pyrobaculum arsenaticum Pyrobaculum Pyrobaculum calidifontis Sulf Pyrobaculum aerophilum Pyrobaculum islandicum Staphylothermus marinus Thermoproteus neutrophilus Ignicoccus hospitalis Metallosphaera sedula maritimus Nitrosopumilus Sulfolobus tokodaii Thermofilum pendens Sulfolobus solfataricus Aeropyrum pernix Aeropyrum Caldivirga maquilingensis symbiosum Cenarchaeum Candidatus

Elf1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 histone 1 2 1 1 1 2 1 1 1 1 1 3 1 3 4 3 3 1 2 3 1 1 1 1 6 5 2 1 2 2 2 2

Eukaryota ces cerevisiae Saccharomy Plasmodium falciparum Phytophtora ramorum Phytophthora sojae Physcomitrella patens Paramecium tetraurelia Paramecium tricornutum Phaeodactylum Naegleria gruberi Monosiga brevicollis Trichomonas vaginalis Giardia lamblia sapiens Homo Leishmania major Toxoplasma gondii Toxoplasma Trypanosoma brucei Entamoeba histolytica Entamoeba Theileria annulata Thalassiosira pseudonana Drosophila melanogaster Cryptosporidium parvum merolae Cyanidioschyzon discoideum Dictyostelium Tetrahymena thermophila Tetrahymena Chlamydomonas reinhardtii Schizosaccharomyces pombe Schizosaccharomyces Takifugu rubripes Arabidopsis thaliana Caenorhabditis elegans

Elf1 1 1 1 1 1 1 2 1 2 2 1 3 1 2 1 1 1 1 1 1 1 1 1 1 2 B

A. pernix (1) GRRRRKRM--RRQV --- RTVPQV (4)FTCPRCGS-LTLSITLTQAKDE (4)KAGVKRAVAVCGNCGL-- RC-EME-VPEGLEKVDVYNLVSDAYFEGRG (20) M. sedula (1) GGKRKKR---TKIV RVK PQIPKT FECPRCGK-IAITIKFDKS--- DSGPRTAHIKCGSCGL-- ST-DLEGLPPVYDEANVYGKFLDLYLEGKL (27) S. acidocaldarius (1) GGRKKRRK--LQLQ RPK PKIPSV FECPRCGK-VTISITVKEG------IAKVMCGNCKL-- ED-QFD-VPLVYDEANAYGKFIDRYYEGKI (20) S. tokodaii (4) GGRRKRRK--QLLL RPK PKIPNT FECPRCGK-VAITVEIKNG------IAKIKCGNCGL-- ED-QFE-VPPIFDEANAYGKFIDRYFEGKI (28) S. solfataricus (1) GGKRRKR---TKII RVK PKLPKT FECPRCGK-VSVSVKIKDN------IATITCGSCGL-- YT-EVE-VPPVFDEANAYSKFIDMYLEGRL (29) S. marinus (1) GRRRKKY---RKPI RKV RKIPKI FQCPNCGS-KTLTIRLEKLN-- IPGYKKAIITCGTCGL-- HA-EMQ-VPELYGPLDVYAKFIDAFEEGNI (21) C. maquilingensis (26) GKRRKTR---KLLV RPK KRLPTV FQCPYCGS-VAVNVSVIKKE------NKVKVTCGVCGL-- EA-VFD-KVNGLLPVDYYNKFVDKYFAGEL (43) P. aerophilum (1) GRRRRRR---KVIL KPK RTLPKI FTCPHCGA-QLVSVKKTGN------GYLVICGNCGL-- RY-EFE-ERQGWMPVDYYNAFVDLYLEGKI (23) P. arsenaticum (1) GKRRKRR---KVLL RPK RTLPKI FTCPHCGA-QLVSVKKTKN------GYLVVCGNCGL-- SY-EFE-ERQGWMHVDYYNAFVDLYLEGKI (23) P. calidifontis (1) GKRRKRR---KILL KPK KTLPKI FTCPHCGA-QLVTVKKTKN------GYLVICGSCGL-- SH-EFE-ERAGWMPVDYYNAFVDLYLEGKI (18) T. neutrophilus (1) GRRRKTR---KVLL RPK KTLPKI FTCPHCGA-QLVTVKKTED------GYLVVCGNCGL-- RH-EFE-ERPGWMPVDYYNAFVDLYLEGKI (18) P. islandicum (1) GRRRKTR---KIIP KLK KTLPKI FTCPHCGA-QLVTVKKTEN------GYLVICGNCGL-- RY-EFE-ERPGWMAVDYYNAFVDLYLEGKI (16) T. pendens (1) GRRRKKRA--RRVV VVK RKLPTV FQCPRCGA-SAVGVSVKKGGRE ----NYVVVSCSNCGL-- KA-RYD-YVEYLEPVDYFSRFLDDYEAGKI (23) Candidatus K. cryptofilum (1) GRRKRRTI--TPVR R-- RRKSAF LRCPSCQH-NSITVEVSKKD------GKAIIRCYNCGL-- VR-EIE-MKPGEGKIDAYTRFFDAFIEGQA I. hospitalis ------MPTV FQCPNCGA-HALIIDIIQNK-- KTKEMKAIIRCAECGL-- VH-EEPVESPIIDRAVIYGRFIDKFYAGEI (10) A. thaliana (1) GKRKSRAK--PAPT KRM DKLDTI FSCPFCNHGSSVECIIDMKH------LIGKAACRICEE-- SF-STT-ITALTEAIDIYSEWIDECERVNT (38) P. patens (XP_001765330.1) (1) GKRKSAAK--PAPR RGQ EKLATV FSCPFCNHDNSVECRMDRKS------LIGEASCRICLE-- TF-STP-IDSLSDPIDVYSDWIDECERING (4) P. patens (XP_001763568.1) (1) GKRKSAAK--PAPR RGQ EKLATV FSCPFCNHDNSVECRMDRKN------QIGEASCRICLE-- KF-STP-IDSLSDPIDVYSDWIDECERING (4) P. sojae (312) GRRKKSTK--KIQT RKK QVVSTV FKCPFCSHDEAVECKMDRER------NIGHLSCRVCTE-- SF-QTP-IHYLSAPIDVYTDWIDECEALNA C. elegans (1) GKRKSKRKA-PTKA KAV MPLDTQ FNCPFCNHERVCEVKMDREK------NVGYISCRVCSE-- DF-QTN-INYLSEPIDVYSDWVDACEQANN (1) D. melanogaster (NP_001104426.1) (1) GRRKSKRKP-PPKR KNI EPLDQQ FNCPFCNHEKSCEVKMDKSR------NTAKITCRVCLE-- DF-QTG-INFLSEPIDVYNDWVDACETAN- H. sapiens (EAW84230.1) (22) GRRKSKRKP-PPKK KMT GTLETQ FTCPFCNHEKSCDVKMDRAR------NTGVISCTVCLE-- EF-QTP-ITYLSEPVDVYSDWIDACEAANQ T. rubripes (1) GRRKSKRKP-PPKK KMT GDLETQ FTCPFCNHEKSCDVKMERSR------NTGIISCSVCLE-- EF-QTP-ITYLSEPVDVYSDWIDACEAANQ D. melanogaster (NP_996099.1) (1) GRRKSKRKG-APRK KNI QPLPIL FDCPFCNHKQSCEAKLDKAK------KIGRITCTVCQE-- FF-QTH-INYLTEAIDVFNDWIDACEEEN- H. sapiens (OTTHUMP00000077650) (1) GRRKSKRKP-PPKK KMT GTLETQ FTCPFCNHEKSCDVKMDRAR------NTGVISCTVCLE-- EF-QTP-ITSSAHPLLSVMSWVLRLPGHQK (31) P. tricornutum ------QVQT KKR VKLSKR FKCPFCANEDTVECKMDFRA------GVGSLNCRLCAA-- AY-QMP-IHHLHEPIDVFSEWLDDCEAAAN (4) T. pseudonana (1) AKKGKKA---PVQT KKK VTLAKR FKCPFCANEDVVEAKMDLKN------GTGSLACRICSA-- SY-QMP-IHHLHEPIDVFSEWLDDCEAA-- D. discoideum (1) GKRKSSK---PPPK KVV QKLPKH FDCLFCAHSQSVDCILDREL------GKGVAKCRVCNA-- SY-ETP-IHELSDAIDVYSDWIDACESIR- N. gruberi (1) GKRATRK---PPPK KVV PKLPTS FDCPFCNNTSTVECTIDRKK------SQGRLNCQICGA-- QY-QTN-ITYITDPIDVYSEWIDECEKV-- C. merolae (1) GKRKKSSK--PVVK RVQ PKLDKR FDCPFCNQPKAVEVLLDRSTVP ----MRGRLECGVCGA-- SY-QCA-INSLSEPVDVYSDWIDTIEEVNA (6) S. cerevisiae (1) GKRKKSTR--KPTK RLV QKLDTK FNCLFCNHEKSVSCTLDKKN------SIGTLSCKICGQ-- SF-QTR-INSLSQPVDVYSDWFDAVEEVNS (63) S. pombe (1) GKRKAKAK--VKPK RRA PPLDTT FTCLFCNHEKSVSCSLDKQS------GVGNLHCKICGQ-- SH-QCL-ITALSAPIDVYSDWIDACDAVAN (25) C. reinhardtii ------PPK KAR AKLDTL FSCPFCNSSKSVSINFDRDM------GRATAKCSQCSQ-- KY-ESR-CTPLTDAVDVYHDWLDSCEEANK P. falciparum (1) ARSKVKKV--KPRK KKP LKLDKQ FNCPFCSYKKSVDIKLHRSK------GIGELACLKCGV-- KY-VNQ-ITSLDECIDVYSEWVDKCLEANK (29) C. parvum (1) GKRKTKKV--EVKK SKV PKLDKE FNCPFCNNVKTVGVRMNHKE------RLGHLSCRVCGV-- EY-TSR-IGKFDEAVDIYSNWIDKCYEVNS (166) T. vaginalis (1) GKRKKST---PKVQ KKF YRLPKN FHCPYCNCEDSVHVTMDLKN------LRADIHCVKCKE-- GVPNAK-ITKISEPIDVYDDWVDQIREQNK (65) T. thermophila (1) GGRKKTGK--KEKK LRT YAIPKV FDCENCSSDQTLHINISKKQNT ----KVAKIYCIKCKY-- ST-KYEKIGALDSAIDIYNKWCEDIQNNN- (25) P. tetraurelia (XP_001346933.1) (1) GGRRKSNR--MPKK EKV QSIKTI FDCALCGYKNCVIVKIKKII------KIAELNCDKCQV-- SF-HTK-IKGLDEAIDVYHKWLAELKQKKL (118) P. tetraurelia (XP_001458266.1) (1) GGRRKSNR--MPKK EKV QSIKTI FDCALCGYKNCVIVKIKKVI------KIAELNCDKCQA-- AF-TTK-IKGLDEAIDVYHKWLADLKQKKL (115) P. tetraurelia (XP_001450964.1) (1) GGRKRTFK--VLKK PKT MKPRAY FDCAVCGWKDCIIIKIKKCY------KTSKLECERCET-- KF-DMK-VRSLDEPIDIYHEWLHKLKTKAI (122) T. annulata (1) GKRKTKKI--KPSK(5) KNR NKPLRE FHCHYCQNDKSVWIRISKST------ATANLKCRICGI-- ES-SFP-VTSLDEPIDVYSIWVDSSRQSLS (227) L. major (XP_001682840.1) (1) GGKRKKSNNGPVKK ESK YKIPTR FDCPLCDAKASIVVRMFRAT------SDATVQCRVCGAGG TK-RWN-VLRLEKPVDVFFRFHEALVQRDH (134) T. brucei (XP_001218865.1) (1) GGTKKKATV-PIKK ESK YKIPSH FDCPFCDAKKSIAVRLKRSD------GLASVHCRVCRVGE NR-HYN-FSPLEKPVDVFFRFREELMEKDH (75) G. lamblia (61) GKKKRTRA--APPK K-- AGRSKF FPCPVCNTIESVGSERISKE------GVFVLTCKMCKE-- RY-TYK-YGLLDTPNEAHRSWGDFIYKKKK (148) L. major (XP_001682841.1) (25) GSKSSSSS--SKNK EHS(14)IHALRT FDCPVCDSKNTIRVTLNPRL------GEGTVSCTYCCSLR(8)PS-VQRFVPSLENRADVFFRFNEQYAALLR (161) T. brucei (XP_001218870.1) (44) GKGDSSGG--AKFQ KRS LQCLKK FTCPVCRMKGAVRIDVNAKE------QQAVVSCSYCMQLK(8)PF-TTSFLPKLQNRADVFFEFNELYRRLQL (101)

A.Figure Organisms 1 with Elf1 and histone orthologues identified in this study A. Organisms with Elf1 and histone orthologues identified in this study. The number in the black circle indicates the number of Elf1 and histone orthologues found. An empty circle indicates that no orthologue was detected. Histone searches were not done in for eukaryotic organisms. B. Alignment of Elf1 orthologues identified in this study. Organisms are indicated on the left. Numbers of trimmed residues are indicated in brackets at their respective position. Residues that are identical or similar to the consensus are shown with a blue or cyan background, respectively.

Page 3 of 5 (page number not for citation purposes) Biology Direct 2009, 4:24 http://www.biology-direct.com/content/4/1/24

plate is affected by its chromatin state, which can be regu- for Elf1-like proteins provided by Pfam. However, because lated by post-translational modifications of terminal tails Pfam families are built with a slightly different aim from that protrude from the histone core. In Archaea, chromatin that of the paper, we chose to iteratively build a bespoke can also impair transcription, as nucleosomes slow RNAP HMM, because this enabled us to define a species set and down in vitro [22]. Archaeal histones lack protruding tail gathering threshold which would be both sensitive and sequences [18], but Alba can be post-translationally selective for this specific task. acetylated at a lysine residue. Deacetylation of Alba by the conserved deacetylase Sir2 increases the DNA-binding Attached (Additional file 1, Figure S1) is a comparison of affinity of Alba and impairs transcription in vitro [23]. the results of searches using our HMM with those per- Thus, it appears that Archaea have the ability to regulate formed with the Elf1 HMM provided by Pfam transcription at the chromatin level in a similar way to (PF05129.5). These data demonstrate that, while both eukaryotes. Our identification of an Elf1 orthologue in models identify the true Elf1 homologues in the organ- Archaea indicates that these mechanisms might be even isms searched (labelled in red), our HMM provides a more similar than previously thought, despite major differ- greater ability to distinguish between positives (red) and ences in the composition of the chromatin template. It thus negatives (black). seems possible that a common ancestor of eukaryotes and Archaea already employed chromatin structure-modulat- Reviewer's response ing factors to regulate transcription. It will be very exciting I support publication of this manuscript with this addi- in the future to learn about a role of archaeal Elf1 in tran- tional Supplemental Figure. scription and how its function can be applied to non-his- tone chromatin templates. Reviewer 2 Dr. Eugene V. Koonin, NCBI/NLM/NIH, USA Competing interests The authors declare that they have no competing interests. Reviewer's comments This is a straightforward but quite valuable paper that Authors' contributions extends the homologous relationship between the JPD and SK conceived the study and carried out the anal- archaeal and eukaryotic transcription machineries to ysis for Elf1 and histones, respectively. All authors include the crenarcaheal orthologs of the eukaryotic chro- designed the experiments and analysed the data. JPD matin remodeling factor Elf1. The results are valid and drafted the manuscript which was read and approved by complete (see comments below for some possible exten- all authors. sions), and also rather unexpected because Elf1 is thought to function in the maintenance of eukaryote-specific chro- Reviewers' comments matin structure. This paper shows that there might be Reviewer 1 even more functional similarity between the process of Dr. Chris Ponting, University of Oxford, UK transcription in eukaryotes and some archaea (in particu- lar, Crenarchaeota) than currently thought. Reviewer's comments This manuscript describes a thorough analysis and a com- Additional material pelling argument for the existence of archaeal Elf1 ortho- logues. Whilst these findings are relatively straightforward to derive using standard database search tools, including Additional file 1 Supplementary tables and figures. PSI-BLAST and the authors' method of choice HMMer, the Click here for file manuscript provides a detailed case for the evolutionary [http://www.biomedcentral.com/content/supplementary/1745- and functional importance of these findings. One addi- 6150-4-24-S1.pdf] tion that I would recommend is for the authors to acknowledge that Pfam has already collated many or all of these orthology relationships (see http:// pfam.sanger.ac.uk/family?acc=PF05129) and differences Acknowledgements from this Pfam family need to be stated explicitly. This work was supported by the Wellcome Trust. KG is a Wellcome Trust Principal Research Fellow. JPD is supported by the EP Abraham Trust, Lin- Author's response coln College, Oxford, and the Studienstiftung des deutschen Volkes (Ger- We probably didn't make the relationship between the man National Merit Foundation). SK is supported by the BBSRC and the EPSRC. We are grateful to Stephen D. Bell for critical reading and com- Pfam Elf1 model and our HMM sufficiently clear in the ments on the manuscript. Predicted protein data sets were obtained from manuscript. As mentioned, there already exists an HMM the sources specified in Supplementary material. We thank each of these

Page 4 of 5 (page number not for citation purposes) Biology Direct 2009, 4:24 http://www.biology-direct.com/content/4/1/24

organizations and the respective genome-sequencing projects for making sequence, gene model and annotation data publicly available.

References 1. Werner F: Structure and function of archaeal RNA polymer- ases. Mol Microbiol 2007, 65(6):1395-1404. 2. Koonin EV, Makarova KS, Elkins JG: Orthologs of the small RPB8 subunit of the eukaryotic RNA polymerases are conserved in hyperthermophilic Crenarchaeota and "Korarchaeota". Biol Direct 2007, 2:38. 3. Korkhin Y, Unligil UM, Littlefield O, Nelson PJ, Stuart DI, Sigler PB, Bell SD, Abrescia NG: Evolution of Complex RNA Polymer- ases: The Complete Archaeal RNA Polymerase Structure. PLoS Biol 2009, 7(5):e102. 4. Kwapisz M, Beckouet F, Thuriaux P: Early evolution of eukaryotic DNA-dependent RNA polymerases. Trends Genet 2008, 24(5):211-215. 5. Hirata A, Klein BJ, Murakami KS: The X-ray crystal structure of RNA polymerase from Archaea. Nature 2008, 451(7180):851-854. 6. Sims RJ 3rd, Belotserkovskaya R, Reinberg D: Elongation by RNA polymerase II: the short and long of it. Genes Dev 2004, 18(20):2437-2468. 7. Svejstrup JQ: Chromatin elongation factors. Curr Opin Genet Dev 2002, 12(2):156-161. 8. Prather D, Krogan NJ, Emili A, Greenblatt JF, Winston F: Identifica- tion and characterization of Elf1, a conserved transcription elongation factor in Saccharomyces cerevisiae. Mol Cell Biol 2005, 25(22):10122-10135. 9. Katoh K, Misawa K, Kuma K, Miyata T: MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res 2002, 30(14):3059-3066. 10. Eddy SR: Profile hidden Markov models. Bioinformatics 1998, 14(9):755-763. 11. Devaux S, Kelly S, Lecordier L, Wickstead B, Perez-Morga D, Pays E, Vanhamme L, Gull K: Diversification of Function by Different Isoforms of Conventionally Shared RNA Polymerase Subu- nits. Mol Biol Cell 2007, 18(4):1293-1301. 12. DeLong EF: Archaea in coastal marine environments. Proc Natl Acad Sci USA 1992, 89(12):5685-5689. 13. Fuhrman JA, McCallum K, Davis AA: Novel major archaebacte- rial group from marine plankton. Nature 1992, 356(6365):148-149. 14. Robertson CE, Harris JK, Spear JR, Pace NR: Phylogenetic diver- sity and ecology of environmental Archaea. Curr Opin Microbiol 2005, 8(6):638-642. 15. Brochier-Armanet C, Boussau B, Gribaldo S, Forterre P: Mesophilic Crenarchaeota: proposal for a third archaeal phylum, the Thaumarchaeota. Nat Rev Microbiol 2008, 6(3):245-252. 16. Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lip- man DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 1997, 25(17):3389-3402. 17. Schneider KL, Pollard KS, Baertsch R, Pohl A, Lowe TM: The UCSC Archaeal Genome Browser. Nucleic Acids Res 2006:D407-410. 18. Sandman K, Reeve JN: Archaeal chromatin proteins: different structures but common function? Curr Opin Microbiol 2005, 8(6):656-661. 19. Sandman K, Reeve JN: Archaeal histones and the origin of the histone fold. Curr Opin Microbiol 2006, 9(5):520-525. Publish with BioMed Central and every 20. Elkins JG, Podar M, Graham DE, Makarova KS, Wolf Y, Randau L, Hedlund BP, Brochier-Armanet C, Kunin V, Anderson I, et al.: A scientist can read your work free of charge korarchaeal genome reveals insights into the evolution of "BioMed Central will be the most significant development for the Archaea. Proc Natl Acad Sci USA 2008, 105(23):8102-8107. disseminating the results of biomedical research in our lifetime." 21. White MF, Bell SD: Holding it together: chromatin in the Archaea. Trends Genet 2002, 18(12):621-626. Sir Paul Nurse, Cancer Research UK 22. Xie Y, Reeve JN: Transcription by an archaeal RNA polymer- Your research papers will be: ase is slowed but not blocked by an archaeal nucleosome. J Bacteriol 2004, 186(11):3492-3498. available free of charge to the entire biomedical community 23. Bell SD, Botting CH, Wardleworth BN, Jackson SP, White MF: The peer reviewed and published immediately upon acceptance interaction of Alba, a conserved archaeal chromatin protein, with Sir2 and its regulation by acetylation. Science 2002, cited in PubMed and archived on PubMed Central 296(5565):148-151. yours — you keep the copyright

Submit your manuscript here: BioMedcentral http://www.biomedcentral.com/info/publishing_adv.asp

Page 5 of 5 (page number not for citation purposes)