life

Article Natural Product Gene Clusters in the Filamentous Cyanobacterium HT-58-2

Xiaohe Jin 1,*, Eric S. Miller 2 and Jonathan S. Lindsey 1

1 Department of Chemistry, North Carolina State University, Raleigh, NC 27695-8204, USA; [email protected] 2 Department of Plant and Microbial Biology, North Carolina State University, Raleigh, NC 27695-7615, USA; [email protected] * Correspondence: [email protected]

Abstract: are known as rich repositories of natural products. One cyanobacterial- microbial consortium (isolate HT-58-2) is known to produce two fundamentally new classes of natural products: the tetrapyrrole pigments tolyporphins A–R, and the diterpenoid compounds tolypodiol, 6-deoxytolypodiol, and 11-hydroxytolypodiol. The genome (7.85 Mbp) of the Nostocales cyanobacterium HT-58-2 was annotated previously for tetrapyrrole biosynthesis genes, which led to the identification of a putative biosynthetic gene cluster (BGC) for tolyporphins. Here, bioinformatics tools have been employed to annotate the genome more broadly in an effort to identify pathways for the biosynthesis of tolypodiols as well as other natural products. A putative BGC (15 genes) for tolypodiols has been identified. Four BGCs have been identified for the biosynthesis of other natural products. Two BGCs related to nitrogen fixation may be relevant, given the association of nitrogen  stress with production of tolyporphins. The results point to the rich biosynthetic capacity of the  HT-58-2 cyanobacterium beyond the production of tolyporphins and tolypodiols. Citation: Jin, X.; Miller, E.S.; Lindsey, J.S. Natural Product Gene Clusters in Keywords: anatoxin-a/homoanatoxin-a; hapalosin; heterocyst glycolipids; natural products; sec- the Filamentous Nostocales ondary metabolites; shinorine; tolypodiols; tolyporphins Cyanobacterium HT-58-2. Life 2021, 11, 356. https://doi.org/10.3390/ life11040356 1. Introduction Academic Editors: Paul M. D’Agostino, Nádia Eusébio and Tolyporphins represent a class of compounds in the pigments of life family with struc- Ângela Brito tural features distinct from other prominent constituents, including heme, chlorophylls, bacteriochlorophylls, cobalamin, and coenzyme F430. Tolyporphin A, the first member of Received: 14 March 2021 the tolyporphins family, was found in the lipophilic extract of a cyanobacterial sample. The Accepted: 15 April 2021 sample, HT-58-2, was obtained from Nan Madol on the island of Pohnpei in Micronesia [1]. Published: 18 April 2021 The discovery and identification of tolyporphin A originated with a broad screen of diverse cyanobacteria for anti-cancer activity [2]. Indeed, tolyporphin A was found to exhibit efflux Publisher’s Note: MDPI stays neutral pump inhibition and photocytotoxicity toward tumor cells [3–5]. with regard to jurisdictional claims in An astonishing aspect of the discovery of tolyporphin A is the presence of a compound published maps and institutional affil- with a dioxobacteriochlorin chromophore in a cyanobacterial sample. Cyanobacteria iations. employ chlorophyll, not bacteriochlorophyll, for , and the presence of compounds with a bacteriochlorin chromophore in a chlorin-based photosynthetic system is unprecedented. Over the years, fractionation of lipophilic extracts from the HT-58-2 culture has led to the identification of a family of tolyporphins, now numbering 18 (A– Copyright: © 2021 by the authors. R) [6–8] (Figure1). Most of the tolyporphins are dioxobacteriochlorins, but three are Licensee MDPI, Basel, Switzerland. oxochlorins, and one is a porphyrin. The structural diversity suggests an in vivo role for This article is an open access article tolyporphins as secondary metabolites in as-yet undefined defense processes. If so, this distributed under the terms and represents a fundamentally new biological function in the pigments of life family. To date, conditions of the Creative Commons the HT-58-2 culture remains the only known producer of tolyporphins. Attribution (CC BY) license (https:// creativecommons.org/licenses/by/ 4.0/).

Life 2021, 11, 356. https://doi.org/10.3390/life11040356 https://www.mdpi.com/journal/life Life 2021, 11, x FOR PEER REVIEW 2 of 18

Life 2021, 11, 356 2 of 17 (http://creativecommons.org/li- censes/by/4.0/).

O R

N HN N HN N HN

NH N NH N R NH N R O O Tolyporphins A -J , L-O Tolyporphin K , Q , R Tolyporphin P R = C -glycoside, OH or OAc R = C -glycoside, OH or OAc

OH OH OH OH Me Me Me Me C -glycosides O O O O

OAc OH HO OH OH HO

Figure 1. StructuresStructures of of dioxobacteriochlorins dioxobacteriochlorins (tolyporphins A A–J,–J, L L–O),–O), oxochlorins (tolyporphins K, Q, R), and a porphyrin (tolyporphin P). The π-chromophore-chromophore isis shownshown inin magenta,magenta, green,green, andand red,red, respectively.respectively.

ToTo pursue questions of biosynthesis and in vivo functionfunction of tolyporphins, in late 2015 we obtained obtained the the tolyporphin-producing tolyporphin-producing culture culture HT-58-2. HT-58 Subsequent-2. Subsequent studies stud haveies haverevealed re- vealedthe following: the following: (1) the HT-58-2(1) the HT culture-58-2 isculture dominated is dominated by a single by filamentousa single filamentous cyanobacterium cyano- bacteriumin a non-axenic in a non cyanobacterial–microbial-axenic cyanobacterial consortium–microbial (Figureconsortium2A,B) (Figure [ 9,10]; (2) 2A several-weeks,B) [9,10]; (2) severalgrowth-weeks under growth stress imparted under stress by theimparted deprivation by the ofdeprivation aqueous-soluble of aque nitrateous-soluble profoundly nitrate profoundlyincreases the increases production the of production tolyporphins, of tolyporphins,reaching a level reaching rivalling a that level of rivalling chlorophyll that [11 of]; chlorophyll(3) tolyporphins [11]; are(3) presenttolyporphins in the sheathare present and cellin the septa sheath of the and filamentous cell septa cyanobacterium, of the filamen- tousas revealed cyanobacterium, by hyperspectral as revealed confocal by hyperspectral fluorescence confocal imaging fluorescence [12]; (4) the imaging cyanobacterium [12]; (4) thecontains cyanobacterium a circular genome contains (7.85 a circular Mbp) withgenome genes (7.85 for Mbp) biosynthesis with genes of heme, for biosynthesis chlorophyll ofa, heme,phycocyanobilin, chlorophyll anda, phycocyanobilin, cobalamin (in part), and ascobalamin well as a (in putative part), as biosynthetic well as a putative gene cluster bio- synthetic(here termed gene BGC-T) cluster for (here the termed biosynthesis BGC- ofT) tolyporphinsfor the biosynthesis (Figure 2ofE) tolyporphins [9]. Notably, (Figure BGC-T 2E)contains [9]. Notably, all genes BGC in the-T corecontains pathway all genes (from in L-glutamic the core pathway acid to protoporphyrinogen (from L-glutamic acid IX) to of protoporphyrinogentetrapyrrole biosynthesis, IX) of except tetrapyrrolehemD, whichbiosynthesis, is located except elsewhere hemD, inwhich the genome. is located else- whereIn in an the initial genome. report in 1992 [1], the Nostocales cyanobacterium HT-58-2 was categorized as Tolypothrix nodosa on the basis of visual inspection of the characteristic filamentous morphology (Figure2C,D). The term “tolypothrix” refers to “a hairy ball of yarn,” whereas “nodosa” denotes the presence of nodules, which are believed to be responsible for nitrogen fixation. The availability of the sequence information enabled phylogenomic analysis, which showed the HT-58-2 cyanobacterium to be more closely aligned with the genus Brasilonema according to 16S rRNA [9]. Scrutiny of the lipophilic extracts of the HT-58-2 culture has revealed other new natural products. The natural products include tolypodiol (Figure3), the first diterpenoid compound obtained from cyanobacteria [13]. A monoacetate derivative of tolypodiol was subsequently synthesized by chemical means [13]. Both tolypodiol and the synthetic O-acetate derivative showed potent anti-inflammatory activity in a mouse ear edema assay. Recently, two tolypodiol analogues, 6-deoxytolypodiol and 11-hydroxytolypodiol, were also found in extracts from the HT-58-2 culture. The structures and absolute configuration of the two tolypodiol analogues were determined, and the compounds were assayed for anti-inflammatory activity [14]. Prior work [9] annotating the genome of the Nostocales HT-58-2 suggested numerous genes for diverse functions, but did not delve deeply into the BGCs of natural products, including those for tolypodiols.

Life 2021, 11, 356 3 of 17 Life 2021, 11, x FOR PEER REVIEW 3 of 18

Life 2021, 11, x FOR PEER REVIEW 4 of 18

FigureFigure 2. ((AA)) Optical Optical image image of of the the HT-58-2 HT-58-2 culture culture on on a a BG-11 BG-11 agar agar plate plate shows shows community community bacteria (red (red arrows) arrows) in in the the presence of filamentous cyanobacteria.Recently, two (B) Scanningtolypodiol electron analogues, microscopy 6-deoxytolypodiol image shows community and 11-hydroxytolypodiol, bacteria (red arrows) were presence of filamentous cyanobacteria. (B) Scanning electron microscopy image shows community bacteria (red arrows) attached to the sheath of thealso filamentous found in bacteria. extracts (C from,D) Optical the HT micrographs-58-2 culture. show The the structures clumping of and HT-58-2 absolute cyanobacteria configuration attached to the sheath of the filamentous bacteria. (C,D) Optical micrographs show the clumping of HT-58-2 cyanobacteria (Neofluar 40X 0.75, Zeiss) grownof the in two BG-11 tolypodiol for 35 days. analogues (E) Genes identifiedwere determined, for tetrapyrrole and the biosynthesis compounds pathways were (termedassayed for (Neofluar 40X 0.75, Zeiss) grown in BG-11 for 35 days. (E) Genes identified for tetrapyrrole biosynthesis pathways (termed as hem genes) are distributedanti throughout-inflammatory the genome activity of [14].the Nostocales Prior work HT-58-2 [9] annotating (7.85 Mbp, thegreen genome rings are of codingthe Nostocales se- as hem genes) are distributed throughout the genome of the Nostocales HT-58-2 (7.85 Mbp, green rings are coding sequences), quences), except in the proposedHT-58 BGC-2 suggested for tolyporphins numerous (BGC-T). genes for diverse functions, but did not delve deeply into except in the proposed BGC for tolyporphins (BGC-T). the BGCs of natural products, including those for tolypodiols. In an initial report in 1992 [1], the Nostocales cyanobacterium HT-58-2 was categorized as Tolypothrix nodosa on the basis of visual inspection of the characteristic filamentous morphology (Figure 2C,D). The term “tolypothrix” refers to “a hairy ball of yarn,” whereas “nodosa” denotes the presence of nodules, which are believed to be responsible for . The availability of the sequence information enabled phylogenomic analysis, which showed the HT-58-2 cyanobacterium to be more closely aligned with the genus Brasilonema according to 16S rRNA [9]. Scrutiny of the lipophilic extracts of the HT-58-2 culture has revealed other new nat- FigureFigure 3. 3. StructureStructureural of ofproducts. tolypodiol tolypodiol The and and naturaltwo two analogues analogues products isolated isolated include from from tolypodiolcyanobacterial cyanobacterial (Figure strain strain 3),HT HT-58-2. -the58- 2.first diterpenoid compound obtained from cyanobacteria [13]. A monoacetate derivative of tolypodiol was subsequentlyIn this paper, synthesized we report by anchemical evaluation mean ofs genes[13]. Boththat supporttolypodiol the and biosynthesis the synthetic of toly- O- acetatepodiols derivativeas well as othershowed natural potent products, anti-inflammatory deepening theactivity coding in apotential mouse earsurvey edema of the assay. ge- nome that was initially reported [9]. A large impetus for this work stems from two facts: (1) the in vivo functions of tolyporphins and tolypodiols are not known; and (2) the biosynthe- sis of tolyporphins depends markedly on environmental conditions. The production of myr- iad natural products by cyanobacteria can be prompted by a host of environmental stimuli [15,16]. The issue then arises as to the biosynthetic capacity of HT-58-2, i.e., what other nat- ural products might be produced either under normal or stressed conditions?

2. Materials and Methods The HT-58-2 sample was incubated in BG-11 medium as described previously [9], under continuous white light (62 µmol m−2 s−1) at 28 °C with shaking at 120 rpm. BG-11 medium provides aqueous-soluble nitrogen in the form of NaNO3. Analysis of genes for the biosynthesis of natural products within the genome of Nos- tocales HT-58-2 (accession number: CP019636) was performed by the use of AntiSMASH [17], PRISM [18], and ARTS [19] with default settings. Putative BGCs for tolypodiols and nitrogen fixation were annotated and labeled manually based on protein homology searching and conserved domain analysis via the BLASTP program [20]. Individual gene or protein alignments were performed with Clustal Omega [21] and formatted with ES- Pript 3.0 [22]. GenBank accession numbers pertaining to the hapalosin BGC from sp. PCC 9431 for protein alignment are as follows: HapA (WP_026723805), HapB (WP_035121546), HapC (WP_081656241), HapD (WP_051206727), and HapE (WP_035122279). GenBank accession numbers for the anatoxin-a BGC from Oscillatoria sp. PCC 6506 are as follows: AnaJ (ACR33072), AnaA (ACR33073), AnaB (ACR33074), AnaC (ACR33075), AnaD (ACR33076), AnaE (ACR33077), AnaF (ACR33078), and AnaG (ACR33079). GenBank accession numbers for the shinorine BGC from variabilis ATCC 29,413 are as follows: Ava_3855 (ABA23460), Ava_3856 (ABA23461), Ava_3857 (ABA23462), and Ava_3858 (ABA23463). GenBank accession numbers for the HGs BGC from sp. ‘Peltigera membra- nacea cyanobiont’ are as follows: HetI (AGJ76607), SDR (AGJ76606), HglB (AGJ76605), PfaD (AGJ76604), HglC (AGJ76603), HglG (AGJ76602), and HglE (AGJ76601).

3. Results 3.1. Putative Tolypodiols BGC

Life 2021, 11, 356 4 of 17

In this paper, we report an evaluation of genes that support the biosynthesis of tolypodiols as well as other natural products, deepening the coding potential survey of the genome that was initially reported [9]. A large impetus for this work stems from two facts: (1) the in vivo functions of tolyporphins and tolypodiols are not known; and (2) the biosynthesis of tolyporphins depends markedly on environmental conditions. The production of myriad natural products by cyanobacteria can be prompted by a host of environmental stimuli [15,16]. The issue then arises as to the biosynthetic capacity of HT-58-2, i.e., what other natural products might be produced either under normal or stressed conditions?

2. Materials and Methods The HT-58-2 sample was incubated in BG-11 medium as described previously [9], under continuous white light (62 µmol m−2 s−1) at 28 ◦C with shaking at 120 rpm. BG-11 medium provides aqueous-soluble nitrogen in the form of NaNO3. Analysis of genes for the biosynthesis of natural products within the genome of Nosto- cales HT-58-2 (accession number: CP019636) was performed by the use of AntiSMASH [17], PRISM [18], and ARTS [19] with default settings. Putative BGCs for tolypodiols and nitro- gen fixation were annotated and labeled manually based on protein homology searching and conserved domain analysis via the BLASTP program [20]. Individual gene or protein alignments were performed with Clustal Omega [21] and formatted with ESPript 3.0 [22]. GenBank accession numbers pertaining to the hapalosin BGC from Fischerella sp. PCC 9431 for protein alignment are as follows: HapA (WP_026723805), HapB (WP_035121546), HapC (WP_081656241), HapD (WP_051206727), and HapE (WP_035122279). GenBank accession numbers for the anatoxin-a BGC from Oscillatoria sp. PCC 6506 are as follows: AnaJ (ACR33072), AnaA (ACR33073), AnaB (ACR33074), AnaC (ACR33075), AnaD (ACR33076), AnaE (ACR33077), AnaF (ACR33078), and AnaG (ACR33079). GenBank accession numbers for the shinorine BGC from ATCC 29,413 are as follows: Ava_3855 (ABA23460), Ava_3856 (ABA23461), Ava_3857 (ABA23462), and Ava_3858 (ABA23463). GenBank accession numbers for the HGs BGC from Nostoc sp. ‘Peltigera membranacea cyanobiont’ are as follows: HetI (AGJ76607), SDR (AGJ76606), HglB (AGJ76605), PfaD (AGJ76604), HglC (AGJ76603), HglG (AGJ76602), and HglE (AGJ76601).

3. Results 3.1. Putative Tolypodiols BGC To pursue the biosynthesis of tolypodiols, a search for BGCs related to terpenoids in the Nostocales HT-58-2 was performed (hereafter in this study, “HT-58-2” will refer to the cyanobacterium unless otherwise noted). The biosynthesis of the terpenoid skeleton in cyanobacteria proceeds via a non-mevalonate pathway, termed the methylerythritol-4- phosphate (MEP) pathway. The pathway is shown in Figure4[ 23,24]. Specific genes that encode enzymes involved in the MEP pathway were annotated throughout the genome of HT-58-2; the annotation was performed manually according to protein homology. A putative BGC of ~17 kbp with genes that encode almost the entire MEP pathway was found at 38,701–55,799 bp of the HT-58-2 genome (CP019636). Two of the requisite eight genes were absent, which coded for 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase (MCT) and 2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase (MDS), corresponding to the third and fifth steps, as shown in Figure4. The two genes missing in the BGC, ispD and ispF, occur elsewhere in the genome. The other six genes in the MEP pathway, including dxs, dxr, ispE, ispG, ispH, and crtE, appear both in the BGC and at least once elsewhere distributed in the genome of HT-58-2 (Figure5). This situation resembles that of the putative BGC for tolyporphins, where a subset of genes within the BGC-T also appear elsewhere in the genome. Life 2021, 11, x FOR PEER REVIEW 5 of 18

To pursue the biosynthesis of tolypodiols, a search for BGCs related to terpenoids in the Nostocales HT-58-2 was performed (hereafter in this study, “HT-58-2” will refer to the cyanobacterium unless otherwise noted). The biosynthesis of the terpenoid skeleton in cyanobacteria proceeds via a non-mevalonate pathway, termed the methylerythritol-4- phosphate (MEP) pathway. The pathway is shown in Figure 4 [23,24]. Specific genes that encode enzymes involved in the MEP pathway were annotated throughout the genome of HT-58-2; the annotation was performed manually according to protein homology. A putative BGC of ~17 kbp with genes that encode almost the entire MEP pathway was found at 38,701–55,799 bp of the HT-58-2 genome (CP019636). Two of the requisite eight genes were absent, which coded for 2-C-methyl-D-erythritol 4-phosphate cytidylyltrans- ferase (MCT) and 2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase (MDS), corre- sponding to the third and fifth steps, as shown in Figure 4. The two genes missing in the BGC, ispD and ispF, occur elsewhere in the genome. The other six genes in the MEP path- way, including dxs, dxr, ispE, ispG, ispH, and crtE, appear both in the BGC and at least once Life 2021, 11, 356 elsewhere distributed in the genome of HT-58-2 (Figure 5). This situation resembles5 that of 17 of the putative BGC for tolyporphins, where a subset of genes within the BGC-T also ap- pear elsewhere in the genome.

Figure 4. TerpenoidTerpenoid biosynthesis via the MEP pathway in bacteria. Proteins Proteins (genes) (genes) labeled red red are present in the putative tolypodioltolypodiolss BGC in HT HT-58-2.-58-2. Carboxylates Carboxylates and and phosphates phosphates are are shown shown in in the the protonated protonated (unionized) (unionized forms.) forms.

The assigned tolypodiols BGC is shown in Figure6. In addition to the six genes for enzymes corresponding to the biosynthesis of the terpenoid backbone, four additional genes were identified that are expected to encode enzymes of the ubiquinone/terpenoid- quinone pathway. The latter pathway entails the attachment of an all-trans-polyprenyl unit (such as geranylgeranyl) to the 3-position of 4-hydroxybenzoate. The arene motif in the resulting polyprenyl benzoquinol has intriguing structural resemblance to the arene in tolypodiols. The corresponding enzymes UbiC, UbiA, UbiH, and UbiE are known to catalyze the reactions shown in Figure7[ 25–28]. The product from the backbone MEP pathway of terpenoid biosynthesis, geranylgeranyl diphosphate, might participate in the biosynthesis of terpenoid-quinone compounds (such as tolypodiols), given the co- clustering of ubi/isp/dxr/dxs/crtE genes. The list of aligned proteins and corresponding genes is provided in Table1. Life 2021, 11, 356 6 of 17 Life 2021, 11, x FOR PEER REVIEW 6 of 18

Figure 5. TolypodiolFigure 5.s BGCTolypodiols and distribution BGC and of distributiongenes for the ofMEP genes pathway for the of MEPterpenoid pathway biosynthesis of terpenoid in the biosynthesisgenome of HT in- the genome 58-2. of HT-58-2.

Table 1. Aligned proteins from the putative tolypodiols BGC in HT-58-2 a. The assigned tolypodiols BGC is shown in Figure 6. In addition to the six genes for HT-58-2 Accession Aligned Protein (Gene) enzymes corresponding to the biosynthesis of the terpenoid backbone, four additional b WP_015211021genes were identified that are expected2-polyprenyl-6-methoxyphenol to encode enzymes of the hydroxylaseubiquinone/terpenoid (ubiH) - ARV62772 4-hydroxybenzoate polyprenyltransferase (ubiA) quinone pathway. The latter pathway entails the attachment of an all-trans-polyprenyl ARV57253 chorismate lyase (ubiC) ARV57254unit (such as geranylgeranyl) to the 3-position 1-deoxy-D-xylulose-5-phosphate of 4-hydroxybenzoate.synthase The arene (dxs motif) in ARV57255the resulting polyprenyl benzoquinol 1-hydroxy-2-methyl-2-(E)-butenyl has intriguing structural4-diphosphate resemblance synthaseto the arene (ispG ) ARV62773in tolypodiols. The corresponding enzymes geranylgeranyl UbiC, UbiA, pyrophosphate UbiH, and synthase UbiE are (crtE) known to ARV57256catalyze the reactions shown in Figure 7 [25–28]. The hypothetical product proteinfrom the backbone MEP ARV57257pathway of terpenoid biosynthesis, phenylpropionate geranylgeranyl dioxygenase diphosphate, or ring-hydroxylating might participate dioxygenase in the ( hcaE) ARV57258 ubiquinone/menaquinone biosynthesis C-methylase (ubiE) biosynthesis of terpenoid-quinone compounds (such as tolypodiols), given the co-cluster- ARV57259 4-Hydroxy-3-methylbut-2-enyl diphosphate reductase (ispH) ARV57260ing of ubi/isp/dxr/dxs/crtE genes. The list of aligned proteins cytochrome and P450 corresponding genes is ARV62774provided in Table 1. 4-diphosphocytidyl-2C-methyl-D-erythritol kinase (ispE) ARV62775 dephospho-CoA kinase (coaE) ARV57261 1-deoxy-D-xylulose 5-phosphate reductoisomerase (dxr) ARV57262 Dienelactone hydrolase a All E-values of aligned proteins are <2.8e-10. b E-value is 1.6e-04.

Figure 6. Assigned tolypodiols BGC at region 38,701–55,799 (17,099 bp) in HT-58-2. Genes encoding enzymes involved in the MEP pathway for terpenoid backbone biosynthesis are in dark blue, while those for ubiquinone/terpenoid-quinone are in light blue.

Life 2021, 11, x FOR PEER REVIEW 6 of 18

Figure 5. Tolypodiols BGC and distribution of genes for the MEP pathway of terpenoid biosynthesis in the genome of HT- 58-2. Life 2021, 11, x FOR PEER REVIEW 7 of 18

The assigned tolypodiols BGC is shown in Figure 6. In addition to the six genes for Tableenzymes 1. Aligned correspond proteinsing from to the putative biosynthesis tolypodiol of thes BGC terpenoid in HT-58 backbone,-2 a. four additional genes were identified that are expected to encode enzymes of the ubiquinone/terpenoid- HT-58-2 Accession Aligned Protein (Gene) quinone pathway. The latter pathway entails the attachment of an all-trans-polyprenyl b unitWP_015211021 (such as ger anylgeranyl)2-polyprenyl to the 3-position-6-methoxyphenol of 4-hydroxybenzoate. hydroxylase The (ubiH arene) motif in the resultingARV62772 polyprenyl benzoquinol4-hydroxybenzoate has intriguing polyprenyltransferase structural resemblance (ubiA to) the arene in tolypodiols.ARV57253 The corresponding enzymeschorismate UbiC, UbiA, lyase UbiH, (ubiC )and UbiE are known to catalyzeARV57254 the reactions shown 1in-deoxy Figure-D 7-xylulose [25–28].- 5The-phosphate product synthasefrom the ( dxsbackbone) MEP pathwayARV57255 of terpenoid 1 biosynthesis,-hydroxy-2-methyl geranylgeranyl-2-(E)-butenyl diphosphate, 4-diphosphate might synthase participate (ispG in )the Life 2021, 11, 356 biosynthesisARV62773 of terpenoid-quinonegeranylgeranyl compounds pyrophosphate (such as tolypodiols) synthase, given (crtE) the co-cluster-7 of 17 ing ARV57256of ubi/isp/dxr/dxs/crtE genes. The list of hypotheticalaligned proteins protein and corresponding genes is provided in Table 1. phenylpropionate dioxygenase or ring-hydroxylating dioxygenase ARV57257 (hcaE) ARV57258 ubiquinone/menaquinone biosynthesis C-methylase (ubiE) ARV57259 4-Hydroxy-3-methylbut-2-enyl diphosphate reductase (ispH) ARV57260 cytochrome P450 ARV62774 4-diphosphocytidyl-2C-methyl-D-erythritol kinase (ispE) ARV62775 dephospho-CoA kinase (coaE) FigureFigure 6.6. AssignedAssigned tolypodiolstolypodiols BGCBGC atat regionregion 38,701–55,79938,701–55,799 (17,099(17,099 bp)bp) inin HT-58-2.HT-58-2. GenesGenes encodingencoding enzymesenzymes involvedinvolved inin ARV57261 1-deoxy-D-xylulose 5-phosphate reductoisomerase (dxr) thethe MEPMEP pathway pathway for for terpenoid terpenoid backbone backbone biosynthesis biosynthesis are are in darkin dark blue, blue, while while those those for ubiquinone/terpenoid-quinonefor ubiquinone/terpenoid-quinone are ARV57262 Dienelactone hydrolase inare light in light blue. blue. a All E-values of aligned proteins are <2.8e-10. b E-value is 1.6e-04.

Figure 7. Reactions catalyzed by enzymes in thethe ubiquinone/terpenoid-quinoneubiquinone/terpenoid-quinone pathway. pathway. All genes for the Ubi enzymes shown here are present inin thethe putativeputative tolypodiolstolypodiols BGCBGC inin HT-58-2.HT-58-2.

3.2. BGCs for Diverse Natural Products In the genome of HT-58-2, eighteen clusters of genes were identified through the use of AntiSMASH (Table2). Further analysis showed that four of the eighteen clusters

(hereafter, BGCs) aligned with relatively high similarity (>50%) with known BGCs for the following compounds: (1) hapalosin; (2) anatoxin-a/homoanatoxin-a; (3) shinorine; and (4) heterocyst glycolipids (Figure8). In-depth examination concerning the four BGCs is provided in the following section. Life 2021, 11, 356 8 of 17

Table 2. Putative BGCs for natural products identified in HT-58-2 by AntiSMASH.

No. Region (bp) Length (nt) Type Similar Known Cluster 1 308,169–329,626 21,458 lassopeptide vioprolides, 2 344,287–393,016 49,759 NRPS a xenoamicins, cyanopeptin 3 855,604–866,222 10,619 RiPP-like b 4 * 958,778–1,011,938 53,161 hglE-KS c, T1PKS d heterocyst glycolipids 5 1,080,296–1,132,473 52,178 T1PKS carbamidocyclophanes 6 1,225,583–1,267,343 41,761 phosphonate, terpene 7 1,620,824–1,643,303 22,480 lassopeptide 8 1,748,080–1,798,143 50,064 NRPS hapalosin 9 * 2,268,098–2,291,911 240,666 NRPS, T1PKS hapalosin 10 2,303,699–2,508,763 20,507 NRPS, T1PKS nostopeptolide A2 11 * 2,683,411–2,768,967 85,557 T1PKS anatoxin-a 12 3,624,228–3,644,311 20,084 terpene 13 3,794,851–3,850,264 55,414 T1PKS chondrochloren A 14 4,005,987–4,016,220 10,234 bacteriocin 15 * 4,105,987–4,016220 42,348 NRPS hexose-palythine-serine 16 4,214,166–4,256,226 41,891 bacteriocin 17 4,262,518–4,353,871 91,354 NRPS, lanthipeptide nostopeptolide A2 18 5,824,405–5,845,337 20,933 terpene * Clusters of known BGCs wherein >4 core genes were identified with high similarity. a Non-ribosomal peptide synthetase cluster. b Other Life 2021unspecified, 11, x FOR ribosomally PEER REVIEW synthesized and post-translationally modified peptide product clusters. c Heterocyst glycolipid synthase-like9 of 18 polyketide synthase (PKS). d Type I polyketide synthase (PKS).

Figure 8. Natural products derived from putative BGCs (>50% similaritysimilarity with reported ones) in HT-58-2.HT-58-2.

3.2.1. Hapalosin BGC Hapalosin is a cyclodepsipeptide that has been shown to cause the reversal of multi multi-- drug resistance in tumor cell lines [[29].29]. The compound has been detected from lipophiliclipophilic extracts of of three three cyanobacterial cyanobacterial strains: strains:Hapalosiphon Hapalosiphon welwitschii welwitschiiUH strain UH IC-52-3, strain Westiella IC-52-3, intricataWestiella UHintricata strain UH HT-29-1, strain andHT-29Fischerella-1, and Fischerellasp. PCC 9431.sp. PCC The 9431. UH strains The UH were strains collected were bycollected the same by the team same at the team University at the University of Hawaii of that Hawaii discovered that discovered HT-58-2 asHT part-58-2 of as a part global of searcha global for search novel naturalfor novel products natural from products diverse from cyanobacteria diverse cyanobacteria [2]. Cloning and[2]. heterologousCloning and expressionheterologous of theexpression majority of of the the hapalosinmajority of BGC the fromhapalosinFischerella BGCsp. from PCC Fischerella 9431 was sp. recently PCC described9431 was recently [30]. described [30]. In HT-58-2, a non-ribosomal peptide synthase/Type I polyketide synthase (NRPS/T1PKS)-related gene cluster was found spanning 240 kb (region nine), as shown in Table 2. More than ten T1PKS/NRPS domains are arranged in this region, which are likely involved in the biosynthesis of cyclic peptide–polyketides or lipopeptides, such as the cy- clodepsipeptide hapalosin. Five genes in the BGC aligned with all five members from the hapalosin BGC from Fischerella sp. PCC 9431 (Figure 9) [31]. The overall percent identity between amino acid sequences of HapA-E in the two BGCs is around 50–60% with an E- value of 0.0 as given by BLASTP; the module domains of PKS and NRPS from HT-58-2 differed from those in Fischerella sp. PCC 9431 (Table 3). The distinct difference in se- quences and modules suggests that hapalosin might not be the only product produced by this long BGC. A second group of hapalosin genes was identified (entry eight, Table 2), but the corresponding pathway appears to be incomplete.

Life 2021, 11, x FOR PEER REVIEW 9 of 18

Figure 8. Natural products derived from putative BGCs (>50% similarity with reported ones) in HT-58-2.

3.2.1. Hapalosin BGC Hapalosin is a cyclodepsipeptide that has been shown to cause the reversal of multi- drug resistance in tumor cell lines [29]. The compound has been detected from lipophilic extracts of three cyanobacterial strains: Hapalosiphon welwitschii UH strain IC-52-3, Westiella intricata UH strain HT-29-1, and Fischerella sp. PCC 9431. The UH strains were collected by the same team at the University of Hawaii that discovered HT-58-2 as part of Life 2021, 11, 356 9 of 17 a global search for novel natural products from diverse cyanobacteria [2]. Cloning and heterologous expression of the majority of the hapalosin BGC from Fischerella sp. PCC 9431 was recently described [30]. InIn HT-58-2, HT-58- a2, non-ribosomal a non-ribosomal peptide synthase/Typepeptide synthase/Type I polyketide Isynthase polyketide (NRPS/T1PKS)- synthase related(NRPS/T1PKS) gene cluster-related was gene found cluster spanning was found 240 kb spanning (region nine),240 kb as (region shown nine in Table), as shown2. More in thanTable ten 2. T1PKS/NRPSMore than ten domainsT1PKS/NRPS are arranged domains in are this arranged region, which in this are region, likely which involved are inlikely the biosynthesisinvolved in the of cyclicbiosynthesis peptide–polyketides of cyclic peptide or– lipopeptides,polyketides or such lipopeptides, as the cyclodepsipeptide such as the cy- hapalosin.clodepsipeptide Five hapalosin. genes in the Five BGC genes aligned in the BGC with aligned all five with members all five from members the hapalosin from the BGChapalosin from BGCFischerella from sp.Fischerella PCC 9431 sp. PCC (Figure 94319)[ (Figure31]. The 9) overall[31]. The percent overall identity percent between identity aminobetween acid amino sequences acid sequences of HapA-E of inHapA the- twoE in BGCsthe two is aroundBGCs is 50–60% around with50–60% an E-valuewith an ofE- 0.0value as givenof 0.0 byas given BLASTP; by BLASTP; the module the domainsmodule domains of PKS and of PKS NRPS and from NRPS HT-58-2 from differedHT-58-2 fromdiffered those from in Fischerella those in Fischerellasp. PCC 9431 sp. PCC(Table 94313). The (Table distinct 3). The difference distinct in difference sequences in and se- modulesquences and suggests modules that suggests hapalosin that might hapalosin not be might the onlynot be product the only produced product produced by this long by BGC.this long A second BGC. A group second of group hapalosin of hapalosin genes was genes identified was identified (entry eight, (entry Table eight2),, Table but the 2), correspondingbut the corresponding pathway pathway appears appears to be incomplete. to be incomplete.

Figure 9. The putative hapalosin BGC identified from HT-58-2 (middle), aligned with that from Fischerella sp. PCC 9431 (top). Genes are color-coded to represent similarities. The lower panel shows the expansion of the 75.7 kb region of Nostocales HT-58-2 that contains 10 genes for NRPS/PKS systems.

Table 3. Alignment between proteins of the putative HT-58-2 hapalosin BGC and those from Fisherella sp. PCC 9431.

HT-58-2 Accession Aligned Protein E-Value % Identity * ARV58870 HapA, AMP-dependent synthetase 0.0 60.2% ARV58846 HapB, polyketide synthetase 0.0 51.6% ARV58869 HapC, non-ribosomal peptide synthetase 0.0 58.8% ARV58868 HapD, non-ribosomal peptide synthetase 0.0 63.6% ARV62947 HapE, polyketide synthetase 0.0 65.3% * The percent identity included >75% of aligned protein length from the two BGCs, except for HapD (42%).

3.2.2. Anatoxin-a/Homoanatoxin-a BGC Anatoxin-a belongs to a class of toxins produced by cyanobacteria that have been reported as threats to humans and animals [32,33]. Anatoxin-a was first isolated from Anabaena flos-aquae in 1977 [34], and several analogues, including homoanatoxin-a, dihydroantoxin-a, and dihydrohomoanatoxin-a, were later identified from multiple genera of cyanobacteria [35]. Known as Very Fast Death Factor (VFDF), anatoxin-a can cause tremors, paralysis, and death within a few minutes when injected into the body cavity of mice [36], and also has high acute oral toxicity [37]. Thus, monitoring water supplies for the presence of cyanobacteria that produce anatoxins has become an important issue. However, due to the limitations of many traditional means of observations such as light microscopy, it is preferable to employ genomic methods to identify cyanobacteria that have the potential to produce anatoxins. A putative anatoxin-a/homoanatoxin-a BGC was found between nucleotides 2,729,361 and 2,752,515 bp in the genome of HT-58-2 (Figure 10). Aligned with the reported ana BGC from Oscillatoria sp. PCC 6506 [38,39], most genes (anaA-G) share 80–90% nucleotide and amino acid identity, except for that of AnaJ with 65% aligned amino acid identity (Table4). The Life 2021, 11, 356 10 of 17

main difference between the two ana BGCs is the position of the thioesterase gene (anaA). anaA is observed upstream of the anaB-G cluster in Oscillatoria sp. PCC 6506, versus down- stream of the BGC in HT-58-2 (Figure 10). Additionally, both predicted protein functions and identified functional domains in the polyketide synthases from HT-58-2 are extremely similar to those from Oscillatoria sp. PCC 6506, except for that of AnaG. In particular, three module domains (KS-AT-ACP) were detected in AnaG of HT-58-2, while an additional SAM-dependent methyltransferase domain exists in Oscillatoria sp. PCC 6506. In Figure 11, a 599 amino acid gap can be observed in the alignment of the two respective AnaG proteins, which corresponds to the methyl transferase region on the basis of module searching. The lack of a methyltransferase in the ana BGC of HT-58-2 might preclude methyl extension of the acetyl group (forming the propionyl group) in the conversion of anatoxin-a and dihydroanatoxin-a to the respective homoanatoxin-a and dihydrohomoanatoxin-a, unless a methyltransferase encoded elsewhere in the genome is used. The BGC for anatoxins found in HT-58-2 is most closely related to that identified in stagnale PCC 7417 (Figure 10) on the basis of BLAST results (data not shown). C. stagnale PCC 7417 has been reported to produce dihydroanatoxin-a, rather than anatoxin-a/homoanatoxin-a [40]. In addition to the presence and alignment of AnaA-G and AnaJ, a MATE efflux transporter is encoded by both BGCs but at distinct positions: downstream from the BGC for anatoxins in HT-58-2, while upstream in C. stagnale PCC 7417. The MATE efflux transporter was reported to export cyanotoxin (encoded within Life 2021, 11, x FOR PEER REVIEW 11 of 18 the saxitoxin BGC) from growing cyanobacteria [41] and might play the same role in transporting anatoxins.

Figure 10. BGCs for anatoxins from OscillatoriaOscillatoria sp. PCC 6506, Cylindrospermum stagnale PCC 7417 and HT-58-2.HT-58-2. Genes are color-codedcolor-coded to represent similarities.similarities.

Table 4. Alignment between proteins of the putative HT-58-2 anatoxin-a BGC and those from Oscillatoria sp. PCC 6506. Table 4. Alignment between proteins of the putative HT-58-2 anatoxin-a BGC and those from Oscillatoria sp. PCC 6506. HT-58-2 Accession Aligned Protein E-Value % identity * HT-58-2 Accession Aligned Protein E-Value % Identity * ARV59080 AnaJ, cyclase 0.0 69.0% ARV59087ARV59080 AnaJ,AnaA, cyclase thioesterase 0.03e-157 82.8% 69.0% ARV59087 AnaA, thioesterase 3e-157 82.8% ARV59081ARV59081 AnaB,AnaB, acyl-CoA acyl-CoA dehydrogenase dehydrogenase 0.00.0 87.9% 87.9% ARV59082ARV59082 AnaC,AnaC, proline proline adenylation adenylation 0.00.0 87.7% 87.7% ARV59083ARV59083 AnaD,AnaD, acyl acyl carrier carrier protein protein 2e-512e-51 87.4% 87.4% ARV59084ARV59084 AnaE,AnaE, polyketide polyketide synthase synthase 0.00.0 86.7% 86.7% ARV59085 AnaF, polyketide synthase 0.0 82.2% ARV59085ARV59086 AnaG,AnaF, polyketide polyketide synthase synthase 0.00.0 82.2% 83.9% ARV59086 AnaG, polyketide synthase 0.0 83.9% * The percent identity included >90% of aligned protein length from the two anatoxin-a BGCs. * The percent identity included >90% of aligned protein length from the two anatoxin-a BGCs.

LifeLife2021 2021, 11, 11, 356, x FOR PEER REVIEW 1112 of of 17 18

FigureFigure 11. 11.Alignment Alignment between between AnaG AnaG from from HT-58-2 HT-58-2 and and from fromOscillatoria Oscillatoriasp. sp. PCC PCC 6506 6506 via via ClustalW ClustalW Omega. Omega. The The image image waswas generated generated by by ESPript ESPript 3. 3. Red Red highlight highlight indicates indicates residues residues of of identity identity or or high high similarity, similarity, non-highlighted non-highlighted fonts fonts show show non-matchingnon-matching amino amino acids, acids, and and dots dots indicate indicate gaps gaps in in the the sequence sequence alignment. alignment.

3.2.3. Hexose-ShinorineThe BGC for anatoxins BGC found in HT-58-2 is most closely related to that identified in CylindrospermumCyanobacteria stagnale are known PCC to 7417 produce (Figure pigments 10) on that the afford basis protection of BLAST against results ultraviolet (data not light.shown). Mycosporine-like C. stagnale PCC amino 7417 has acids been (MAAs) report anded to scytonemins produce dihydroanatoxin are two types of-a, ultravioletrather than (UV)-absorbinganatoxin-a/homoanatoxin compounds- fora [40]. protection In addition against to UV-B the andpresence UV-A and radiation, alignment respectively of AnaA [42-].G Shinorineand AnaJ, is a one MATE of the efflux ~30 MAAs transporter earlier is identified encoded inby 28 both cyanobacterial BGCs but at strains distinct [43 ]positions: and red algae,downstreamPorphyra from umbilicalis the BGC[44 ].for The anatoxins structure in of HT shinorine-58-2, while contains upstream a cyclohexenimine in C. stagnalecore PCC bearing7417. The glycine MATE and efflux serine transporter substituents. was Shinorine reported has to a export strong cyanotoxin UV-absorbing (encoded potential within due tothe a largesaxitoxin molar BGC) absorption from growing coefficient cyanobacteria(ε = 28,100–50,000 [41] and M −might1 cm− play1) in the the same UV-A role range in trans- [45], whichporting generally anatoxins. is more penetrating than UV-B [46]. A putative BGC for the biosynthesis of shinorine was identified from HT-58-2 at the region3.2.3. Hexose of 4,124,014–4,131,995-Shinorine BGC bp, which can be compared to the reported MAA producer AnabaenaCyanobacteria variabilis ATCC are known 29,413 [to47 produce] (Figure pigments12). The shinorine that afford BGC protection contains against four coding ultra- sequencesviolet light. (CDSs), Mycosporine with the-like arrangement amino acids of genes (MAAs) and and the predictedscytonemins functions are two of types each CDSof ul- showntraviolet in (UV) Figure-absorbing 12 and Table compounds5. Comparison for protection can be against made with UV- anB and NRPS UV- inA radiaA. variabilistion, re- ATCCspectively 29,413 [42]. (Ava_3855) Shinorine and is one in HT-58-2 of the ~30 (ARV60046). MAAs earlier The identified latter contains in 28 ancyanobacterial additional condensationstrains [43] and domain, red algae which, Porphyra may affect umbilicalis the final [44]. product The derivedstructure from of shinorine the shinorine contains BGC. a Suchcyclohexenimine a condensation core module bearing was glycine also observed and serine in NRPSssubstituents. from other Shinorine cyanobacterial has a strong strains, UV- e.g., Chlorogloeopsis fritschii PCC 6912 [48] and Chlorogloeopsis sp. PCC 9212 [49]. The two absorbing potential due to a large molar absorption coefficient (ε = 28,100–50,000 M−1 cm– Chlorogloeopsis strains were identified by the presence of a shinorine BGC similar to that 1) in the UV-A range [45], which generally is more penetrating than UV-B [46]. of HT-58-2. A putative BGC for the biosynthesis of shinorine was identified from HT-58-2 at the region of 4,124,014–4,131,995 bp, which can be compared to the reported MAA producer Anabaena variabilis ATCC 29,413 [47] (Figure 12). The shinorine BGC contains four coding

LifeLife 2021 2021, 11,, 11x ,FOR x FOR PEER PEER REVIEW REVIEW 13 of13 18 of 18

sequences (CDSs), with the arrangement of genes and the predicted functions of each CDS sequences (CDSs), with the arrangement of genes and the predicted functions of each CDS shown in Figure 12 and Table 5. Comparison can be made with an NRPS in A. variabilis shown in Figure 12 and Table 5. Comparison can be made with an NRPS in A. variabilis ATCC 29,413 (Ava_3855) and in HT-58-2 (ARV60046). The latter contains an additional ATCC 29,413 (Ava_3855) and in HT-58-2 (ARV60046). The latter contains an additional condensation domain, which may affect the final product derived from the shinorine BGC. condensation domain, which may affect the final product derived from the shinorine BGC. Such a condensation module was also observed in NRPSs from other cyanobacterial Suchstrains, a condensation e.g., Chlorogloeopsis module fritschii was PCC also 6912 observed [48] and in Chlorogloeopsis NRPSs from othersp. PCC cyanobacterial 9212 [49]. strains, e.g., Chlorogloeopsis fritschii PCC 6912 [48] and Chlorogloeopsis sp. PCC 9212 [49]. Life 2021, 11, 356 The two Chlorogloeopsis strains were identified by the presence of a shinorine BGC12 similar of 17 Theto thattwo of Chlorogloeopsis HT-58-2. strains were identified by the presence of a shinorine BGC similar to that of HT-58-2.

Figure 12. Alignment between shinorine BGCs from A. variabilis ATCC 29,413 and HT-58-2. Genes are color-coded to FigurerepresentFigure 12. Alignment 12.similarities.Alignment between between shinorine shinorine BGCs BGCs from from A.A. variabilis ATCCATCC 29,413 29,413 and and HT-58-2. HT-58 Genes-2. Genes are color-codedare color-coded to to representrepresent similarities. similarities. Table 5. Alignment between proteins of the putative HT-58-2 shinorine BGC and those from A. variabilis ATCC 29413.

TableHT-58Table 5.-2 Alignment Accession 5. Alignment between between proteins proteins of ofthe the Alignedputative putative HTProtein HT-58-2-58-2shinorine shinorine BGC BGC and and those thoseE from-Value fromA. variabilis A. variabilisATCC% ATCCidentity 29413. 29413 * . HT-58-ARV60049HT-58-22 Accession Accession 3-dehydroquinate AlignedAligned Protein Protein synthase E-Value3eE--Value149 % Identity%78.7% identity * * ARV60048 SAM-dependent methyltransferase 1e-109 74.2% ARV60049ARV60049 3 3-dehydroquinate-dehydroquinate synthase synthase 3e-1493e-149 78.7%78.7% ARV60047ARV60048 SAM-dependentATP-grasp methyltransferase enzyme 1e-1091e-215 74.2%79.0% ARV60048 SAM-dependent methyltransferase 1e-109 74.2% ARV60046ARV60047 ATP-graspNRPS enzyme 1e-2150.0 79.0%72.9% ARV60047ARV60046 ATP-grasp NRPS enzyme 0.01e-215 72.9%79.0% * The percent identity included >66% of aligned protein length from the two shinorine BGCs. ARV60046 * The percent identity included >66%NRPS of aligned protein length from the two shinorine0.0 BGCs. 72.9% * The percent identity3.2.4. Heterocyst included >66% Glycolipid of aligned BGC protein length from the two shinorine BGCs. 3.2.4. Heterocyst Glycolipid BGC Heterocyst glycolipids (HGs) form a protective layer for oxygen-sensitive nitrogen- 3.2.4. HeterocystHeterocyst Glycolipid glycolipids (HGs)BGC form a protective layer for oxygen-sensitive nitrogenase ase enzymes [50] in the envelope of heterocystous nitrogen-fixing cyanobacteria. The HT- enzymes [50] in the envelope of heterocystous nitrogen-fixing cyanobacteria. The HT- 58-2Heterocyst cyanobacterium glycolipids is filamentous (HGs) form and isa protectivecapable of layernitrogen for fixationoxygen -[9]sensitive; therefore nitrogen-, the 58-2 cyanobacterium is filamentous and is capable of nitrogen fixation [9]; therefore, the ase enzymes [50] in the envelope of heterocystous nitrogen-fixing cyanobacteria. The HT- presencepresence of of a a BGC BGC for for HG HG biosynthesis inin thethe genomegenome of of HT-58-2 HT-58- seemed2 seemed likely. likely. Indeed, Indeed, 58an-an2 HGcyanobacterium HG BGC BGC occurs occurs in inis HTHT-58-2filamentous-58-2 atat positionposition and is 974,174–991,938 974,174capable– 991,938of nitrogen bp. bp. Six Six fixation genes genes therein [9]therein; therefore encode encode, the presenceproteinsproteins ofaligned aligned a BGC with with for highHG biosynthesis similarity (>63%(>63% in the identity)identity) genome to to those ofthose HT of of- a58 reporteda- 2reported seemed BGC BGC likely. for for HGs Indeed,HGs anfrom fromHG NostocBGCNostoc occurssp.sp. ‘ ‘PeltigeraPeltigera in HT -membranacea 58-2 at position cyanobiont’cyanobiont 974,174–’ [991,938 51[51].]. The The bp. alignment alig Sixnment genes of of the therein the two two HG encode HG proteinsBGCsBGCs and andaligned the the predicted predicted with high product product similarity functions (>63% areare identity) shownshown in toin Figure thoseFigure 13of 13 anda andreported Table Table6. ABGC6. short-A shortfor HGs- fromchainchain Nostoc dehydrogenase/reductase dehydrogenase/reductase sp. ‘Peltigera membranacea (SDR) (SDR) is is cyanobiontabsent absent in in the the’ [51]. HG TheBGC alig fromnment HT-58-2,HT- 58of- 2,the whilewhile two aHG BGCscarboxypeptidasea carboxypeptidase and the predicted regulatory regulatory-like product-like domainfunctions domain protein proteinare shown is present is present in Figure between between 13 Hg and HglGlG Table and and HglE. 6. HglE. A Ad-short- chainjacentAdjacent dehydrogenase/reductase to the to assumed the assumed HG HGBGC BGC are (SDR) twelve are twelve is photosynthesisabsent photosynthesis in the HGand and phycobilisomeBGC phycobilisome-related from HT--related58-2, while pro- a proteins, for which the relationship to the formation of HGs remains unknown. Overall, carboxypeptidaseteins, for which the regulatory relationship-like to domain the formation protein of is HGspresent remains between unknown. HglG andOverall, HglE. the Ad- putativethe putative HG BGC HG BGC in HT in- HT-58-258-2 aligned aligned with with 85% 85% similarity similarity to tothe the protein proteinss of of the the NostocNostoc sp., jacentsp., to although the assumed the effect HG of theBGC absent are twelve SDR on photosynthesis the biosynthesis ofand HGs phycobilisome requires further-related study. pro- teins,although for which the effect the ofrelationship the absent SDRto the on formation the biosynthesis of HGs of remains HGs requires unknown. further Overall, study. the putative HG BGC in HT-58-2 aligned with 85% similarity to the proteins of the Nostoc sp., although the effect of the absent SDR on the biosynthesis of HGs requires further study.

FigureFigure 13. 13. HeterocystHeterocyst glycolipids glycolipids BGCs BGCs from from NostocNostoc sp.sp. ‘Peltigera ‘Peltigera membranacea membranacea cyanobiont cyanobiont’’ and and HT HT-58-2.-58-2. Genes Genes are arecolor- codedcolor-coded to represent to represent similarities. similarities.

Figure 13. Heterocyst glycolipids BGCs from Nostoc sp. ‘Peltigera membranacea cyanobiont’ and HT-58-2. Genes are color- coded to represent similarities.

Life 2021, 11, x FOR PEER REVIEW 14 of 18

Table 6. Alignment between proteins of the putative HT-58-2 heterocyst glycolipids BGC and those from Nostoc sp.

HT-58-2 Accession Aligned Protein E-Value %identity * ARV57925 HetI, 4’-phosphopantetheinyl transferase 1e-125 71.2% Life 2021, 11, 356 13 of 17 ARV57926 HglB, thioester reductase 0.0 73.0% PfaD, polyunsaturated fatty acid/polyketide biosyn- ARV62843 0.0 79.5% thesis protein TableARV57927 6. Alignment between proteinsHglC, of the beta putative-ketoacyl HT-58-2 synthase heterocyst glycolipids BGC0.0 and those from Nostoc63.6%sp. HT-58-2ARV57928 Accession HglG, Alignedpolyketide Protein synthase 0.0 E-Value 65.3% %Identity * ARV57929 carboxypeptidase regulatory protein N/A N/A ARV57925 HetI, 40-phosphopantetheinyl transferase 1e-125 71.2% ARV57926ARV57930 HglE, HglB, polyketide thioester reductase synthase 0.0 0.066.8% 73.0% ARV62843* The PfaD,percent polyunsaturated identity included fatty >97% acid/polyketide of aligned protein biosynthesis length from protein the two HG 0.0 BGCs. 79.5% ARV57927 HglC, beta-ketoacyl synthase 0.0 63.6% ARV579283.3. Putative BGC HglG, for polyketide Nitrogen synthaseFixation 0.0 65.3% ARV57929 carboxypeptidase regulatory protein N/A N/A ARV57930Genes for HglE, nitrogen polyketide fixation synthase enable cyanobacteria to grow 0.0in the absence 66.8%of dissolved nitrogenous compounds. The presence of a minimum set of six genes, termed nifHDK and * The percent identity included >97% of aligned protein length from the two HG BGCs. nifENB for catalysis and biosynthesis proteins, respectively, has been reported to be es- sential for nitrogen fixation [52]. In HT-58-2, genes for nitrogen fixation were found to be 3.3. Putative BGC for Nitrogen Fixation concentrated in two regions: 4,862,885–4,900,602 bp (nif cluster 1) and 4,972,977–4,986,558 bp (nifGenes cluster for 2) nitrogen (Figure fixation 14). The enable former cyanobacteria cluster contains to grow five in of the the absence six genes of dissolved(missing nifBnitrogenous), whereas compounds. the latter cluster The presence contains of three a minimum genes (missing set of six nifENK genes,); termed regardless,nifHDK alland six requirednifENB for genes catalysis are represented and biosynthesis among proteins, the two respectively, clusters. The has expression been reported of the to genes be essential in the for nitrogen fixation [52]. In HT-58-2, genes for nitrogen fixation were found to be con- two clusters may differ under distinct growth conditions [53]. centrated in two regions: 4,862,885–4,900,602 bp (nif cluster 1) and 4,972,977–4,986,558 bp In addition to the nif and vnf genes encoding nitrogenases (the latter a vanadium- (nif cluster 2) (Figure 14). The former cluster contains five of the six genes (missing nifB), dependent nitrogenase), modABC genes are present in cluster 1; such genes encode pro- whereas the latter cluster contains three genes (missing nifENK); regardless, all six required teins involved in the high affinity molybdate/tungsten uptake system. The presence of genes are represented among the two clusters. The expression of the genes in the two Mo-related proteins in the nif gene cluster 1 supports the inferred role of Mo transport in clusters may differ under distinct growth conditions [53]. regulating nitrogen fixation by HT-58-2.

FigureFigure 14. GeneGene clusters clusters related related to to nitrogen fixation fixation in HT HT-58-2.-58-2. Nitrogen Nitrogen fixation fixation genes genes are are labeled in magenta, while additional genes related to ion transport or energy production are shown in maroon. additional genes related to ion transport or energy production are shown in maroon.

4. DiscussionIn addition to the nif and vnf genes encoding nitrogenases (the latter a vanadium- dependentCyanobacteria nitrogenase), are essentialmodABC constituentsgenes are present of the in earth’s cluster biota 1; such and genes occupy encode diverse proteins ter- restrialinvolved and in aquatic the high ecosystems affinity molybdate/tungsten [54,55]. Cyanobacteria uptake provide system. a tremendous The presence treasury of Mo- of naturalrelated products proteins with in the richnif applicationsgene cluster to 1 serve supports human the needs inferred in the role pharmaceutical, of Mo transport food, in andregulating energy nitrogenindustries fixation [56]. The by HT-58-2.culture HT-58-2, which contains a filamentous cyanobac- terium and several other bacteria (mostly photoheterotrophic bacteria), is known to pro- duce4. Discussion two distinct classes of natural products: tolyporphins, new members of the tetrapyrroleCyanobacteria macrocycle are essentialfamily; and constituents tolypodiols, of which the earth’s are diterpenoids. biota and occupy Both diversesets of nat- ter- uralrestrial products and aquatic have ecosystems been subjected [54,55 to]. initial Cyanobacteria evaluation provide for biological a tremendous activities. treasury Toly- of porphinnatural productsA was found with richto inhibit applications efflux pump to serve activity human and needs thereby in the reverse pharmaceutical, multidrug food, re- sistanceand energy (MDR) industries in tumor [56 cells]. The [1,3,4] culture. Tolypodiols HT-58-2, whichand analogues contains were a filamentous evaluated cyanobac- for anti- terium and several other bacteria (mostly photoheterotrophic bacteria), is known to produce two distinct classes of natural products: tolyporphins, new members of the tetrapyrrole macrocycle family; and tolypodiols, which are diterpenoids. Both sets of natural products have been subjected to initial evaluation for biological activities. Tolyporphin A was found to inhibit efflux pump activity and thereby reverse multidrug resistance (MDR) in tumor cells [1,3,4]. Tolypodiols and analogues were evaluated for anti-inflammatory properties Life 2021, 11, 356 14 of 17

germane to neurological disorders [13,14]. In contrast with the intriguing reports concern- ing bioactive properties, the biosynthetic pathways to tolyporphins and tolypodiols have been little explored, and essentially nothing is known concerning the functional roles of these natural products in the cyanobacterium and associated community bacteria. Knowledge of the full genome of the Nostocales HT-58-2 enables the identification of genes or BGCs that might contribute to the biosynthesis of tolyporphins and tolypodiols. Studies of the tolyporphins BGC (BGC-T) will be reported elsewhere. Here, a BGC was examined that contains almost all genes in the MEP pathway related to the backbone biosyn- thesis of terpenoids. Genes co-localizing with those in the MEP pathway are ubi genes, which are involved in the terpenoid-quinone pathway. Additionally, the significant product of the MEP pathway, geranylgeranyl diphosphate, could react with 4-hydroxybenzoate (a product of the UbiC-catalyzed reaction derived from the common precursor chorismic acid) to afford many important aromatic compounds such as ubiquinone and vitamin K [57]. Such a BGC for tolypodiols requires further analysis to establish the complete biosynthetic steps derived from the MEP pathway. The absence of genes (ispD and ispF) corresponding to MCT and MDS proteins in the tolypodiols BGC is (1) not believed to result from an incompleteness of sequencing reads when assembling and closing the circular genome of Nostocales HT-58-2 [9]; and (2) is not concerning with regard to biosynthesis, because such genes are present elsewhere in the genome. The HT-58-2 culture grows under a medium deprived of aqueous-soluble nitrate (BG- 11o), consistent with a presumed nitrogen fixation capacity of heterocyst-forming cyanobac- teria [11]. The production of tolyporphins is profoundly increased in the absence versus presence of soluble nitrate [11]. A minimum set of six conserved nif genes (nifHDKENB) is required for nitrogen fixation [52]. Genes for two different nitrogenases (nif and vnf) were found in two clustered regions. Nitrogen fixation in HT-58-2 is probably regulated by metal ion concentrations, because molybdate-dependent transporters (mod) and nitrogenases (nif, vnf) are co-localized. Further studies may probe co-relationships of gene expression for nitrogen fixation and tolyporphins production in the face of environmental stimuli. The studies reported here suggest that HT-58-2 may be capable of producing diverse natural products beyond tolyporphins and tolypodiols. Four BGCs are aligned with relatively high similarity (above 50% similarity) with those reported for the biosynthesis of hapalosin, anatoxin-a/homoanatoxin-a, hexose-shinorine, and heterocyst glycolipids. The alignments are not complete, because a gap region occurs in the putative hapalosin BGC and at least one gene is absent in each of the BGCs. Alignment and conserved domain analysis of the protein AnaG did not reveal a means for the methylation of anatoxin-a and dihydroanatoxin-a to form the corresponding homoanatoxin-a and dihydrohomoanatoxin- a. However, the effects on product structures of absent domains, or differences in domains of individual proteins, remain unknown. Traditional methods of discovering bioactive products from microorganisms are often limited by available cultivation conditions or purification methodologies [58]. Genome mining and bioinformatics provide complementary strategies for exploring biosynthetic pathways of metabolites. Although none of the four natural products (hapalosin, anatoxin- a/homoanatoxin-a, hexose-shinorine, and heterocyst glycolipids) has yet been isolated from the HT-58-2 culture, the identification of the putative BGCs highlights potential op- portunities. The potential presence of anatoxins and analogues in particular should prompt caution in handling the HT-58-2 cultures and extracts thereof, given their known physiolog- ical effects. Moreover, biological assays with crude extracts must be cautiously interpreted given the possible presence of numerous bioactive natural products. Identification of putative BGCs enables further work to address issues concerning possible evolutionary origin. Taken together, the putative BGCs described in this study establish a framework for investigation of the biosynthesis of tolypodiols and other natural products, as well as nitrogen fixation and regulation in the HT-58-2 cyanobacterial–bacterial consortium. Life 2021, 11, 356 15 of 17

Author Contributions: Conceptualization, X.J., E.S.M. and J.S.L.; methodology, X.J.; formal analysis, X.J.; writing—original draft preparation, X.J. and J.S.L.; writing—review and editing, E.S.M. and J.S.L.; funding acquisition, J.S.L. All authors have read and agreed to the published version of the manuscript. Funding: This research grew out of research in the Photosynthetic Antenna Research Center, an Energy Frontier Research Center funded by the U.S. Department of Energy, Office of Science, Of- fice of Basic Energy Sciences, under award no. DESC0001035, and also was supported by NC State University. Institutional Review Board Statement: Not applicable. Informed Consent Statement: Not applicable. Data Availability Statement: Data are available from the corresponding author on reasonable request. Conflicts of Interest: The authors declare no conflict of interest.

References 1. Prinsep, M.R.; Caplan, F.R.; Moore, R.E.; Patterson, G.M.L.; Smith, C.D. Tolyporphin, a Novel Multidrug Resistance Reversing Agent from the Blue-green Alga Tolypothrix nodosa. J. Am. Chem. Soc. 1992, 114, 385–387. [CrossRef] 2. Patterson, G.M.L.; Baldwin, C.L.; Bolis, C.M.; Caplan, F.R.; Karuso, H.; Larsen, L.K.; Levine, I.A.; Moore, R.E.; Nelson, C.S.; Tschappat, K.D.; et al. Antineoplastic Activity of Cultured Blue-Green Algae (Cyanophyta). J. Phycol. 1991, 27, 530–536. [CrossRef] 3. Smith, C.D.; Prinsep, M.R.; Caplan, F.R.; Moore, R.E.; Patterson, G.M.L. Reversal of Multiple Drug Resistance by Tolyporphin, a Novel Cyanobacterial Natural Product. Oncol. Res. 1994, 6, 211–218. 4. Morlière, P.; Mazière, J.-C.; Santus, R.; Smith, C.D.; Prinsep, M.R.; Stobbe, C.C.; Fenning, M.C.; Golberg, J.L.; Chapman, J.D. Tolyporphin: A Natural Product from Cyanobacteria with Potent Photosensitizing Activity against Tumor Cells in Vitro and in Vivo. Cancer Res. 1998, 58, 3571–3578. [PubMed] 5. Prinsep, M.R.; Appleton, T.G.; Hanson, G.R.; Lane, I.; Smith, C.D.; Puddick, J.; Fairlie, D.P. Tolyporphin Macrocycles from the Cyanobacterium Tolypothrix nodosa Selectively Bind Copper and Silver and Reverse Multidrug Resistance. Inorg. Chem. 2017, 56, 5577–5585. [CrossRef][PubMed] 6. Prinsep, M.R.; Patterson, G.M.L.; Larsen, L.K.; Smith, C.D. Further Tolyporphins from the Blue-Green Alga Tolypothrix nodosa. Tetrahedron 1995, 51, 10523–10530. [CrossRef] 7. Prinsep, M.R.; Patterson, G.M.L.; Larsen, L.K.; Smith, C.D. Tolyporphins J and K, Two Further Porphinoid Metabolites from the Cyanobacterium Tolypothrix nodosa. J. Nat. Prod. 1998, 61, 1133–1136. [CrossRef] 8. Gurr, J.R.; Dai, J.; Philbin, C.S.; Sartain, H.T.; O’Donnell, T.J.; Yoshida, W.Y.; Rheingold, A.L.; Williams, P.G. Tolyporphins L–R: Unusual Tetrapyrroles from a Brasilonema sp. of Cyanobacterium. J. Org. Chem. 2020, 85, 318–326. [CrossRef] 9. Hughes, R.-A.; Zhang, Y.; Zhang, R.; Williams, P.G.; Lindsey, J.S.; Miller, E.S. Genome Sequence and Composition of a Tolyporphin- Producing Cyanobacterium-Microbial Community. Appl. Environ. Microbiol. 2017, 83, e01068-17. [CrossRef][PubMed] 10. Hughes, R.-A.; Jin, X.; Zhang, Y.; Zhang, R.; Tran, S.; Williams, P.G.; Lindsey, J.S.; Miller, E.S. Genome Sequence, Metabolic Properties and Cyanobacterial Attachment of Porphyrobacter sp. HT-58-2 Isolated from a Filamentous Cyanobacterium-Microbial Consortium. Microbiology 2018, 164, 1229–1239. [CrossRef] 11. Zhang, Y.; Zhang, R.; Hughes, R.-A.; Dai, J.; Gurr, J.R.; Williams, P.G.; Miller, E.S.; Lindsey, J.S. Quantitation of Tolyporphins, Diverse Tetrapyrrole Secondary Metabolites with Chlorophyll-Like Absorption, from a Filamentous Cyanobacterium–Microbial Community. Phytochem. Anal. 2018, 29, 205–216. [CrossRef][PubMed] 12. Barnhart-Dailey, M.; Zhang, Y.; Zhang, R.; Anthony, S.M.; Aaron, J.S.; Miller, E.S.; Lindsey, J.S.; Timlin, J.A. Cellular Localization of Tolyporphins, Unusual Tetrapyrroles, in a Microbial Photosynthetic Community Determined using Hyperspectral Confocal Fluorescence Microscopy. Photosynth. Res. 2019, 141, 259–271. [CrossRef][PubMed] 13. Prinsep, M.R.; Thomson, R.A. Tolypodiol, an Antiinflammatory Diterpenoid from the Cyanobacterium Tolypothrix nodosa. J. Nat. Prod. 1996, 59, 786–788. [CrossRef][PubMed] 14. Gurr, J.R.; O’Donnell, T.J.; Luo, Y.; Yoshida, W.Y.; Hall, M.L.; Mayer, A.M.S.; Sun, R.; Williams, P.G. 6-Deoxy- and 11- Hydroxytolypodiols: Meroterpenoids from the Cyanobacterium HT-58-2. J. Nat. Prod. 2020, 83, 1691–1695. [CrossRef][PubMed] 15. Tidgewell, K.; Clark, B.R.; Gerwick, W.H. The Natural Products Chemistry of Cyanobacteria. In Comprehensive Natural Products II Chemistry and Biology; Mander, L., Lui, H.-W., Eds.; Elsevier: Oxford, UK, 2010; Volume 2, pp. 141–188. ISBN 978-008-045-382-8. 16. Leão, P.N.; Engene, N.; Antunes, A.; Gerwick, W.H.; Vasconcelos, V. The Chemical Ecology of Cyanobacteria. Nat. Prod. Rep. 2012, 29, 372–391. [CrossRef][PubMed] 17. Weber, T.; Blin, K.; Duddela, S.; Krug, D.; Kim, H.U.; Bruccoleri, R.; Lee, S.Y.; Fischbach, M.A.; Müller, R.; Wohlleben, W.; et al. AntiSMASH 3.0–A Comprehensive Resource for the Genome Mining of Biosynthetic Gene Clusters. Nucleic Acids Res. 2015, 43, W237–W243. [CrossRef] 18. Skinnider, M.A.; Merwin, N.J.; Johnston, C.W.; Magarvey, N.A. PRISM 3: Expanded Prediction of Natural Product Chemical Structures from Microbial Genomes. Nucleic Acids Res. 2017, 45, W49–W54. [CrossRef][PubMed] Life 2021, 11, 356 16 of 17

19. Mungan, M.D.; Alanjary, M.; Blin, K.; Weber, T.; Medema, M.H.; Ziemert, N. ARTS 2.0: Feature Updates and Expansion of the Antibiotic Resistant Target Seeker for Comparative Genome Mining. Nucleic Acids Res. 2020, 48, W546–W552. [CrossRef] 20. States, D.J.; Gish, W. Combined Use of Sequence Similarity and Codon Bias for Coding Region Identification. J. Comput. Biol. 1994, 1, 39–50. [CrossRef] 21. Madeira, F.; Park, Y.M.; Lee, J.; Buso, N.; Gur, T.; Madhusoodanan, N.; Basutkar, P.; Tivey, A.R.N.; Potter, S.C.; Finn, R.D.; et al. The EMBL-EBI Search and Sequence Analysis Tools APIs in 2019. Nucleic Acids Res. 2019, 47, W636–W641. [CrossRef] 22. Robert, X.; Gouet, P. Deciphering Key Features in Protein Structures with the New ENDscript Server. Nucleic Acids Res. 2014, 42, W320–W324. [CrossRef] 23. Rohmer, M. The Discovery of a Mevalonate-Independent Pathway for Isoprenoid Biosynthesis in Bacteria, Algae and Higher Plants. Nat. Prod. Rep. 1999, 16, 565–574. [CrossRef] 24. Rohmer, M. Mevalonate-Independent Methylerythritol Phosphate Pathway for Isoprenoid Biosynthesis. Elucidation and Distribution. Pure Appl. Chem. 2003, 75, 375–387. [CrossRef] 25. Young, I.G.; Stroobant, P.; MacDonald, C.G.; Gibson, F. Pathway for Ubiquinone Biosynthesis in Escherichia coli K-12: Gene-Enzyme Relationships and Intermediates. J. Bacteriol. 1973, 114, 42–52. [CrossRef][PubMed] 26. Siebert, M.; Severin, K.; Heide, L. Formation of 4-Hydroxybenzoate in Escherichia coli: Characterization of the ubiC Gene and its Encoded Enzyme Chorismate Pyruvate-lyase. Microbiology 1994, 140, 897–904. [CrossRef] 27. Melzer, M.; Heide, L. Characterization of Polyprenyldiphosphate: 4-Hydroxybenzoate Polyprenyltransferase from Escherichia coli. Biochim. Biophys. Acta 1994, 1212, 93–102. [CrossRef] 28. Lee, P.T.; Hsu, A.Y.; Ha, H.T.; Clarke, C.F. A C-Methyltransferase Involved in Both Ubiquinone and Menaquinone Biosynthesis: Isolation and Identification of the Escherichia coli ubiE Gene. J. Bacteriol. 1997, 179, 1748–1754. [CrossRef][PubMed] 29. Stratmann, K.; Burgoyne, D.L.; Moore, R.E.; Patterson, G.M.L.; Smith, C.D. Hapalosin, a Cyanobacterial Cyclic Depsipeptide with Multidrug-Resistance Reversing Activity. J. Org. Chem. 1994, 59, 7219–7226. [CrossRef] 30. D’Agostino, P.M.; Gulder, T.A.M. Direct Pathway Cloning Combined with Sequence- and Ligation-Independent Cloning for Fast Biosynthetic Gene Cluster Refactoring and Heterologous Expression. ACS Synth. Biol. 2018, 7, 1702–1708. [CrossRef] 31. Micallef, M.L.; D’Agostino, P.M.; Sharma, D.; Viswanathan, R.; Moffitt, M.C. Genome Mining for Natural Product Biosynthetic Gene Clusters in the Subsection V Cyanobacteria. BMC Genom. 2015, 16, 669. [CrossRef][PubMed] 32. Codd, G.; Bell, S.; Kaya, K.; Ward, C.; Beattie, K.; Metcalf, J. Cyanobacterial Toxins, Exposure Routes and Human Health. Eur. J. Phycol. 1999, 34, 405–415. [CrossRef] 33. Rastogi, R.P.; Madamwar, D.; Incharoensakdi, A. Bloom Dynamics of Cyanobacteria and Their Toxins: Environmental Health Impacts and Mitigation Strategies. Front Microbiol. 2015, 6, 1254. [CrossRef][PubMed] 34. Devlin, J.P.; Edwards, O.E.; Gorham, P.R.; Hunter, N.R.; Pike, R.K.; Stavric, B. Anatoxin-a, A Toxic Alkaloid from Anabaena flos-aquae NRC-44h. Can. J. Chem. 1977, 55, 1367–1371. [CrossRef] 35. Hemscheidt, T.; Rapala, J.; Sivonen, K.; Skulberg, O.M. Biosynthesis of Anatoxin-a in Anabaena flos-aquae and Homoanatoxin-a in Oscillatoria formosa. J. Chem. Soc. Chem. Commun. 1995, 13, 1361–1362. [CrossRef] 36. Carmichael, W.W.; Gorham, P.R.; Biggs, D.F. Two Laboratory Case Studies on the Oral Toxicity to Calves of the Freshwater Cyanophyte (Blue-green Alga) Anabaena flos-aquae NRC-44-1. Can. Vet. J. 1977, 18, 71–75. [PubMed] 37. Rogers, E.H.; Hunter, E.S., III; Moser, V.C.; Phillips, P.M.; Herkovits, J.; Muñoz, L.; Hall, L.L.; Chernoff, N. Potential Developmental Toxicity of Anatoxin-a, A Cyanobacterial Toxin. J. Appl. Toxicol. 2005, 25, 527–534. [CrossRef][PubMed] 38. Cadel-Six, S.; Iteman, I.; Peyraud-Thomas, C.; Mann, S.; Ploux, O.; Méjean, A. Identification of a Polyketide Synthase Coding Sequence Specific for Anatoxin-a-Producing Oscillatoria Cyanobacteria. Appl. Environ. Microbiol. 2009, 75, 4909–4912. [CrossRef] 39. Méjean, A.; Mann, S.; Maldiney, T.; Vassiliadis, G.; Lequin, O.; Ploux, O. Evidence that Biosynthesis of the Neurotoxic Alkaloids Anatoxin-a and Homoanatoxin-a in the Cyanobacterium Oscillatoria PCC 6506 Occurs on a Modular Polyketide Synthase Initiated by L-Proline. J. Am. Chem. Soc. 2009, 131, 7512–7513. [CrossRef] 40. Méjean, A.; Dalle, K.; Paci, G.; Bouchonnet, S.; Mann, S.; Pichon, V.; Ploux, O. Dihydroanatoxin-a Is Biosynthesized from Proline in Cylindrospermum stagnale PCC 7417: Isotopic Incorporation Experiments and Mass Spectrometry Analysis. J. Nat. Prod. 2016, 79, 1775–1782. [CrossRef] 41. Kellmann, R.; Mihali, T.K.; Jeon, Y.J.; Pickford, R.; Pomati, F.; Neilan, B.A. Biosynthetic Intermediate Analysis and Functional Homology Reveal a Saxitoxin Gene Cluster in Cyanobacteria. Appl. Environ. Microbiol. 2008, 74, 4044–4053. [CrossRef] 42. Rastogi, R.P.; Incharoensakdi, A. Characterization of UV-screening Compounds, Mycosporine-like Amino Acids, and Scytonemin in the Cyanobacterium Lyngbya sp. CU2555. FEMS Microbiol. Ecol. 2014, 87, 244–256. [CrossRef] 43. Sinha, R.P.; Singh, S.P.; Häder, D.-P. Database on Mycosporines and Mycosporine-like Amino Acids (MAAs) in Fungi, Cyanobac- teria, Macroalgae, Phytoplankton and Animals. J. Photochem. Photobiol. B Biol. 2007, 89, 29–35. [CrossRef] 44. Hartmann, A.; Murauer, A.; Ganzera, M. Quantitative Analysis of Mycosporine-like Amino Acids in Marine Algae by Capillary Electrophoresis with Diode-array Detection. J. Pharm. Biomed. Anal. 2017, 138, 153–157. [CrossRef] 45. Singh, S.P.; Sinha, R.P.; Klisch, M.; Häder, D.-P. Mycosporine-like Amino Acids (MAAs) Profile of a Rice-Field Cyanobacterium Anabaena doliolum as Influenced by PAR and UVR. Planta 2008, 229, 225–233. [CrossRef][PubMed] 46. Gao, Q.J.; Ferran, G.-P. An ATP-Grasp Ligase Involved in the Last Biosynthetic Step of the Iminomycosporine Shinorine in ATCC 29133. J. Bacteriol. 2011, 193, 5923–5928. [CrossRef] Life 2021, 11, 356 17 of 17

47. Balskus, E.P.; Walsh, C.T. The Genetic and Molecular Basis for Sunscreen Biosynthesis in Cyanobacteria. Science 2010, 329, 1653–1656. [CrossRef][PubMed] 48. Llewellyn, C.A.; Greig, C.; Silkina, A.; Kultschar, B.; Hitchings, M.D.; Farnham, G. Mycosporine-like Amino Acid and Aromatic Amino Acid Transcriptome Response to UV and Far-Red Light in the Cyanobacterium Chlorogloeopsis fritschii PCC 6912. Sci. Rep. 2020, 10, 20638. [CrossRef] 49. Ho, M.-Y.; Bryant, D.A. Global Transcriptional Profiling of the Cyanobacterium Chlorogloeopsis fritschii PCC 9212 in Far-Red Light: Insights into the Regulation of Chlorophyll d Synthesis. Front. Microbiol. 2019, 10, 465. [CrossRef][PubMed] 50. Wörmer, L.; Cirés, S.; Velázquez, D.; Quesada, A.; Hinrichs, K.-U. Cyanobacterial Heterocyst Glycolipids in Cultures and Environmental Samples: Diversity and Biomarker Potential. Limnol. Oceanogr. 2012, 57, 1775–1788. [CrossRef] 51. Kampa, A.; Gagunashvili, A.N.; Gulder, T.A.M.; Morinaka, B.I.; Daolio, C.; Godejohann, M.; Miao, V.P.W.; Piel, J.; Andrésson, Ó. Metagenomic Natural Product Discovery in Lichen Provides Evidence for a Family of Biosynthetic Pathways in Diverse Symbioses. Proc. Natl. Acad. Sci. USA 2013, 110, 3129–3137. [CrossRef] 52. Dos Santos, P.C.; Fang, Z.; Mason, S.W.; Setubal, J.C.; Dixon, R. Distribution of Nitrogen Fixation and Nitrogenase-like Sequences amongst Microbial Genomes. BMC Genom. 2012, 13, 162. [CrossRef][PubMed] 53. Thiel, T.; Pratte, B.S. Regulation of Three Nitrogenase Gene Clusters in the Cyanobacterium Anabaena variabilis ATCC 29413. Life 2014, 4, 944–967. [CrossRef][PubMed] 54. Aráoz, R.; Molgó, J.; Tandeau de Marsac, N. Neurotoxic Cyanobacterial Toxins. Toxicon 2010, 56, 813–828. [CrossRef][PubMed] 55. Gaysina, L.A.; Saraf, A.; Singh, P. Cyanobacteria in Diverse Habitats. In Cyanobacteria—From Basic Science to Applications; Mishra, A.K., Tiwari, D.N., Rai, A.N., Eds.; Academic Press: Cambridge, MA, USA, 2019; pp. 1–28. ISBN 978-012-814-667-5. 56. Rastogi, R.P.; Sinha, R.P. Biotechnological and Industrial Significance of Cyanobacterial Secondary Metabolites. Biotechnol. Adv. 2009, 27, 521–539. [CrossRef] 57. Gibson, F. The Elusive Branch-Point Compound of Aromatic Amino Acid Biosynthesis. Trends Biochem. Sci. 1999, 24, 36–38. [CrossRef] 58. Medema, M.H.; Fischbach, M.A. Computational Approaches to Natural Product Discovery. Nat. Chem. Biol. 2015, 11, 639–648. [CrossRef]