<<

Genome

Genome sequence of flavus A7, a marine- derived with antibacterial activity

Journal: Genome

Manuscript ID gen-2020-0066.R3

Manuscript Type: Note

Date Submitted by the 27-Nov-2020 Author:

Complete List of Authors: Gao, Yaru; China Pharmaceutical University College of Life Science and Technology, Du, Xinyang; China Pharmaceutical University College of Life Science and Technology Li, Huanhuan;Draft China Pharmaceutical University College of Life Science and Technology Wang, Ying; China Pharmaceutical University College of Life Science and Technology

Keyword: genome sequence, secondary metabolism, BGCs, Aspergillus flavus

Is the invited manuscript for consideration in a Special Not applicable (regular submission) Issue? :

© The Author(s) or their Institution(s) Page 1 of 46 Genome

1 Genome sequence of Aspergillus flavus A7, a marine-derived fungus with

2 antibacterial activity

3 Yaru Gao, Xinyang Du, Huanhuan Li, Ying Wang*

4 School of Life Science and Technology, China Pharmaceutical University, Nanjing 211198, PR

5 China.

6 ———————————————

7 *Corresponding author. Tel: +86-25-86185219. fax: +86-25-8321249. E-mail: [email protected]

Draft

© The Author(s) or their Institution(s) Genome Page 2 of 46

9 Abstract

10 Due to the specific properties of the marine environment, marine microorganisms

11 have exclusive physicochemical characteristics that are different from those of

12 terrestrial microorganisms, which can produce various secondary metabolites (SMs)

13 with considerable structural diversity and biological activity. In this study, three

14 strains of coepiphytic Aspergillus with potential antibacterial activities, A7

15 (Aspergillus flavus), B27 (Aspergillus flavipes) and R12 (Aspergillus sydowii), were

16 isolated from the South China Sea. Via the Illumina MiSeq sequencing platform, the

17 genomes of the three strains were sequenced, and genome comparison showed the 18 highest diversity of the biosyntheticDraft gene clusters (BGCs) in A7. Meanwhile, a 19 comparison of physiological and genomic characteristics between A7 and other

20 Aspergillus flavus strains demonstrated the superior environmental adaptability of A7,

21 which is apparently consistent with the genetic richness of BGCs. By assigning reads

22 to known BGCs, putative BGCs were allocated in A7 that corresponded to various

23 SMs, including naphthopyrone, pyranonigrin E, cyclopiazonic acids, etc. Based on

24 gene homology analysis, we surmise that a region is involved in the biosynthesis of

25 ustiloxin-like RiPPs, a less thoroughly studied SM in fungi. Our results provide

26 genetic information for the investigation of marine Aspergillus sp., which may help to

27 elucidate their chemical diversity and adaptive strategies.

28 Keywords: Aspergillus flavus; genome sequence; secondary metabolism; BGCs

© The Author(s) or their Institution(s) Page 3 of 46 Genome

30 Abbreviation List:

31 BGCs: biosynthetic gene clusters

32 SMs: secondary metabolites

33 MH: Mueller-Hinton

34 MEA: Malt Extract Agar

35 MeOH: Methanol

36 LPCB: lactophenol cotton blue

37 ITS: internal transcribed spacer

38 BenA: β-tubulin 39 CaM: calmodulin Draft 40 ML: maximum-likelihood

41 GO: Gene Ontology

42 KEGG: Kyoto Encyclopedia of Genes and Genomes

43 KOG: euKaryotic Orthologous Groups

44 NRPS: nonribosomal peptide synthase

45 PKS: polyketide synthase

46 RiPP: ribosomally synthesized and posttranslationally modified peptide

© The Author(s) or their Institution(s) Genome Page 4 of 46

48 Introduction

49 The prevailing extreme conditions of the marine environment have induced marine

50 microorganisms to develop exclusive adaptations in their genetic and physiological

51 characteristics, resulting in changes in metabolic patterns (Lindequist 2016), which

52 produce abundant secondary metabolites (SMs) (Rateb and Ebel 2011). Accumulating

53 evidence has highlighted the potential of marine-derived fungi and their biologically

54 active metabolites in drug discovery as sustainable sources of therapeutic agents;

55 special promise is exhibited by symbiotic microorganisms with marine invertebrates,

56 e.g., sponges, corals and ascidians (Lee et al. 2013; Skropeta and Wei 2014). A wide 57 range of chemically diverse compounds,Draft exhibiting antibacterial, antiviral and 58 antitumor properties, have been obtained from marine fungi. Among those fungal

59 strains, Aspergillus, a genus of well-defined asexual spore-forming fungi, is known as

60 a pathogen in view of its detrimental mycotoxins; however, Aspergillus can

61 prolifically produce valuable compounds, such as degraded starches, polysaccharides

62 and enzymes, which have been exploited on an industrial scale for the production of

63 biochemical and pharmaceutical agents (Lee et al. 2016). In previous work, a series of

64 fungi, including Aspergillus sp., were isolated from sponges and corals in the South

65 China Sea to screen potentially useful strains with antimicrobial activities. Among

66 these fungi, three Aspergillus sp., A7 (Aspergillus flavus), B27 (Aspergillus flavipes)

67 and R12 (Aspergillus sydowii), displayed broad bacteriostatic actions, underlining the

68 necessity of further research.

69

© The Author(s) or their Institution(s) Page 5 of 46 Genome

70 On the other hand, with the rapid development of sequencing technologies, the body

71 of knowledge concerning marine microorganisms and their natural products is in

72 transition. Notably, genomic sequencing is being utilized increasingly frequently as a

73 tool in microbial , as well as in phylogenetic analysis and functional genome

74 mining (Nielsen and Nielsen 2017). In addition, this method could provide genetic

75 insights into the biosynthetic pathways and structural analysis of SMs, which are may

76 be employed to facilitate combinatorial biosynthesis of bioactive natural products

77 (Kjærbølling et al. 2018). In recent years, the genomes of various Aspergillus flavus

78 strains have been extensively studied and compared with those of other strains (Yin et 79 al. 2018), but marine-derived AspergillusDraft flavus strains have rarely been reported. 80

81 In this study, we present the draft genomes of the abovementioned Aspergillus strains

82 (A7, B27, and R12). Following the sequencing of the genomes of these strains,

83 fragment assembly, functional annotation and comparative genomic analysis were

84 performed, with an emphasis being placed on the prediction of SM gene clusters

85 present in A7, which exhibited compelling effects against bacteria. Moreover, certain

86 biosynthesis pathways were verified by chemical separation and identification in A7,

87 which may help to elucidate the ecological and biochemical profiles of

88 marine-derived Aspergillus.

89

90 Materials and methods

91 Preliminary Screening for Antibacterial strains

© The Author(s) or their Institution(s) Genome Page 6 of 46

92 Strains A7 and R12 were isolated from corals (A7: 17° 6'12.4"N, 111° 29'57.8"E; B27:

93 16° 50'6.15"N, 112° 21'10.58"E) in the South China Sea. Strain B27 was isolated

94 from sponges in a nearby maritime location (17° 5'0.70"N, 111° 31'1.39"E). The

95 purified strains were activated and inoculated onto rice medium (rice 100 g, MgSO4

96 0.2 g, sea salt 1.5 g, H2O 100 mL) to culture at 28 °C for 15 days. The fermentation

97 products were extracted by methanol (MeOH) followed by decompressing distillation

98 to yield a crude extract. The crude extract was dissolved in 2 mL MeOH and

99 subsequently screened for antibacterial activity by the agar diffusion method.

100 101 Escherichia coli ATCC 25922, KlebsiellaDraft aerogenes ATCC 700603, Pseudomonas 102 aeruginosa ATCC 27853, methicillin-resistant Staphylococcus aureus (MRSA)

103 ATCC 43300), methicillin-resistant Staphylococcus epidermidis (MRSE) ATCC

104 35984, Micrococcus luteus ACCC11001, and Acinetobacter baumannii ATCC 19606

105 were utilized as control strains. All the above strains were stored in the Marine

106 Pharmaceutical Laboratory of China Pharmaceutical University.

107

108 In brief, single colonies of control strains were inoculated into Mueller-Hinton (MH,

109 Solarbio, China) broth and cultured at 37 °C and 200 rpm for 18 h. The bacterial

110 inoculum was spread onto MH agar (Solarbio, China). Wells measuring 6 mm in

111 diameter were punched onto the surface of the agar using a sterile hole puncher. Next,

112 30 µL crude extract was added to the wells and incubated for 24 h at 28 °C. Each

113 assessment was developed in triplicate. MeOH was used as a vehicle control, while a

© The Author(s) or their Institution(s) Page 7 of 46 Genome

114 standard antibiotics (0.1 mM chloramphenicol) were used as a positive control. The

115 diameters (in mm) of the inhibition zone were recorded to estimate antimicrobial

116 activities, which were expressed by the ratio of the inhibition zone relative to that of

117 the positive control. The bacteriostatic activities were considered strong if the ratio

118 was greater than 1.0, moderate when the scale was between 0.5 and 1, and weak if it

119 was less than 0.5.

120

121 Morphological analysis

122 The spore suspensions of three strains were inoculated and glowed on Malt Extract 123 Agar (MEA, OXOID, UK) containingDraft 3% sea salt at 28 ℃ for 5 days. The colony 124 morphology was observed and characterized regarding parameters including size,

125 texture, color, soluble pigments, and exudates. Next, microscopic examination was

126 performed on spores and hyphae followed by lactophenol cotton blue (LPCB) staining

127 (Leck 1999) under a BA210 light microscope (Motic, Xiamen, China).

128

129 Molecular identification and phylogenetic analysis

130 For molecular identification, fungal mycelium was scratched by a sterile blade and

131 used for DNA extraction. The nuclear ribosomal internal transcribed spacer region

132 (ITS), partial β-tubulin (BenA) and calmodulin (CaM) genes were amplified by

133 primers ITS1 and ITS4 (White et al. 1990), Bt2a and Bt2b (Glass and Donaldson

134 1995), CMD5 and CMD6 (Hong et al. 2005), respectively. The primers sequences are

135 shown in Table S1. PCR amplification was performed using the following program:

© The Author(s) or their Institution(s) Genome Page 8 of 46

136 95 °C for 10 min, 30 cycles of 95 °C for 30 s, 55 °C for 30 s, and 72 °C for 1 min

137 followed by a final extension at 72 °C for 10 min (Visagie et al. 2014). Next, the PCR

138 products were subcloned for sequencing to establish the Basic Local Alignment

139 Search Tool (BLAST) search engine. Homologous ITS, BenA and CaM sequences

140 from various were downloaded from the GenBank database (Table S2), and

141 multiple sequence alignment was conducted using ClustalW. Phylogenetic trees were

142 constructed by dataset of ITS, BenA and CaM sequences and concatenated sequences

143 using maximum-likelihood (ML) analysis in MEGA7 (Kumar et al. 2016).

144 145 Genome sequencing and assemblyDraft 146 Based on the bioactive screening, three strains (A7, B27, and R12) were believed to

147 be potentially antibacterial strains, for which genetic analysis is essential and effective.

148 Hence, the whole genomes of A7, B27 and R12 were sequenced by Genewiz Co.

149 (Suzhou, China) on the Illumina MiSeq sequencing platform.

150

151 In brief, the construction of DNA libraries was performed using 100 ng genomic

152 DNA, which was randomly fragmented to 500 bp by sonication (Covaris S220, USA).

153 Sequencing was subsequently performed using a 2×150-bp paired-end (PE)

154 configuration; image analysis and base calling were performed using HiSeq Control

155 Software. The adapter and low-quality sequences were removed from the raw

156 sequencing data by cutadapt (v1.9.1). The ideal reads were assembled and gap-filled

157 using Velvet (Zerbino and Birney 2008), SSPACE (Boetzer et al. 2011) and GapFiller

© The Author(s) or their Institution(s) Page 9 of 46 Genome

158 (Boetzer and Pirovano 2012), respectively. All genome sequencing data have been

159 submitted to the NCBI SRA database. The SRA accession numbers of A7, B27 and

160 R12 are PRJNA627810, PRJNA628551 and PRJNA628557, respectively.

161

162 Genome comparison of A7, B27 and R12

163 Cd-hit (Version 4.6) was used to cluster the cds sequences of A7, B27 and R12.

164 Clustering parameters were set as follows: the minimal sequence similarity was 70%,

165 and only hits with > 60% identity over 60% of the length of the query sequences were

166 considered. 167 Draft 168 The genomic sequences of three Aspergillus genomes were analyzed with the

169 antiSMASH (Blin et al. 2019) online server with the ClusterFinder algorithm to

170 identify the potential biosynthetic gene clusters (BGCs) based on homology analysis.

171

172 Intraspecies comparison of Aspergillus flavus

173 Comparison of tolerance of pH level and salinity

174 Aspergillus flavus ATCC 11492, the reference strain derived from terrestrial

175 ecosystems, was purchased from the Guangdong Microbial Culture Collection Center

176 (GDMCC, China).

177

178 Both strains, A7 and ATCC 11492, were cultured under the same photomixotrophic

179 conditions. Briefly, frozen stocks were inoculated into potato dextrose broth (PDB)

© The Author(s) or their Institution(s) Genome Page 10 of 46

180 and incubated at 28 °C with shaking for 3 days. Subsequently, 0.1 mL of the activated

181 fungal cultures were coated onto MEA solid medium plates for 3 days at 28 °C. Next,

182 the spores were harvested with sterilized water and adjusted to 1.0×107 spores/mL.

183 Finally, 2.5 µL dilutions of spores were plated on different MEA media. The pH

184 range (4, 5, 6, 7, 8, 9, and 10) for growth and NaCl tolerance (0, 1, 3, 5, 7, and 10%

185 NaCl, w/v) was examined on MEA at 28℃ for 5 days. Colony diameter was measured

186 daily for five days.

187

188 Genome comparison of different Aspergillus flavus 189 A total of 9 genomic sequences fromDraft terrestrial Aspergillus flavus were downloaded 190 from NCBI GenBank for comparative analyses (Table 1). Next, the downloaded

191 genomic sequences were uploaded to antiSMASH to predict the SM BGCs.

192

193 Gene prediction and functional annotation

194 The software Augustus (version 3.3) (Stanke et al. 2006) was used to predict coding

195 genes and high-GC regions. Through a homology-based approach, the gene structures

196 were mapped to the reference genome Aspergillus flavus NRRL 3357 (Payne et al.

197 2009). Next, the coding genes were annotated with the NCBI nr database by BLAST,

198 and the functions of genes were annotated by the Gene Ontology (GO)(Harris et al.

199 2004) and Kyoto Encyclopedia of Genes and Genomes (KEGG)(Kanehisa and Goto

200 2000) databases. In addition, the predicted proteins were classified by the euKaryotic

201 Orthologous Groups (KOG) database (Tatusov et al. 2000).

© The Author(s) or their Institution(s) Page 11 of 46 Genome

202

203 Results

204 Antibacterial activities screening

205 Four kinds of G- bacteria and 3 kinds of G+ bacteria were employed to determine the

206 spectrum of antibacterial activity of the isolated strains (Table 2). Specifically, the

207 potent effect was visible in A7, which exhibited extensive inhibitory effects against 4

208 kinds of G- bacteria (Escherichia coli, Pseudomonas aeruginosa, Klebsiella

209 aerogenes and Acinetobacter baumannii), as well as 2 kinds of G+ bacteria (MRSA

210 and MRSE). B27 also actively inhibited the G- strain (Acinetobacter baumannii) and 211 G+ bacteria (MRSA, MRSE and MicrococcusDraft luteus). However, a modest action was 212 observed for R12 with weak efficiency on G+ bacteria (Micrococcus luteus). The

213 antibacterial properties of these strains stressed the necessity of studying them further.

214

215 Morphological analysis

216 Generally, morphology analysis provides a preliminary approach for microorganism

217 taxonomy. Morphological features, such as colony appearance, pigments, hyphae, and

218 conidiophores, are shown in Fig. 1. Apparently, A7 grows on MEA media with a

219 powdery appearance that is caused by numerous condiophores, exhibiting

220 yellow-brown mycelial areas but not soluble pigments and exudate, while fertile

221 vesicles are visible under microscope, exhibiting globose to subglobose morphology

222 (Fig. 1A). B27 grows poorly on MEA media, its colony surface is flocculent, its

223 sporulation and mycelial areas are yellowish white to pale yellow, exhibiting no

© The Author(s) or their Institution(s) Genome Page 12 of 46

224 soluble pigment and no exudate, its spores are brownish orange without soluble

225 pigment, and its conidia are globose to subglobose (Fig. 1B). R12 grows poorly on

226 MEA media, its colony surface is velutinous when sporulating, it is mainly composed

227 of white sterile mycelia and becomes grayish green when spores are formed, soluble

228 pigment is mostly absent from this strain, exudate is mostly absent, the conidia head is

229 broom-shaped, similar to Penicillium, the phialides is ampulliform and the conidia are

230 globose to subglobose (Fig. 1C). The three strains conformed to the morphological

231 characteristics of Aspergillus in terms of colony texture, colors of mycelia, and

232 conidia; however, molecular species identification was warranted. 233 Draft 234 Molecular identification and phylogenetic analysis

235 Molecular markers have been widely accepted as crucial for the taxonomic

236 identification of fungi, among which ITS and some protein-coding genes, such as

237 ribosomal polymerase B2, BenA, and CaM, are popular barcode‐like molecules

238 (Schoch et al. 2012). In this study, the phylogenetic status of the bioactive strains (A7,

239 B27, and R12) was analyzed based on the ITS, CaM and BenA sequences. As shown

240 in the phylogenetic tree (Fig. 2), Aspergillus and Talaromyces are divided into

241 apparently separated genera. More specifically, A7, B27 and R12 cluster into the

242 Aspergillus clades supported by a 100% bootstrap value, which is consistent with the

243 highest identity scores. More specifically, A7 and Aspergillus flavus cluster into the

244 same branch as a sister taxon with B27, which appears in the same branch as

© The Author(s) or their Institution(s) Page 13 of 46 Genome

245 Aspergillus flavipes. R12 is clustered into a clade with Aspergillus sydowii, close to

246 Aspergillus creber, Aspergillus cvjetkovicii and Aspergillus puulaauensis.

247

248 Genome sequencing

249 In view of the potent bactericidal activity of A7, this strain is clearly the most distinct

250 one, which underscores the vital role played by SMs and the generation process.

251 Clearly, genomic analysis could demonstrate the inventory of all SM gene clusters

252 and the underlying mechanisms governing their production. Hence, the genome of A7

253 was sequenced with a coverage of 157.8 X. The draft genome was assembled into a 254 total size of 36.73 Mb, with a G+CDraft content of 48.39% composing 338 scaffolds. The 255 average length of consensus contigs was 108660 bp with an N50 of 634952 bp.

256 Augustus software was employed to predict the protein-coding genes, resulting in a

257 total of 11,711 protein-coding genes with an average length of 1575 bp. The resulting

258 genome assembly of B27 had a length of 32.33 Mb divided into 158 scaffolds. The

259 N50 scaffold length was 1,367,452 bp, the GC content was 52.42%, and the genome

260 coverage was 162.97X. The draft genome of R12 was observed to contain 1857

261 scaffolds with a total size of 35.39 Mb (50.59% G+C content). The N50 value and the

262 average scaffold size were 563,844 bp and 19,987.64 bp, respectively. Prediction of

263 protein-coding genes was performed using AUGUSTUS, and a total of 13,035 genes

264 were predicted. The general genomic characteristics of strains A7, B27 and R12 are

265 listed in Table 3.

266

© The Author(s) or their Institution(s) Genome Page 14 of 46

267 Genome comparison of A7, B27 and R12

268 The core and unique genes represent phylogenetic conservation and species

269 specificity, respectively, to a certain extent. Comparative genomics could provide

270 useful information, such as effective identification of orthologous or diverse genes,

271 which could facilitate the understanding of those bioactive strains. As shown in the

272 Venn diagram (Fig. 3A), there were 2369 genes in common among A7, B27 and R12,

273 which accounted for 20.48%, 21.26%, and 18.79% of the total coding genes of A7,

274 B27, and R12, respectively. In addition, pairwise comparisons identified 3547 coding

275 genes shared between A7 and B27, accounting for 30.66% of the total coding genes in 276 A7. When these strains were comparedDraft with R12, less commonality was discernible, 277 demonstrated by 2988 genes in the intersection and an overlap rate of 24.98%, which

278 coincided with the phylogenetic relationship.

279

280 The genes responsible for SM production are usually arranged in multigene BGCs in

281 fungi (Marcet-Houben et al. 2012). In this study, the potential for SM production by

282 the A7 strain was genetically profiled by searching for BGCs in the antiSMASH

283 database. As shown in Fig. 3B, 58 gene clusters were involved in SM biosynthesis in

284 A7, including 26 nonribosomal peptide synthase (NRPS) clusters, 13 polyketide

285 synthase (PKS) clusters, 5 terpene clusters, 4 indole clusters, 2 NRPS-PKS hybrid

286 clusters, 1 fungal-ribosomally synthesized and posttranslationally modified peptide

287 (RiPP) cluster, and 7 other clusters. Interestingly, the SM gene clusters of A7,

© The Author(s) or their Institution(s) Page 15 of 46 Genome

288 especially hybrid gene clusters, are more diverse than those of B27 and R12,

289 indicating the diversity of SMs as well as the abundance of chemical skeletons.

290

291 Intraspecies comparison of Aspergillus flavus

292 Comparison of tolerance of pH and salinity

293 As the reference strain, ATCC 11492 was used to estimate the growth adaptability to

294 environmental stress, such as salinity and pH (Fig. 4). As shown in Fig. 4A, both

295 strains had variable growth in the MEA medium at all salinity levels (1-10%),

296 although with apparent growth restriction induced by increasing salinity. However, 297 A7 exhibited a robust pattern in Draft contrast to its terrestrial counterpart at the same 298 salinity, indicating an extensive tolerance to salt stresses. In terms of ambient pH (Fig.

299 4B), both strains were more tolerant to alkalinity than acidic media. Even so, an

300 adaptive advantage was more obvious in A7, at least within the detection range of this

301 work (pH 4-10). From this plot, A7 exhibited a tendency to survive in harsh

302 environments, which might be attributed to a higher genetic richness and metabolic

303 robustness.

304

305 Genome comparison of different Aspergillus flavus

306 Based on the above observations, we attempted to obtain further structural and

307 functional clues from the genomic information. Therefore, a genome-wide

308 comparison was performed among A7 and all available Aspergillus flavus genomes (9

309 available in GenBank at the time of this study). Consistent with its environmental

© The Author(s) or their Institution(s) Genome Page 16 of 46

310 adaptive plasticity, the genome of A7 exhibits substantial richness and relative

311 abundance, characterized by the most significant diversity in BGCs, which may be an

312 exploitable genetic resource, possibly to produce a variety of SMs (Fig. 5).

313

314 Functional annotation

315 In terms of the structural-functional correlation, the gene functional annotations may

316 be inferred from primary structure homology. The genome of A7 yielded similar

317 annotation results in the GO, KEGG and KOG analyses: a large number of genes are

318 involved in metabolic pathways. Using KOG functional classification, we assigned 319 7467 proteins based on sequenceDraft similarity, accounting for 63.76% of the total 320 protein-coding genes (Fig. 6). For functional classification, the proteins were

321 categorized into 4 main KOG groups: intracellular processes (22.37%), metabolism

322 (39.91%), information storage processing (15.36%) and poorly characterized function

323 (22.37%). Notably, the most functional category of related protein-coding genes is

324 metabolism, such as amino acids (458), carbohydrates (453), lipids (529) or secondary

325 metabolites (506). The significant proportion (16.98%) of metabolism-related

326 gene-associated SMs might contribute to their potential antibacterial activities.

327

328 Secondary metabolite gene clusters of A7

329 The analysis of the BGCs of A7 highlighted 7 gene clusters showing 100% similarity

330 to known BGCs, corresponding to a variety of SMs, such as naphthopyrone, aflavarin,

331 6-methylsalicyclic acid, (-)-mellein, asperlactone, aculeacin A, leporin B,

© The Author(s) or their Institution(s) Page 17 of 46 Genome

332 cyclopiazonic acid (CPA) and pyranonigrin E (Fig. 7). As symbolic SMs of

333 filamentous fungi, these compounds have been assumed to have various bioactivities

334 by accumulating studies, including insecticidal (Tepaske et al. 1992), antifeedant

335 (Cary et al. 2015) and antifungal (Iwata et al. 1982) effects. In terms of the

336 biosynthetic pattern, the PKS (Fig. 7A-7D), NRPS (Fig. 7E) and NRPS-PKS hybrid

337 (Fig. 7F-7H) pathways are all involved, which may imply a more diverse toolkit, as

338 well as their derived compounds, as determined from the current limited

339 genome-mining scope.

340 341 For instance, a gene cluster presentDraft in Region 55.1 is presumed to synthesize aflatoxin 342 and CPA, which are important fungal metabolites and major concerns from a safety

343 perspective, as symbolic toxins are mainly produced by Aspergillus flavus (Frisvad et

344 al. 2019). The gene cluster of CPA is reported to be located in the subtelomeric region

345 close to the aflatoxin cluster in the Aspergillus flavus and Aspergillus oryzae genomes

346 (Chang et al. 2009). As shown in Fig. 8, the CPA BGC of A7 shares relatively high

347 amino acid homology with that of Aspergillus flavus NRRL 3357 (> 98%) and

348 Aspergillus oryzae NBRC 4177 (> 84%), despite the insertion of cpaH and cpaM in

349 Aspergillus oryzae NBRC 4177. To some extent, the hypotoxicity of Aspergillus

350 oryzae could be attributed to the detoxifying properties of cpaH (Kato et al. 2011),

351 which has been modified into as a relatively safe strain in food processing (Machida

352 et al. 2005). Essentially, mycotoxins are a very limited group of fungal SMs that are

353 produced more commonly for species competition or nutrient stress than for growth

© The Author(s) or their Institution(s) Genome Page 18 of 46

354 (Frisvad et al. 2018). Therefore, it is not difficult to understand the conservation of

355 gene clusters of mycotoxins in marine-derived Aspergillus sp., especially considering

356 the intricate and dystrophic marine environment.

357

358 For another example, a genome mining approach demonstrated a high degree of

359 homology between A7 and Aspergillus flavus NRRL3357 for the BGC of ustiloxin B

360 (Fig. 9B): among the verified biosynthetic genes of the ustiloxin B BGC, there is

361 considerable similarity (over 60% amino acid identity), despite the lack of ustA, ustP1,

362 ustM and ustS. Notably, ustiloxin B is a secondary metabolite originally identified in 363 the rice pathogen UstilaginoideaDraft virens, which represents a class of macrocyclic 364 peptides (Fig. 9A) with strong potential to suppress microtubule assembly and mitosis

365 (Umemura et al. 2013). Recently, based on concurrent expression of contiguous genes

366 in the genome of Aspergillus flavus, a BGC termed ust has been identified, producing

367 ustiloxin B constructed by oxidative cyclization between the amino acid side chains,

368 which is classified into RiPPs. Among the 3 genes missing in A7, utsA is of great

369 importance, since it encodes the putative precursor peptide of ustiloxin B. It is

370 possible that numerous unidentified RiPP genes exist in fungi (Tsukui et al. 2015;

371 Umemura et al. 2014), which implies the presence of miscellaneous ustiloxin-like

372 products due to the potentially morphed precursor scaffolds. Recently, dozens of

373 ustiloxin-like ribosomal peptide precursor genes have been identified from Aspergilli,

374 computationally supporting this possibility (Nagano et al. 2016). The precursor

375 peptide UstA is highly unique in its characteristic that its core peptide repeats 16

© The Author(s) or their Institution(s) Page 19 of 46 Genome

376 times, which strongly contributes to identifying the gene in the complete sequences

377 (Umemura et al. 2014). Therefore, regarding current bioinformatics, the utsA

378 deficiency in the A7 genome may be attributable to insufficient knowledge of

379 homologous biosynthesis and precursor peptides, which warrants further study.

380

381 Discussion

382 Marine fungi, an important marine microorganism, have become potentially prolific

383 sources of highly bioactive SMs (Rateb and Ebel 2011). The first clinical applications

384 of SMs derived from marine fungi date back to the discovery of cephalosporin C in 385 the 1960s, which led to the developmentDraft of a series of valuable antibiotics (Fischbach 386 2009). During the following decades, an increasing number of marine fungi have been

387 characterized as sources of novel and potential therapeutic agents (Gomes et al. 2015).

388

389 In this study, we isolated a range of strains from corals and sponges in the South

390 China Sea and screened three bioactive strains, which were identified as Aspergillus

391 flavus (A7), Aspergillus flavipes (B27) and Aspergillus sydowii (R12), through

392 morphological and phylogenetic analysis. The fermentation products exhibited

393 promising bacteriostatic effects against both G+ and G- bacteria, with superior efficacy

394 being demonstrated in A7. The antibacterial properties called for consequent study on

395 the strains, for which the availability of a complete genome sequence would clearly be

396 practical and supportive.

397

© The Author(s) or their Institution(s) Genome Page 20 of 46

398 Next, the whole genomes of A7, B27 and R12 were sequenced followed by

399 comparative genomics analysis and functional prediction for A7. Comparative

400 genomics labeled 2369 core genes, while the pattern of pairwise comparison appeared

401 consistent with the phylogenetic relationship, which paved the way for subsequent

402 genome analysis and mining. Thereafter, referring to a terrestrial counterpart,

403 Aspergillus flavus ATCC 11492, the stress tolerance of A7 to salinity or pH was

404 investigated for integrative characterization from phylogenetic and phenotypic aspects.

405 The results of this analysis showed that A7 possessed a broader adaptive capacity to

406 harsh environmental conditions, which was consistent with its oceanic origin. Clearly, 407 intraspecies comparison of AspergillusDraft flavus genomes could help to elucidate the 408 acclimation and adaptation of A7 to harsh marine environments, genetically

409 highlighting the mechanisms governing ecological survivability and adaptive

410 plasticity.

411

412 These potent bioactivities underscored the necessity for the follow-up investigation,

413 especially for A7, of which the classification is confirmed at both the 16S rRNA and

414 genomic levels to Aspergillus flavus. Known as a saprophytic and pathogenic fungus

415 with a cosmopolitan distribution, Aspergillus flavus produces significant quantities of

416 SMs, such as sterigmatocystin, CPA, kojic acid, and β-nitropropionic acid, to cope

417 with environmental or ecological stress (e.g., UV radiation, survival competition and

418 fungivorous predators) (Hedayati et al. 2007; Reverberi et al. 2010). In the case of A7,

419 kojic acid and CPA were isolated and characterized from the rice fermentation extract

© The Author(s) or their Institution(s) Page 21 of 46 Genome

420 through chemical separation (presented in another paper), to which the bacteriostatic

421 efficacy of fermentation production could be at least partly attributed. Therefore, from

422 a genetic perspective, it is reasonable to characterize the highly homologous CPA

423 BGC in A7, and it remains to be further investigated.

424

425 Early studies have affirmed that the Aspergillus flavus genome contains hallmark

426 enzymatic genes involved in SM synthesis, which are far from fully characterized

427 (Amaike and Keller 2011). Consistently, a total of 57 putative BGCs were predicted

428 in the A7 genome. Except for the BGCs mentioned above, there are a variety of gene 429 clusters that show low similaritiesDraft to the BGCs characterized in the database (Table 430 S3), hinting that a wider range of BGCs might designate putative products from

431 Aspergillus sp. to a large chemodiversity.

432

433 Following spurting reports on the bioactive metabolites from microorganisms

434 associated with marine invertebrates, these microbes represent an important reservoir

435 of untapped chemical diversity that enriches therapeutic arsenal (Haygood et al. 1999).

436 Conceivably, the utilization of microbial resources provides a possibility for scale-up

437 production to hurdle the limited supply and unravel their full potential through

438 fermentation or suitable biotechnology approaches (Gomes et al. 2016; Newman

439 2018). From a structural-functional perspective, sequencing-based genomics assays

440 could present a large amount of biological information for the clarification of the

441 biosynthetic pathways of some potent biological agents. As far as Aspergillus sp. is

© The Author(s) or their Institution(s) Genome Page 22 of 46

442 still somewhat difficult to obtain enough natural or semisynthetic compounds due to

443 limitations in current approaches, such as culturable conditions, purification

444 techniques and biosynthesis schemes. In this sense, the current study adds to our

445 knowledge of the genetic profiles and architectures of marine Aspergillus, although

446 further investigation is necessary.

447

448 Conclusion

449 In this study, we performed whole-genome sequencing, comprehensive phylogenetic

450 analysis and genome comparisons on several bioactive strains of Aspergillus 451 previously derived from South China.Draft The subsequent analysis of the physiological 452 characteristics and biosynthetic potential of A7, which was the most potent strain

453 among them, presented a range of putative BGCs for representative fungal SMs,

454 including naphthopyrone, aflavarin, 6-methylsalicyclic acid, (-)-mellein, asperlactone,

455 aculeacin A, leporin B, CPA and pyranonigrin E. Additionally, a gene cluster

456 responsible for the production of ustiloxin-like RiPP was also predicted. Moreover,

457 there are a variety of uncharacterized gene clusters, which awaits comprehensive

458 explorations of their biological functions and putative products. The results of this

459 study may provide a foundation for further research investigating marine Aspergillus

460 sp., facilitating the understanding and exploitation of those fungal species.

461

462 Acknowledgments

© The Author(s) or their Institution(s) Page 23 of 46 Genome

463 This work was supported by funds from the National Key R&D Program of China

464 (2018YFC0311001), the Priority Academic Program Development of the Jiangsu

465 Higher Education Institutions (PAPD) and the Fundamental Research Funds for the

466 Central Universities (SM20190116021).

467

468 References

469 Amaike, S., and Keller, N. P. 2011. Aspergillus flavus. Annual review of phytopathology. 49:

470 107-133. doi:10.1146/annurev-phyto-072910-095221.PMID:21513456

471 Blin, K., Shaw, S., Steinke, K., Villebro, R., Ziemert, N., Lee, S. Y., et al.Weber, T. 2019. 472 antiSMASH 5.0: updates to theDraft secondary metabolite genome mining pipeline. 473 Nucleic Acids Res. 47(W1): W81-W87. doi:10.1093/nar/gkz310.PMID:31032519

474 Boetzer, M., and Pirovano, W. 2012. Toward almost closed genomes with GapFiller. Genome

475 biology. 13(6): R56. doi:10.1186/gb-2012-13-6-r56.PMID:22731987

476 Boetzer, M., Henkel, C. V., Jansen, H. J., Butler, D., and Pirovano, W. 2011. Scaffolding

477 pre-assembled contigs using SSPACE. Bioinformatics (Oxford, England). 27(4):

478 578-579. doi:10.1093/bioinformatics/btq683.PMID:21149342

479 Cary, J. W., Uka, V., Han, Z., Buyst, D., Harris-Coward, P. Y., Ehrlich, K. C., et al.Di Mavungu,

480 J. D. 2015. An Aspergillus flavus secondary metabolic gene cluster containing a

481 hybrid PKS-NRPS is necessary for synthesis of the 2-pyridones, leporins. Fungal

482 Genetics & Biology. 81: 88-97. doi:10.1016/j.fgb.2015.05.010.PMID:26051490

© The Author(s) or their Institution(s) Genome Page 24 of 46

483 Chang, P. K., Ehrlich, K. C., and Fujii, I. 2009. Cyclopiazonic acid biosynthesis of Aspergillus

484 flavus and Aspergillus oryzae. Toxins. 1(2): 74-99.

485 doi:10.3390/toxins1020074.PMID:22069533

486 Fischbach, M. A. 2009. Antibiotics from microbes: converging to kill. Current Opinion in

487 Microbiology. 12(5): 520-527. doi:10.1016/j.mib.2009.07.002.PMID:19695947

488 Frisvad, J. C., Mller, L. L. H., Larsen, T. O., Kumar, R., and Arnau, J. 2018. Safety of the

489 fungal workhorses of industrial biotechnology: update on the mycotoxin and

490 secondary metabolite potential of Aspergillus niger , Aspergillus oryzae , and

491 Trichoderma reesei. Applied Microbiology and Biotechnology. 102(22): 9481-9515. 492 Frisvad, J. C., Hubka, V., Ezekiel, C. N.,Draft Hong, S.-B., Nováková, A., Chen, A. J., et 493 al.Mahakarnchanakul, W. 2019. Taxonomy of Aspergillus section Flavi and their

494 production of aflatoxins, ochratoxins and other mycotoxins. Studies in Mycology. 93:

495 1-63. doi:10.1016/j.simyco.2018.06.001.PMID:30108412

496 Glass, N. L., and Donaldson, G. C. 1995. Development of primer sets designed for use with

497 the PCR to amplify conserved genes from filamentous ascomycetes. Applied and

498 environmental microbiology. 61(4): 1323-1330.PMID:7747954

499 Gomes, N. G., Lefranc, F., Kijjoa, A., and Kiss, R. 2015. Can Some Marine-Derived Fungal

500 Metabolites Become Actual Anticancer Agents? Marine drugs. 13(6): 3950-3991.

501 doi:10.3390/md13063950.PMID:26090846

502 Gomes, N. G., Dasari, R., Chandra, S., Kiss, R., and Kornienko, A. 2016. Marine Invertebrate

503 Metabolites with Anticancer Activities: Solutions to the "Supply Problem". Marine

504 drugs. 14(5): 98. doi:10.3390/md14050098.PMID:27213412

© The Author(s) or their Institution(s) Page 25 of 46 Genome

505 Harris, M. A., Clark, J., Ireland, A., Lomax, J., Ashburner, M., Foulger, R., et al.R, W. 2004.

506 The Gene Ontology (GO) database and informatics resource. Nucleic acids research.

507 32: D258-261. doi:10.1093/nar/gkh036.PMID:14681407

508 Haygood, M. G., Schmidt, E. W., Davidson, S. K., and Faulkner, D. J. 1999. Microbial

509 Symbionts of Marine Invertebrates: Opportunities for Microbial Biotechnology. Journal

510 of Molecular Microbiology & Biotechnology. 1(1): 33-43.PMID:10941782

511 Hedayati, M. T., Pasqualotto, A. C., Warn, P. A., Bowyer, P., and Denning, D. W. 2007.

512 Aspergillus flavus: human pathogen, allergen and mycotoxin producer. Microbiology

513 153(6): 1677-1692. doi:10.1099/mic.0.2007/007641-0.PMID:17526826 514 Hong, S. B., Go, S. J., Shin, H. D., Frisvad,Draft J. C., and Samson, R. A. 2005. Polyphasic 515 taxonomy of Aspergillus fumigatus and related species. Mycologia. 97(6): 1316-1329.

516 doi:10.3852/mycologia.97.6.1316.PMID:16722222

517 Iwata, K., Yamamoto, Y., Yamaguchi, H., and Hiratani, T. 1982. In vitro studies of aculeacin A,

518 a new antifungal antibiotic. The Journal of antibiotics. 35(2): 203-209.

519 doi:10.7164/antibiotics.35.203.PMID:7042670

520 Kanehisa, M., and Goto, S. 2000. KEGG: kyoto encyclopedia of genes and genomes. Nucleic

521 acids research. 28(1): 27-30. doi:10.1093/nar/28.1.27.PMID:10592173

522 Kato, N., Tokuoka, M., Shinohara, Y., Kawatani, M., Uramoto, M., Seshime, Y., et al.Osada,

523 H. 2011. Genetic Safeguard against Mycotoxin Cyclopiazonic Acid Production in

524 Aspergillus oryzae. Chembiochem. 12(9): 1376-1382.

525 doi:10.1002/cbic.201000672.PMID: 21608094

© The Author(s) or their Institution(s) Genome Page 26 of 46

526 Kjærbølling, I., T.C., V., Frisvad, J. C., Nybo, J. L., Theobald, S., Kuo, A., et al.Andersen, M.

527 R. 2018. AspergillusLinking secondary metabolites to gene clusters through genome

528 sequencing of six diverse species. Proceedings of the National Academy of Sciences

529 of the United States of America. 115(4): E753-E761.

530 doi:10.1073/pnas.1715954115.PMID:29317534

531 Kumar, S., Stecher, G., and Tamura, K. 2016. MEGA7: Molecular Evolutionary Genetics

532 Analysis Version 7.0 for Bigger Datasets. Molecular biology and evolution. 33(7):

533 1870-1874. doi:10.1093/molbev/msw054.PMID:27004904

534 Leck, A. 1999. Preparation of lactophenol cotton blue slide mounts. Community eye health. 535 12(30): 24.PMID:17491984 Draft 536 Lee, S., Park, M. S., and Lim, Y. W. 2016. AspergillusDiversity of Marine-Derived from Tidal

537 Mudflats and Sea Sand in Korea. Mycobiology. 44(4): 237-247.

538 doi:10.5941/myco.2016.44.4.237.PMID:28154481

539 Lee, Y. M., Kim, M. J., Li, H., Zhang, P., Bao, B., Lee, K. J., and Jung, J. H. 2013.

540 Marine-derived Aspergillus species as a source of bioactive secondary metabolites.

541 Marine biotechnology. 15(5): 499-519.

542 doi:10.1007/s10126-013-9506-3.PMID:23709045

543 Lindequist, U. 2016. Marine-Derived Pharmaceuticals - Challenges and Opportunities.

544 Biomolecules & therapeutics. 24(6): 561-571.

545 doi:10.4062/biomolther.2016.181.PMID:27795450

© The Author(s) or their Institution(s) Page 27 of 46 Genome

546 Machida, M., Asai, K., Sano, M., Tanaka, T., Kumagai, T., Terai, G., et al.H, K. 2005.

547 Genome sequencing and analysis of Aspergillus oryzae. Nature. 438(7071):

548 1157-1161. doi:10.1038/nature04300.PMID:16372010

549 Marcet-Houben, M., Ballester, A. R., de la Fuente, B., Harries, E., Marcos, J. F.,

550 González-Candelas, L., and Gabaldón, T. 2012. Genome sequence of the

551 necrotrophic fungus Penicillium digitatum, the main postharvest pathogen of citrus.

552 BMC genomics. 13(1): 1-18. doi:10.1186/1471-2164-13-646.PMID:23171342

553 Nagano, N., Umemura, M., Izumikawa, M., Kawano, J., Ishii, T., Kikuchi, M., et al.Asai, K.

554 2016. Class of cyclic ribosomal peptide synthetic genes in filamentous fungi. Fungal 555 genetics and biology : FG & B.Draft 86: 58-70. 556 doi:10.1016/j.fgb.2015.12.010.PMID:26703898

557 Newman, D. J. 2018. “From Large-Scale Collections to the Potential Use of Genomic

558 Techniques for Supply of Drug Candidates”. Frontiers in Marine Science. 5: 401.

559 doi:10.3389/fmars.2018.00401

560 Nielsen, J. C., and Nielsen, J. 2017. Development of fungal cell factories for the production of

561 secondary metabolites: Linking genomics and metabolism. Synthetic and systems

562 biotechnology. 2(1): 5-12. doi:10.1016/j.synbio.2017.02.002.PMID:29062956

563 Payne, G. A., Nierman, W. C., Wortman, J. R., Pritchard, B. L., Brown, D., Dean, R. A., et

564 al.Yu, J. 2009. Whole genome comparison of Aspergillus flavus and A. oryzae.

565 Medical Mycology. 44(Supplement_1): S9-S11.

566 doi:10.1080/13693780600835716.PMID:30408940

© The Author(s) or their Institution(s) Genome Page 28 of 46

567 Rateb, M. E., and Ebel, R. 2011. Secondary metabolites of fungi from marine habitats.

568 Natural Product Reports. 28(2): 290. doi:10.1039/C0NP00061B.PMID:21229157

569 Reverberi, M., Ricelli, A., Zjalic, S., Fabbri, A. A., and Fanelli, C. 2010. Natural functions of

570 mycotoxins and control of their biosynthesis in fungi. Applied microbiology and

571 biotechnology. 87(3): 899-911. doi:10.1007/s00253-010-2657-5.PMID:20495914

572 Schoch, C. L., Seifert, K. A., Huhndorf, S., Robert, V., Spouge, J. L., Levesque, C. A., and

573 Chen, W. 2012. Nuclear ribosomal internal transcribed spacer (ITS) region as a

574 universal DNA barcode marker for Fungi. Proceedings of the National Academy of

575 Sciences of the United States of America. 109(16): 6241-6246. 576 doi:10.1073/pnas.1117018109.PMID:22454494Draft 577 Skropeta, D., and Wei, L. 2014. Recent advances in deep-sea natural products. Natural

578 product reports. 31(8): 999-1025. doi:10.1039/c3np70118b.PMID:24871201

579 Stanke, M., Keller, O., Gunduz, I., Hayes, A., Waack, S., and Morgenstern, B. 2006.

580 AUGUSTUS: ab initio prediction of alternative transcripts. Nucleic acids research. 34:

581 W435-439. doi:10.1093/nar/gkl200.PMID:16845043

582 Tatusov, R. L., Galperin, M. Y., Natale, D. A., and Koonin, E. V. 2000. The COG database: a

583 tool for genome-scale analysis of protein functions and evolution. Nucleic acids

584 research. 28(1): 33-36. doi:10.1093/nar/28.1.33.PMID:10592175

585 Tepaske, M. R., Gloer, J. B., Wicklow, D. T., and Dowd, P. F. 1992. Aflavarin and β-Aflatrem:

586 New Anti-Insectan Metabolites from the Sclerotia of Aspergillus flavus. Journal of

587 Natural Products. 55(8): 1080-1086. doi:10.1021/np50086a008

© The Author(s) or their Institution(s) Page 29 of 46 Genome

588 Tsukui, T., Nagano, N., Umemura, M., Kumagai, T., Terai, G., Machida, M., and Asai, K. 2015.

589 Ustiloxins, fungal cyclic peptides, are ribosomally synthesized in Ustilaginoidea virens.

590 Bioinformatics (Oxford, England). 31(7): 981-985.

591 doi:10.1093/bioinformatics/btu753.PMID:25414363

592 Umemura, M., Nagano, N., Koike, H., Kawano, J., Ishii, T., Miyamura, Y., et al.Machida, M.

593 2014. Characterization of the biosynthetic gene cluster for the ribosomally

594 synthesized cyclic peptide ustiloxin B in Aspergillus flavus. Fungal genetics and

595 biology : FG & B. 68: 23-30. doi:10.1016/j.fgb.2014.04.011.PMID:24841822

596 Umemura, M., Koike, H., Nagano, N., Ishii, T., Kawano, J., Yamane, N., et al.Machida, M. 597 2013. MIDDAS-M: motif-independentDraft de novo detection of secondary metabolite 598 gene clusters through the integration of genome sequencing and transcriptome data.

599 PloS one. 8(12): e84028. doi:10.1371/journal.pone.0084028.PMID:24391870

600 Visagie, C. M., Houbraken, J., Frisvad, J. C., Hong, S. B., Klaassen, C. H., Perrone, G., et

601 al.Samson, R. A. 2014. Identification and nomenclature of the genus Penicillium.

602 Studies in mycology. 78: 343-371. doi:10.1016/j.simyco.2014.09.001.PMID:25505353

603 White, T., Bruns, T., Lee, S., and Taylor, J. 1990. Amplification and direct seqencing of fungal

604 ribosomal RNA genes for phylogenetics. Pcr Protocols: 315-312.

605 doi:10.1016/B978-0-12-372180-8.50042-1

606 Yin, G., Hua, S., Pennerman, K. K., Yu, J., Bu, L., Sayre, R. T., and Bennett, J. W. 2018.

607 Genome sequence and comparative analyses of atoxigenic Aspergillus flavus WRRL

608 1519. Mycologia. 110(3): 482-493.

609 doi:10.1080/00275514.2018.1468201.PMID:29969379

© The Author(s) or their Institution(s) Genome Page 30 of 46

610 Zerbino, D. R., and Birney, E. 2008. Velvet: algorithms for de novo short read assembly using

611 de Bruijn graphs. Genome research. 18(5): 821-829.

612 doi:10.1101/gr.074492.107.PMID:18349386

Draft

© The Author(s) or their Institution(s) Page 31 of 46 Genome

614 Table 1. Genome overview of 9 Aspergillus flavus from the terrestrial ecosystem.

615 Table 2. Assessment of antimicrobial activities of fermentation products

616 E. coli: Escherichia coli; P. aeruginosa: Pseudomonas aeruginosa; K. aerogenes:

617 Klebsiella aerogenes; MRSA: methicillin-resistant Staphylococcus aureus; MRSE:

618 methicillin-resistant Staphylococcus epidermidis; M. luteus: Micrococcus luteus; A.

619 baumannii: Acinetobacter baumannii. The antimicrobial activities were estimated by

620 measuring the diameters of the inhibition zones and expressed on a scale relative to

621 the standard antibiotics (0.1 mM chloramphenicol). The activities were considered

622 weak (+), moderate (++), or strong (+++) when the value was < 0.5, between 0.5 and 623 1.0 or ≥ 1.0. Draft 624 Table 3. General features of the A7, B27 and R12 genomes

625 Fig. 1. Morphology of characterized strains. A-C: Colony and microscopic

626 morphology after 7 days of incubation. From left to right: obverse colonies on MEA,

627 reverse on MEA, conidiophores and conidiogenous at 40× magnification (scale bar 10

628 µm).

629 Fig. 2. Maximum likelihood tree of A7, B27, and R12 based on the concatenated

630 sequences of ITS, BenA and CaM. Multiple sequence alignment was conducted

631 using Clustal W (default settings), and phylogenetic relationships were based on ML

632 analysis with 1000 bootstrap replications in MEGA7. Numbers above branches

633 indicate bootstrap values. The sequence accession numbers and detailed strain

634 information for the phylogenetic tree are provided in supplemental Table S2.

© The Author(s) or their Institution(s) Genome Page 32 of 46

635 Fig. 3. Genome comparison of A7, B27 and R12. A. Venn diagram of core and

636 specific genes in each strain (A7, B27, R12) through cluster analysis of cds sequence;

637 B. Comparative analysis of types of BGCs among the three Aspergillus species (A7,

638 B27, R12). NRPS: nonribosomal peptide synthase; PKS: polyketide synthase; RiPP:

639 ribosomally synthesized and posttranslationally modified peptide.

640 Fig. 4. Comparison of the physiological characteristics between A7 and ATCC

641 11492. A. Effect of different NaCl concentrations on colony growth of Aspergillus

642 flavus (A7 and ATCC 11492) at 5 days; B. Effect of pH values on the growth of

643 Aspergillus flavus colonies (A7 and ATCC 11492) at 5 days after inoculation. 644 Fig. 5. Comparison of A7 with otherDraft Aspergillus flavus. Different colors represent 645 different BGC types.

646 Fig. 6. KOG classifications of putative proteins in the genome of A7. I:

647 intracellular processes; II: metabolism; III: information storage/processing; IV: poorly

648 characterized function.

649 Fig. 7. Schematic representation of A7 putative BGCs showing high similarity

650 with genes from characterized BGCs. A-G. The upper part represents the BGC in

651 A7, followed by the known BGCs in the MIBiG database.

652 Fig. 8. Schematic comparison of the CPA BGC between A7 and reference strains.

653 The homologous genes in region 55.1 and known CPA BGCs from Aspergillus oryzae

654 NBRC 4177 or Aspergillus flavus NRRL 3357 are marked with the same color, and

655 missing genes are marked with a red frame. BGC descriptions and amino acid

656 homology (query cover and identity) are listed.

© The Author(s) or their Institution(s) Page 33 of 46 Genome

657 Fig. 9. Putative ustiloxin-like BGC in A7. A. ustiloxin B structure; B. The putative

658 ustiloxin-like BGC of A7 and the comparison of this cluster with the ustiloxin B

659 cluster reported for Aspergillus flavus NRRL3357. Compared with the ustiloxin B

660 BGC of Aspergillus flavus NRRL3357, ustA, ustP1, ustM and ustS were missing in

661 region 46.1, which is marked with a read frame. The homologous genes are marked

662 with the same color. Known BGC descriptions and amino acid homologies (query

663 cover and identity) are listed below.

Draft

© The Author(s) or their Institution(s) Genome Page 34 of 46

Table 1. Genome overview of 9 Aspergillus flavus from the terrestrial ecosystem

Strain number NRRL 3357 AF12 E1404 CS1137 WRRL 1519 CA14 NRRL TERIBRI SU16

21882

Isolation source moldy peanuts Cotton field Peanut Cotton seed almond nut Pistacia vera Zea mays muddy water Wheat Qu

soil Total sequence 36.00 36.27 35.94 Draft35.68 36.28 35.96 35.30 34.10 35.82 length (Mb)

Assembly level Complete Scaffold Scaffold Scaffold Scaffold Scaffold Scaffold Contig Chromosome

Genome

Genome coverage 600.0x 40.0x 124.0x 123.0x 248.0x 121.0x 100.0x 200.0x 570.0x

Sequencing Illumina Illumina Illumina Illumina Illumina Illumina Illumina Illumina Illumina technology HiSeq HiSeq HiSeq HiSeq MiSeq MiSeq HiSeq HiSeq HiSeq

GenBank assembly GCA_009017 GCA_00371 GCA_013146 GCA_003711 GCA_00286 GCA_00370 GCA_00244 GCA_00415 GCA_00985 accession 415.1 1345.1 025.1 285.1 4195.1 9025.1 3195.2 0275.1 6665.1

© The Author(s) or their Institution(s) Page 35 of 46 Genome

Draft

© The Author(s) or their Institution(s) Genome Page 36 of 46

Table 2. Assessment of Antimicrobial activities of fermentation products

E. coli P. aeruginosa K. aerogenes MRSA MRSE M. luteus A. baumannii

A7 + + ++ + ++ − +

B27 − − − ++ +++ ++ +

R12 − − − − − + −

E. coli: Escherichia coli; P. aeruginosa: Pseudomonas aeruginosa; K. aerogenes: Klebsiella aerogenes;

MRSA: methicillin-resistant Staphylococcus aureus; MRSE: methicillin-resistant Staphylococcus

epidermidis; M. luteus: Micrococcus luteus; A. baumannii: Acinetobacter baumannii. The antimicrobial

activities were estimated by measuring the diameters of the inhibition zones and expressed on a scale

relative to the standard antibiotics (0.1 mM chloramphenicol). The activities were considered weak (+), moderate (++), or strong (+++) when the Draftvalue was < 0.5, between 0.5 and 1.0 or ≥ 1.0.

© The Author(s) or their Institution(s) Page 37 of 46 Genome

Table 3. General features of the A7, B27 and R12 genomes

Genome A7 B27 R12

Assembly size (Mp) 35.02 32.33 35.39

G+C (%) 48.39 52.42 50.59

Assembled scaffolds 338 158 1857

N50 length (bp) 634952 1367452 563844

verage length(bp) 108659.86 210934.2 19987.64

Predicted Protein-Coding Genes 11711 11307 13035

Average length of Predicted Protein-Coding Genes (bp) 1575.27 1662.22 1548.37 Average depth of reads cover Draft157.8 162.97 218.12 Sequencing Method Illumina HiSeq Illumina HiSeq Illumina HiSeq

© The Author(s) or their Institution(s) Genome Page 38 of 46

Draft

Fig.1. Morphology of characterized strains. A-C: Colony and microscopic morphology after 7 days of incubation. From left to right: obverse colonies on MEA, reverse on MEA, conidiophores and conidiogenous at 40× magnification (scale bar 10 µm).

© The Author(s) or their Institution(s) Page 39 of 46 Genome

Draft

Fig. 2. Maximum likelihood tree of A7, B27, and R12 based on the concatenated sequences of ITS, BenA and CaM. Multiple sequence alignment was conducted using Clustal W (default settings), and phylogenetic relationships were based on ML analysis with 1000 bootstrap replications in MEGA7. Numbers above branches indicate bootstrap values. The sequence accession numbers and detailed strain information for the phylogenetic tree are provided in supplemental Table S2.

© The Author(s) or their Institution(s) Genome Page 40 of 46

Draft

Fig. 3. Genome comparison of A7, B27 and R12. A. Venn diagram of core and specific genes in each strain (A7, B27, R12) through cluster analysis of cds sequence; B. Comparative analysis of types of BGCs among the three Aspergillus species (A7, B27, R12). NRPS: nonribosomal peptide synthase; PKS: polyketide synthase; RiPP: ribosomally synthesized and posttranslationally modified peptide.

© The Author(s) or their Institution(s) Page 41 of 46 Genome

Draft

Fig. 4. Comparison of the physiological characteristics between A7 and ATCC 11492. A. Effect of different NaCl concentrations on colony growth of Aspergillus flavus (A7 and ATCC 11492) at 5 days; B. Effect of pH values on the growth of Aspergillus flavus colonies (A7 and ATCC 11492) at 5 days after inoculation.

© The Author(s) or their Institution(s) Genome Page 42 of 46

Draft Fig. 5. The comparison of A7 with other Aspergillus flavus. Different colors represent different BGC types.

© The Author(s) or their Institution(s) Page 43 of 46 Genome

Draft

Fig. 6. KOG classifications of putative proteins in the genome of A7. I: intracellular processes; II: metabolism; III: information storage/processing; IV: poorly characterized function.

© The Author(s) or their Institution(s) Genome Page 44 of 46

Draft

Fig. 7. Schematic representation of A7 putative BGCs showing high similarity with genes from characterized BGCs. A-G. The upper part represents the BGC in A7, followed by the known BGCs in the MIBiG database.

© The Author(s) or their Institution(s) Page 45 of 46 Genome

Draft

Fig. 8. Schematic comparison of the CPA BGC between A7 and reference strains. The homologous genes in region 55.1 and known CPA BGCs from Aspergillus oryzae NBRC 4177 or Aspergillus flavus NRRL 3357 are marked with the same color, and missing genes are marked with a red frame. BGC descriptions and amino acid homology (query cover and identity) are listed.

© The Author(s) or their Institution(s) Genome Page 46 of 46

Draft

Fig. 9. Putative ustiloxin-like BGC in A7. A. ustiloxin B structure; B. The putative ustiloxin-like BGC of A7 and the comparison of this cluster with the ustiloxin B cluster reported for Aspergillus flavus NRRL3357. Compared with the ustiloxin B BGC of Aspergillus flavus NRRL3357, ustA, ustP1, ustM and ustS were missing in region 46.1, which is marked with a read frame. The homologous genes are marked with the same color. Known BGC descriptions and amino acid homologies (query cover and identity) are listed below.

© The Author(s) or their Institution(s)