Dataset overview
The final dataset analysed in this study comprised 66 MAGs, 65 of which derived from Bacteria and one from Archaea (Table S1, Additional File 2). Twenty-five MAGs were obtained from healthy octocoral samples (all species together), 14 MAGs from necrotic E. gazella tissue and 27 MAGs, including the archaeal one, from seawater. Of the 66 MAGs, 30 and 36 were of high and medium quality, respectively (Table S2, Additional File 2). Average genome size and GC-content ranged from only 0.63 Mb and 22.3% in Metamycoplasmataceae to 4.2 Mb and 59.6% in Ca. Inquilinaceae (Alphaproteobacteria), both symbionts of healthy octocorals (Table S1, Additional File 2).
Taxonomic affiliation of MAGs
The 66 MAGs belonged to six phyla, seven classes, 15 orders, 16 families, and at least 30 species as defined by 95% ANI thresholds (Figure 1a-c; Table S1, Additional File 2). The 21 Gammaproteobacteria MAGs retrieved from octocorals all represented so-far uncultured and unclassified lineages, with 11 MAGs affiliating with the Endozoicomonadaceae family (Figure 1b). Of these, nine derived from the microbiomes of healthy octocoral tissue and two from necrotic tissue. Six octocoral-derived MAGs affiliated with the family Ca. Thioglobaceae, five of them obtained from the microbiomes of healthy tissue. Moreover, two Gammaproteobacteria MAGs from healthy L. sarmentosa were affiliated with candidate taxon DT-91 of the order Pseudomonadales,while two MAGs from necrotic E. gazella were identified as unclassified Cardiobacterales. Contrarily, the three Gammaproteobacteria MAGs found in seawater belonged to the genus Luminiphilus (Halieaceae, Cellvibrionales). Notably, four MAGs from healthy octocoral tissue were affiliated with intracellular bacterial symbionts [55] of the families Metamycoplasmataceae and Ca. Rhabdochlamydiaceae. The five Alphaproteobacteria MAGs from healthy octocoral tissue were either unclassified at order level (N=2) or belonged to the candidate family Inquilinaceae and the Rhodobacteraceae genera Ruegeria and Yoonia (Figure 1c). Among the MAGs obtained from necrotic E. gazella tissue was also one affiliating with the genus Aquimarina (Bacteroidetes), a taxon frequently cultured from octocorals [14, 22], as well as three MAGs affiliating with the Rhizobiaceae family, one of them identified as Lentilitoribacter. Overall, little overlap was observed at species level between MAGs reconstructed from healthy octocoral tissue versus necrotic tissue versus seawater. Of 11 bacterial species recovered from healthy octocorals, only two species were also recovered from necrotic samples. However, their average genome coverage, a proxy for relative abundance, was much lower in necrotic than healthy tissue (Figure 1c).
Phylogenomics of Endozoicomonadaceae and Ca. Thioglobaceae MAGs
Phylogenomics analysis of the Endozoicomonadaceae family comprised 29 publicly available type genomes, MAGs, and SAGs plus the 11 octocoral-derived MAGs retrieved in this study (Figure 2). The latter formed their own, well supported and deeply branching clade, separate from the genomes of all described genera with cultured representatives (i.e., Kistimonas, Parendozoicomonas and Endozoicomonas). This clade comprised two subclusters, each representing a novel species, sharing ~89.8% ANI between them. Subcluster I contained seven MAGs, all obtained from Eunicella spp. Subcluster II comprised four MAGs, three from L. sarmentosa and one from E. gazella. The closest type strain genome to our MAGs, was Endozoicomonas atrinae GCA_001647025T, which shared only 52-53% AAI with our MAGs (Table S1, Additional File 2), well below the 65% threshold considered by MiGA for same-genus classification. This indicates that our 11 Endozoicomonadaceae MAGs represent two distinct species, forming a novel yet uncultured genus unique to temperate octocorals. We propose the names Candidatus Gorgonimonas eunicellae (corresponding to subcluster I) andCa. Gorgonimonas leptogorgiae (corresponding to subcluster II) for the two species.
Phylogenomics inference of the Ca. Thioglobacaceae family showed that our six octocoral-derived Thioglobaceae MAGs formed two separate clusters, representing distinct, novel species as judged by ANI values way below 80% compared with the remainder genomes of the family (Figure S1, Additional File 1). The first clade comprised two E. gazella-derived MAGs which formed a subcluster within other Ca. Thioglobaceae clusters of the genera Thioglobus and Thiomultimodus. The second clade was composed of the otherfour Thioglobaceae MAGs, derived from healthy L. sarmentosa and E. gazella tissue, which formed a well-supported, deeply branching phylogenomic node on their own, sharing only 46% AAI with genomes of their closest type strains, namely Sulfurivirga caldicuralii GCA_900141795T and Thiohalobacter thiocyanaticus GCA_003932505T (Table S1, Additional File 2). This indicates that the six octocoral-derived Ca. Thioglobaceae MAGs not only represent two distinct species but most likely two distinct, novel genera, here proposed Ca. Thiocorallibacter gorgonii and Ca. Microaerophilica antagonistica (Figure S1, Additional File 1), which so far lack cultured representatives.
Functional profiling of MAGs from octocoral and seawater microbiomes
Multivariate analysis based on COG functional profiles showed that the MAGs grouped primarily according to their (order level) taxonomic affiliations (PERMANOVA, F = 9.869, P = 0.0001) (Figure 3). The 11 Endozoicomonadaceae MAGsformed a very tight cluster, much distant from all other MAGs. Such separate clustering was mostly determined by the high copy number of ankyrin repeat motifs (COG0666) and serine/threonine protein kinase-encoding genes (COG0515) on the Endozoicomonadaceae MAGs. SIMPER analysis showed that these two COGs were indeed the functions that contributed most to the dissimilarity between all MAGs at order level (Tables S3, S4, Additional File 2).
The six Ca. Thioglobaceae MAGs formed two, well-separated clusters in the ordination space, one comprising the two Ca. Thiocorallibacter gorgonii MAGs and another one with the four Ca. Microaerophilica antagonistica MAGs, congruent with our phylogenomic assessment. The positioning of Alphaproteobacteria MAGs in the PCA diagram was influenced, amongst others, by the presence and abundance of genes encoding for LysR family transcriptional regulator (COG0583), ABC sugar transport system (COG3839), Acyl-CoA and NAD(P)-dependent alcohol dehydrogenases (COG1960 and COG1028) and drug metabolite transporters (COG0697).
Functional features enriched in Endozoicomonadaceae symbionts of octocorals
The 11 Endozoicomonadadeae MAGs were drastically enriched in COGs related to eukaryotic-like proteins (Figure 4), mainly ankyrin repeats (q < 0.0001; Welch’s test)and, to a lesser extent, WD40 repeats and tetratricopeptide repeats (Table S5, Additional File 2, Figure S2, Additional File 1). These MAGs also displayed high abundance of COG entries related to thetype III secretion system (q < 0.0001; Welch’s test), serine/threonine protein kinases, serine protease inhibitors (q < 0.0001; Welch’s test) and the membrane-anchored periplasmic protein YejM.Other typical features of all 11 Endozoicomonadaceae MAGs were several COG entries associated with Type IV-pilus (Tfp) production (Figures 4 and 5), and the consistent presence of COG3206 encoding a protein involved in exopolysaccharide (EPS) biosynthesis, which was not observed to such extent in any of the other 55 MAGs investigated here. The two Endozoicomonadaceae species identified in this study were distinguished by a consistent presence of serine/threonine phosphatase encoding genes on the four MAGs of Ca. Gorgonimonas leptogorgiae which were absent on the seven MAGs of Ca. G. eunicellae.
The Endozoicomonadaceae MAGs show capacity for pyruvate metabolism and to convert acetyl-CoA to acetate via acetyl-phosphate, through a phosphate acetyl transferase (EC 2.3.1.8) and an acetate kinase (EC 2.7.2.1), respectively, a process able to generate ATP independently from aerobic conditions (Figure S3, Additional File 1). The consistent presence of pyruvate-formate lyase activating enzyme encoding genes (COG1189) on the 11 Endozoicomonadaceae genomes (Figure S2, Additional File 1, Welsh’s-test q < 0.0001) further suggests that these symbionts may supply the citric acid cycle with acetyl-CoA from pyruvate during anaerobic glycolysis. Evidence for adaptation of Endozoicomonadaceae symbionts to suboxic conditions could also be found through the consistent presence of feoA/B genes (COG1918, COG0370) encoding for ferrous iron (Fe2+) uptake systems, distinguishing this taxon from the other 55 MAGs of this study (Welch’s t-test, q < 0.0001). In this regard, Endozoicomonadaceae MAGs were also distinguished by the presence of rubredoxin encoding genes (Figures 4-6;Figure S2, Additional File 1), a class of iron-containing proteins which play an important role in superoxide reduction that can be found in several anaerobic and sulphate-reducing bacteria.
All Endozoicomonadaceae MAGs possessed a gene encoding an endo-chitinase (COG3469) involved in the extracellular breakdown of chitin polymers (Figures 4-6). We found a high degree of novelty within these 11 endo-chitinase encoding genes, as they clustered into two distinct groups and possessed less than 50% amino acid sequence similarity to publicly available endo-chitinases (Blastp search). Remarkably, these groups mirror the phylogenomic relatedness of the Endozoicomonadaceae MAGs, representing two endo-chitinase gene clades, each from Ca. species Gorgonimonas leptogorgiae and G. eunicellae (Figure S4, Additional File 1). Protein family (Pfam) analysis confirmed the presence of a GH18 domain with an active site on all 11 genes. These genes were all complete (start and stop codon present) and carried a signal peptide sequence, indicating the protein can be excreted from the cell (Table S6A, Additional File 2). We also screened all publicly availablegenomes from cultured and uncultured Endozoicomonadaceae representatives and detected endo-chitinases on 32 out of 42 Endozoicomonadaceae genomes (Table S6B, Additional File 2).
Metabolic inference of Ca. Thioglobaceae symbionts of octocorals
The Ca. Thioglobaceae MAGs were significantly enriched in CRISPR/Cas system-associated endoribonuclease Cas2 (COG1343) (Figure S5, Additional File 1). Several other CRISPR/Cas protein-encoding genes were found on all Ca. Thioglobaceae, all Metamycoplasmataceae, and many Endozoicomonadaceae MAGs (Figure 4; Table S5, Additional File 2). Ca. Thioglobaceae MAGs were further significantly enriched in Na+-translocating ferredoxin:NAD+ oxidoreductase (Rnf complex) encoding genes (Figure S5, Additional File 1), which were also present in great abundance in nine of the 11 Endozoicomonadaceae MAGs. An assimilatory sulfite (SO32) reductase (EC 1.8.1.2) was also found on the Ca. Thioglobaceae MAGs, pointing towards a role in sulfur cycling in the octocoral holobiont. The four Ca. Microaerophilica antagonistica MAGs encoded for versatile taurine utilization pathways and its metabolization to aminoacetaldehyde and sulfite, via taurine deoxygenase (TauD; EC 1.14.11.17), or to sulfoacetaldehyde via taurine-pyruvate-aminotransferase (EC 2.6.1.77) (Figure S6, Additional File 1).
All Ca. Thioglobaceae MAGs showed an extensive genetic repertoire for ammonium assimilation and transformation of inorganic nitrogen into amino acids, possessing genes coding for glutamine and asparagine synthetase (EC 6.3.1.2; EC 6.3.5.4), L-asparaginase (EC 3.5.1.1), glutamine amidotransferase (EC 6.3.1.5), aminoethyltransferase (EC 2.1.2.10), and glutamate synthase (EC 1.4.1.13; EC 1.4.7.1). The latter was significantly enriched in this symbiotic family compared to the other 60 MAGs obtained in this study (One-sided Welch’s-test, p>0.001, Figure S5 Additional File 1). Finally, Ca. Thiocorallibacter gorgonii MAG EG15H_bin1 stood out as the only MAG harboring ribulose 1,5-bisphosphate carboxylase - RuBisCo (EC 4.1.1.39, COG4451, COG1850) and several other genes involved in the reductive dicarboxylate cycle (Figure S7, Additional File 1), suggesting a chemoautotrophic lifestyle and “dark carbon fixation” ability of this octocoral symbiont.
Secondary metabolite biosynthetic capacities of octocoral symbionts
Genome mining with antiSMASH revealed that 46 MAGs harboured between one and 16 (Ca. Inquilinaceae EV04_Bin1) SM-BGCs, while 20 MAGs lacked SM-BGCs, among them all Endozoicomonadaceae, Metamycoplasmataceae, Ca. Rhabdochlamydiaceae and unclassified Aphaproteobacteria MAGs from octocorals (Figure 7a; Table S7, Additional File 2). Notably, the two Ca. Thioglobaceae species presented distinct secondary metabolite coding potential. While Ca. Thiocorallibacter gorgonii MAGs harboured one or two arylpolyene cluster(s) (which may function as antioxidants), Ca. Microaerophilica antagonistica MAGs harbored one T3PKS and one bacteriocin/RiPP SM-BGC each, which may indicate antagonistic potential. The two Pseudomonadales MAGs from healthy L. sarmentosa samples showed rich SM-BGC profiles, with 7-8 NRPS and 2-3 bacteriocin/RiPP/proteusin clusters, plus a siderophore SM-BGC (Figures 6 and 7a).
Only 24 out of 163 SM-BGCs detected across all MAGs showed some homology with SM-BGCs encoding known compounds present in the MIBiG database, with 14 SM-BGCs sharing a similarity of 60-100% (Figure 7b; Table S8, Additional File 2). One of the NRPS clusters of Pseudomonadales MAG LS06H_Bin2 showed 100% similarity to the NRPS cluster of the antimicrobial peptide bicornutin A [56] (Figure 7c, d). Another NRPS cluster with 100% similarity to a SM-BGC of a known compound, namely the antibiotic and cyclic depsipeptide isocalide A [57], was identified on the Ca. Inquilinaceae MAG from healthy E. verrucosa, which was also the MAG with the richest SM-BGC profile across the entire dataset.