Building a diverse library of mycobacterial species
A collection of 44 tractable mycobacterial species was assembled to cover most of the mycobacterial phylogenetic tree (Fig. 1a). To evaluate the biological diversity of our library we analyzed the ecological and genomic information available for the selected species. In terms of niche/pathogenicity, mycobacteria can be broadly divided into bona fide avirulent saprophytes, opportunistic pathogens, and professional pathogens. However, we considered necessary to add further nuance to the opportunistic category by assessing the strength of the evidence of pathogenicity provided in the literature and used this knowledge to establish a five-level scoring system (Fig. 1b). We determined doubling times in liquid culture in equivalent experimental conditions for 26 of the species (Fig. 1c). The results revealed large differences within the genus. For example, M. szulgai and M. flavescens divided every 1.3 and 2.0 hours, respectively, while M. marinum and M. tuberculosis, divided every 17.1 hours. Strikingly, among slow-growers a 13-fold change difference exists between the fastest and the slowest-growing species (M. szulgai and M. tuberculosis respectively). The genome size of the library species is also highly different, with genomes as small as 3.59 Mbp and 4.08 Mbp, for M. triviale and M. koreense, respectively, to genomes as large as 6.99 Mbp and 8.01 Mbp for M. smegmatis and M. mageritense, respectively (Fig. 1d). These genome size differences suggest that the nature of the accessory genome varies widely from species to species. Interestingly, there is only a very small difference when comparing the average genome size of all fast and slow growers, 6.1 vs 5.8 Mbp, respectively. The high guanine-cytosine content (GC%) is a defining characteristic of mycobacteria. Within our library, GC% varies from 66.9–68.8%, in slow growers and the M. terrae clade respectively (Fig. 1e). Next, we investigated the number of ribosomal RNA encoding genes (rrn operon), a feature that has often been linked to growth rate. Twenty-five species possess a single copy of the rrn operon while the other 19 possess two copies (Fig. 1f). In general, but not always, “fast growers” tend to have two copies while “slow growers” one. In the genus Bacillus a correlation between growth rate and rrn operon copy number has been experimentally disproven [22], and number of rrn operon copies could be related to response of resource availability [23]. Furthermore, comparative pangenome analyses conducted by Bachmann and collaborators suggested that growth limitation in slow growing mycobacteria might instead be related to loss of amino acid transporters [24]. Finally, we analyzed the gene ontology (GO) distribution for our library in order to evaluate functional genetic diversity at the genome level. GO distribution varies noticeably across the analyzed genomes. The number of genes associated with transcription and membrane transport (categories indicated by arrows in Fig. 1g) appeared to be particularly variable. We noted that much of the genome of several of these species is poorly annotated. Therefore, Fig. 1 supports the potential usefulness of our library as a resource to investigate mycobacterial biology, antibiotic resistance, and pathogen evolution.
Wide variation in antibiotic resistance profiles in mycobacteria
To harness the potential of our library, we tested the antibiotic potency and the extent of its variation across the Mycobacterium genus to identify biologically-relevant differences to be further studied. We determined minimal inhibitory concentrations (MIC99) for 15 antibiotics, spanning most of the classes employed to treat mycobacterial infections, including TB (Fig. 2a, Supplementary Table 1). We found that several species displayed at least one MIC99 value that is considerably different from the mean (Supplementary Figs. 1 and 2), highlighting the biological diversity of the genus with respect to antibiotic action. As expected, notoriously multi-drug resistant M. abscessus was resistant to several antibiotics (Fig. 2a) [21, 25], yet M. abscessus was not the most resistant of the species studied. M. mageritense, M. salmoniphilum and M. houstonense were highly resistant to most of the antibiotics tested. M. abscessus was somewhat sensitive to amikacin (AMK) and bedaquiline (BDQ), which is consistent with other findings in the literature [26, 27]. Also, the magnitude of the changes in MIC99 is remarkable, of the order of 100- to 1000-fold in some instances. Of note, when the data were re-ordered and unbiased clustered, based on the overall antibiotic sensitivity of each individual species, the species distribution were divided in three main clusters, which are dramatically different when compared to their phylogenetic positioning, as shown by the tanglegram between the two heatmaps in Fig. 2a. To illustrate the absence of a taxonomic trend in antibiotic resistance we explored in detail the clade composed by M. holsaticum, M. phlei, M. flavescens, M. tusciae and M. moriokaense (Supplementary Fig. 3). While most antibiotics behaved similarly across this group (i.e., MIC99 FC < 3-fold). M. holsaticum was highly sensitive to para-aminosalicylic acid (PAS) and highly resistant to BDQ. High-level BDQ resistance was also observed with M. flavescens. Interestingly, M. flavescens is unusually sensitive to d-cycloserine (DCS). The mechanisms underpinning these distinct responses to antibiotics are currently unknown. We also observed that once ordered by antibiotic sensitivity, M. tuberculosis is positioned at the middle of the heatmap and the number of NTM more resistant to antibiotics compared to M. tuberculosis is nearly equal to the number of NTM that are more sensitive. Therefore, NTM are not generally intrinsically more drug resistant to the antibiotics tested than M. tuberculosis. In summary, antibiotic sensitivity varies dramatically across the Mycobacterium genus and our data provide the first quantitative blueprint of this variation.
There are striking differences in antibiotic response across the genus that highlight the value of a genus-wide approach to inform antibiotic research efforts. For example, M. smegmatis is frequently used as a model organism for M. tuberculosis in TB antibiotic discovery [28], but it displayed a completely different sensitivity profile from M. tuberculosis, being highly resistant to PAS, ethionamide (ETH), DCS and RIF. Our results suggest that M. marinum is a better M. tuberculosis proxy [29] as both have a similar sensitivity profile except to ofloxacin (OFX) (Fig. 2b). The overall distribution of antibiotic potency (MIC99) against different species is shown in Fig. 2c. Cell-envelope-targeting antibiotics and PAS exhibit a weaker potency across the genus, while antibiotics that target protein synthesis, DNA gyrase and the ATP synthase on average displayed an overall lower MIC99, indicating that most mycobacteria are sensitive to them. From the antibiotics that inhibit protein synthesis, linezolid (LZD) displays the lowest overall MIC99 and was effective against most species (Fig. 2c). To verify whether there is a correlation between doubling time and sensitivity to antibiotics we compared the doubling time of a subset of species (Fig. 1c) with the MIC99 of a subset of antibiotics. As it can be seen in Fig. 2d, no correlation is apparent between growth rate and antibiotic sensitivity in mycobacteria. Below, we explore the molecular causes of these dramatic changes in antibiotic potency observed across the Mycobacterium genus.
Intra-bacterial antibiotic accumulation does not predict potency.
We employed liquid chromatography–time-of-flight mass spectrometry (LC-MS) to determine the relative internal concentration of antibiotic ([ABX]IB) with an antibiotic concentration in the growth medium of 6×MIC99. [ABX]IB is a function of three parameters: antibiotic uptake, efflux, and modification. Figure 3a shows extracted ion chromatograms (EICs) in five mycobacterial species for BDQ, LZD and RIF (Fig. 3b). Quantification of [BDQ]IB, [LZD]IB and [RIF]IB illustrates the variability and the magnitude of the changes observed in [ABX]IB, spanning from 2- to 200-fold (Fig. 3c). As the experiment was performed at a concentration of antibiotic where every antibiotic was equally potent, we replotted these data as a function of each antibiotic MIC99 (Fig. 3d). Only for BDQ we could observe a correlation between antibiotic potency and [BDQ]IB which could be indicative of efflux playing a role in antibiotic efficacy. In the case of RIF, where there is no correlation between antibiotic potency and its accumulation in mycobacteria (Fig. 3d), factors other than uptake and efflux as the dominant drivers of RIF potency in mycobacteria.
A minor role for pre-existing target modification in RIF resistance.
Considering the importance of rifamycins for the treatment of TB, leprosy, Buruli ulcer, MAC and M. kansasii infections, we focused on RIF resistance mechanisms operating in mycobacteria. Figure 4a highlights the diversity in RIF potency across our library, ranging from an MIC99 of more than 100 µg/mL to less than 0.2 µg/mL. Arranging species by decreasing MIC99 value highlights that there are species better suited for the identification of target-mediated resistance mechanisms (dark orange), and species that are better suited for the identification of non-target-based resistance mechanisms (dark purple). At this stage, we focused our work on four species, all of which are resistant (MIC99 = 12.5 µg/mL) or super-resistant (MIC99 ≥ 100.0 µg/mL) to RIF, compared to M. tuberculosis (MIC99 = 0.9 µg/mL): M. smegmatis and M. flavescens (MIC99 = 12.5 µg/mL), M. houstonense (MIC99 = 25.0 µg/mL), and M. conceptionense (MIC99 ≥ 100.0 µg/mL) (Fig. 4b).
In M. tuberculosis, the fixation of mutations that decrease the affinity of RIF to the RNA polymerase β subunit (RpoB) represents the dominant cause of RIF resistance [7, 30], and therefore target modification is an obvious starting point to explore probable mechanisms of resistance to the rifamycin class of antibiotics in other mycobacteria. Figure 4c shows the Rifampicin Resistance Determining Region (RRDR), the segment of RpoB where most mutations conferring resistance to RIF are found. Except for M. branderi, no amino acid variations are found in our species of interest. This observation suggests that in contrast to M. tuberculosis, most mycobacteria are not resistant to RIF due to variations in the RIF binding region of RpoB.
As [RIF]IB or RpoB target variation cannot account for the observed resistance to RIF, we evaluated the remaining major mechanism of resistance to rifamycins, drug modification. RIF modification is widely found in nature and is carried out by various enzyme types, including phosphotransferases, glycosyltransferases, ADP-ribosyltransferases (ARTs) and monooxygenases [31–35]. Importantly, a RIF-ART has been characterized in M. smegmatis [36]; it is encoded by the gene MSMEG_1221, also known as arr-ms, and it has been showed to be the sole determinant of RIF resistance in M. smegmatis by chemical and genetic methods [37, 38]. We employed proteomics to first check whether Arr-ms is expressed in the absence of RIF and if it is differentially expressed in the presence of RIF. Figure 4d shows that expression of Arr-ms is stimulated (5.6-fold) in the presence of RIF at 6×MIC99, and therefore, proteomics can assist on the identification of RIF modifying enzymes in mycobacteria. Next, we evaluated whether the annotated Arr homologous proteins in M. conceptionense (SAMEA3305051) and in M. flavescens (SAMN05729960) were also induced in the presence of RIF (Fig. 4e), this was indeed the case (4.12- and 2.75-fold change respectively). To confirm that these putative RIF-ARTs are inactivating RIF, we employed LC-MS, to identify ribosyl-RIF (m/z 955.4601), a fragment of the larger ADP-ribosyl-RIF product, which fragments under LC-MS conditions. Figure 4f illustrates that ribosyl-RIF was observed in M. smegmatis, M. conceptionense and M. flavescens treated with RIF. Additionally, other mycobacteria with annotated putative arr genes also displayed high levels of RIF ADP-ribose (Supplementary Fig. 4a and 4b), indicating that RIF modification, and precisely ADP-ribosylation, is the dominant mechanism of resistance to RIF in mycobacteria.
A novel group of rifamycin ADP-ribosyltransferases
In order to have a comprehensive understanding of the distribution of Arrs in mycobacteria we mined for arr sequences in reference genomes and built a phylogenetic tree. Arr proteins were found to be widespread in both fast- and slow-growing mycobacteria, but in a dispersed pattern suggesting that both local vertical inheritance and gene losses and acquisitions have taken place. Mycobacterial Arrs form two monophyletic groups (Fig. 5a; Supplementary Table 2). One of the groups, which we designated Arr-1, corresponds to sequences closely related to Arr-ms (median sequence identity of 80%). Arr-1 group members are predominantly Actinomycetota of the orders Geodermatophilales, Propionibacteriales, Micrococcales and Mycobacteriales. The second group, which we have named Arr-X, is taxonomically more broadly distributed, including members from Actinomycetota, Bacillota, Pseudomonadota and Bacteroidota. Within mycobacteria, more species have an arr-1 gene than arr-X and a few species have both, for example M. conceptionense and M. flavescens (Supplementary Fig. 4a). M. conceptionense Arr-1 (Uniprot A0A0U1D6J3) and Arr-X (Uniprot A0A0U1DL14) share 50% identity and 63% similarity (BLOSUM62). The equivalent of the three residues showed by Baysarowich and collaborators to be necessary for enzymatic activity in Arr-ms (Asp84, His19 and Tyr49) are conserved in all mycobacterial Arr-1 and Arr-Xs, suggesting that they are all active ADP ribosyltransferases [34]. However, the hydrophobic nature of the RIF binding cleft of Arr-ms is not completely preserved in the Arr-X group (Supplementary Table 3) hinting at probable differences in substrate binding preference.
To assess that Arr-X enzymes are indeed RIF-ARTs and to understand why some species have two arr genes, we cloned, overexpressed, purified, and tested the enzymatic activity of Arr-ms (as a control) and Arr-1 and Arr-X from both M. conceptionense and M. flavescens. Figure 5c displays the catalytic activity (Vapp) of the different proteins with six rifamycins. All Arr-1 enzymes had similar activity and substrate preference, but Arr-X enzymes were much superior at inactivating rifamycins. For example, M. flavescens Arr-X is 3.9-fold faster with rifapentine than Arr-ms. Surprisingly, M. conceptionense Arr-X is 29-fold faster to inactivate rifabutin, compared to Arr-ms. These results therefore demonstrate that Arr-Xs are not only bona fide rifamycin inactivating enzymes, but also that they are significantly more efficient than Arr-1s. We also determined the MIC99 for different rifamycins in selected species (Supplementary Table 4). Interestingly, these species are resistant to all rifamycins except for rifabutin. To probe whether Arr-X is active in bacterio, we used CRISPR interference to reduce the transcription of arr-1, arr-X and both genes in M. conceptionense. M. conceptionense continued to be resistant to rifabutin upon arr-1 silencing but became more sensitive when arr-X was knocked down (Fig. 5d). Thus, Arr-X is an active “rifabutinase” that confers rifamycin resistance in M. conceptionense.