Patient characteristics
68 strains were included. Five (7.4%) strains were obtained from patients diagnosed in 2011, two (2.9%) in 2012, 18 (26.5%) in 2013, 23 (33.8%) in 2014 and 20 (29.4%) in 2015. Likewise, 38 (55.9%) strains were initially obtained from men and 30 (44.1%) from women. Their age ranged from 15 to 78 years with a mean of 36 years (interquartile range 25.3–44.5). 51 (75.0%) cases had received a previous antiTB treatment (Table 1). 58 (85.3%) strains belonged to patients from Lima and Callao (51 from Lima and 07 from Callao). Regarding the strains of Lima, 30 (59%) came from the east zone (Lima este), 16 (31%) from the center and north (Lima ciudad), 3 (6%) from the south (Lima sur), and 2 (4%) from the other provinces (Lima provincias). In addition, Piura, La Libertad, Loreto, Ancash, Ucayali, Arequipa and Madre de Dios regions were represented by one strain each, whereas Ica region with two strains (Supplementary Table S2).
Table 1
Characteristics of patients with XDR-TB.
Characteristics | n [%] |
Sex | |
Male | 38 [56] |
Female | 30 [44] |
Treatment History | |
Previously treated | 51 [75] |
Newly diagnosed | 15 [22] |
Unknown | 2 [3] |
Age | |
≤ 15 year. | 1 [1] |
16–30 year. | 24 [35] |
31–45 year. | 25 [37] |
> 45 year. | 16 [24] |
Unknown | 2 [3] |
Mean (IQR) | 36.3 (25–45) |
Second Line Inyectables* | |
Kanamycin resistant | 11 [16] |
Capreomycin resistant | 14 [21] |
Kanamycin & Capreomycin resistant | 43 [63] |
*All strains (n = 68) were additionally resistant to rifampicin, isoniazid and levofloxacin drugs.
IQR
interquartile range.
Sequencing and genome assembly
An average of 935,183 raw sequencing reads per fastq file was obtained. Two fastq files (forward and reverse) were generated for every sample. The minimum, maximum and average depth of sequencing obtained were 53x, 153x and 88x, respectively (Table S2). All strains had reads covering more than 99% of the H37Rv genome.
Antimicrobial resistance
All strains showed simultaneous phenotypic resistance to rifampicin, isoniazid and levofloxacin. However, they showed differences in second line injectable drugs resistance, as described in Table 1. Discordant results between phenotypic and genotypic methods were found for isoniazid (strain ‘XDR_28’) and rifampicin (strain ‘XDR_19’) showing only resistance outputs through phenotypic method (Table S1) and analysed elsewhere 19. Concerning rifampicin resistance, all strains had resistant mutations located at rpoB gene. From this, 67 (98.5%) were considered high-confident mutations located inside the rifampicin-resistance-determining region (RRDR) of the rpoB gene, while only one strain (1.5%) presented a mutation outside this region. The most frequent mutations were S450L, D435V and H445R. Only one strain had the double mutation H445N + S431R. The isoniazid-resistant phenotypes showed mutations in katG and inhA genes. There was one mutation in katG (S315T), whereas three differente mutations located in the promoter region (g-17t, c-15t and t-8c) and one mutation located in the codificant region (S94A) of inhA were found. There were four strains with double mutation, S315T + c-15t, and one with S315T + t-8c.
Levofloxacine resistance was predominately caused by mutations occurring in the quinolone-resistance-determining region (QRDR) of gyrA gene and contained nine different mutations, whereas gyrB showed showed only three different mutations. In the gyrA gene, the codon 94 showed the greatest variability (five different mutations). Only two strains presented mutations in both genes. Finally, one strain (1.5%) strain showed no mutations in any of these genes.
Resistance to kanamycin and capreomycin was driven by mutations occurring at rrs (a1401g, c1402t and g1484t), tlyA (all frame shifts) and and eis (c-14t) genes. However, no mutations were detected in two strains for the screened genes. Strains with exclusive resistance to kanamicine only showed the a1401g mutation of rrs gene, whereas two strains had no mutation in any of the three genes analyzed. Exclusive resistance to capreomycin was caused by several frameshifts’ mutations occurring in tlyA gene. However, there were three strains with no detected mutations in any of the analyzed genes. In general, 96, 85 and 90% (average 90%) of strains had high-confident mutations for resistance to rifampicin, isoniazid and both second line drugs, respectively (Table 2).
Several synonymous and nonsynonymous mutations located in additional resistant-associated genes were evidenced to be present together with the mutations described above (Supplementary Table S3).
Table 2
Resistance-associated mutations in peruvian XDR MTB strains.
Drug | Gene | Mutation | Confidence grading ** | n (%) |
Rifampicin | rpoB | Q432P | High | 1 (1.5) |
D435V | High | 22 (32.4) |
H445N, S431R | Minimal | 1 (1.5) |
H445R | High | 4 (5.9) |
H445S | No data | 1 (1.5) |
S450L | High | 38 (55.9) |
I491F | Minimal | 1 (1.5) |
| Total | | 68 (100) |
Isoniazid | katG | S315T* | High | 53 (77.9) |
inhA | c-15t | Moderate | 7 (10.3) |
g-17t | No data | 2 (2.9) |
S94A | No data | 1 (1.5) |
katG + inhA | S315T* + c-15t | High | 4 (5.9) |
S315T* + t-8c | High | 1 (1.5) |
| Total | | 68 (100) |
Levofloxacin | gyrA | G88C | High | 2 (2.9) |
A90V | High | 13 (19.1) |
S91P | High | 2 (2.9) |
D94A | High | 2 (2.9) |
D94G | High | 32 (47.1) |
D94H | No data | 3 (4.4) |
D94N | High | 5 (7.4) |
D94Y | High | 1 (1.5) |
A90V, D94H | High | 1 (1.5) |
D94N, D94G | High | 1 (1.5) |
D94N, D94A | High | 1 (1.5) |
gyrB | S447F | No data | 2 (2.9) |
gyrA + gyrB | G88A + T500P | No data | 1 (1.5) |
D94A + E501D | High | 1 (1.5) |
Mutation not detected | | 1 (1.5) |
| Total | | 68 (100) |
Kanamycin & Capreomycin | rrs | a1401g | High | 34 (50.0) |
c1402t | High | 1 (1.5) |
g1484t | High | 1 (1.5) |
tlyA | S92fs | Generally high | 2 (2.9) |
A111fs | Generally high | 1 (1.5) |
H68fs, S92fs, Q202fs | Generally high | 1 (1.5) |
rrs, eis, tlyA | A1401G, c-14t, R60fs | High | 1 (1.5) |
Mutation not detected | | 2 (2.9) |
Kanamycin | rrs | a1401g | High | 9 (13.2) |
Mutation not detected | | 2 (2.9) |
Capreomycin | tlyA | V198fs | Generally high | 4 (5.9) |
G232D | Generally high | 2 (2.9) |
L209dup | Generally high | 1 (1.5) |
C86fs | Generally high | 1 (1.5) |
S252fs | Generally high | 1 (1.5) |
R133fs, L209dup | Generally high | 1 (1.5) |
L139fs, V198fs | Generally high | 1 (1.5) |
Mutation not detected | | 3 (4.4) |
| Total | | 68 (100) |
Codon numbering systems are according H37Rv genome (NC_000962.3). fs: frame shift, dup: duplication.
* AGC ➡ ACC
** According to WHO-NGS Technical guide 20
Lineages and evolutionary relationships
Lineages analysis was performed in the 68 strains determining that 62 (91.2%) XDR strains belong to the Euro-American Lineage (Lineage 4) and 6 (8.8%) to the East-Asian Lineage (Lineage 2). 59 strains from Lineage 4 were able of sublineages classification, whereas all Lineage 2 strains were only represented by one sublineage. Moreover, in silico Spoligotypes analysis showed that within Lineage 4 there was a statistically higher frequency (p = 0.23) of the H, LAM and T families followed by family X. Likewise, the existence of only five (7.4%) strains with orphan spoligotypes and of three (4.4%) strains of unknown spoligotypes was evidenced (Fig. 1, Supplementary Fig. S1).
In general, the clustering of strains according to the patterns of spoligotypes agreed with the lineages and sublineages obtained; however, the strains labelled as belonging to the family T1 were assigned to different sublineages of Lineage 4. The northern strains of the country belonged to sublineages 4.1.1 (Piura, n = 1), 4.1.1.3 (La Libertad, n = 1) and 4.3.3 (Loreto, n = 1). In the center, the city of Lima contained 7 sublineages (except 4.1.1.3); likewise, the strains from the constitutional province of Callao were aligned in the sub-clustering groupings 4.3.3 (n = 3), 4.3.4.2 (n = 2), 4.1.2.1 (n = 1) and 2.2.1 (n = 1), evidencing a great diversity of strains that circulate in these areas. In the rest of the cities, strains of the subfamilies 4.3.3 (Ica and Ucayali, n = 2), 4.1.1 (Ancash, n = 1) and 4.3.4.2 (Huancavelica, n = 1) were found. Finally, in the South of the country, strains were found to belong to the sublineages 4.1.1.3 (Arequipa, n = 1) and 4.1.2.1 (Madre de Dios, n = 1) (Fig. 1).
The maximum likelihood phylogenomic tree confirmed the lineage classification and showed adittional subclassification for strains belonging to the same sublineage. Interestingly, there was no association between phylogenetic clades and specific resistant mutations conferring resistance. Finally, spoligotypes with SITs ‘Orphans’ and ‘Unknown’ could be characterized by evolutionary and lineage analysis. All strains with ‘Unknown’ SITs aligned within the sublineage 4.3.3 group, showing an evolutionary similarity with members of the H3 family, while the strains with ‘Orphans’ SITs were located in the groups belonging to sublineages 4.3.3 (n = 4), 4.1 (n = 1) and 4.1.2.1 (n = 1). A close evolutionary relationship was found between strains from Arequipa (‘XDR_10’) and La Libertad (‘XDR_05’), being the only representatives of the sublineage 4.1.1.3 in the entire country and exhibiting a greater degree of genomic differentiation with the other strains (Fig. 2). Global evolutionary relatedness showed that most of Peruvian XDR strains were grouped in defined clusters and had a close relatedness with XDR strains of Lineages 4 and 2 of european countries (Supplementary Fig. S2).
Transmission cluster determination
The analysis of pairwise genetic differences showed a high number of strains that differed by large amounts of SNPs. This genetic distance between strains varied from 5 to 1272 nucleotide differences with an interquartile range of 373 and a median of 772 (Supplementary Fig. S3). The WGS analysis determined that most strains were not related at genetic level. Only 23 (34%) strains were grouped into nine transmission clusters, each one comprising between two and five strains (Fig. 3, Supplementary Fig. S4). These clusters were mostly (91.3%) from Lineage 4. The average genetic distance between the clusters belonging to this lineage was 304.3 nucleotides; while the cluster belonging to Lineage 2 had a considerable minimum distance of 1198 with the nearest cluster of Lineage 4. Likewise, it was evidenced that clusters integrated by strains belonging to the same sublineage showed a lower genetic distance between them compared with the clusters integrated with different sublineages (Fig. 3). All clustered strains came from Lima (n = 21) and Callao (n = 2). Regarding the strains of Lima, 17 belonged to the east zone, 1 to the center (in the area bordering the east zone), 2 to the south zone and 1 to the outskirts (Supplementary Table S4). Regarding the variability of sources of infection, clusters 2 and 6 were composed of strains belonging to a single infection district; while the rest of clusters were integrated by at least two districts. The cluster with more members (cluster 5) was integrated by five strains from the bordering districts of San Juan de Lurigancho and El Agustino. In the same way, cluster 4 was formed by four strains from the geographically close districts of San Juan de Lurigancho, Santa Anita and La Victoria. The remaining clusters were composed of two members, these coming mainly from geographically separated districts (Fig. 3, Supplementary Table S4).