Patient characteristics and clinical samples.
In this study, the clinical information of 144 patients with newly diagnosed pulmonary infections was retrospectively analyzed. The mean age of all patients was 67.1 years, ranging from 15 to 95. Among them, 77.0% of patients were male. 61.1% patients had underlying comorbidities, including respiratory failure (32, 22.2%), disorders of consciousness (14, 9.7%), cardiovascular disease (9, 6.3%), malignancies (11, 7.6%), and diabetes (6, 4.2%). The bacteria with ARGs (ARB) positive results were associated with the department (P = 0.001) but gender, age, antibiotic history and comorbidities (Table 1).
Table 1
Basic clinical characteristics of 144 patients with suspected pulmonary infections
Characteristics | N (%) | ARB detected (Alignment) | ARB detected (Assembly) |
Yes | P | Yes | P |
Demographics | | | | | |
Age | | | | | |
Mean age (years) | 66.6 | | | | |
<=50 | 27 (18.75) | 25 | 0.172 | 7 | 0.894 |
> 50 | 117(81.25) | 98 | 44 |
Sex | | | | | |
Male | 108 (77) | 93 | 0.817 | 42 | 0.119 |
Female | 36 (25) | 30 | 9 |
Antibiotic history | | | | | |
Yes | 119(82.63) | 98 | 0.694 | 43 | 0.889 |
No | 25(17.36) | 25 | | 8 | |
Department | | | | | |
ICU | 94 (65.27) | 76 | 0.001*** | 28 | 0.001*** |
PD | 13 (9.03) | 13 | | 10 |
PCCM | 23 (15.97) | 21 | | 3 |
Others | 14 (9.72) | 13 | | 10 |
Comorbidities | | | | | |
Respiratory failure | 32 (22.22) | 10 | 0.633 | 4 | 0.149 |
Disorders of consciousness | 14 (9.72) | 10 | 4 |
Cardiovascular disease | 9 (6.25) | 8 | 5 |
Malignancies | 11 (7.64) | 10 | 7 |
Diabetes | 6 (4.17) | 5 | 3 |
Others | 17 (11.80) | 15 | 6 |
None | 55 (38.19) | 65 | 22 |
ICU: intensive care unit; PD: Pneumology Department; PCCM: Pneumology Critical Care Medicine. *** P < 0.001 |
A total of 151 clinical specimens were collected, which included BALF (n = 134, 88.7%) and sputum (n = 17, 11.3%). The number of effective reads ranged from 103 to 107 counts (Fig. 1A). Primary mNGS analysis were performed for 151 samples, of which 114 samples were regarded as ARB positive by the alignment method, and 52 samples were assessed positive by the assembly method (Table S1). To better identify the ARB, the mNGS assay 52 samples with positive ARGs by both approaches are used for comprehensive evaluation. And then, 14 culture positive samples were performed the consistency ratio analysis combined with the mNGS alignment and assembly analysis. Finally, the predominant ARB-ARGs and the reference application conditions were assessed (Fig. 1B).
Evaluating the performance of mNGS alignment and assembly analysis with conventional method for the identification of ARB
52 out of 151 samples with positive ARB by both mNGS alignment and mNGS assembly methods are used for comprehensive evaluation. A total of 165 ARB was obtained by these two approaches, of which 142 ARB detected by the alignment method (red bar) were much more than the 48 by the assembly method (blue bar), respectively (Fig. 2A). Among them, 25 ARB was obtained in both methods (Fig. 2B, Table S2 ). The alignment method detected much more ARB than the assembly method (142 > 48). However, many congeneric species in the same genus, such as Acinetobacter baumannii, Acinetobacter pittii and Acinetobacter nosocomialis were detected as ARB by alignment, only A. baumannii in Acinetobacter was detected as ARB by the assembly method. This may be caused by the false-positive ARB due to mapping reads directly to large data sets can inflate false-positive predictions, as reads derived from protein-coding sequences may spuriously align to other genes as a result of local sequence homology [11, 16]. In addition, it was found that substantial non-ARB (gray bar) evaluated by the mNGS assembly method were classified as ARB by the alignment method. This may be caused by the over-reliant on the reference antibiotic resistance database by the alignment method, which requires the existence of relationship between bacteria and its corresponding ARG (ARG-bacteria attribution). However, mNGS assembly method in longer sequence (contigs) has the advantages of facilitating more accurate judgment of the issue of ARG-bacteria attribution as described previously [11]. To evaluate whether the above susception were the major cause, we then compared the mNGS analysis and the culture test results.
We assessed the detection efficiency analysis of the mNGS method and the conventional method. Among the 52 patients, 14 of them showed the culture positive results (Table S3). Among the 14 double-positive patients’ samples, mNGS alignment method (87%, 36/42 vs. 14%, 6/42) and the assembly method (81%, 25/31 vs. 19%, 6/31) yields a higher positive detection efficiency for pathogens than compared with the culture method (Fig. 2C, culture, pathogens). The detected ARB of mNGS alignment in 12 patients and 85.7% matched with were the culture and drug susceptibility tests results (DST), and 64.2% (9/14) matching by the assembly method (Fig. 2C, *CD). In total, mNGS yields a higher positive detection efficiency for pathogens and ARB compared with the conventional method.
We further assessed the consistency analysis according to the number of times of detected pathogens to distinguish between true- and false-positive ARB. Comparing with the DST results, mNGS assembly method indicated consistency results with a matching accuracy of 46% (12/26), which was much higher than the mNGS alignment method with 13% (19/141) (Fig. 2D, 2E). Those results revealed that comparing to conventional detection methods, the false-positive detection rate of ARB was significantly higher using mNGS alignment method. The assembly method could assist the determining of the detected pathogens by the alignment method as true ARB and improve the predictive capabilities. Intrigued by our finding, we further performed ARGs evaluation through a combination of mNGS alignment and assembly methods to explore relationship between the bacteria and its corresponding ARG (ARG-bacteria attribution).
Analysis of the ARG-ARB network based on the alignment and assembly methods.
To gain an insight into the potential interplay of ARG-bacteria attribution, we performed an ARG-ARB network analysis by the mNGS alignment and assembly methods. A total of 361 ARGs were detected, of which 141 were detected by the assembly method and 352 were detected by the alignment method from 52 clinical samples. According to the antibiotic categories, these ARGs mainly comprised multidrug resistance (n = 91), macrolide-lincosamide-streptogramin B (MLSB) (n = 11), tetracycline (n = 14), aminoglycoside (n = 36), β-lactam (n = 145), aminocoumarin (n = 3), phenicol (n = 7), peptide (n = 9), fluoroquinolone (n = 11), disinfecting agents and antiseptics (n = 10) (Table S4), which mostly belonged to the multidrug class and β-lactam antibiotic classes.
Specifically, we found substantial associations between a ARG and multiple bacteria, especially complex network results displayed by the alignment method (Fig. 3, alignment). Nevertheless, the assembly method results showed a ARG was attributed a particular bacterium (Fig. 3, assembly). For example, among the 101ARGs existing in both two approaches, correlations between the ADE family (adea, adeb, adec, adef, adeg, adeh, adel, aden, ader, ades) and multiple bacteria (A. baumannii, K. pneumonia, P. aeruginosa and S. maltophilia), were attributed by the alignment method. However, they were only attributed to the A. baumannii by the assembly method. Similarly, the other genes including MEX and OPM were only attributed to the P. aeruginosa, were aligned to multiple bacteria (Table S5). In addition, there were 34 ARGs detected only by assembly method, including C. striatum (carA), Neisseria sicca (farB, mtrC, mtrD), Streptococcus mitis (patB, pmrA, RlmA (II)) (Table S5, blue font), which may play a role in improving the reference antibiotic resistance database. The results revealed that the interactions potentially could contribute to determining the ARG-bacteria attribution and optimizing the antibiotic resistance database, via the introduction of mNGS assembly method.
In line with our hypothesis, improved the determining of ARG-bacteria attribution was obtained by combining mNGS alignment method and assembly method, suggesting an additive detection value for the combination of those two approaches. Based on the common detected numbers of ARGs, we explore the most prevalent ARGs and drug class in its corresponding ARB. As follows: A. baumannii (ADE, multidrug), P. aeruginosa (Mex, multidrug), K. pneumonia (MDT, aminocoumarin; EMR, fluoroquinolone), S. maltophilia (SME, multidrug) and C. striatum (carA, MLSB) (Table S6, red/blue font). The ARB had high drug resistance to antibiotics and multidrug resistance. Multidrug resistance of ARB to antibiotics was also more severe in BALF and sputum samples (Fig. 3, bottom right plot).
Identification of the predominant ARB in pulmonary infection patient samples
Since combination with two approaches were more effective and accurate for ARB detection, we next aimed to identify the predominant ARB of the pulmonary infection patient samples. In this study, 48 predominant ARB were identified by both methods and only by the assembly method. The ARB and its corresponding drug resistance are also shown in Fig. 4. To identify a predominant ARB in the pulmonary infection patient samples, we choice features according to their ranking (detection frequency based the number of samples). First, pathogens detected high frequency by both method, that is, A. baumannii (75.0%, 48.0%), P. aeruginosa (65.4%, 11.5%), S. maltophilia (71.2%, 11.5%), K. pneumoniae (63.5%, 11.5%) were among the top four most predominant ARB.
Additionally, pathogens detected high frequency (> 50%) by the alignment method, including Pseudomonas fluorescens (59.6%), Staphylococcus aureus (57.7%), Enterococcus faecium (51.9%), and Staphylococcus epidermidis (51.9%) (Fig. 4, blue font) also contributed to the predictability of the ARB. Meanwhile, pathogens detected high frequency by the assembly method, such as C. striatum (38.5%), Streptococcus oralis (9.6%), N. sicca (9.6%), S. mitis (9.6%), were also identified as predominant ARB (Fig. 4, red font). In addition, we also identified the main antibiotic drug class according to CARD. Multidrug resistances were dominant among most common bacteria. In A. baumannii, P. aeruginosa and S. maltophilia, most ARGs were resistance-nodulation-division (RND)-type efflux pump genes, which conferred resistance to multiple drugs. Antimicrobial agents should be used rationally to decrease multiple antibiotic resistant strains. Altogether, this comprehensive analysis validated the popular ARB and its drug class in pulmonary infection patient samples from this study across a combination of the mNGS alignment and assembly methods.
Conditions of mNGS alignment and assembly methods in application for the clinical antibiotic resistant detection.
We compared the ARG positive rates of 151 clinical samples based on the alignment and assembly methods. In this study, the ARG positive rates referred to the ARGs detected in the sample alignment method and assembly method. The ARG positive rates were much higher by the alignment method than by the assembly method (75.5%, 114/151 vs. 34.4%, 52/151) (Fig. 5A).
We also analyzed the ARG positive rates among the samples with varied amounts of effective reads and different filtering conditions based on the two approaches. First, in the BALF and sputum samples, when the effective reads were approximately 107 counts, the positive rate was 100% both the alignment and assembly methods. The ARG positive rate was higher with the alignment than in the assembly method, with approximately 106 counts (BALF: 95.5% > 49.3%; sputum: 100% > 50%) and approximately 105 counts (BALF: 71.4% > 7.1%; sputum: 66.7% > 33.3%), respectively (Fig. 5B). Herein, setting a cutoff over 107 effective reads may have been more suitable for both methods to detect ARGs in BALF and sputum samples. Generally, the ARG positive rates relied more on sufficient effective reads (over 107 counts) in these sample types.
In terms of filtering conditions, a gradual decrease in ARG positive rates by the alignment method was observed when the cutoff values of unique reads increased as the key filtering conditions. When unique reads = 1, the ARG positive rates of BALF and sputum samples were 77.6% and 58.8%, respectively. When unique reads = 8, it fells to 57.1% and 50.4%, respectively (Fig. 5C). Regarding effective reads, when unique reads = 1, the positive rates were 100%, 90%, 70% and 0% for the effective reads of 107, 106, 105 and 104 counts, respectively (Fig. 5D). Henceforth, unique reads = 1 could be introduced as the minimum filtering condition for ARG detection in the case of insufficient effective reads (≤ 107).
Moreover, we calculated the CPU time occupied based on assembly and alignment. As shown in Fig. 5E, when the effective reads were approximately 107 counts, the average times for alignment, assembly, and the combination of these two approaches were approximately 152.5 s, 567.8 s and 600 s, respectively. The data running time of the two approaches could still meet the clinical report requirement (48–72 h). Finally, we proposed a combined condition of unique reads ≥ 1 and effective reads ≥ 107, which in most cases met the requirement of ARG detection for BALF and sputum samples.