• Population diversity reveals the presence of multiple clones
Our collection comprised 194 strains isolated over a two-year period in hospitals. Our analysis reveals a diverse population, as reflected by the number of STs (n = 43), spa (n = 23), agr (n = 23), and SCCmec (n = 24) types. The population was found to be non-typable for ST, spa, and agr in 9%, 2%, and 1% of the strains, respectively. The concordance level between ST and agr and spa typing was high, in contrast to the SCCmec typing of the STs, which showed variation in the locus belonging to the same STs (Fig. 1). We identified six dominant clones represented by more than ten strains. These clones correspond to the six STs in the population: BAPS1/ST5 (n = 33), BAPS2/ST672 (n = 36), BAPS3/ST97 (n = 14), BAPS4/ST6 (n = 15), BAPS5/ST88 (n = 19), and BAPS8/ST8 (n = 27) (Fig. 1). Except for ST672, other clones were recently reported as CA-MRSA strains in hospital settings in Riyadh with microarrays. [12, 13]. In line with this, a community mode of infection acquisition was confirmed for 25% up to 75% across all dominant clones (Figure S1A). Typing based on the agr gene, a key regulatory element in the virulence mechanism, revealed the presence of three types of gp genes: gpI (in BAPS2/ST672, BAPS3/ST97, BAPS4/ST6, and BAPS8/ST8 clones), gpII (in BAPS1/ST5 clone), and gpIII (in BAPS5/ST88) types (one isolate contained gpV type), pointing to the diversity of virulence mechanisms for the major clones. Moreover, we conducted typing based on the variation of the spa gene, which encodes the Staphylococcus protein A triggering B cell proliferation. [33]. The spa gene is commonly used for epidemiological surveillance of S. aureus because of its high variation, offering a higher discrimination typing compared to MLST and agr methods [34]. The analysis identified a total of 63 different known spa types in 188 genomes (Fig. 1, Supplementary Table S1). Six types (t690 (n = 12), t304 (n = 14), t008 (n = 18), t311 (n = 18), t3841 (n = 31)) were the most prevalent in the population and together were detected in 51.5% (100/194) of the genomes (Fig. 1). Among these types, t311, t3841, t304, t690, and t008 were the most prevalent in BAPS1/ST5 (n = 18/33), BAPS2/ST672 (n = 29/36), BAPS4/ST6 (n = 14/15), BAPS5/ST88 (n = 12/19), and BAPS8/ST8 (n = 18/19) clones, respectively. The spa types t008, t304, and t311 were reported predominantly from Europe, America, and to a lesser extent in Africa. [34]. However, t690, t3841 were not reported as prevalent types before and appear to be exclusive to the current collection. The BAPS3/ST97 group consisted of six spa types, including t521 (n = 2/14), t359 (n = 3/14), t2770 (n = 2/14), t267 (n = 4/14), t2297 (n = 2/14) and t1544 (n = 1/14), showing the high level of genomic dynamics of the clone. The contextualization of the genomes with the global genomic collection of S. aureus identified a low level of mixing with the global genomic collection, with only two clusters (SNP clusters) with external genomes. This included the grouping of ST8 strains with a clinical sample from Denmark (SNP cluster PDS000144949.1) and two strains of ST1535, which belonged to a cluster (PDS000069007.8) comprising clinical strains from Denmark and the Netherlands, as well as two environmental samples from meat retailers from unpublished genomes in Saudi Arabia, indicating a link with non-clinical settings. These findings suggest the occurrence of community-associated clones, some of which were specific to the study setting and had links with the environment.
• SCCmec typing indicated the presence of highly dynamics locus
SCCmec typing indicated the presence of a highly dynamic locus. Our analysis showed mec genes in all strains, although not all genes were found in the context of known SCCmec elements. Among the strains, 67 exhibited a complete match to one of the known SCCmec elements (Fig. 1, Figure S2). For the rest, a partial match to an element with a sequence (Hamming) distance of > 0 and < 5 was observed (Figure S2). All strains were found to contain the mecA gene, although 25 harbored divergent copies of the gene not situated in an SCCmec element. For the copies of mecA in an SCCmec element, a strong association with resistance to beta-lactams of oxacillin and cephalosporins was identified (GWAS p-value < 0.05 after accounting for lineage effect). HA-MRSA is recognized to carry SCCmec types I, II, or III, while CA-MRSA carries smaller SCCmec types IV, V, or VI. [35]. In our collection, except for one ST8 sample that carried an SCCmec type III locus, the other strains harbored SCCmec loci associated with CA-MRSA strains (Figure S2). The loss of SCCmec occurred in 22 lineages across the phylogenetic tree, including two times within the BAPS1/ST5 clones and three times in BAPS2/ST672, BAPS5/ST88, and BAPS8/ST88 groups (Figure S2). Besides the loss of SCCmec elements, switching between SCCmec types was detected within identified clones. Specifically, we observed a switching in SCCmec types within the BAPS2/ST672 clone, shifting from type V/VII to type IVd. Similarly, the BAPS1/ST5 clone exhibited a switch between types V/VII and type VI (observed in two genomes). The BAPS8/ST88 clone included switches between SCCmec types, changing from type VI to type IIIa, and types V/VII were observed in four genomes. The observation demonstrates significant dynamics of SCCmec on epidemiological time scales. This finding aligns with the reported high mobility of the CA-MRSA-associated SCCmec types of IV and V, especially the type IV element, which is recognized to be transferred to methicillin-susceptible backgrounds frequently. [36, 37].
• Presence of hallmark genes for CA-MRSA virulence across independent lineages
In addition to the SCCmec types, the clones in the population included multiple virulence genes associated with CA-MRSA strains (Fig. 1). Most notably, multiple strains contained PVL genes, lukF-PV and lukS-PV, encoding a two-component S. aureus pore-forming protein well-characterized as causative of community-acquired invasive diseases. [38]. The PVL genes were found in 31 genomes, which included multiple lineages: 60% (12/20) of strains in BAPS8/ST8 clones harbored the PVL locus, while these genes occurred less frequently in other major clones, i.e., one genome each in BAPS3/ST97 and BAPS5/ST88. In other minor clones, the locus was detected in the clones of ST80 (n = 4), ST22 (n = 3), and ST30 (n = 3), all of which were described as CA-MRSA strains (Fig. 1, Supplementary Table S1). Another virulence factor for CA-MRSA strains is the Toxic Shock Syndrome Toxin (tsst) gene, encoding a toxin associated with toxic shock syndrome. [39]. The gene was observed in 15 genomes in our collection, belonging to ST22 (n = 8), ST361 (n = 3), and one genome from each of ST8, ST30, and ST672. For the three strains characterized by BAPS9/ST22/t223, BAPS9/ST22/t223, and ST30/t233, the gene co-occurred with PVL genes within the same genomes. This co-occurrence renders these strains clinically significant, exhibiting a dual virulence phenotype. The results show the independent occurrence of some known hallmark genes for CA-MRSA in the collection.
• Distinctive pattern of virulence gene across the clones
As the virulence mechanisms of CA-MRSA are not fully characterized, we screened the collection for the presence of other virulence genes, specifically focusing on 29 genes that were variably present in the major clones and the background population (Fig. 2, Supplementary Table S1). For a set of secretion protein genes, including essC and esxBC, toxin genes like esxD, eap/map, and adhesion genes such as set22 and esaD, the major clones more frequently contained the virulence genes compared to the background population, indicating the potential higher virulence of these clones (p-value from ANOVA test < 0.05) (Fig. 2A). Moreover, the screening revealed that some clones contained more genes encoding superantigens (SAg) [39, 40], which are potent immunostimulatory exotoxins causing T cell proliferation and cytokine release, than the rest of the population (p-value from ANOVA test < 0.05). The gene list included selq (55%) and selk (55%) in BAPS8/ST8, set21 in all BAPS5/ST88, set16 in all BAPS2/ST672 (Fig. 2A). Some BAPS groups contained virulence factors that were significantly more frequent than other clones, i.e., adhesion clf gene in BAPS3/ST97 (100% of strains), adhesion cna (100% of strains), and enterotoxin sea gene (93% of strains) in BAPS4/ST6, hyaluronidase hysA, and human-specific immune evasion cluster chp gene (84% of strains) in BAPS5/ST88, and adhesion sdrD (82%) in BAPS8/ST8 (Fig. 2A). Although the specific role of these genes in virulence requires further functional analysis, the variation of the virulence gene profiles across major clones points to the complex and diverse virulence mechanisms in CA-MRSA.
• Population has high resistance level somewhat reflecting the prescriptions
The population exhibits a high resistance level against beta-lactams such as penicillin and second-generation cephalosporins, as well as fusidic acid, with resistance levels of 98%, 90%, 90%, and 71% for benzylpenicillin, oxacillin, cefoxitin, and fusidic acid, respectively (Fig. 1, Fig. 2B). The resistance levels for fluoroquinolones were moderate (moxifloxacin 31% and levofloxacin 46%). For aminoglycosides (tobramycin and gentamicin), macrolides (erythromycin and clindamycin), tetracycline, and trimethoprim, low resistance levels of 11%, 13%, 23%, 9%, 15%, and 20%, respectively, were observed. No resistance to vancomycin, linezolid, or tigecycline was detectable in the collection. Moderate to strong cross-resistance was prevalent for fluoroquinolones, fusidic acid, and beta-lactams (see Fig. 1), while for macrolides, tetracycline, and trimethoprim, there was a weaker correlation of resistance with other groups. This pattern somewhat reflects the prescription pattern in the window of six months before sampling: the most prescribed antimicrobial classes in the six months prior to admission and at the time of diagnosis were vancomycin, beta-lactams (piperacillin/tazobactam and carbapenems) (43% of prescriptions), fluoroquinolones (46% of prescriptions), and fusidic acids (37% of prescriptions), while tetracyclines and aminoglycosides were less commonly prescribed. Vancomycin was still the most used antimicrobial at the time of diagnosis and following antimicrobial susceptibility, although no resistance had yet arisen (Supplemental Table S1). For other antimicrobials, resistance levels varied across the clones: BAPS3/ST97 strains exhibited higher resistance to aminoglycosides (gentamicin and tobramycin), while for fluoroquinolones, the ST672 and ST6 clones were found to be more resistant compared to the other major clones (Fig. 2B) (p-value from proportion test < 0.05). Some level of clindamycin resistance was observed for all major clones except BAPS4/ST6 and BAPS3/ST97 (Fig. 2B). For tetracycline, BAPS5/ST88 and BAPS1/ST5 were found to be relatively more resistant.
• Resistance genes profiles differ across clones with ST8 and ST5 harboring more resistance genes
The collection exhibited a diverse array of antimicrobial resistance genes for two databases of known resistance genes, with varying average counts observed across major BAPS clones: BAPS8/ST8 (ResFinder: 6.75, CARD: 6.1), BAPS4/ST6 (ResFinder: 5.13, CARD: 4.0), BAPS2/ST672 (ResFinder: 5.41, CARD: 4.61), BAPS5/ST88 (ResFinder: 5.84, CARD: 4.61), BAPS1/ST5 (ResFinder: 6.67, CARD: 3.84), and BAPS3/ST97 (ResFinder: 5.79, CARD: 5.48) (Fig. 2C). The BAPS8/ST8 group strain had higher resistance genes except BAPS1/ST5 (p-value from one-sided Wilcoxon signed-rank test < 0.05). On the next level, BAPS3/ST97 and BAPS5/ST88 were more resistant than BAPS4/ST6 and BAPS2/ST672, although they had comparable resistant gene counts as the other minor STs altogether (Fig. 2C). Both ST8 and ST5 lineages belong to the significant clonal complexes of S. aureus (CC5 and CC8), associated with rapid dissemination of methicillin and multidrug resistance in both hospital and community settings [3, 41]. The distribution of resistance genes revealed a consistent presence of tetracycline resistance gene blaZ across all clones, highlighting a higher count of macrolide resistance genes msrA and mphC in BAPS8/ST8 and BAPS1/ST5 (Fig. 1, Fig. 2C). Further exploration into genes and mutations strongly associated with resistance demonstrated variable presence across major clones. The mecA gene as mentioned above strongly linked with beta-lactam resistance and was found lost in some lineages. The bifunctional enzyme aacA-aphD gene (tetracycline efflux pump gene tetK), significantly linked with aminoglycoside resistance genes, emerged frequently in the BAPS3/ST97 (BAPS5/ST88) clone (GWAS p-value < 0.05 after accounting for lineage effect). Macrolide resistance genes msrA and ermC underlie higher resistance in BAPS8/ST8 and BAPS1/ST5, respectively, and emerged across multiple lineages (GWAS p-value < 0.05) (Fig. 1). The fusc gene, linked with fusidic acid resistance, was prevalent in all major clones except BAPS4/ST6. Among the resistance mutations for fluoroquinolones, the S84L mutation in gyrA was found significantly linked with resistance phenotypes for moxifloxacin and levofloxacin (GWAS p-value < 0.05). The gene diversity highlights variety of emerging resistance trends, occurring both among different clones and more recently within individual clones.
• Similar plasmids carrying antimicrobial resistance genes are shared between lineages
To contextualize the resistance genes, we conducted long-read sequencing for strains associated with bloodstream infections. We retrieved full genome sequence of plasmids from on isolate (ID00734) (plasmid 1) belonging to BAPS5/ST88 from bloodstream infection and one from (ID00777) (plasmid 2) BAPS8/ST8 (Fig. 3A). Both strains contained a mosaic plasmid of size 28K. The search for similar plasmid backbones in the database did not pinpoint any previously reported backbone. The highest coverage of 69% was found with a plasmid with accession CP094779.1 recovered from the non-human sources [42]. Plasmid ID00734 is predominantly limited to the BAPS5/ST88 clone and occurred in 6 strains and one isolate and one strain from untyped ST (Fig. 3A). The plasmid is a mosaic plasmid with a replicon reported in previous clinical isolates consisting of a blaZ and blaI genes surrounded by mobile genetic elements. The same plasmid also accommodates the macrolide resistance gene of msr(A) (Fig. 3B). Three aminoglycoside resistance genes of aph(A), sat(A) and aad(K) were linked with an IS6 element. The plasmid also contained a cadC gene for cadmium resistance. The size and lack of conjugative transfer genes suggest that the plasmid belongs to low-copy non-conjugative theta replicating plasmids [43]. The plasmid from BAPS8/ ST8 isolate somewhat resembled the plasmid from BAPS5/ST88 (76% sequence identity) in that it has similar size and the same bla cassette. However, the plasmid lacks msr(A) and a tet(K) gene linked with IS6 plasmid has replaced the aminoglycoside carrying plasmid. Moreover, the plasmid has a broader phylogenetic distribution and occurred in strains from ST67, ST1482 and ST1 strains in which more than 95% of the plasmid contigs were covered by the reads of these strains (Fig. 3A). The sharing of the plasmids and variation in the arrangement of antimicrobial resistance genes suggest a high rate of horizontal gene transfer occurring at plasmid and gene levels.
• Concurrent expansion of clones over the past few decades, with steady population increase
The comparison of dynamics among the clones within the hospitals also indicates marked differences between them. The BAPS1/ST5, BAPS5/ST88, and BAPS8/ST8, belonging to globally known circulating clones, emerged 30 to 50 years ago, while the other three BAPS groups emerged more recently (Fig. 4). The higher clock rate for BAPS3/ST97 and BAPS4/ST6 points to recent expansions of these clones, indicating that the emerging mutations have not yet been purged by selection pressure (Fig. 4). The population dynamics of the clones in the hospital indicate an increase in the effective population size for BAPS/ST15, BAPS2/ST672, BAPS3/ST97, and BAPS4/ST6 over the last decade, despite a slight recent decrease (Fig. 5). For the more long-standing BAPS5/ST88 and BAPS8/ST8 clones, the populations remained constant after an initial increase (Fig. 5). Further examination of the Bayesian phylodynamic trees reveals signatures of multiple independent introductions of the clones from the community into the hospital wards (Fig. 6). For the strains with no evidence of community acquisition routes for the infection, transmission appeared to have occurred uniformly across the hospital wards. Specifically, for BAPS2/ST672, five incidents of potential recent transmission (i.e., divergence in the past one year) involving patient pairs ID00707/D00696, ID00647/D00642, ID00792/D00791, ID00624/D00765 were observed. For BAPS5/ST88, two incidents (ID00832/D00828, ID00646/D00718), for BAPS8/ST8, one incident (ID00819/D00783), for BAPS1/ST5, one incident (ID00726/D00727), and for BAPS4/ST6 (ID00774/D00721) were observed. Except for two incidents, the rest of the transmission involved the same body sites, and only two occurred between patients with the same admission ward (Fig. 6). On one occasion, i.e., BAPS4/ST6 (ID00774/D00721), the sampling times for the patients involved in transmission were five months apart, showing the persistence of the clone in the hospital environment. Furthermore, for two patients in BAPS1/ST5 and BAPS8/ST8, distinct strains were found to colonize the same patients, suggesting repeated mixed infections [44]. Within-patient diversity indicated seven incidents in the BAPS groups (the shaded boxes in Figure), three of which in BAPS2/ST672, BAPS3/ST97, and BAPS4/ST6 happened between strains in the blood and other sites, pointing to the invasiveness of the strains. Altogether, these findings indicate the multiple introductions of strains into the hospital and the circulation of the strains across wards and the same patient's sides happening on epidemiological time scales.
• Diverse clinical manifestation amongst the major clones
We finally examined the clinical features and manifestations of the strains to dissect the differences in virulence among the major clones. All major clones were extracted from bodily fluids, including nasal secretions, sputum, and wound exudate (fluid discharged from a wound). Except for BAPS8/ST8, other STs contained strains from bloodstream infections. Bloodstream infection did not occur in the rest of the population (Figure S1B). Except for BAPS8/ST8, other clones included bloodstream invasive strains. While clones exhibited a uniform distribution across ages, the hospital duration for the colonized patients showed slight variations among the major clones (Figure S1C). Patients with BAPS3/ST97 and BAPS/ST88 showed a significantly longer hospitalization period (p-value from Wilcoxon signed-rank test < 0.05) (Figure S1D). The integration of in-hospital morality data and the colonized ST confirmed a hazard ratio greater than one corresponding to the increased hazard of death for all major clones compared with the other clones in the population. However, only for BAPS3/ST97 was a significantly higher death odds-ratio than one confirmed (Figure S3A) (95% CI greater than one). After accounting for the impact of demographic factors and other comorbidities, only the colonization by BAPS3/ST97 strains was found significantly linked with in-hospital mortality (Figure S4B). We further linked the colonization of STs with the importance of the terms extracted from the diagnostic notes for the patients. Figure S4 shows the top ten important terms for the corpus for each ST, which are highly variable across corpora for different STs. The terms include comorbidities, e.g., cancers related terms of "carcinoma," "glioblastoma," and "neuroblastoma" for BAPS2/ST672, BAPS3/ST97, and BAPS5/ST88, as well as infection types (Figure S5). These might be due to the presence of long-stay patients and frequent visitors to the clinic who are immunocompromised. For patients with BAPS3/ST97, the bloodstream infection term of "sepsis" was found to be the most important. For BAPS5/ST88, similar terms, i.e., "septic" and "shock," still ranked among the top ten most important terms. These critical clinical circumstances for patients carrying ST97 and ST88 are in concordance with the higher in-hospital mortality for these patients. Underscoring the clinical significance of these two clones (see Discussion), the evidence also shows the broad infection sites and patients’ circumstances linked with the infection/colonization of each of CA-MRSA clone.