Study Overview and Population Demographics
In this study, paired dental plaque samples were collected from caries and non-caries lesions from AA (n = 12) and LAH (n = 7) subjects with ECC (Supplemental Table S1–ECC cohort). This cohort (n = 19) was 58% female and 42% male, with a mean age of 4.4 years. The caries status of the subjects was determined by the total number of primary teeth (dmft) and tooth surfaces (dmfs) that were decayed, missing (due to decay), and filled. The mean dmft and dmfs among AA and LAH subjects were 8.2 and 18.9, respectively. To establish a non-disease baseline, plaque samples were collected from age-matched, caries-free AA (n = 9) and LAH (n = 3) children (Supplemental Table S1–Caries-free cohort). This cohort was 58% female and 42% male, with a mean age of 3.9 years. Both ECC and caries-free children had similar low socioeconomic backgrounds (≤ 322% FPL).
Sequencing Depth and Quality
The total RNA extracted from plaque samples ranged from 100 ng to 1000 ng, and high sequencing coverage was achieved with low adaptor contamination (Supplemental Table S2). Species accumulation curves for the ECC and caries-free cohorts (Supplemental Figure S1), as well as the AA and LAH (Supplemental Figure S2) indicate that the number of bacterial organisms reached a point of diminishing returns (at approximately 32 samples in the ECC cohort and at 22 samples in the caries-free cohort) which suggests that the sequencing depth and the number of samples we collected sufficiently captured most dominant taxa.
Bacterial Species Abundance
The bacterial species composition of each sample was determined by alignment-based taxonomic classification of sequence reads using MTSv software package (43). Supplemental Table S3 shows the species with the most significant fold change (differences in gene expression between the paired plaque samples within the ECC cohort). In 13 of 19 children, the median fold change for Streptococcus mutans reads was nearly 500-fold higher in caries plaque. Other classical cariogenic bacterial species, including Lactobacillus rhamnosus and Streptococcus parasanguinis, were also identified in higher abundance in caries plaque. Interestingly, there were non-classical cariogenic bacterial species, including Parascardovia denticolens, Olsenella sp. oral taxon 807, Olsenella uli, Selenomonas sputigena, and Prevotella denticola, consistently more abundant in caries plaque of at least 7 ECC subjects.
Principle Component Analysis (PCA) was completed to identify the similarity of bacterial species composition of each sample type (caries plaque, non-caries plaque, and caries-free plaque). The non-caries plaque and the caries-free plaque samples generally clustered together, although two samples from a caries-free subject clustered near the caries plaque samples, suggesting a potential pre-caries stage (Supplemental Figure S3).
Shifting Metabolic Functions and Contributing Bacterial Species in Early Childhood Caries
To identify the changes in gene expression and bacterial species contributing to caries development, we further assigned the reads by gene and species using the same alignment-based approach (43). Differential gene expression analysis was carried out between caries lesions and matched non-caries plaque samples in the ECC cohort, as well as healthy, caries-free plaque samples in the caries-free cohort. Due to the similarities between the caries-free cohort samples and the non-caries plaque samples in the ECC cohort (Supplemental Figure S3), we primarily focused on the genes that were significantly differentially expressed (SDE) between the matched caries and non-caries plaque samples to elucidate the contributing taxonomic and putative metabolic mechanisms of cavity formation within the same mouth and to capture better the differences between the African American and Hispanic groups.
Metatranscriptomic analysis showed that core microbes contribute to the SDE genes shared among all ECC subjects, supporting the well-established drivers of ECC. We identified 7,763 SDE genes among all subjects with caries (Fig. 1), representing 2,186 KEGG orthologues (KOs). Of these genes, 6,967 were upregulated, and 676 were downregulated. Streptococcus mutans and Lactobacillus rhamnosus were associated with many genes that were SDE in both the AA and LAH groups; however, we also identified SDE genes from Veillanella parvula and Propionibacterium acidifaciens, which are more rarely associated with ECC. Genes that were upregulated in both groups (Fig. 2 – shared) were associated with S. mutans (15%), Veillonella parvula (15%), Propionibacterium acidifaciens (9%), lactobacilli (Lactobacillus rhamnosus–9%, L. gasseri–6%, L. oris–4%), Olsenella sp. (8%), and Bifidobacterium longum (5%).
The top functions of the shared up and downregulated genes for each species are shown in Fig. 3. Upregulated SDE genes were associated with more diverse bacterial species, whereas the downregulated SDE genes were predominantly associated with three bacterial species: Pseudopropionibacterium propionicum (51%), Abiotrophia defective (30%), and Streptococcus sanguinis (9%). Many upregulated porphyrin metabolism genes and lipopolysaccharide biosynthesis genes were associated with V. parvula (Fig. 3A). Most of the shared downregulated genes were associated with Pseudopropionibacterium propionicum and were related to oxidative phosphorylation and propanoate metabolism, while downregulated genes in S. sanguinis were associated with alkali-generating pathways (Fig. 3B).
Differences in active organisms and functions between African American and Latin American Hispanic children with ECC
Apart from the shared SDE genes, the AA and LAH groups presented a significant difference in the organisms and gene functions that were SDE in caries plaque. We identified 4,970 genes (from 2,106 KOs) that were uniquely SDE in AA ECC children and 6,519 genes (from 2,236 KOs) uniquely SDE in LAH ECC children (Fig. 1). Genes unique in AA children were primarily associated with two bacterial species, Pseudopropionibacterium propionicum and Cardiobacterium hominis, and most genes (3,224/4,970–65%) were downregulated (Figs. 2, 4, and 5), suggesting that these species were not as prevalent or active in the disease (caries) status. In addition, other studies have linked both species to healthy caries-free status (45, 48). The upregulated genes in the AA children included several acid-utilizing and/or producing species that are commonly associated with caries such as Campylobacter curvus, Streptococcus sp., lactobacilli, B. longum, and Selenomonas sputigena (26, 49, 50), as well as Streptococcus parasanguis which has been shown to help buffer the pH of the biofilm through alkali production (47).
Genes that were only SDE in the LAH ECC children were associated with a more diverse group of microorganisms (Propionibacterium acidifaciens, Selenomonas sp., Rothia dentocariosa, Atopobium parvulum, and Streptococcus sanguinis) and most of the genes (5,683/6,519–87%) were upregulated (Figs. 2, 4, and 5). Figure 4 illustrates a distinctive pattern of upregulated genes in the LAH children, with the predominant species differing from those observed in the AA children. The predominant contributors were S. sanguinis and A. defectiva, exhibiting notably fewer downregulated genes, although some downregulated genes were associated with Escherichia coli.
Overall, our data showed that there were major differences in gene expression and the active organisms between the AA and LAH children in caries plaque. There were 795 KOs that were SDE expressed in only one of the groups: 413 (52%) in the AA group and 382 (48%) in the LAH group (Supplemental Figure S4). This suggests that unique functions or pathways are specific to each group that are active in caries development. Additionally, even when the same KOs are SDE in both groups, there is evidence that different organisms fill these functional roles. Figure 6 shows similar changes in expression among orthologous KOs but in genes associated with a different set of species in each group. This evidence underscores an ecological shift in the contributing bacterial species despite maintaining a consistent functional profile.