General clinical data
We collected fresh fecal samples from 15 patients, a total of 30 samples, 15 of which were preoperative specimens and 15 postoperative specimens. We compared E2, AMH, and FSH in the RS1 group and RS2 group. The AMH level of the RS1 group was lower than that of the RS2 group, but the difference was not statistically significant. The FSH level was higher in the RS1 group than the RS2 group, but the difference was not statistically significant. The E2 level was significant higher in the RS1 group. Therefore, we believe that ovarian function was greater in the RS1 group compared to the RS2 group (Table 1).
Study of the structure of intestinal flora in patients with undergoing transabdominal hysterectomy
A total of 2,083,203 sequences were obtained from 30 samples from the 2 groups, with an average of 73,170 sequences per sample. After quality control, 69,440 valid data were obtained, and the quality control efficiency was 94.96%. The mean number of sequences in the RS1 group was 70,122.73±54.71, and the mean number of sequences in the RS2 group was 68,755.47±5159.97, where the difference between the two groups was not statistically significant (P=0.314). The sequence was clustered into OTUs (operational taxonomic units) with 97% identity. A total of 6651 OTUs were obtained, with an average of 221 OTUs per sample. Among them, the RS1 group had 5949 OTUs and the RS2 group had 5290 OTUs. Comparing the number of unique and common OTUs in the two groups according to the Veen chart, the OTU number composition and similar situation of the sample could be compared. The number of OTUs shared by the RS1 and RS2 groups was 4588, and the number of unique OTUs was 1361 in the RS1 group and 702 in the RS2 group. The Veen diagram showed that the diversity of RS1 was significantly higher than that of RS2. The petal plot indicated the number of OTUs contained in each sample of the RS1 and RS2 groups (Figure 1A, 1B).
In this study, the statistical analysis of the sample at 97% similarity level produced a rarefaction curve, and a significant plateau appeared on the starting curve of 7737 sequences, indicating that the sequencing depth was close to saturation, increasing the sequencing depth at 97% similarity. No more bacterial species could be found at the top. The combination of the rarefaction curve and the Shannon diversity curve indicated that the amount of data in this study was reasonable, the sequencing depth was sufficient; the detection rate of the bacterial species of the sample was close to saturation, meeting the requirements of subsequent bioinformatics analysis (Figure 1C, 1D, 1E). Second, by analyzing Good’s coverage index, the sequencing coverage of each sample was over 98%. 16S rRNA gene sequencing was effective in this study and represented more than 98% of the bacterial species in each sample, and the coverage of the bacterial species was good. The rank-abundance curve was steep, indicating that sample distribution was uneven, and there could have been a dominant flora; the curve span was large, indicating that the abundance of the species was high. As shown in(Figure 1F), the RS1 curve had a wide and flat span on the horizontal axis, indicating that species richness and uniformity of the RS1 group were better.
The analytical indices included ACE, Chao1, Simpson, and Shannon. ACE and Chao1 are indices for evaluating the number of OTUs contained in the sample. Simpson and Shannon indices were used to reflect the diversity of the sample population: the larger the Simpson, the lower the diversity of the flora, and the larger the Shannon, the higher the diversity of the flora. The Wilcoxon rank sum test found that there was no significant difference between the RS1 group and RS2 group, P=0.2328: Shannon index, P=0.2169; Simpson index, P=0.2017; ACE, P=0.3669; and Chao1, P=0.5125. There was no significant difference in diversity between RS1 and RS2 (Table 2).
Changes in microbial community composition after hysterectomy
The relevant bacterial composition of the patient was analyzed from the perspective of phylogeny (domain/phylum/class/order/family/genus). In the human microbiota, the bacterial group is quite conservative, which can directly reflect the heterogeneity of bacterial community structure in different human body parts. Therefore, when analyzing the composition of common human microorganisms, we must first explain the relative abundance of bacterial phyla. The law of variation, in the bacterial classification unit, the phylum can be called the highest classification unit. Species annotations were made by comparison with the Silva 132 database, and statistics were analyzed at different classification levels: there were 6651 OTUs, of which all could be annotated to the database (100.00%), and the proportion of annotations to the boundary level was 100.00%. The ratio of the phylum level is 91.35%, the ratio of the class level is 81.01%, the ratio of the order level is 69.59%, the proportion of the family level is 58.39%, the proportion of the genus level is 33.82%. According to the results of the species annotation, each species or group is selected in the top 10 species of the highest abundance in the horizontal phylum, class, order, family, genus, and the relative abundance of the species is generated. A cylindrical cumulative graph to visually view the species and their proportions of the relatively abundant abundance of each sample at different classification levels.
Analysis at the phylum level
At the phylum level, the top ten strains of the RS1 and RS2 groups ranked as the most abundant. In the RS1 group, they were: Bacteroidetes, Proteobacteria, Firmicutes, Acidobacteria, Actinobacteria, Gemmatimonadetes, Planctomycetes, Chloroflexi, Verrucomicrobia, Tenericutes, and bacteria that could not be classified (Figure 2A). In the RS2 group, they were: Bacteroidetes, Proteobacteria, Firmicutes, Melainabacteria, Acidobacteria, Cyanobacteria, Gemmatimonadetes, Actinobacteria, Verrucomicrobia, Planctomycetes, and bacteria that could not be classified (Figure 2B).At the phylum level,the dominant bacteria were basically the same, and the three dominant bacteria, Bacteroidetes, Proteobacteria and Firmicutes, accounted for more than 75% of the intestinal flora (Figure 2C). Significant individual differences occurred between the samples. The proportion of Bacteroidetes in each sample ranged from 2.85 to 78.77%, and the proportion of Proteobacteria in each sample ranged from 3.15 to 92.13%. Firmicutes were present in each sample at a proportion of 2.31 to 66.03%, and the relative abundance of Bacteroidetes was higher in the RS1 group than RS2 group, P=0.003. After surgery, Bacteroidetes decreased significantly, and the relative abundance of Proteobacteria was significantly lower in the RS1 group, P=0.016, and thus increased after surgery. The relative abundance of Firmicutes was lower in the RS1 group but not significantly, P=0.926 (Table 3); combined with UPGMA (Unweighted Pair-group Method with Arithmetic Means) clustering tree, in environmental biology, UPGMA is a commonly used clustering analysis method. It is the earliest method used to solve a classification problem. The UPGMA clustering analysis was performed with the Weighted UniFrac distance matrix and the Unweighted UniFrac distance matrix, and the clustering results were integrated with the relative abundance of the species at the phylum level (Figure 2D, 2E),suggesting that the structure of the two groups of bacteria is not significantly different.
Analysis at the class level
At the class level, the top ten strains in the RS1 and RS2 groups were selected. The RS1 group included: Bacteroidia, Gammaproteobacteria, Clostridia, Bacilli, Alphaproteobacteria, unidentified_Actinobacteria, Negativicutes, unidentified_Acidobacteria, Deltaproteobacteria, unidentified_Gemmatimonadetes, and bacteria that could not be classified (Figure 3A).The RS2 group included: Gammaproteobacteria, Clostridia, Bacteroidia, unidentified_Melainabacteria, Bacilli, Negativicutes, unidentified_Cyanobacteria, Alphaproteobacteria, unidentified_Acidobacteria, Deltaproteobacteria, and bacteria that could not be classified (Figure 3B).The dominant species at the class level were Gammaproteobacteria, Bacteroidia and Clostridia, which accounted for more than 50% of the intestinal flora (Figure 3C).Significant individual differences occurred between the samples. Gammaproteobacteria accounted for 2.04-91.43% of each sample, and Bacteroidia accounted for 2.85-78.77% of each sample, while Clostridia accounted for 0.37-64.80% of each sample.
Analysis at the order level
At the order level, the top ten strains in the RS1 and RS2 groups were selected. The RS1 group included: Bacteroidales, Enterobacteriales, Xanthomonadales, Clostridiales, Lactobacillales, Flavobacteriales, Bifidobacteriales, unidentified_Gammaproteobacteria, Selenomonadales, unidentified_Acidobacteria, and other bacteria that could not be classified (Figure 3D).The RS2 group included: Enterobacteriales, Clostridiales, Bacteroidales, unidentified_Melainabacteria, Lactobacillales, unidentified_Gammaproteobacteria, Selenomonadales, unidentified_Cyanobacteria, unidentified_Acidobacteria, Aeromonadales, and other bacteria that could not be classified (Figure 3E). The dominant species at the order level were Enterobacteriales, Bacteroidales and Clostridiales (Figure 3F).
Analysis of species differences and differences between species
The weighted UniFrac distance and the unweighted UniFrac distance were used to measure the difference coefficient between the two samples (Figure 4A).The unweighted UniFrac distance was tested by the Wilcoxon rank sum test, P=0.4646, and the weighted UniFrac distance was also tested by the Wilcoxon rank sum test, P=0.1083, indicating that there was no significant difference in species diversity between the two groups.
NMDS (Non-Metric Multi-Dimensional Scaling) statistics is a sorting method suitable for ecological research. The smaller the stress (<0.2), the more accurately . Stress=0.156 indicated that NMDS accurately reflected the degree of difference between samples (Figure 4B).
MRPP (Multi Response Permutation Procedure) analysis was used to analyze whether the differences in microbial community structure between groups were significant. A value of less than 0.05 indicates a significant difference. Table 4 showed that the differences between the two groups were significant.
To find the differential species between the groups at each classification level (phylum, class, order), a t-test test between the groups was performed to determine the species with significant differences (P<0.05).
At the phylum level, the species difference analysis between the t-test groups was obtained. Proteobacteria showed a significant difference between the two groups. The average abundance of the RS1 group was 34.36%, and the average abundance of the RS2 group was 54.04%, P<0.05 (Figure 4C).
At the class level, the difference in species between the t-test groups was obtained. Gammaproteobacteria (Proteobacteria) showed a significant difference between the two groups; the average abundance of RS1 was 22.74%, and the average abundance of RS2 was 48.89%, P<0.05. There was a significant difference between the groups for Alphaproteobacteria (Proteobacteria); the average abundance of RS1 was 7.56%, and the average abundance of RS2 was 3.20%, P<0.05 (Figure 4D).
At the order level, the difference in species between the t-test groups was determined. For Enterobacteriales (p__Proteobacteria; c__Gammaproteobacteria), the average abundance of RS1 was 9.44%, and the average abundance of RS2 was 42.05%, P<0.05, indicating a significant difference between the two groups. For Rhizobiales (p__Proteobacteria; c__Alphaproteobacteria), the average abundance of RS1 was 2.85%, and the average abundance of RS2 was 1.10%, P <0.05, indicating a significant difference between the two groups. For Caulobacterales (p__Proteobacteria; c__Alphaproteobacteria), the average abundance of RS1 was 0.58%, and the average abundance of RS2 was 0.13%, P<0.05, indicating a significant difference between the two groups. The average abundance of RS1 for Chthoniobacterales (p__Verrucomicrobia; c__Verrucomicrobiae) was 0.13%, and the average abundance of RS2 group was 0.04%, P<0.05, indicating a significant difference between the two groups (Figure 4E).
LEfSe (LDA(Linear Discriminant Analysis) effect size) is used to compare two or more groups. The LDA value distribution histogram shows the species with an LDA score greater than the set value (the default setting is 4), i.e., the biomarker with statistical differences between the groups. This study showed that the species with significant differences in abundance in the different groups were c_Gammaproteobacteria, f_Xanthomonadaceae, and o_Xanthomonadales, and the length of the histogram bar represents the size of the difference species (i.e., LDA score). C_Gammaproteobacteria was enriched in the RS2 group, and f_Xanthomonadaceae and o_Xanthomonadales were enriched in the RS1 group; the LDA score showed f_Xanthomonadaceae>o_Xanthomonadales, indicating great influence of f_Xanthomonadaceae in the RS1 group (Figure 4F, 4G).