In our effort to describe tumour microbial landscape, we explored the differences in microbiome abundance, diversity, the presence/absence of the species and the proportion of samples with the respective genera in different sample types across patient groups defined by clinical variables (Table 1).
Table 1
Table of clinical variables and their distribution in the full set of 186 patients and subset of 133patients and CRC tumour microbial subtypes. (For categorical variable Fisher exact test was performed and for continuous data Kruskal-Wallis test was used.)
|
Data subset comparison
|
Tumour microbiome subtypes
|
|
all tumours (N = 186)
|
triplets (N = 133)
|
p-value
|
TS1
(N = 54)
|
TS2
(N = 32)
|
TS3
(N = 100)
|
p-value
|
age at dg
|
mean (SD)
|
mean (SD)
|
0.769
|
mean (SD)
|
mean (SD)
|
mean (SD)
|
0.467
|
|
67.03 (10.65)
|
66.68 (10.65)
|
|
67.24 (9.48)
|
64.94 (12.31)
|
67.59 (10.71)
|
|
gender
|
N (%)
|
N (%)
|
1
|
N (%)
|
N (%)
|
N (%)
|
0.7463
|
male
|
104 (55.9)
|
74 (55.6)
|
|
30 (55.6)
|
16 (50.0)
|
58 (58.0)
|
|
female
|
82 (44.1)
|
59 (44.4)
|
|
24 (44.4)
|
16 (50.0)
|
42 (42.0)
|
|
tumour location
|
N (%)
|
N (%)
|
0.650
|
N (%)
|
N (%)
|
N (%)
|
< 0.001
|
right
|
66 (35.5)
|
50 (37.6)
|
|
32 (59.3)
|
3 ( 9.4)
|
31 (31.0)
|
|
transversum
|
19 ( 10.2)
|
13 ( 9.8)
|
|
7 (13.0)
|
2 ( 6.2)
|
10 (10.0)
|
|
left
|
47 (25.3)
|
38 (28.6)
|
|
5 ( 9.3)
|
9 (28.1)
|
33 (33.0)
|
|
rectosigmoideum
|
32 (17.2)
|
23 (17.3)
|
|
5 ( 9.3)
|
8 (25.0)
|
19 (19.0)
|
|
rectum
|
22 (11.8)
|
9 ( 6.8)
|
|
5 ( 9.3)
|
10 (31.2)
|
7 ( 7.0)
|
|
grade
|
N (%)
|
N (%)
|
0.994
|
N (%)
|
N (%)
|
N (%)
|
< 0.001
|
NA, in situ
|
8 ( 4.3)
|
6 ( 4.5)
|
|
0 ( 0.0)
|
1 ( 3.1)
|
7 ( 7.0)
|
|
1
|
17 ( 9.1)
|
11 ( 8.3)
|
|
0 ( 0.0)
|
2 ( 6.2)
|
15 (15.0)
|
|
2
|
108 (58.1)
|
78 (58.6)
|
|
26 (48.1)
|
24 (75.0)
|
58 (58.0)
|
|
3
|
53 (28.5)
|
38 (28.6)
|
|
28 (51.9)
|
5 (15.6)
|
20 (20.0)
|
|
stage
|
N (%)
|
N (%)
|
0.963
|
N (%)
|
N (%)
|
N (%)
|
0.02702
|
0
|
8 ( 4.3)
|
6 ( 4.5)
|
|
0 ( 0.0)
|
1 ( 3.1)
|
7 ( 7.0)
|
|
I
|
35 (18.8)
|
29 (21.8)
|
|
3 ( 5.6)
|
8 (25.0)
|
24 (24.0)
|
|
II
|
69 (37.1)
|
48 (36.1)
|
|
27 (50.0)
|
12 (37.5)
|
30 (30.0)
|
|
III
|
48 (25.8)
|
34 (25.6)
|
|
15 (27.8)
|
7 (21.9)
|
26 (26.0)
|
|
IV
|
26 (14.0)
|
16 (12.0)
|
|
9 (16.7)
|
4 (12.5)
|
13 (13.0)
|
|
tumour pathologic
|
N (%)
|
N (%)
|
0.987
|
N (%)
|
N (%)
|
N (%)
|
< 0.001
|
pTis
|
8 ( 4.3)
|
6 ( 4.5)
|
|
0 ( 0.0)
|
1 ( 3.1)
|
7 ( 7.0)
|
|
pT1
|
11 ( 5.9)
|
10 ( 7.5)
|
|
0 ( 0.0)
|
3 ( 9.4)
|
8 ( 8.0)
|
|
pT2
|
36 (19.4)
|
27 (20.3)
|
|
3 ( 5.6)
|
8 (25.0)
|
25 (25.0)
|
|
pT3
|
117 (62.9)
|
81 (60.9)
|
|
46 (85.2)
|
16 (50.0)
|
55 (55.0)
|
|
pT4a
|
9 ( 4.8)
|
5 ( 3.8)
|
|
4 ( 7.4)
|
1 ( 3.1)
|
4 ( 4.0)
|
|
pT4b
|
5 ( 2.7)
|
4 ( 3.0)
|
|
1 ( 1.9)
|
3 ( 9.4)
|
1 ( 1.0)
|
|
node pathologic
|
N (%)
|
N (%)
|
0.827
|
N (%)
|
N (%)
|
N (%)
|
0.1382
|
pN0
|
114 (61.3)
|
84 (63.2)
|
|
30 (55.6)
|
21 (65.6)
|
63 (63.0)
|
|
pN1a
|
25 (13.4)
|
20 (15.0)
|
|
7 (13.0)
|
1 ( 3.1)
|
17 (17.0)
|
|
pN1b
|
21 (11.3)
|
16 (12.0)
|
|
5 ( 9.3)
|
5 (15.6)
|
11 (11.0)
|
|
pN1c
|
0 ( 0.0)
|
0 ( 0.0)
|
|
0 ( 0.0)
|
0 ( 0.0)
|
0 ( 0.0)
|
|
pN2
|
0 ( 0.0)
|
0 ( 0.0)
|
|
0 ( 0.0)
|
0 ( 0.0)
|
0 ( 0.0)
|
|
pN2a
|
10 ( 5.4)
|
7 ( 5.3)
|
|
4 ( 7.4)
|
2 ( 6.2)
|
4 ( 4.0)
|
|
pN2b
|
14 ( 7.5)
|
5 ( 3.8)
|
|
8 (14.8)
|
3 ( 9.4)
|
3 ( 3.0)
|
|
pNX
|
2 ( 1.1)
|
1 ( 0.8)
|
|
0 ( 0.0)
|
0 ( 0.0)
|
2 ( 2.0)
|
|
metastasis
|
N (%)
|
N (%)
|
0.737
|
N (%)
|
N (%)
|
N (%)
|
0.843
|
M0
|
160 (86.0)
|
117 (88.0)
|
|
45 (83.3)
|
28 (87.5)
|
87 (87.0)
|
|
M1
|
26 (14.0)
|
16 (12.0)
|
|
9 (16.7)
|
4 (12.5)
|
13 (13.0)
|
|
β-diversity analysis by NMDS performed on all sample types showed that tumor location was the factor with the highest influence on total microbiome composition for all sample types, while tumour histological grade affected only tumour and visually-normal samples, which had similar microbial profiles (Text S1, Figure S2, Figure S3).
Microbial categorization according to sample type
There was no significant difference between the read counts across different sample types (paired analysis of sample triplets, see Methods).
Overall, in all the 505 samples we identified 5553 ASVs, of these, 4920 ASVs in the 133 triplet samples. The QIIME assigned species only to 50 ASVs, hence we also performed manual BLAST search to the SILVA database (Table S3).
For further analysis, however, we mainly operated on higher taxonomic levels. After the taxa filtering step (Table S2) 14 phyla, 26 classes, 44 orders, 77 families and 268 genera were identified in the 133 triplets, most of which in all three sample types (Table S4). Inclusion of the additional 53 duplets (tumour mucosa and visually-normal mucosa swabs) resulted only in slight differences at the genus level – the identified taxa remained the same, what changed was their unique presence in some sample types (Text S1).
While most of the genera were found in all three sample types, their incidence and abundance across sample types varied greatly, mainly between mucosal samples and stool, both in overall and pairwise comparisons (Text S1). 14 genera (Stomatobaculum, Pseudoramibacter, Pelomonas, Pasteurella, Mycoplasma, Kingella, Johnsonella, Helicobacter, Deinococcus, Centipeda, Bergeyella, Actinobacillus, Abiotrophia and an unassigned genus from order Comamonadaceae) were detected only in mucosal (tumour and visually-normal) samples (Figure S4).
We further analysed the pairwise incidence of the 268 genera across sample types using Cochran’s Q test and subsequent pairwise McNemar’s tests and found that 128 genera varied significantly across sample types (analysis of 133 triplets, Text S1, Fig. 2A, Table S5).
To categorize the microbial genera based on their preferred environment we compared their abundance across sample types using Friedman rank sum test. Out of the 268 genera, 104 differed significantly in abundance across the sample types (Table 2, Fig. 1). Based on these results, we defined five microbial categories. The first is based solely on the results of tumour vs stool comparison: tumour genera (46 genera, more abundant in tumours compared to stool). Additionally, within the category of tumour genera, we defined mucosa genera (41 genera, enriched also in visually-normal mucosa compared to stool) and tumour-specific genera (18 genera of tumour category, additionally enriched in tumours compared to visually-normal mucosa). Fifty genera were significantly more abundant in stool compared to tumours and visually-normal mucosa form the group of stool genera. The fifth category was defined as the no-difference genera (164 genera, no difference across any of the sample types) (Text S1).
Table 2
Total counts of genera found significantly differentially abundant across the three sample types (133 triplets), divided into categories according to their enrichment in different sample types and top 10 significant genera for each category (see Table S6 for all genera). Abbreviations: “TtoS”: tumour to stool, “VNtoS”: visually-normal to stool, “TtoVN”: tumour to visually-normal.
Category I
|
Freq
|
Category II
|
Freq
|
Category III
|
Freq
|
T to S
|
VN to S
|
T to VN
|
N
|
top 10 significant genera
|
Tumour genera (enriched in tumours compared to stool)
|
46
|
Mucosa genera (enriched in both mucosal samples compared to stool)
|
41
|
|
|
↑
|
↑
|
↓
|
5
|
G_Prevotella 1, G_Romboutsia, F_Peptostreptococcaceae; G_Unassigned, G_Faecalitalea, G_Massil
|
|
|
↑
|
↑
|
-
|
23
|
G_Haemophilus, G_Bacteroides, G_Veillonella, G_Pseudomonas, G_Escherichia-Shigella, G_Aggregatibacter, G_Actinobacillus, G_Neisseria, G_Coprobacter, G_Selenomonas 4
|
Tumour-specific genera (enriched in tumours compared to both stool and normal mucosa)
|
18
|
↑
|
↑
|
↑
|
13
|
G_Fusobacterium, G_Campylobacter, G_Gemella, G_Leptotrichia, G_Parvimonas, G_Granulicatella, G_Selenomonas 3, G_Eikenella, G_Selenomonas, G_Hungatella
|
|
|
↑
|
-
|
↑
|
4
|
G_Howardella, G_Solobacterium, G_Streptococcus, G_Slackia
|
|
|
↑
|
↓
|
↑
|
1
|
G_Defluviitaleaceae UCG-011
|
No difference between stool and tumours
|
170
|
|
|
Enriched only in visually normal mucosa compared to stool
|
4
|
-
|
↑
|
↓
|
4
|
O_Clostridiales, G_Odoribacter, G_Parabacteroides, G_Ruminococcaceae UCG-004
|
|
|
No-difference genera between sample types
|
164
|
-
|
-
|
-
|
164
|
|
|
|
Depleted only in normal mucosa compared to stool
|
|
-
|
↓
|
-
|
1
|
G_Methanobrevibacter
|
|
|
|
-
|
↓
|
↑
|
1
|
G_Cloacibacillus
|
Depleted in tumours compared to stool
|
52
|
|
|
|
|
↓
|
-
|
-
|
1
|
G_Pseudobutyrivibrio
|
|
|
Depleted in tumours both compared to stool and normal mucosa
|
4
|
↓
|
↑
|
↓
|
1
|
G_Coprococcus 2
|
Stool genera (depleted in both mucosal samples compared to tool)
|
50
|
↓
|
↑
|
↓
|
3
|
G_Ruminococcaceae UCG-013, G_Lachnospiraceae ND3007 group, G_Alistipes
|
|
|
↓
|
↓
|
↑
|
3
|
G_Bifidobacterium, G_Acidaminococcus, G_Fusicatenibacter
|
|
|
↓
|
↓
|
-
|
44
|
G_[Eubacterium] coprostanoligenes group, G_Ruminococcaceae UCG-002, F_Lachnospiraceae; G_Unassigned, G_Christensenellaceae R-7 group, G_Ruminococcaceae NK4A214 group, G_Lachnospira, G_Ruminococcaceae UCG-005, G_Ruminococcus 2, G_Ruminococcaceae UCG-003, F_Clostridiales vadinBB60 group; G_uncultured bacterium
|
Overall
|
268
|
|
|
|
|
|
|
|
268
|
|
The landscape of CRC tumour microbiome
For the description of tumour mucosa microbial heterogeneity without possible stool contaminants, we only considered species that were statistically significantly enriched in tumour mucosa compared to stool. We hence investigated the group of 46 tumour genera with special focus on the subgroup of 18 tumour-specific genera (Bergeyella, Gemella, Granulicatella, Parvimonas, Hungatella, Peptoclostridium, Flavonifractor, Selenomonas, Selenomonas 3, Fusobacterium, Leptotrichia, Eikenella, Campylobacter, Slackia, Streptococcus, Howardella, Solobacterium, Defluviitaleaceae UCG-011, Fig. 2A).
19 tumour genera (of which 14 tumour-specific), co-occurred in tumour mucosa significantly more often: Eikenella, Campylobacter, Solobacterium, Gemella, Leptotrichia, Parvimonas, Bergeyella, Peptoclostridium, Howardella, Slackia, Selenomonas 3, Granulicatella, Selenomonas, Fusobacterium, Aggregatibacter, Alloprevotella, Treponema 2, Selenomonas 4 and Veilonella (Figure S5, Table S7). On the other hand, we observed significantly decreased co-occurrence of the following pairs of genera: Selenomonas and Tyzzerella 4, Pseudomonas and Massillias; Selenomonas 3 and Morganella, Coprobacter and Treponema 2 and Coprococcus and Faecalitalea.
Tumour genera incidence ranged from 3.2–99.4% (median 34.4%) of tumours with the median abundance of the individual genera in the samples ranging from 0.015–28.8% (median 0.23%) (Fig. 2A). Overall, tumour genera constituted 10.8–96.7% (median 58.6%) while the tumour-specific genera constituted between 0.0%-63.7% (median 2.6%) of tumour mucosa microbiome (Fig. 2B).
We performed detailed literature search which revealed that tumour genera consisted predominantly of oral bacteria, many known as oral pathogens. The presence of some genera on tumor mucosa was never reported before nor associated with CRC (Table S8).
Microbiome and clinical variables
In the next step, we assessed the association of microbiome abundance with the clinical parameters and interpreted the results based on our microbial categorization. The results for each clinical variable are summarized in Table 3.
Table 3
Summary of rank regression results (p < 0.05) associating microbiome of the three different sample types with the clinical variables. Bold text denotes genera significant at FDR < 0.1. Up and down arrows denote increase or decrease in abundance, respectively.
Regression covariate
|
Effect/Contrast
|
Tumour mucosa
|
Visually-normal mucosa
|
Stool
|
grade
|
increasing grade
|
↑ Fusobacterium, Campylobacter, Mobigacterium, Leptotrichia
|
|
|
location
|
right-sided/ transverse vs left-sided and rectum/rectosigmoid
|
↑ Erysipelatoclostridium, Holdemania, Selenomonas 3, Selenomonas 4, Fretibacterium
|
↑ Fretibacterium, Selenomonas
|
|
left-sided vs right-sided/ transverse
|
↑ Fusicatenibacter, Christensenellaceae R-7 group, Ruminococcaceae UCG − 013, Coprococcus 1, Family XIII AD3011 group
↓ Campylobacter
|
↑ Bifidobacterium, Lachnospira
|
|
rectosigmoid/rectum vs right-sided/transverse
|
↑ Fusicatenibacter, Christensenellaceae R-7 group, Ruminococcaceae UCG − 013, Bifidobacterium
|
↑ Bifidobacterium, Lachnospira, [Eubacterium] ventriosum group
|
|
interaction between
grade and location
|
low-graded; left-sided
|
↑ Lachnospira, Ruminoclostridium 6, Gemella, [Eubacterium] ventriosum group, Methanobrevibacter, an uncultured bacterium from Opitutae vadinBB60 group family, Ruminococcaceae UCG-010, Victivallis,
an uncultured species and an Incertae Sedis genus from Lachnospiraceae family
|
↑ Gemella
|
↑ Ruminoclostridium 6
|
grade 2; right-sided
|
↑ Lachnoclostridium
|
|
↑ Ruminoclostridium 6
|
grade 2; transverse, left-sided
|
↑ Lachnoclostridium
|
|
|
grade 3; right-sided
|
↑ Prevotella, Selenomonas
|
|
|
grade3; rectosigmoid/rectum
|
↑ Lachnospira
|
|
↑ Prevotella
|
stage
|
III-IV vs 0-II
|
↑ Peptoclostridium, Fusobacterium
↓ Parabacteroides, Lachnospiraceae FCS020 group, Tyzzerella 4
|
↑ Peptoclostridium
|
↑ Streptococcus, Peptococcus, Akkermansia
↓ Dorea, Ruminiclostridium
|
T
|
pT 3–4 vs pTis-2
|
↑ Fusobacterium, Campylobacter, Selenomonas, Holdemania, Anaerotruncus, Peptoclostridium, Mogibacterium, Parvimonas, Family XIII UCG-001, one unassigned genera from Ruminococcaceae family
↓ Parabacteroides, Prevotellaceae NK3B31 group, Coprococcus 2, Lachnospiraceae FCS020 group, Tyzzerella 4, [Ruminococcus] gauvreauii group, Sutterella, Neisseria, Enterococcus, Actinobacillus, Staphylococcus, Peptostreptococcus
|
↑ Peptoclostridium, Selenomonas, Gemella
↓ Corynebacterium, Coprobacter, Parabacteroides, Anaerostipes, Tyzzerella 3, [Ruminococcus] gauvreauii group, Lachnospiraceae FCS020 group, Lachnospiraceae ND3007 group, Lachnospiraceae UCG-001, Haemophilus
|
↑ Fusobacterium, Anaerotruncus, Ruminococcaceae NK4A214 group
↓ Lachnospira, Coprococcus 2, Lachnospiraceae UCG-001, Hydrogenoanaerobacterium
|
N
|
N1-2 vs N0
|
↑ Peptoclostridium
↓ Prevotellaceae UCG-001
|
↑ Selenomonas
↓ [Eubacterium] hallii group
|
↑ Coprobacter, Peptococcus Campylobacter, Akkermansia, Selenomonas, Porphyromonas, Streptococcus
↓ Faecalibacterium, Ruminiclostridium, Dorea, uncultured bacterium from Bacteroidales S24-7 group family
|
M
|
present vs absent
|
↑ Porphyromonas, Streptococcus,
Ruminococcaceae UCG-005
|
↑ Akkermansia
↓ Gelria, [Eubacterium] brachy group,
uncultured genera from Christensenellaceae family
|
↑ uncultured genus from Erysipelotrichaceae family, Akkermansia, Coprococcus 1, Solobacterium
↓ Metanosphaera
|
age
|
increasing age
|
↓ Lachnospira, Pseudobutyvibrio,
Erysipelotrichaceae UCG-003
|
↑ uncultured species from the Ruminococcaceae
family
↓ Lachnospira, Erysipelotrichaceae UCG − 003
|
↑ Ruminiclostridium 5
↓ Pseudobutyrivibrio
|
gender
|
men vs women
|
↑ Mogibacterium, Dorea, Roseburia, Bacteriodes,
Paraprevotella, Eubacterium hallii group,
Subdoligranulum, Sutterella
↓ Fretibacterium, [Eubacterium] nodatum group,
Aeromonas
|
↑ Mogibacterium, Dorea, Roseburia, Collinsella,
Howardella, Lachnospira, Ruminiclostridium, Acidaminococcus, Haemophilus
↓ Ruminiclostridium 5
|
↑ Acidaminococcus, Sutterella, Mogibacterium
↓ Actinomyces, Hungatella, Oscillospira,
Ruminiclostridium 9, Erysipelatoclostridium,
Tyzzerella, uncultured genus from family
Erysipelotrichaceae, Eggerthella
|
Tumour grade, location and microbiome
In the tumour mucosa samples, 35 genera were found significantly associated with grade or location, of which 18 with significant interaction (Table S9, Figure S6, Figure S7, Figure S8, Figure S9). Some of these associations were found also in visually-normal mucosa (analysis of 186 samples).
In tumour mucosa, three genera, Fusobacterium, Campylobacter and Mobigacterium, significantly increased with tumour grade regardless of primary tumour side. Campylobacter was also associated with lower abundance in the mucosa of the left sided tumours. Of these, in the visually-normal mucosa adjacent to these tumours, none remained associated with high tumour grade.
Leptotrichia was significantly increased in advanced grade tumours (2 or 3 compared to 1); while the interactional model was chosen according to the drop dispersion test, none of interactional coefficients were significant. Prevotella and Selenomonas both had significantly increased abundances in the grade 3 right sided tumours. Prevotella was also increased in the stool of patients with grade 3 rectosigmoid/rectum tumours. Lachnoclostridium was associated with grade 2 in all tumours except for rectosigmoideum/rectum.
In contrast, Lachnospira, Ruminoclostridium 6, Gemella, [Eubacterium] ventriosum group, Methanobrevibacter, an uncultured bacterium from Opitutae vadinBB60 group family, Ruminococcaceae UCG-010, Victivallis and an uncultured species and an Incertae Sedis genus from Lachnospiraceae family were significantly enriched mainly in left-sided (for some including rectosigmoid/rectum) low-grade tumours. Lachnospira increased in abundance in grade 3 tumours of the rectosigmoid/rectum. In visually-normal mucosa, the same association was observed for Gemella, while Lachnospira was only significantly enriched in left, rectosigmoid and rectum location and [Eubacterium] ventriosum group only in rectum. Ruminoclostridium 6 remained enrichedin the stool of patients with low-grade left-sided tumours and grade 2 right-sided tumors.
Erysipelatoclostridium, Holdemania, Selenomonas 3 and Selenomonas 4 and Fretibacterium were increased in mucosa of the right-sided and transverse tumours. The same associations were found only for Fretibacterium and Selenomonas in the visually-normal mucosa. Fusicatenibacter, Christensenellaceae R-7 group, Ruminococcaceae UCG − 013 were increased in mucosa of the left-sided, rectosigmoid and rectal tumours, Coprococcus 1 and Family XIII AD3011 group in the mucosa of left-sided tumours and Bifidobacterium in the mucosa of rectosigmoid and rectal tumours. Similar associations were found for visually-normal mucosa for the Bifidobacterium.
For Flavonifractor and Odoribacter, although the model itself was significant in the drop dispersion test, none of the coefficients was significant alone.
Tumour stage and microbiome
When comparing early (0-II) and advanced (III-IV) stages, we identified increased abundance of Peptoclostridium and Fusobacterium associated with advanced stage and increased abundance of Parabacteroides, Lachnospiraceae FCS020 group and Tyzzerella 4 associated with early stage tumours. Of these, only Peptoclostridium was associated with an advanced stage in visually-normal tissue. None of these genera were significant after the FDR correction (p < 0.05, FDR > 0.1). In stool, we found a different set of genera associated with advanced tumour stage: increased Streptococcus, Peptococcus and Akkermansia and decreased Dorea and Ruminiclostridium (p < 0.05, FDR > 0.1).
Patients with advanced T stages (pT 3–4) were characterized by significant increase in abundance of Fusobacterium and Campylobacter (FDR < 0.1) as well as Selenomonas, Holdemania, Anaerotruncus, Peptoclostridium, Mogibacterium, Parvimonas, Family XIII UCG-001 and one unassigned genera from Ruminococcaceae family (p < 0.05, FDR > 0.1) in tumour mucosa, increased Peptoclostridium, Selenomonas and Gemella in visually-normal tissue, and increased Fusobacterium, Anaerotruncus and Ruminococcaceae NK4A214 group in stool.
Early T stage tumours (pTis-2) were associated with increase in Parabacteroides, Prevotellaceae NK3B31 group, Coprococcus 2, Lachnospiraceae FCS020 group, Tyzzerella 4, [Ruminococcus] gauvreauii group, Sutterella, Neisseria, Enterococcus, Actinobacillus, Staphylococcus and Peptostreptococcus (p < 0.05, FDR > 0.1) in tumour mucosa, increased Corynebacterium, Coprobacter, Parabacteroides, Anaerostipes, Tyzzerella 3, [Ruminococcus] gauvreauii group, Lachnospiraceae FCS020 group, Lachnospiraceae ND3007 group, Lachnospiraceae UCG-001 and Haemophilus in visually-normal tissue and Lachnospira, Coprococcus 2, Lachnospiraceae UCG-001, or Hydrogenoanaerobacterium, but none of these associations was significant after FDR correction (Table S9, Figure S10, Figure S11).
The presence of metastases (local or distant) at the time of diagnosis was predominantly associated with changes in stool microbiome. Except for uncultured genus from the Erysipelotrichaceae family, none of these associations were significant after FDR correction.
Lymph-node metastases (N1-2) were associated with increased abundance of Peptoclostridium on tumour mucosa, Selenomonas on visually-normal mucosa and increased abundance of Coprobacter, Peptococcus Campylobacter, Akkermansia, Selenomonas, Porphyromonas and Streptococcus in stool (p < 0.05, FDR > 0.1). Tumours without lymph-node metastases showed increased abundance of Prevotellaceae UCG-001 in tumour mucosa, [Eubacterium] hallii group on visually-normal mucosa and Faecalibacterium, Ruminiclostridium, Dorea and uncultured bacterium from Bacteroidales S24-7 group family in stool (p < 0.05, FDR > 0.1).
The presence of distant metastases was associated with increased abundance of Porphyromonas, Streptococcus and Ruminococcaceae UCG-005 on tumour mucosa, Akkermansia in visually-normal tissue (p < 0.05, FDR > 0.1) and an uncultured genus from Erysipelotrichaceae family (FDR < 0.1), Akkermansia, Coprococcus 1 and Solobacterium (p < 0.05, FDR > 0.1) in stool.
Patients without distant metastases had increased Gelria, [Eubacterium] brachy group and uncultured genera from Christensenellaceae family in visually-normal tissue and increased abundance of Metanosphaera in stool (p < 0.05, FDR > 0.1) (Table S9, Figure S10, Figure S12).
Gender, age and microbiome
We found three genera from tumour mucosa with abundances decreasing with increasing patients’ age: Lachnospira, Pseudobutyvibrio (p < 0.01, FDR > 0.1) and Erysipelotrichaceae UCG-003 (FDR < 0.1). Lachnospira was decreasing with age also on visually-normal mucosa, together with Erysipelotrichaceae UCG − 003 (p < 0.01, FDR > 0.1). An uncultured species from the Ruminococcaceae family increased in abundance with increasing age in visually-normal mucosa (p < 0.01, FDR > 0.1). In stool, Pseudobutyrivibrio decreased and Ruminiclostridium 5 increased with age (p < 0.01, FDR > 0.1). (Table S9, Figure S10, Figure S13).
No significant (FDR < 0.1) associations between patients' gender and microorganisms’ abundance were found for any sample type. Non-adjusted results (p < 0.01, FDR > 0.1), however, pointed to increased Mogibacterium, Dorea, Roseburia in both tumour and visually-normal mucosa in men, Bacteriodes, Paraprevotella, Eubacterium hallii group, Subdoligranulum, Sutterella in tumour mucosa of men, Collinsella, Howardella, Lachnospira, Ruminiclostridium, Acidaminococcus and Haemophilus in visually-normal mucosa of men and Acidaminococcus, Sutterella and Mogibacterium in stool of men with CRC. In contrast, women with CRC had increased Fretibacterium, [Eubacterium] nodatum group and Aeromonas in tumour mucosa, Ruminiclostridium 5 in visually-normal mucosa and Actinomyces, Hungatella, Oscillospira, Ruminiclostridium 9, Erysipelatoclostridium, Tyzzerella, uncultured genus from family Erysipelotrichaceae and Eggerthella in stool. (Table S9, Figure S10, Figure S14).
Tumour CRC microbial subtypes
We performed hierarchical clustering of patients based on the relative abundance of the 46 tumour genera in the tumour mucosa samples. As a distance measure between patients we used Aitchinson’s distance, and Euclidean distance for the clustering of genera (see Methods).
Based on the tumour-mucosa microbial composition we observed three major subtypes of tumours (TS1-TS3), that could further be divided into seven groups (a-h) (Fig. 3). The bacteria clustered into six groups M1-M6 (Table S8). The seven minor groups a-h are reflecting profiles of certain individual species, such as Sutterella, Peptoclostridium, Flavonifractor, Coprobacter, Aggregatibacter, Granulicatella, Hungatella, Alloprevotella or Slackia. We associated these subtypes with clinical variables (Table 1).
The M1 group and M2 group are represented by typical gut microbiome members. The M1 group consists of five most common and most abundant genera Fusobacterium, Lachnoclostridium, Bacteroides, Escherichia-Shigella and one uncultured genus from family Lachnospiraceae. All tumours contain at least three of these bacteria, most tumours (78,5%) all five. These bacteria have high co-occurrence across sample types (Fig. 2A, fourth panel), except for Fusobacterium. The M3, M4 and M5 groups contain predominantly oral microbiome genera. These genera have significantly different incidence across sample types, with 44.0-94.6% patients missing these genera in stool if present on tumour mucosa. Group M5 is more heterogeneous and with respect to the tumour microbial subtypes can be further divided into two subgroups: (i) Alloprevotella, Howardella, Slackia and (ii) Treponema 2, Selenomonas, Selenomonas 3, Selenomonas 4. Group M6 consists of sixteen less common species.
Tumour microbial subtype 1 (TS1) represents 29% (54) of tumours and is defined by presence of M1-3 and M5 microbial groups and overall contains most of the high-grade associated genera (Fusobacterium, Campylobacter, Leptotrichia and Selenomonas). This subtype is enriched in right-sided tumours (59.3%), grade 3 tumours (51.9%), tumours with pT3 or pT4 stage (94.4%) and is depleted of stage 0 and stage I tumours (0% and 5.6%, respectively). Tumour microbial subtype 2 (TS2) represents 17% (32) of tumours and is defined mainly by the presence of M1-M3 bacteria. 84% of TS2 tumours are from the left side, rectosigmoid or rectum and mostly of grade 2 (75%). Finally, the largest subtype 3 (TS3) represents 54% (100) of tumours and is mostly missing the M3-5 bacterial groups as well as most of the high-grade related species, containing in median TS3 is thus characteristic by increased proportion of low grade tumours (15% grade 1). In TS3 microbial subtype, right-sided and left-sided tumours are equally represented.