Priority effects and microbial cross-feeding shape zoonotic agent spread in broiler chickens

doi:10.21203/rs.3.rs-3588367/v1

Download PDF

Research Article

Priority effects and microbial cross-feeding shape zoonotic agent spread in broiler chickens

https://doi.org/10.21203/rs.3.rs-3588367/v1

This work is licensed under a CC BY 4.0 License

You are reading this latest preprint version

Unravelling the colonisation dynamics and physiological effects of zoonotic bacteria such as Campylobacter is imperative to prevent foodborne diseases. We employed a hologenomic approach to jointly analyse metabolic networks and gene expression of the caecal microbiota, with the intestinal gene expression of 613 broiler chickens that did and did not undergo an opportunistic Campylobacter colonisation. We report that an early development of a distinct microbial enterotype enriched with Bacteroides fragilis_A, changed the community to a functional profile that likely benefited Campylobacter through production of key metabolites. The resulting enterotype was not associated with a host immune response, but exhibited an enriched and energetically more demanding functional repertoire compared to the standard enterotype, which could have caused the growth decline observed in Campylobacter-colonised animals. We provide unique insights into microbe-microbe and host-microbe interactions, which point to the early-stage microbiota-development as a relevant factor for later Campylobacter spread in broiler chickens.

Zoonotic bacteria responsible for foodborne diseases represent a significant global concern due to their public health and economic implications [1]. However, the efficacy of prevention strategies is often hindered by a limited understanding of the precise colonisation dynamics and the physiological effects within vector hosts [2, 3]. Fortunately, the emergence of multi-omic technologies has greatly enhanced our ability to bridge this knowledge gap and comprehend the ecological dynamics of zoonotic agents in source animals [4]. By facilitating the study of functional interactions among zoonotic bacteria, the animal host, and its associated microbial community, multi-omics offer invaluable insights crucial for the development of effective strategies to mitigate the impact of these diseases on human populations [5].

Campylobacteriosis is the most frequently reported zoonosis in the EU due to the presence of Campylobacteraceae strains in broiler chickens [6]. These bacteria are typically detected in chickens during the third week of their life, a period when the gut microbiota begins to stabilise after an initial rapid development characterised by significant species turnover [7, 8]. While Campylobacter has been considered a causative agent of microbiota rearrangements [9–11], the drivers of Campylobacter colonisation remain unclear [12]. The metabolic auxotrophies of Campylobacter render it a potential scavenger of metabolic by-products produced by other bacteria [13, 14]. Thus, prior alterations in the biochemical conditions during initial microbial succession could drive spread of Campylobacteraceae at a later stage [7, 9, 11, 15, 16]. The construction of metabolic networks using metagenome-assembled genomes now allows for the study of such interdependencies [17], enabling investigations into whether different paths of microbiota development can either facilitate or hinder the establishment and transmission of zoonotic strains.

The physiological effects of Campylobacteraceae on the host are also a subject of ongoing debate, as the literature presents contradictory findings regarding the relationship between Campylobacter and animal growth performance. While certain studies report no significant impact on chicken body weight [10, 18], others describe weight loss after Campylobacter colonisation [19, 20]. There is also considerable variability in the host immune response to the presence of Campylobacter [10, 21–23]. Furthermore, it remains unresolved whether these patterns are directly attributable to the action of Campylobacter itself or to the microbial community as a whole. The application of host and microbial (meta)transcriptomics enables us to delve into the actual interactions occurring between the two domains during different developmental stages.

In the H2020 project HoloFood [24], we conducted three five-week-long experimental replicates to understand the effect of host-microbiota interactions in broiler chicken production. In the last of the three trials, we detected an opportunistic colonisation of Campylobacter spp. in almost all chickens slaughtered after the third week [25]. While the first two experimental trials allowed us to characterise the functional dynamics of the caecal microbiota development [26], the third trial presented an exceptional opportunity to study the microbial and host alterations associated with Campylobacter colonisation with unprecedented resolution. We generated multiple omic data sets that encompassed host and microbial domains of the hologenomic landscape [27], namely genome-resolved metagenomics, microbial metatranscriptomics and chicken intestinal transcriptomics. By integrating those omic layers we compared the temporal development of the functional capabilities and activity of the caecal microbiota, along with the intestinal gene expression, of broiler chickens that experienced opportunistic Campylobacter colonisation and those that did not. We first analysed the alterations preceding Campylobacter spread using genome-scale metabolic networks (GSMNs), followed by host and microbial gene expression variation associated with the presence of Campylobacter and related bacteria.

A distinct microbiota development precedes the spread of Campylobacteraceae bacteria

We analysed the microbial communities of 7-, 21- and 35-day-old chickens from three experimental replicates (trials A, B and C) by mapping 613 metagenomic datasets generated from caecum content samples to a bacterial genome catalogue generated from the same pool of animals [26]. Our metagenomic analysis confirmed the initial detection of Campylobacter spp. through PCR screening [25]. All animals in trial C from day 21 onwards got colonised by at least one of the two Campylobacteraceae species, namely C. jejuni and C. coli. In addition, we detected species bacteria belonging to the Campylobacterales order, Helicobacter pullorum (Fig. 1a). C. jejuni and C. coli represent prominent zoonotic agents responsible for human diarrheal diseases in industrialised and developing countries [28], while H. pullorum is an emerging zoonotic pathogen linked to colitis and hepatitis in humans [29]. Although chickens are predominantly colonised by C. jejuni, co-colonisation with C. coli and Helicobacter strains is often detected, which can lead to either commensalism or competition between them [15, 30].

Community-level analyses revealed a robust association between the propagation of Campylobacterales strains in trial C and a distinctive microbiota development, contrasting with the trajectories observed in trials A and B (Fig. 1b). The distinct enterotype became evident at an early stage, manifesting noticeable differences in community composition (R² = 0.01, F-value = 2.02, p-value = 0.018) and functional profile (R² = 0.14, F-value = 49.05, p-value = 0.001) by the first sampling point at 7 days of age. This enterotype was characterised by an atypical dominance of Bacteroidota, primarily driven by Bacteroides fragilis_A according to Dirichlet Multinomial Mixture models (Fig. S1). B. fragilis_A was the most abundant bacteria in approximately half of the day-7 animals in trial C (Fig. 1c), in contrast to the standard microbiota development observed in trials A and B (Fig. S2). In those trials, Bacteroidales proliferated in the third week of age, after an initial period dominated by Lactobacillus and Lachnospiraceae clades, as previously reported in the literature [8]. B. fragilis_A is a non-spore-forming obligate anaerobe that usually colonises the chicken intestine at a later stage, likely due to its lower colonisation ability compared to spore-forming bacteria such as Lachnospiraceae, whose spores are widespread in the farm environment [8]. Our observations are therefore in line with previous studies that reported a link between Campylobacter and obligate anaerobes within orders Clostridiales and Bacteroidales [9, 11]

To delve into the potential causes of the development of a distinct enterotype, we studied how the abundance of B. fragilis_A distributed across experimental groups at day 7. We found that B. fragilis_A was not randomly distributed, but aggregated in pens, as indicated by the 79% of the variance explained by the random effect of pens. However, the distribution was not associated with pen-specific chicken characteristics such as genetic line (t-value = -0.91, p-value = 0.37) and sex (t-value = 0.24, p-value = 0.81). These observations suggest that the development of the distinct enterotype in trial C animals was primarily driven by the expedited colonisation of B. fragilis_A in a few animals, likely occurring before pen allocation, followed by subsequent transmission to pen-mates.

Bacteroides fragilis as a facilitator of Campylobacter colonisation

Despite inherent limitations in establishing causal associations from observational data, our investigation into bacterial metabolic dependencies unveiled several molecular mechanisms that shed light on how the early development of a Bacteroides-enriched enterotype might promote the subsequent proliferation of Campylobacter strains [11, 15]. To quantify these metabolic dependencies, we employed 822 genome-scale metabolic networks (GSMNs) [31] constructed using Pathway Tools [32], and grounded on EggNOG annotations [33]. We categorised each metabolite for each bacterium as a source, transit, or sink metabolite based on its capacity for utilisation or production. Source metabolites are those that a bacterium can utilise but not produce, transit metabolites can be both produced and consumed, and sink metabolites are produced but not used (Fig. 2a).

We determined the existence of 385 and 395 source metabolites for C. jejuni and C. coli, respectively, of which 32.2% and 32.7% could be produced by other members of the community. Both strains shared 95% of their metabolic networks, diverging only in C. coli's superior capability to metabolise some by-products such as glutathione, succinate, and oxaloacetate (Fig. S3). To assess whether the distinct enterotype conferred metabolic advantages to Campylobacter, we calculated the weighted capacity of each enterotype to produce those source metabolites at day 7. Our findings revealed that the distinct enterotype exhibited a higher capacity compared to the standard one (LMM, estimate = 2.23, t-value = 5.43, p-value < 0.01) in producing source metabolites that Campylobacter cannot synthesise on its own (Fig. 2b). Broadening our analysis, we observed that 1,043 out of the 4,533 identified metabolites were overrepresented in the distinct enterotype, encompassing 31 and 35 source metabolites for C. jejuni and C. coli, respectively, which can be potentially produced by the rest of the community (Table S1).

In vitro assays suggested Bacteroides as a potential facilitator of Campylobacter colonisation via the provision of free sugars and short-chain fatty acids (SCFAs) [13, 14, 34]. Our joint GSMN of B. fragilis_A and C. jejuni highlighted numerous ways in which Bacteroides could contribute to Campylobacter through the production of relevant metabolic by-products (Fig. 2c). However, we found no evidence of an enhanced genomic capacity of the distinct enterotype for polysaccharide degradation (Fig. 2h) or SCFA production (Fig. 2i). As the dominant taxa in the standard enterotype (e.g., Lachnospirales and Oscillospirales) also possess these metabolic attributes [35], it is unlikely that Campylobacter colonisation was primarily linked to these metabolites.

Nonetheless, we identified other metabolites that were likely to play pivotal roles in the Bacteroides-Campylobacter interaction. The two most relevant source metabolites for Campylobacter were coproporphyrin III and (R)-citramalate, as they stood out due to their pronounced differences between enterotypes (Figs. 2d, 2e) and their classification as sink metabolites for B. fragilis_A. This indicates that B. fragilis_A likely overproduces these metabolites, which may become available for Campylobacter. Coproporphyrin III is an essential component of one of the three haem biosynthesis pathways. Haem is an iron-chelated modified tetrapyrrole and is a key compound for proteins involved in several essential cellular processes [36]. (R)-citramalate is a metabolic intermediate that participates in the synthesis of tricarboxylic acids [37] and is known to be a substrate of the alternative threonine-independent isoleucine synthesis pathway [38]. We quantified gene expression and verified the utilisation of both metabolites by Campylobacter strains (EC:4.99.1.9 and EC:4.2.1.35) and the production of coproporphyrin III by B. fragilis_A (EC:1.3.3.15).

Two other source metabolites for Campylobacter, MOCS3-Cysteine and sulphate, were also disproportionately prevalent in the distinct enterotype (Fig. 2f, 2g). However, unlike the previously mentioned metabolites, B. fragilis_A has the capacity to utilise them. MOCS3-Cysteine, a sulphur transferase enzyme crucial for molybdopterin biosynthesis, plays a pivotal role in the formation of redox enzymes [39]. Sulphate can be reduced to hydrogen sulphide, required for cysteine synthesis [40]. Despite reported auxotrophies related to sulphate assimilation in Campylobacter [41], our strains exhibited gene expression for the enzymes responsible for consuming these metabolites (EC:2.7.7.4 and EC:2.8.1.7, respectively). Our results therefore suggest that priority effects, whereby the order of microbial species colonisation influences longer term microbiome composition, likely play a central role in shaping temporal Campylobacter dynamics. Specifically, the Bacteroides-dominated enterotype creates a metabolically favourable environment for Campylobacter establishment and colonisation, likely facilitated by acquisition of compounds involved in central metabolic processes.

The distinct enterotype correlates with host body weight

Chickens from trial C not only underwent a distinct enterotype development followed by the spread of Campylobacterales strains, but also exhibited a significantly reduced growth performance as compared to chickens from trials A and B (Fig. S4). Unlike in humans, Campylobacterales strains do not cause disease symptoms in chickens [42]. However, impaired performance has been observed in multiple trials, which has fueled discussions about strain-specific mechanisms by which Campylobacter could affect chicken growth [12, 42]. In light of this, we posed two non-exclusive hypotheses on how microbe-host interactions could have contributed to the reduction of animal growth: i) colonisation by Campylobacterales triggers an inflammatory response, which hinders the correct functioning of the intestine and deviates energy from growth to immunity [21]; ii) the distinct enterotype is functionally different, which affects host energy balance [26].

To assess whether the distinct enterotype triggered a persistent pro-inflammatory response in the host's intestine, we performed differential expression analyses between the two enterotypes. The study of 169 host transcriptomic datasets derived from caecal mucus samples collected at three distinct time points, revealed no substantial differences in the expression profiles between animals hosting each enterotype. Although we detected the largest difference at day 35, with 36 differentially expressed genes (Fig. S5, Table S2), no clear Gene Ontology or KEGG pathway enrichment could be observed, thus yielding no evidence of an inflammatory response from the host. The critical window in the immune cell development is identified between days 14 and 28, in which certain bacteria play a key role in their maturation process [43]. Campylobacter is recognised by Toll-like receptors and can induce an inflammatory response by increasing expression of cytokines and immune-associated genes [10, 21, 22]. In fact, enzyme immunoassay conducted in blood did detect a significant peak of C-reactive protein in the distinct enterotype at day 21 [25]. This time point coincides with the initial detection of Campylobacter, which pointed towards a possible response from the host. Nonetheless, neither the rest of inflammation (haptoglobin-like protein) nor stress (corticosterone) biomarkers analysed in the same animals pointed towards a significant inflammatory response [25]. We therefore deem unlikely that the observed growth deceleration in chickens from trial C was due to an immune response towards Campylobacteraceae bacteria.

Instead, we hypothesised that the lower body weights associated with the distinct enterotype could result from a heightened metabolic capacity of the microbial communities. An increased metabolic demand of the caecal microbiota might cause the microbial community to compete for resources with the host, restricting the host’s absorption of nutrients in the proximal part of the caecum [26]. Once validated that the distinct enterotype had higher metabolic capacities than the standard in the first weeks of the trials (Fig. S6), we explored whether these capacities were actually realised. For this purpose, we compared the metabolic activity of both enterotypes across the three time points. We distilled microbial gene expression data from 125 microbial metatranscriptomic datasets into 170 quantitative Genome-Inferred Functional Traits (GIFTs) (Table S3) per genome, by pondering gene expression values according to the weight of each gene in each metabolic pathway. Community level analyses showed that the distinct enterotype tended to overexpress genes involved in organic anion biosynthesis (B06) and, particularly, nitrogen compound degradation (D06) already from day 7, but exacerbated at days 21 and 35 (Figs. 3a, 3b, 3c). Bacteroidales and Campylobacterales were the main contributors to the B06 and D06 function groups (Fig. 3d), as different strains of the phylum Bacteroidota emerged at days 21 and 35 (Fig. S2). In addition, the decline of Oscillospirales and Lachnospirales clades in the distinct enterotype caused a reduction of amino acid derivative biosynthesis (B03) and lipid degradation (D01). The D06 function group consists mainly in nitrate, urate, taurine and hypotaurine degradation. B06, although comprising biosynthesis pathways, is derived mainly from degradation of lipids, proteins and carbohydrates through Krebs cycle and other processes to produce succinate, fumarate and citrate, which together with D06 points towards a higher catalytic activity of microbes. Our results are in line with previous studies which reported that the reduced body weight gain in Campylobacter-colonised chickens could be due to the extensive amino acid utilisation by Campylobacter, which caused lower concentrations of amino acids in the ileum, and a reduced expression of peptide and amino acid transporters in the caecum [19, 22]. Nevertheless, our data suggests that such heightened energy utilisation should not only be attributed to Campylobacter, but also extended to Bacteroidalesstrains, which collectively contributed to the increased metabolic action of the distinct enterotype.

Foodborne zoonotic bacteria not only give rise to infectious diseases in humans and have environmental implications, but also impose significant economic and resource burdens on the meat industry [44]. Nevertheless, current methods for prevention and early detection of zoonotic agents often fall short, partly due to our limited knowledge about their interactions with the rest of the gut microorganisms and host animals. Our multi-omic study, focusing on the temporal development of functional capacities and activity within Campylobacterales bacteria and the rest of the microbiota, unveiled numerous novel insights into these intricate interactions. We observed that the distinct enterotype preceding the widespread emergence of Campylobacterales demonstrated a heightened ability to meet the metabolic demands of Campylobacter spp., in contrast to the standard enterotype associated with Campylobacter-free animals. While it is worth noting that the enterotypes presented in this study are likely only two of many possible microbiota development trajectories [9, 11, 15], our findings suggest that metabolic interdependencies and priority effects significantly influence the likelihood of Campylobacter colonisation within chicken gut environments. Notably, the emergence of this distinct enterotype primarily led by Bacteroides fragilis_A opens the possibility of using the early presence of this bacteria as a biomarker for subsequent Campylobacter colonisation. While manipulation of the early-life microbiota followed by experimental infection with zoonotic strains will be necessary to validate our results, these findings pave the way for exploring strategies to manipulate early-life microbiota compositions that minimise metabolic advantages for Campylobacter.

The colonisation of Campylobacter spp. was associated with a marked reduction of body weight in the studied chickens. However, neither the gene expression analyses conducted on intestinal mucosal samples, nor the examination of numerous complementary markers, revealed clear indications of a systemic immune response to the presence of Campylobacter spp. or the distinct enterotype. We nevertheless observed notable disparities in the metabolic capacities and activities of bacteria in both enterotypes, with the distinct enterotype exhibiting enhanced activity across most metabolic domains, with a particular emphasis on nitrogen compound utilisation. These results align with previous observations that indicated a negative impact of increased metabolic activity on animal growth, likely stemming from increased competition for nutritional resources [26]. Therefore, observed correlations between the presence of Campylobacter spp. and reduced body weight, in the absence of an inflammatory response, may be attributed to a distinct metabolic activity of the associated microbiota.

In summary, our study underscores the importance of studying zoonotic bacteria, their accompanying microbiota and the host organism in combination, all while harnessing the power of multi-omic technologies. Only through the high-resolution functional analysis of the three mentioned elements will we be able to resolve the complex tripartite interactions between them, and in doing so gain knowledge to develop novel sustainable strategies to improve safety and sustainability of poultry production.

Animal experiments

A total of 613 animals were sampled in the three experimental replicates (trials A, B, and C) performed in 2019 within the H2020 project HoloFood [24]; 205, 182 and 226 birds from A, B and C trials respectively. Broiler chickens from two genetic lines (Cobb 500 and Ross 308) and both sexes were reared in intensive farm conditions for 35-37 days. Each trial comprised 24 pens (12 groups replicated twice distributed in 3 treatments x 2 genetic lines x 2 sexes), each pen containing 40 animals. More details about the experimental design, diet and performance results are available in Tous et al. [25]. Chickens were euthanized, weighed, and sampled at days 7–8, 21–22 and 35–37 (multiple days were necessary due to workload, and these differences have been accounted for in the statistical analyses), hereafter simplified to three time points (days 7, 21 and 35). Caecal pathogens detection (Salmonella spp., Campylobacter spp., and Clostridium spp.) procedures using PCR are explained in detail in Tous et al. [25]. Molecular data was obtained from three different sections of the caecum. In short, the end of one of the caecum bags was isolated and longitudinally opened to gently collect ca. 100 mg of digesta for each metagenomic and metatranscriptomic analysis. After carefully washing the intestinal surface with saline solution, the mucosal layer was scraped and ca. 100 mg of mucosa were collected for host transcriptomic analyses. All types of samples were preserved in DNA/RNA Shield buffer (Zymo) and stored at -20 ºC until nucleic acid extractions.

Data generation

DNA and RNA extraction

A total of 613 metagenomic, 125 metatranscriptomic and 169 host transcriptomic datasets were generated. Both nucleic acids were extracted using a custom purification method optimised for samples preserved on DNA/RNA Shield buffer [45]. The protocol consisted in a bead-beating for tissue disruption, followed by digestion, nucleic acid separation (DNA and RNA) and purification steps. Samples were processed in batches of 90 samples, along with 6 extraction, library preparation, and library indexing blanks (2x2x2). Samples within each batch were randomised using a custom script, but different sample types were not mixed to minimise the risk of cross-contamination due to DNA concentration differences.

Library preparation of metagenomic DNA

The nucleic acids extracted were fragmented to obtain an average length of 400 bp using a Covaris LE220 ultrasonication device. A standard amount of 200 ng of DNA was used for the library preparation. We used the BEST [46] ligation-based library preparation protocol to prepare sequencing libraries. In order to evaluate the success of the libraries, we conducted quality controls using qPCR assays. The optimal number of cycles was estimated to achieve the desired DNA molarity while reducing clonality. Any libraries that exceeded 12 cycles were repeated for library preparation due to potential technical biases. Subsequently, libraries were indexed using unique dual tags, along with the necessary PCR cycles. Before final quality-checks were performed with a DNA Fragment Analyser (Agilent), bead purification was carried out. Libraries with expected fragment-size distributions and molarities were equimolarly pooled for sequencing. Libraries that showed too low molarities were re-indexed to achieve the desired molarity, and the ones exhibiting unusual fragment distributions and large adaptor dimers were re-built. Sequencing was performed in multiple BGIseq runs with 150bp paired-end chemistry. Sequencing effort per sample typically varied between 8GB and 16GB, equivalent to 26 and 52 million reads.

Library preparation of metatranscriptomic RNA

rRNA depletion was performed using TIANSeq rRNA Depletion Kit (Animal) (Cat.No. NR101-T1), and the remaining RNA was fragmented into 250-300bp, to finally reverse-transcribe into double stranded cDNA with random hexamers. Total cDNA was sent to Novogene for library preparation and sequencing. In short, cDNA libraries were constructed using Novogene NGS RNA Library Prep Set (PT042), which comprises end repair, A-tailing and adapter ligation steps. The libraries were checked with Qubit and real-time PCR for quantification and Bioanalyzer (Agilent) for size distribution detection. Quantified libraries were pooled and sequenced on an Illumina NovaSeq 6000 platform with 150bp paired-end chemistry, aiming for 5GB of protein-coding gene data.

Library preparation of chicken transcriptomic RNA

Total RNA was quantified using Nanodrop (Thermo Scientific) and Bioanalyzer 2100 (Agilent), as well as analysed for integrity (Agilent 2100) and purity (agarose electrophoresis and Nanodrop). Samples were subjected to rRNA removal step by poly-A enrichment, using poly-T oligo-attached magnetic beads. After fragmentation, the first strand cDNA was synthesised using random hexamer primers. During the second strand cDNA synthesis, dUTPs were replaced with dTTPs in the reaction buffer. Total cDNA was sent to Novogene for library preparation and sequencing. The directional libraries were ready after end repair, A-tailing, adapter ligation, size selection, USER enzyme digestion, amplification, and purification. Libraries were checked with Qubit and real-time PCR for quantification, as well as with bioanalyzer for size distribution detection. Quantified libraries were pooled and sequenced on NovaSeq 6000 (Illumina), according to effective library concentration and data amount required.

Bioinformatic data processing

Generation of the MAG catalogue

Details on the procedures employed to create the MAG catalogue used in this study will be published, and code can also be accessed at Workflowhub (https://workflowhub.eu/programmes/28). In short, data from 261 caecal metagenomic samples collected from chickens from the three experiments were used to generate the caecal MAG catalogue. We performed de novo metagenomic assemblies using the MGnify assembly pipeline [47]. The assembly tool MetaSPAdes [48] was used preferentially for single-run assemblies, while MEGAHIT [49] was used for co-assemblies when memory requirements for MetaSPAdes were too high. Samples prioritised for co-assembly were selected by hierarchical clustering based on Jaccard distance between low-quality bins generated by single assembly. Contigs shorter than 500 base pairs were excluded, and further host, human and PhiX decontamination was performed post-assembly with blastn [50]. Contig binning was performed using ‘binning’ and ‘bin_refinement’ modules of metaWRAP’s. Genome quality assessment was conducted using checkM [51], with the criteria of retaining genomes with completeness >50%, contamination <5%, and a quality score (QS) >50 (where QS = completeness - 5*contamination). Genomes were de-replicated using an Average Nucleotide Identity (ANI) of 95%, and 30% alignment fraction to generate species-level clusters using dRep [52]. Lastly, GUNC [53] was employed to identify potentially chimeric genomes for subsequent removal, utilising specific parameters that included a clade separation score >0.45, contamination >0.05, and reference representation score >0.5.

Functional annotation and distillation of MAG catalogue

Taxonomy annotation and phylogenetic tree construction was carried out using GTDB-Tk [54]. Functional annotation of the MAGs was performed through an ensemble approach implemented in DRAM [55]. This approach incorporates data from various databases, including Pfam [56], KEGG [57], UniProt [58], CAZY [59] and MEROPS [60]. To distil these annotations into quantitative genome-inferred functional traits (GIFTs) representing metabolic capacities provided by the microbiota to its host, we used the R package DistillR, which can be found at the following link: (https://github.com/anttonalberdi/distillR). DistillR contains a set of >300 metabolic curated metabolic pathways and modules derived from KEGG and MetaCyc [61] databases, which are used to obtain quantitative estimates of the metabolic capacities of microorganisms through quantifying the relative representation of genes required for accomplishing a metabolic task. GIFTs range between 0-1, the zero indicating none of the genes defined in the pathway are present in the genome and one indicating that all genes are present. In cases where a step within a pathway requires the presence of two Identifiers, the step is considered full if both Identifiers are present, half full if one is present, and empty if none is present. We quantified 170 GIFTs per genome (complete detailed list can be found in Supplementary Table S1), whose values were first corrected by MAG genome completeness to reduce functional biases [62], and then averaged to obtain a genome-level overall metabolic capacity metric, hereafter referred to as Metabolic Capacity Index (MCI). We also distilled microbial gene expression data into 170 GIFTs per genome, by weighing gene expression values according to the weight of each gene in each metabolic pathway.

Genome-Scale Metabolic Networks

A genome-scale metabolic network (GSMN) is a comprehensive representation of all the metabolic reactions that occur within an organism. It is constructed based on the genomic information of the organism and integrates biochemical and genetic knowledge to capture the complexity of the organism's metabolism. We employed the software metage2metabo [17], which in turns relies on Pathway Tools [63] to reconstruct the GSMNs of every bacterial genome in our study using a custom snakemake [64] pipeline. Shortly, due to software dependencies, bacterial genomes were re-annotated using eggnog-mapper2 [65] against the eggNOG 5.0 database [33]. The annotation files were transformed into Genbank annotation files (gbk) using ‘emapper2gbk’ and SBML files generated using ‘m2m recon’, as implemented in metage2metabo. SBML files were analysed using the package COBRApy [66] and custom python scripts to quantify source, transit and sink metabolites as explained below.

Metagenomic data processing and read mapping

Sequencing adapters and identical duplicates were filtered out using AdapterRemoval 2.2.4 [67] and seqkit 0.7.1 [68]. Sequences were mapped to the latest chicken reference genome (galGal6, NCBI Assembly accession GCF_000002315.6) using bwa [69] increasing the minimum seed length to 25 to minimise the likelihood of incorrect read pair alignments from the metagenomic fraction. To evaluate the quality of the alignment, mapping statistics including depth and breadth of coverage, and percentage of mapped reads were calculated using SAMtools 1.11 [70]. Aligned reads were sorted and the metagenomic fraction was isolated using SAMtools. Metagenomic reads were mapped to the MAG catalogue using bwa at 90% identity and 60% coverage threshold and further summarised with samtools. Read-mapping counts resulting in < 30% genome coverage per sample were removed from further analysis. Retained read mapping counts were divided by the total number of paired-reads per sample, and multiplied by 100 to give the percentage of reads mapped to the MAG catalogue for each sample. Relative abundance was estimated by adapting the RPKG (Reads Per Kilobase per Genome equivalent) formula provided by Nayfach and Pollard. It is referred to as RPMM (Reads Per Million bases of genome, per Million mapped reads), as reads mapped to MAGs were normalised both by genome length (divided by 1M) and by read length (divided by 1M).

Metatranscriptomic data processing and read mapping

We employed a custom snakemake pipeline for preprocessing metatranscriptomic data (https://github.com/anttonalberdi/holoflow/tree/EisenRa/workflows/metatranscriptomics). In short, reads were trimmed and quality controlled using fastp [71], keeping reads >60 bp and with Phred scores >20. Processed reads were then mapped against the host genome (galGal6) using STAR [72]. The unmapped reads were subsequently mapped to a combined database containing SILVA 16S rRNA SSU and LSU NR 99 [73], as well as 5SRNAdb [74] using Bowtie2 [75] with default parameters. Unmapped reads were then mapped to the MAG catalogue genes (outputted from DRAM; genes.fna.gz) using Bowtie2 with default parameters. Finally, gene read counts were calculated using CoverM (https://github.com/wwood/CoverM), requiring both pairs of reads to hit the gene (--proper-pairs-only flag).

Chicken transcriptomic data processing and read mapping

Raw transcriptomic reads were quality-filtered using fastp, mapped against the host reference genome using STAR, and gene count data extracted using the gene count option. Each sample yielded on average 12.5±3.2 million reads against the 24,131 genes annotated in the chicken reference genome (galGal6).

Data analysis

Metagenomic data analysis

Metagenomic counts were standardised by MAG length and sequencing depth. Dirichlet Multinomial Mixtures (DMM) [76] were utilised to profile and identify enterotypes in chicken microbial communities. Models were run by setting the maximum allowed number of community types to 5. A total of three runs were performed, one for each sampling day. At day 7, two community profiles were defined, where half of individuals from trial C formed an enterotype, and the rest of individuals from trials C grouped together with animals from trials A and B, forming another enterotype. At day 21, the model detected two clearly defined enterotypes, where in one of them all animals from trials A and B clustered together, and in the other enterotype all animals from trial C. At day 35, three enterotypes that were consistent with trials were detected. Thus, we defined microbial enterotypes as the distinct and standard enterotypes. The distinct enterotype comprised chickens from trial C that grouped separately from the rest at day 7, and the rest of the chickens from trials C at days 21 and 35. The rest of enterotypes were grouped together under the standard enterotype. Top community driver bacteria (i.e. MAGs with highest contribution to discriminate between enterotypes) were identified by selecting the 3% of MAGs with the highest posterior probabilities in DMM analysis.

To assess the temporal development of the composition of microbial communities across time, the MAG sequence count table was transformed using centred log-ratio (CLR) [77] and submitted to the constrained ordination from R package vegan [78]. MAG sequence count table was constrained by the factor trial (categorical variable with three levels: trials A. B and C), sampling time (categorical variable with three levels: days 7, 21 and 35) and their interaction. The significance of the constraining variables was tested using 999 permutations. To test the null hypothesis of no differences between enterotypes in microbial composition and function at day 7, PERMANOVAs were fitted through the ‘adonis2’ function of R package vegan. Euclidean distance matrices of CLR-transformed microbial abundances and functional profiles were included as responses in the PERMANOVAs and trial, enterotype, chicken age, sex, genetic line and treatment were included as explanatory variables. P-values were generated with 999 permutations.

A cladogram derived from the GTDB [79] tree constructed by GTDB-tk for taxonomic annotation was built with R package ggtree [80]. Tips of reference genomes were pruned using the ‘keep.tips’ function included in the R package ape [81]. Relative abundances of each bacterial genome of the catalogue for both enterotypes at each sampling time were illustrated with barplots after counts were standardised by MAG length and sequencing depth.The order of the bacteria was based on the tree obtained with GTDB.

To assess the drivers of the relative abundance of B. fragilis_A (the main indicator of the distinct enterotype at day 7) on trial C and day 7 we used linear mixed effect models as implemented in the R package lme4 [82]. P-values for the fixed effects were computed with the R package lmerTest [83]. CLR-transformed abundance of B. fragilis_A was used as response variable and trial, treatment, chicken age, sex and genetic line were included as fixed explanatory variables. To account for the fact that chickens were nested within pens we included a pen-level random intercept (1|pen). Then, we calculated the marginal and conditional R² using the R package MuMIn [84]: marginal R² captures the variance explained by fixed effects whereas the conditional R² quantifies the variance explained by fixed and random effects together. Therefore, the variance associated with random effects (between-pen variance in relative abundance of B. fragilis_A) was calculated by subtracting the marginal from the conditional R².

Genome-Scale Metabolic Network analysis

General statistics of metabolic properties were calculated for each bacterial genome using custom python functions. These included listing and quantifying source, transit and sink metabolites. Source metabolites were defined as metabolites that a given bacteria is able to use as reactant in at least one metabolic reaction inferred from the genomic information, but that the bacteria is unable to produce itself. Therefore, source metabolites have to be acquired from elsewhere, and can potentially be provided by other bacteria. Transit metabolites were defined as any metabolite that a bacteria can produce and utilise, while sink metabolites are a special case of transit metabolites, defined as metabolites that a bacteria can produce but is unable to use itself. Sinkmetabolites and, potentially transit metabolites, are therefore metabolic by-products that may become available for other bacteria.

Using the GSMNs of all 882 characterised bacteria, we calculated the community-weighted average number of source metabolites for Campylobacter that the microbial community associated with each chicken at day 7 could potentially produce. Additionally, we calculated the capacity of each enterotype to produce specific source metabolites for Campylobacter. We did so by calculating the cumulative relative abundance of bacteria in each sample collected at day 7, capable of producing each metabolite. First, to test the null hypothesis of no difference between enterotypes to produce source metabolites for Campylobacter,we used linear mixed models as implemented in the R package lme4. The capacity of each enterotype to produce source metabolites was used as response variable and enterotype, trial, treatment, chicken age, sex and genetic line were included as fixed explanatory variables. A pen-level random intercept (1|pen) was used to account for the nested design. The assumptions of homoscedasticity and normal distribution of errors were verified with visual inspection of residual plots. Then, the null hypothesis of no difference in capability of producing specific source metabolites for Campylobacter between enterotypes was tested using generalised linear mixed models using the function glmmPQL() of R package MASS [85]. In this case, the response variables were fractional (i.e. they take values between 0 and 1), thus we used a quasibinomial distribution with logit link function, which allowed us modelling the fractional response variables while accounting for under- or overdispersion and obtaining robust standard errors [86]. The set of fixed explanatory variables and random effects were the same as in above linear mixed models. Since multiple metabolites were tested consecutively, P-values were corrected for multiple testing using the Benjamini-Hochberg false discovery rate procedure [87].

Metatranscriptomic data analysis

Quantitative GIFTs were calculated with ‘distillq’ function using the R package distillR. To explore the temporal development of the functional expression profile of the standard and distinct chicken caecum enterotypes, the CLR-transformed community level quantitative GIFT profiles were ordinated with the constrained ordination RDA using ‘rda’ command from R package vegan. The ordination was constrained with the continuous variable chicken age, the categorical variable enterotype and their interaction. The significance of the factors was assessed using 999 permutations.

To identify specific quantitative GIFTs differentially expressed between standard and distinct chickens at different time points, linear mixed effect models were used with the R package lme4. CLR-transformed community level quantitative GIFTs were used as response variables in linear mixed models. As fixed explanatory variables in the models we used the enterotype, chicken age, sex, genetic line and treatment, and a pen-level random intercept (1|pen) was included to account for the nested design. P-values were corrected for multiple testing using the Benjamini-Hochberg false discovery rate procedure.

Lastly, to assess which bacterial strains were contributing the most to the expression of specific functions at different time points and enterotypes, we calculated the average relative expressions (given as gene counts per million) of specific functions for each MAG, at different combinations of enterotype and sampling time.

Chicken transcriptomic data analysis

Gene counts were processed with the tidybulk R package [88]. Briefly, gene counts were imported with the tidyverse metapackage [89]. Then, counts were normalised using the TMM method from edgeR [90]. Samples were clearly differentiated by sex and age. Thus, sex was a controlled variable for subsequent analyses. No statistical differentiation between breeds could be observed. We set as confounding variables the sex, breed, laboratory (two sample extraction batches) and sequencing batch (three sequencing batches). Then, samples were compared for differential expression between trials A and B versus C at the three sampling times. The method chosen for differential expression was the one implemented in edgeR. p-values were corrected for multiple testing using the Benjamini-Hochberg method [87]. Analysis of overrepresented Gene Ontologies [91] and KEGG pathways was done with clusterProfiler [92].

Data availability

All raw DNA and RNA sequences, and the MAG catalogues are available under HoloFood’s umbrella project on ENA (Project ID: PRJEB43192) and displayed in the HoloFood Data Portal (www.holofooddata.org). Bioinformatic resources including ENA accession numbers, scripts, data matrixes and files have been archived in Zenodo with the DOI 10.5281/zenodo.8429925, as a release of the following Github repository: https://github.com/holochicken/priority_effects.

Acknowledgments

This research was funded by the European Union’s Horizon Research and Innovation Programme under grant agreement No. 817729 (HoloFood, Holistic solution to improve animal food production through deconstructing the biomolecular interactions between feed, gut microorganisms, and animals in relation to performance parameters), as well as the Danish National Research Foundation grant DNRF143. The work of S.M. was supported by the Basque Government doctoral fellowship. The visit of E.S. was funded by TUBITAK 2214-A International Research Scholarship.

Abebe E, Gugsa G, Ahmed M. Review on Major Food-Borne Zoonotic Bacterial Pathogens. J Trop Med. 2020;2020:4674235.
Sahoo M, Panigrahi C, Aradwad P. Management strategies emphasizing advanced food processing approaches to mitigate food borne zoonotic pathogens in food system. Food Front. 2022;3:641–65.
Abd El-Hack ME, El-Saadony MT, Shehata AM, Arif M, Paswan VK, Batiha GE-S, et al. Approaches to prevent and control Campylobacter spp. colonization in broiler chickens: a review. Environ Sci Pollut Res Int. 2021;28:4989–5004.
Salmon-Divon M, He YO, Kornspan D, Wen ZT. Editorial: Omics approach to study the biology and virulence of microorganisms causing zoonotic diseases. Front Microbiol. 2022;13.
Bäumler AJ, Sperandio V. Interactions between the microbiota and pathogenic bacteria in the gut. Nature. 2016;535:85–93.
Food Safety Authority E. The European Union one health 2020 zoonoses report. EFSA. 2021.
Ijaz UZ, Sivaloganathan L, McKenna A, Richmond A, Kelly C, Linton M, et al. Comprehensive Longitudinal Microbiome Analysis of the Chicken Cecum Reveals a Shift From Competitive to Environmental Drivers and a Window of Opportunity for Campylobacter. Front Microbiol. 2018;9:2452.
Rychlik I. Composition and Function of Chicken Gut Microbiota. Animals. 2020.
Thibodeau A, Fravalo P, Yergeau É, Arsenault J, Lahaye L, Letellier A. Chicken Caecal Microbiome Modifications Induced by Campylobacter jejuni Colonization and by a Non-Antibiotic Feed Additive. PLoS One. 2015;10:e0131978.
Connerton PL, Richards PJ, Lafontaine GM, O’Kane PM, Ghaffar N, Cummings NJ, et al. The effect of the timing of exposure to Campylobacter jejuni on the gut microbiome and inflammatory responses of broiler chickens. Microbiome. 2018;6:88.
Yan W, Zhou Q, Yuan Z, Fu L, Wen C, Yang N, et al. Impact of the gut microecology on Campylobacter presence revealed by comparisons of the gut microbiota from chickens raised on litter or in individual cages. BMC Microbiol. 2021;21:290.
Awad WA, Hess C, Hess M. Re-thinking the chicken-Campylobacter jejuni interaction: a review. Avian Pathol. 2018;47:352–63.
Garber JM, Nothaft H, Pluvinage B, Stahl M, Bian X, Porfirio S, et al. The gastrointestinal pathogen Campylobacter jejuni metabolizes sugars with potential help from commensal Bacteroides vulgatus. Commun Biol. 2020;3:2.
Luijkx YMCA, Bleumink NMC, Jiang J, Overkleeft HS, Wösten MMSM, Strijbis K, et al. Bacteroides fragilis fucosidases facilitate growth and invasion of Campylobacter jejuni in the presence of mucins. Cell Microbiol. 2020;22:e13252.
Kaakoush NO, Sodhi N, Chenu JW, Cox JM, Riordan SM, Mitchell HM. The interplay between Campylobacter and Helicobacter species and other gastrointestinal microbiota of commercial broiler chickens. Gut Pathog. 2014;6:18.
Han Z, Willer T, Li L, Pielsticker C, Rychlik I, Velge P, et al. Influence of the Gut Microbiota Composition on Campylobacter jejuni Colonization in Chickens. Infect Immun. 2017;85.
Belcour A, Frioux C, Aite M, Bretaudeau A, Hildebrand F, Siegel A. Metage2Metabo, microbiota-scale metabolic complementarity for the identification of key species. Elife. 2020;9:e61968.
Munoz LR, Bailey MA, Krehling JT, Bourassa DV, Hauck R, Pacheco WJ, et al. Effects of dietary yeast cell wall supplementation on growth performance, intestinal Campylobacter jejuni colonization, innate immune response, villus height, crypt depth, and slaughter characteristics of broiler chickens inoculated with Campylobacter jejuni at d 21. Poult Sci. 2023;102:102609.
Awad WA, Smorodchenko A, Hess C, Aschenbach JR, Molnár A, Dublecz K, et al. Increased intracellular calcium level and impaired nutrient absorption are important pathogenicity traits in the chicken intestinal epithelium during Campylobacter jejuni colonization. Appl Microbiol Biotechnol. 2015;99:6431–41.
Kollarcikova M, Kubasova T, Karasova D, Crhanova M, Cejkova D, Sisak F, et al. Use of 16S rRNA gene sequencing for prediction of new opportunistic pathogens in chicken ileal and cecal microbiota. Poult Sci. 2019;98:2347–53.
Humphrey S, Chaloner G, Kemmett K, Davidson N, Williams N, Kipar A, et al. Campylobacter jejuni is not merely a commensal in commercial broiler chickens and affects bird welfare. MBio. 2014;5:e01364–14.
Awad WA, Aschenbach JR, Ghareeb K, Khayal B, Hess C, Hess M. Campylobacter jejuni influences the expression of nutrient transporter genes in the intestine of chickens. Vet Microbiol. 2014;172:195–201.
Han Z, Willer T, Pielsticker C, Gerzova L, Rychlik I, Rautenschlein S. Differences in host breed and diet influence colonization by Campylobacter jejuni and induction of local immune responses in chicken. Gut Pathog. 2016;8:56.
HoloFood. CORDIS. https://cordis.europa.eu/project/rcn/218793/factsheet/en.
Tous N, Marcos S, Boroojeni F, Pérez de Rozas A, Zentek J, Estonba A, et al. Novel Strategies to Improve Chicken Performance and Welfare by Unveiling Host-Microbiota Interactions through Hologenomics. Front Physiol. 2022. https://doi.org/10.3389/fphys.2022.884925.
Marcos S, Odriozola I, Eisenhofer R, Aizpurua O, Tarradas J, Martin G, et al. Reduced metabolic capacity of the gut microbiota associates with host growth in broiler chickens. ResearchSquare. 2023.
Nyholm L, Koziol A, Marcos S, Botnen AB, Aizpurua O, Gopalakrishnan S, et al. Holo-Omics: Integrated Host-Microbiota Multi-omics for Basic and Applied Biological Research. iScience. 2020;23:101414.
Kaakoush NO, Castaño-Rodríguez N, Mitchell HM, Man SM. Global Epidemiology of Campylobacter Infection. Clin Microbiol Rev. 2015;28:687–720.
Javed S, Gul F, Javed K, Bokhari H. Helicobacter pullorum: An Emerging Zoonotic Pathogen. Front Microbiol. 2017;8:604.
Rzeznitzeck J, Breves G, Rychlik I, Hoerr FJ, von Altrock A, Rath A, et al. The effect of Campylobacter jejuni and Campylobacter coli colonization on the gut morphology, functional integrity, and microbiota composition of female turkeys. Gut Pathog. 2022;14:33.
Burgard AP, Nikolaev EV, Schilling CH, Maranas CD. Flux coupling analysis of genome-scale metabolic network reconstructions. Genome Res. 2004;14:301–12.
Karp PD, Midford PE, Billington R, Kothari A, Krummenacker M, Latendresse M, et al. Pathway Tools version 23.0 update: software for pathway/genome informatics and systems biology. Brief Bioinform. 2021;22:109–26.
Huerta-Cepas J, Szklarczyk D, Heller D, Hernández-Plaza A, Forslund SK, Cook H, et al. eggNOG 5.0: a hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses. Nucleic Acids Res. 2019;47:D309–14.
Fan Y, Ju T, Bhardwaj T, Korver DR, Willing BP. Week-Old Chicks with High Bacteroides Abundance Have Increased Short-Chain Fatty Acids and Reduced Markers of Gut Inflammation. Microbiol Spectr. 2023;:e0361622.
Vacca M, Celano G, Calabrese FM, Portincasa P, Gobbetti M, De Angelis M. The Controversial Role of Human Gut Lachnospiraceae. Microorganisms. 2020;8.
Zamarreño Beas J, Videira MAM, Saraiva LM. Regulation of bacterial haem biosynthesis. Coord Chem Rev. 2022;452:214286.
Petushkova E, Mayorova E, Tsygankov A. TCA Cycle Replenishing Pathways in Photosynthetic Purple Non-Sulfur Bacteria Growing with Acetate. Life. 2021;11.
Risso C, Van Dien SJ, Orloff A, Lovley DR, Coppi MV. Elucidation of an alternate isoleucine biosynthesis pathway in Geobacter sulfurreducens. J Bacteriol. 2008;190:2266–74.
Mendel RR, Leimkühler S. The biosynthesis of the molybdenum cofactors. J Biol Inorg Chem. 2015;20:337–47.
Kredich NM. The molecular basis for positive regulation of cys promoters in Salmonella typhimurium and Escherichia coli. Mol Microbiol. 1992;6:2747–53.
Man L, Dale AL, Klare WP, Cain JA, Sumer-Bayraktar Z, Niewold P, et al. Proteomics of Campylobacter jejuni Growth in Deoxycholate Reveals Cj0025c as a Cystine Transport Protein Required for Wild-type Human Infection Phenotypes. Mol Cell Proteomics. 2020;19:1263–80.
Wyszyńska AK, Godlewska R. Lactic Acid Bacteria - A Promising Tool for Controlling Chicken Campylobacter Infection. Front Microbiol. 2021;12:703441.
Liu Y, Feng Y, Yang X, Lv Z, Li P, Zhang M, et al. Mining chicken ileal microbiota for immunomodulatory microorganisms. ISME J. 2023;17:758–74.
Smith KM, Machalaba CC, Seifman R, Feferholtz Y, Karesh WB. Infectious disease and economics: The case for considering multi-sectoral impacts. One Health. 2019;7:100080.
Bozzi D, Rasmussen JA, Carøe C, Sveier H, Nordøy K, Gilbert MTP, et al. Salmon gut microbiota correlates with disease infection status: potential for monitoring health in farmed animals. Anim Microbiome. 2021;3:30.
Carøe C, Gopalakrishnan S, Vinner L, Mak SST, Sinding MHS, Samaniego JA, et al. Single-tube library preparation for degraded DNA. Methods Ecol Evol. 2018;9:410–9.
Richardson L, Allen B, Baldi G, Beracochea M, Bileschi ML, Burdett T, et al. MGnify: the microbiome sequence data analysis resource in 2023. Nucleic Acids Res. 2023;51:D753–9.
Nurk S, Meleshko D, Korobeynikov A, Pevzner PA. metaSPAdes: a new versatile metagenomic assembler. Genome Res. 2017;27:824–34.
Li D, Liu C-M, Luo R, Sadakane K, Lam T-W. MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph. Bioinformatics. 2015;31:1674–6.
Chen Y, Ye W, Zhang Y, Xu Y. High speed BLASTN: an accelerated MegaBLAST search tool. Nucleic Acids Res. 2015;43:7762–8.
Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res. 2015;25:1043–55.
Olm MR, Brown CT, Brooks B, Banfield JF. dRep: a tool for fast and accurate genomic comparisons that enables improved genome recovery from metagenomes through de-replication. ISME J. 2017;11:2864–8.
Orakov A, Fullam A, Coelho LP, Khedkar S, Szklarczyk D, Mende DR, et al. GUNC: detection of chimerism and contamination in prokaryotic genomes. Genome Biol. 2021;22:178.
Chaumeil P-A, Mussig AJ, Hugenholtz P, Parks DH. GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database. Bioinformatics. 2019. https://doi.org/10.1093/bioinformatics/btz848.
Shaffer M, Borton MA, McGivern BB, Zayed AA, La Rosa SL, Solden LM, et al. DRAM for distilling microbial metabolism to automate the curation of microbiome function. Nucleic Acids Res. 2020;48:8883–900.
Mistry J, Chuguransky S, Williams L, Qureshi M, Salazar GA, Sonnhammer ELL, et al. Pfam: The protein families database in 2021. Nucleic Acids Res. 2021;49:D412–9.
Kanehisa M, Goto S. KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000;28:27–30.
UniProt Consortium. UniProt: a worldwide hub of protein knowledge. Nucleic Acids Res. 2019;47:D506–15.
Cantarel BL, Coutinho PM, Rancurel C, Bernard T, Lombard V, Henrissat B. The Carbohydrate-Active EnZymes database (CAZy): an expert resource for Glycogenomics. Nucleic Acids Res. 2009;37 Database issue:D233–8.
Rawlings ND, Barrett AJ, Bateman A. MEROPS: the peptidase database. Nucleic Acids Res. 2010;38 Database issue:D227–33.
Karp PD, Riley M, Paley SM, Pellegrini-Toole A. The MetaCyc Database. Nucleic Acids Res. 2002;30:59–61.
Eisenhofer R, Odriozola I, Alberdi A. Impact of microbial genome completeness on metagenomic functional inference. ISME Commun. 2023;3:12.
Karp PD, Paley S, Romero P. The Pathway Tools software. Bioinformatics. 2002;18 Suppl 1:S225–32.
Mölder F, Jablonski KP, Letcher B, Hall MB, Tomkins-Tinch CH, Sochat V, et al. Sustainable data analysis with Snakemake. F1000Res. 2021;10:33.
Cantalapiedra CP, Hernández-Plaza A, Letunic I, Bork P, Huerta-Cepas J. eggNOG-mapper v2: Functional Annotation, Orthology Assignments, and Domain Prediction at the Metagenomic Scale. Mol Biol Evol. 2021;38:5825–9.
Ebrahim A, Lerman JA, Palsson BO, Hyduke DR. COBRApy: COnstraints-Based Reconstruction and Analysis for Python. BMC Syst Biol. 2013;7:74.
Schubert M, Lindgreen S, Orlando L. AdapterRemoval v2: rapid adapter trimming, identification, and read merging. BMC Res Notes. 2016;9:88.
Shen W, Le S, Li Y, Hu F. SeqKit: A Cross-Platform and Ultrafast Toolkit for FASTA/Q File Manipulation. PLoS One. 2016;11:e0163962.
Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25:1754–60.
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25:2078–9.
Chen S, Zhou Y, Chen Y, Gu J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics. 2018;34:i884–90.
Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29:15–21.
Quast C, Pruesse E, Yilmaz P, Gerken J, Schweer T, Yarza P, et al. The SILVA ribosomal RNA gene database project: improved data processing and web-based tools. Nucleic Acids Res. 2013;41 Database issue:D590–6.
Szymanski M, Zielezinski A, Barciszewski J, Erdmann VA, Karlowski WM. 5SRNAdb: an information resource for 5S ribosomal RNAs. Nucleic Acids Res. 2016;44:D180–3.
Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012;9:357–9.
Holmes I, Harris K, Quince C. Dirichlet multinomial mixtures: generative models for microbial metagenomics. PLoS One. 2012;7:e30126.
Lahti L, Shetty S, Blake T, Salojarvi J. Tools for microbiome analysis in R. Version. 2017.
Oksanen J, Blanchet FG, Friendly M, Kindt R, Legendre P, McGlinn D, et al. vegan: Community Ecology Package. R package version 2.5–7. 2020. 2022.
Parks DH, Chuvochina M, Rinke C, Mussig AJ, Chaumeil P-A, Hugenholtz P. GTDB: an ongoing census of bacterial and archaeal diversity through a phylogenetically consistent, rank normalized and complete genome-based taxonomy. Nucleic Acids Res. 2022;50:D785–94.
Yu G, Smith DK, Zhu H, Guan Y, Lam TT-Y. Ggtree: An r package for visualization and annotation of phylogenetic trees with their covariates and other associated data. Methods Ecol Evol. 2017;8:28–36.
Paradis E, Claude J, Strimmer K. APE: Analyses of Phylogenetics and Evolution in R language. Bioinformatics. 2004;20:289–90.
Bates D, Mächler M, Bolker B, Walker S. Fitting Linear Mixed-Effects Models using lme4. arXiv [stat.CO]. 2014.
Kuznetsova A, Brockhoff PB, Christensen RHB. lmerTest Package: Tests in Linear Mixed Effects Models. J Stat Softw. 2017;82:1–26.
Barton K. MuMIn: multi-model inference. http://r-forge.r-project.org/projects/mumin/. 2009.
Venables WN, Ripley BD. Modern Applied Statistics with S, Springer, New York: ISBN 0-387-95457-0. 2002.
Papke LE, Wooldridge JM. Econometric methods for fractional response variables with an application to 401(k) plan participation rates. J Appl Econ. 1996;11:619–32.
Benjamini Y, Hochberg Y. Controlling the false discovery rate: A practical and powerful approach to multiple testing. J R Stat Soc. 1995;57:289–300.
Mangiola S, Molania R, Dong R, Doyle MA, Papenfuss AT. tidybulk: an R tidy framework for modular transcriptomic data analysis. Genome Biol. 2021;22:42.
Wickham H, Averick M, Bryan J, Chang W, McGowan L, François R, et al. Welcome to the tidyverse. J Open Source Softw. 2019;4:1686.
Robinson MD, McCarthy DJ, Smyth GK. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26:139–40.
Gene Ontology Consortium. The Gene Ontology resource: enriching a GOld mine. Nucleic Acids Res. 2021;49:D325–34.
Wu T, Hu E, Xu S, Chen M, Guo P, Dai Z, et al. clusterProfiler 4.0: A universal enrichment tool for interpreting omics data. Innovation (Camb). 2021;2:100141.

No competing interests reported.

Campylobactermanuscriptsupplementaryinformation.docx

Download PDF

Editorial decision: Revision requested
19 Dec, 2023
Reviews received at journal
06 Dec, 2023
Reviewers agreed at journal
24 Nov, 2023
Reviewers agreed at journal
23 Nov, 2023
Reviewers invited by journal
23 Nov, 2023
Editor assigned by journal
23 Nov, 2023
Submission checks completed at journal
14 Nov, 2023
First submitted to journal
09 Nov, 2023

You are reading this latest preprint version

Priority effects and microbial cross-feeding shape zoonotic agent spread in broiler chickens

Status:

Version 1

Abstract

Figures

Introduction

Results

Discussion

Methods

Animal experiments

Data generation

DNA and RNA extraction

Library preparation of metagenomic DNA

Library preparation of chicken transcriptomic RNA

Functional annotation and distillation of MAG catalogue

Genome-Scale Metabolic Networks

Metagenomic data processing and read mapping

Chicken transcriptomic data processing and read mapping

Genome-Scale Metabolic Network analysis

Metatranscriptomic data analysis

Chicken transcriptomic data analysis

Declarations

References

Additional Declarations

Supplementary Files

Status:

Version 1