Study Area
The sampling was carried out at three study savannah dominant sites in Tanzania: the Serengeti National Park (Serengeti), the Ruaha National Park (Ruaha), and the Selous Game Reserve (Selous). The ~25,000 km2 Serengeti ecosystem is located in northern Tanzania, bordering southern parts of Kenya [26]. The ecosystem in Tanzania consists of several protected areas including the Serengeti National Park (14,760 km2, between 1º and 3º30S and 34º and 36ºE) and adjacent Ngorongoro Conservation Area, Game Reserves and Game Controlled Areas. The climate of the Serengeti has a bimodal rainfall pattern with two rainy and two dry seasons. Typically, the short rains are in November-December, and the long rains in March-May [27, 28]. The Ruaha ecosystem (7°30′S 35°00′E), located in the southern region of Tanzania and at 20,226 km2 (7,809 sq mi), is comprised of Ruaha National Park and surrounding Game Reserves and Wildlife Management Areas. Similar to the Serengeti, it has a biannual rainfall pattern with a short rainy season (November-February) a long rainy season (March-April), and a dry season (June-October) [29]. For the purpose of this study, dry season includes the months of June to October while rainy season spans from the months of March to May. The Selous Game Reserve (9°0′S 37°24′E) 50,000 km2 (partitioned to form Nyerere National Park, a largest Park in the country) is another protected area in Southern Tanzania. It is contiguous with the Mikumi National Park and Udzungwa Mountains National Park and contains one third of the abundant wildlife of Tanzania. The climate is dry sub-humid influenced by the prevailing southeasterly winds which bring rainfall to the reserve.
Bushmeat Sample Collection
Sample collection occurred over a two-year period between July 2016 to July 2018, from both dry and rainy seasons. Samples were identified as either “fresh” or “processed”. Fresh samples were those that appeared to be recently harvested while the processed samples included those that were either heavily salted, sun or air-dried, semi-boiled, or a combination of methods that resulted in dehydrated meat samples [30]. The samples were collected by teams of veterinarians, anti-poaching units, and local enumerator networks from villages surrounding the protected areas, where bushmeat is sold [31]. All samples were collected through incidental sampling after obtaining requisite permits from national authorities and in consultation with village leaders. No wildlife species were harmed or directly surveyed as part of this investigation.
Bushmeat samples (~ 250-500g) were collected in double zip lock freezer bags containing sterile non-toxic Silica Gel desiccant moisture absorbers/dehumidifiers. The samples were stored in -20°C vehicle freezers prior to transferring to -20°C solar freezers in the field. The samples were then transported to the Nelson Mandela African Institution of Science and Technology (NM-AIST) in Arusha via vehicle freezers, where they were stored at -80°C until processing. Cold chain maintenance was ensured through temperature trackers included with the samples throughout the entire process. Sample identification sheets were completed for all collected samples and the metadata, including information regarding sample condition, wildlife species, the collection site (using GPS coordinates), and seasonality.
Nucleic Acid Extraction
To avoid potential issues associated with surface contamination, three replicates of ~ 120 mg each, were obtained from the muscle tissue using sterile, disposable Rapid Core punch (World Precision Instruments, Sarasota, FL) and sterile disposable safety scalpels (VWR, Bridgeport, NJ). The samples were placed in the MagMAX™ Lysis/Binding Solution Concentrate buffer (Thermo Fisher Scientific, Waltham, Massachusetts). The homogenization protocol was optimized and the settings with the best DNA yield for both fresh and processed samples can be found in the Appendix 1.
Nucleic acid extractions were performed using MagMAX™ 96 DNA Multi-Sample Kit (Thermo Fisher Scientific, Grand Island, NY) and KingFisher Flex automated DNA purification system (Thermo Fisher Scientific, Grand Island, NY) per manufacturer’s instructions with minor modifications. If the yield and purity of extractions were not adequate for a specific sample, such as the DNA concentration was less than 4 ng/µl and/or if the quality of the amplicons analyzed by the Bioanalyzer did not pass the quality control, DNA was extracted manually using DNeasy PowerSoil Kit (Qiagen, Hilden, Germany) per manufacturer’s instructions. Extracted DNA was quantified using Qubit™ 3.0 Fluorometer (Thermo Fisher scientific, Grand Island, NY). To ensure purified DNA was of high-quality, DNA was also visualized through agarose gel electrophoresis.
Real Time PCR
Real-time PCR for the detection of B. anthracis, Brucella and Coxiella were carried in 20 µl reactions with both positive and negative controls internal controls (Appendix 1; Supplemental Table 1) [32]. Samples were considered positive for B. anthracis when both plasmids, pXO1 and pXO2, were detected through RT-PCR. All positive samples were verified through repeating the nucleic acid extraction from the original sample and RT-PCRs were performed in technical triplicates.
Microbiome Sequencing
The purified DNA was transported to the Biosciences eastern and central Africa-International Livestock Research Institute (BecA-ILRI) Hub in Nairobi, Kenya for the microbiome sequencing. The V3-V4 hypervariable region of the 16S rRNA gene sequenced on the Illumina MiSeq System generating 2x300 bp paired-end reads (see methods in Appendix 1). Sequence datasets are deposited in NCBI’s Sequence Read Archive (SRA) repository (Bioproject PRJNA477349).
Bioinformatics for Microbiota Composition
To guarantee a high quality data set, microbiome analyses were performed using an open source platform, Empowering the Development of Genomics Expertise (EDGE) [33]. The EDGE Bioinformatics platform (v2.3.0) implementation is based on QIIME v1.9.1 and includes demultiplexing and quality filtering, operational taxonomic unit (OTU) clustering, taxonomic assignment, phylogenetic reconstruction, diversity analyses, and visualizations of each of these analyses. [33]. QIIME started the pre-processing step with 43,095,486 paired-end reads (in FASTQ format) by removing reads with a PHRED quality score lower than Q20. Data were further filtered to remove reads with more than one ambiguous base. If less than 50% of a read contained consecutive high-quality bases, the read was discarded. The forward and reverse reads were merged before analysis. A 94% nucleotide identity sequence clustering threshold (approximately genus level) was used to generate OTUs. The representative sequence for each OTU was then queried against the Greengenes database [34] for taxonomic classification assignment. The depth filter was set to 1000 reads – any barcoded sample that had less than 1000 assembled reads did not undergo taxonomic classification or further alpha- and beta-diversity analyses. Next, the integrated phylotype command generated the consensus taxonomy using phylotype-based approach, where taxonomy-linked sequences were assigned to OTUs based on similarity.
Statistical Analysis
All the statistical analyses were performed using the software R (v1.2.1335, RStudio, Inc), if not otherwise stated. The proportion of positive samples by season (dry, rainy), sampling region (Serengeti, Ruaha, Selous) and meat condition (fresh, processed) was estimated using prop.test with continuity correction function in the ‘stats’ package [36]. The relative risk of a sample being positive for one or more of the select pathogens was calculated using the risk-ratio function of the ‘epitools’ package [35]. To investigate the additive effects of all three categories on the outcome of a sample being positive for any of the pathogens, a multivariate analysis was performed using logistic Generalized Linear Model (GLM with logit link) with region, season and meat condition included as categorical variables [36]. To examine the potential for sample size biases in the microbiota sequencing, a rarefaction analysis was conducted where the fraction of taxa was captured for each sample [37]. Results show that we were able to consistently identify all the detectable OTUs present in every sample with a minimum reading depth of 14,050 sequences per sample. This suggests that the variation of the microbiota among samples is not likely caused by low sequence coverage and that further sequencing would not detect additional genera (Supplemental Figure 1). The microbiome alpha diversity (species richness) was calculated using the Shannon Diversity Index in the package ‘Vegan’ [38]. Statistical analysis for the microbial diversity between the groups was performed using Welch’s t-test [36]. Hierarchical clustering of the samples based on relative abundance of phyla was performed using the package ‘pvclust’, with 1000 bootstrapped replications of the clusters [39]. To examine the beta diversity of the samples, Principal Coordinate Analysis (PCoA) was performed using the Bray-Curtis Matrix at the phylum level using the package ‘Vegan’ [38]. Groups were clustered graphing ellipses estimating the 95% confidence interval (CI) around the average value, for each variable.