Reference bacterial species
For this study, three biosafety level 1 (BSL-1) ATCC strains of Gram-positive and Gram-negative bacteria with varying GC content and genome sizes were selected: Bacillus cereus ATCC 14579 (B. cereus) (35% GC), Staphylococcus epidermidis ATCC 12228 (S. epidermidis) (32% GC) and Enterobacter cloacae ATCC 13047 (E. cloacae) (54% GC) (Table 1).
Table 1. Key bacterial features of species used in the study.
Bacterial Strain
|
ATCC number
|
ATCC Genomic Element
|
ATCC size
|
ATCC GC content (%)
|
NCBI Reference Sequence
|
NCBI Genomic Element
|
B. cereus
|
14579
|
Chromosome
|
5 416 622
|
35.3
|
NC_004722.1
|
Chromosome
|
Plasmid 1
|
15200
|
38.0
|
NC_004721
|
pBClin15
|
E. cloacae
|
13047
|
Chromosome
|
5 317 012
|
54.8
|
NC_014121.1
|
Chromosome
|
Plasmid 1
|
199 566
|
52.5
|
NC_014107
|
pECL_A
|
Plasmid 2
|
86 314
|
47.4
|
NC_014108
|
pECL_B
|
Plasmid 3
|
5 128
|
37.0
|
No equivalence
|
No equivalence
|
S. epidermidis
|
12228
|
Chromosome
|
2 503 245
|
32.1
|
NC_004461
|
Chromosome
|
Plasmid 1
|
27017
|
29.1
|
NC_005004
|
pSE-12228-05
|
Plasmid 2
|
21979
|
28.9
|
NC_005005
|
pSE-12228-04
|
Plasmid 3
|
8007
|
35.6
|
NC_005006
|
pSE-12228-03
|
Plasmid 4
|
6585
|
31.5
|
NC_005003
|
pSE-12228-06
|
Plasmid 5
|
4679
|
31.8
|
NC_005007
|
pSE-12228-02
|
Plasmid 6
|
4439
|
30.1
|
NC_005008
|
pSE-12228-01
|
Bacterial cultures and DNA extraction
B. cereus and E. cloacae were grown on LB-agar medium, while S. epidermidis was cultured on TSA medium, all overnight at 37°C. Petri dishes with bacterial colonies were then kept at 4°C.
Four methods of DNA preparation or extraction were performed on single bacterial colonies. Two methods involved rapid bacterial DNA lysis performed without prior DNA extraction: heat shock lysis (HS) and glass bead disruption (GBD). Additionally, two purified DNA preparations were obtained using an automatic nucleic acid extractor (EZ1 Advanced, Qiagen, Hilden, Germany). These were processed either as a one-step process (EZ1) or as a two-step process (EZ1-AMP), which included additional purification with AMPure® beads.
Heat shock bacterial lysis (HS)
A single colony was resuspended in a tube with 100 µL of molecular grade water. The suspension was heated in a water bath at 100°C for 10 minutes and then immediately placed on ice. Subsequently, the suspension was centrifuged at 18,500 x g for 5 minutes, and the supernatant containing the lysate was collected and stored at -20°C until use.
Bacterial lysis using glass bead disruption (GBD)
Mechanical disruption with glass beads was used to extract DNA as previously described by Köser et al., 2014 [11]. Briefly, a single colony was resuspended in a tube containing a volume of 8.33 µL of glass beads (425-600 µm) and 25 µL of DNA-free water for a final bead-to-water ratio of 1:3. The sample was vortexed at speed 6 on a Vortex Multi Reax (Heidolph Instruments, Schwabach, Germany) for 5 min and subsequently centrifuged in a benchtop centrifuge for 2 min at 18,500 x g. Finally, 10 µL of the supernatant was collected and stored at -20°C until use.
Bacterial purification using automatic nucleic acid EZ1 extractor (EZ1)
From a single colony, an overnight liquid culture at 37°C was performed on LB broth for B. cereus and E. cloacae, and in TSB for S. epidermidis. The culture was then centrifuged at 1,900 x g at room temperature for 10 minutes, and the supernatant was discarded. For Gram-positive bacteria, the pellet was resuspended in a solution containing 170 µL of buffer G2 (Qiagen, Hilden, Germany), 20 µL of lysozyme 50 mg/ml (Roche Diagnostics GmbH, Mannheim, Germany) and 10 µl of lysostaphin (5 mg/ml) (ProSpec-Tany TechnoGene Ltd., Ness-Ziona, Israel), and incubated at 37°C for 30 min with shaking at 300 rpm. For Gram-negative bacteria, the pellet was resuspended in 200 µL of buffer G2. The lysate was then purified on the EZ1 Advanced XL apparatus (Qiagen, Hilden, Germany) using the pre-programmed DNA Bacteria Card protocol and the EZ1 DNA Tissue Kit (Qiagen, Hilden, Germany) following the manufacturer’s instructions. The elution volume was set at 100 µL, and the eluate was stored at -20°C until use.
AMPure® bead purification (EZ1-AMP)
Fifty µL of DNA extracted according to the EZ1 protocol was purified at a ratio of 1.8:1 of AMPure® beads (Agencourt® AMPure®, Beckman Coulter, MA, USA) relative to DNA, aiming to remove fragments smaller than 100 bp. The elution volume was adjusted at 50 µL, and the eluate was stored at -20°C until use.
DNA quantification and dilution
DNA quantification was performed using the Qubit 1X dsDNA High Sensitivity Assay Kit (Invitrogen Life Sciences, Merelbeke, Belgium) on a Qubit 4 Fluorometer (Invitrogen Life Sciences, Merelbeke, Belgium), following the manufacturer’s instructions with a 2 μL sample volume. Based on the DNA concentration stocks, dilutions were made in DNA-free Tris-HCl 10 mM to adjust the working solution to 0.333 ng/μL. Four aliquots of each DNA solution were prepared for use in library preparation and stored at -20°C.
Library preparation for WGS
Four library preparation kits using different fragmentation methods were used for this study: Nextera XT DNA Library Preparation Kit (Illumina, San Diego, CA, USA) (XT), DNA Prep Kit (Illumina, San Diego, CA, USA) (DP), KAPA HyperPlus Kit (Roche Diagnostics GmbH, Mannheim, Germany) (KP), and NEBNext® Ultra™ II FS DNA Library Prep Kit (Bioké, Leiden, The Netherlands) (NN). Each library was prepared using a minimal starting quantity of DNA (1 ng), with amplification conditions adapted according to the manufacturer’s instructions (Table 2).
Table 2. Library preparation kits and fragmentation methods
Library preparation kits
|
Acronym
|
Company*
|
Fragmentation Methods
|
Amplification cycles (PCR)
|
Illumina DNA Prep
|
DP
|
Illumina
|
Tagmentation by transposome
|
12
|
Nextera XT DNA Library Prep
|
XT
|
Illumina
|
Tagmentation by transposome
|
12
|
KAPA HyperPlus
|
KP
|
Roche
|
Enzymes
|
14
|
NEBNext® Ultra™ II FS DNA Library Prep
|
NN
|
New England Biolabs
|
Enzymes
|
9
|
Caption: Fragmentation methods and protocol adaptations based on the manufacturer's instructions for the number of PCR cycles using 1 ng of DNA. *Details of the companies are provided in the section Material and Methods.
The library preparations and WGS sequencing were carried out four times for each DNA sample. The quantity and quality of each individual library were assessed using the Qubit 1X dsDNA High Sensitivity Assay Kit on a Qubit 4 Fluorometer and a High Sensitivity DNA Assay kit on a 2100 Expert Bioanalyzer apparatus, following the manufacturer’s instructions. Subsequently, the libraries were equimolarly pooled, and the quantity and the quality of each pool were assessed again using the Qubit and Bioanalyzer, respectively. Finally, each pool was loaded onto a MiSeq for a paired-end 2 x 300 bp sequencing run using the MiSeq reagent kit V3 (600 cycles).
Bioinformatic analysis
To evaluate the impact of different DNA preparation methods and library preparation kits on WGS results, we compared key performance indicators (KPIs) from two bioinformatics pipelines: reference-based mapping and de novo assembly. The KPIs included sequencing depth evenness across chromosomes and plasmids (accounting for GC bias), the number of contigs, N50, genome fraction, and percentage of mismatches.
Reference-based mapping:
Raw sequence reads were aligned to the ATCC reference genomes of the three target bacteria (Table 1) using Minimap2 v.2.17. The resulting SAM files were sorted and indexed using SAMtools v.1.6. Sequencing depth was calculated with GenomicAlignments v.1.34.1. The sequencing depth of plasmids was normalised using the average chromosomal sequencing depth as a baseline. To assess potential GC content bias, the genome was segmented into 200 bp fragments. For each segment, the local-to-average sequencing depth ratio was calculated and plotted against the corresponding GC content.
De novo assembly:
After down-sampling reads to 150.000/sample, filtered FASTQ files were assembled into contigs using SPAdes v.4.0.0. Contigs shorter than 1000 bp were discarded.to enhance assembly quality. The quality of the assemblies (i.e. N50, number of contigs, genome fraction compared to ATCC reference genomes, and the percentage of mismatches between the assemblies and the ATCC reference genomes) was evaluated using QUAST 5.0.2.
We used the ggplot2 R package to visualise and compare quality metrics across different DNA extraction and library preparation conditions.