Dissection of the mutation accumulation process during bacterial range expansions

doi:10.21203/rs.2.20228/v1

Download PDF

Research article

Dissection of the mutation accumulation process during bacterial range expansions

https://doi.org/10.21203/rs.2.20228/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 23 Mar, 2020

Read the published version in BMC Genomics →

You are reading this older preprint version

Read the latest preprint version →

Background

Recent experimental work has shown that the evolutionary dynamics of bacteria expanding across space can differ dramatically from what we expect under well-mixed conditions. During spatial expansion, deleterious mutations can accumulate due to inefficient selection on the expansion front, potentially interfering with and modifying adaptive evolutionary processes.

Results

We used whole genome sequencing to follow the genomic evolution of 10 mutator Escherichia coli lines during 39 days (∼1650 generations) of a spatial expansion, which allowed us to gain a temporal perspective on the interaction of adaptive and non-adaptive evolutionary processes during range expansions. We used elastic net regression to infer the positive or negative effects of mutations on colony growth. Even though the colony size, measured after three day of growth, decreased at the end of the experiment in all 10 lines, and mutations accumulated at a nearly constant rate over the whole experiment. We find evidence that beneficial mutations accumulate primarily at an early stage of the experiment, leading to a non-linear change of colony size over time. Indeed, colony size remains almost constant at the beginning of the experiment and then decreases after ∼12 days of evolution. We also find that beneficial mutations are enriched in flagella genes, genes encoding transport proteins, and genes coding for the membrane structure, whereas deleterious mutations show no enrichment for any biological process.

Conclusions

Our experiment shows that beneficial mutations target specific biological functions mostly involved in inter or extra membrane processes, whereas deleterious mutations are randomly distributed over the whole gnome. It thus appears that the interaction between genetic drift and the availability or depletion of beneficial mutations determines the change in fitness of bacterial populations during range expansion.

Epigenetics & Genomics

experimental evolution

range expansion

mutation load

Many populations expanded or shifted their range in their evolutionary history, for instance during the invasion of new habitats or in response to environmental changes (1-3). Understanding the impact of dynamic species range margins on the evolutionary forces driving genomic and phenotypic evolution has become an important question in evolutionary biology, for example in the context of the evolution of dispersal (4), genetic diversity (5) or the structure of biodiversity (6). Recent theoretical and empirical studies show that new mutations occurring at the edge of an expanding population can increase in frequency and spread over a large proportion of newly colonized territories. This process has been called gene surfing (7) and results from stochastic evolutionary processes at the wave front where population density is low and genetic drift is strong (8-10). Theoretical studies have predicted that deleterious mutations can accumulate during range expansion (11) and create an expansion load (12). This prediction could be confirmed experimentally with expanding Escherichia coli populations (13).

Although the theory predicts that the fitness of spatially expanding populations of bacteria should decrease over time, there is evidence that populations that expand their range can evolve greater expansion speed (14-16), which can be a result of spatial sorting (4). It remains however unclear if and how various evolutionary dynamics changes forces vary over time and space in populations that are expanding their range. Recently, microbial evolution experiments in liquid media using time-resolved sequencing have revealed complex dynamics occurring that are characterized by rapid adaptation, competition between beneficial mutations, epistasis, and genetic parallelism (17-20). It is possible that adaptation is mainly due to constant selection occurring on mutations of small effect, which would lead to a gradual change in fitness. Alternatively, evolution on rugged fitness landscapes could lead to alternating periods of rapid phenotypic evolution and more static periods of evolution (21). This variation in the rate of adaptation can be caused by changes in the environment, opportunities for improvement after key innovations, and invasion of new habitats (22, 23).

In this study, we investigate the rate at which mutations accumulate during range expansion by performing evolution experiments with populations of the bacterium Escherichia coli. We selected 12 populations from our previous experiment that expand their range on solid surfaces of agar plates for a total of 39 days (13). We sequenced 6 sample at 13 time point and 6 samples at 5 time points within 39 days of expansion to determine for each line how many mutations accumulate over time. Additionally, we used the measurement of the expansion speed of the lines during the experiment to determine the effect of these mutations on the expansion speed and how these effects change over time.

Linear increase in number of mutations and decrease of colony size over time

We sequenced the genome of 12 lines of Escherichia coli every third day for 39 days in total of radial expansion on agar plates. In total, we collected 108 DNA samples of the 12 lines during the 39 days of expansion (see Methods). Two lines were excluded after DNA sequence analysis due to contaminations during the experiment. We thus used 90 sequences from 10 lines for all further analyses. The colony size was also measured after every growth period of 3 days.

We used a linear mixed effect regression model to predict expansion speed over time, and, separately, the number of accumulated mutations. In the first mixed effect model used to predict expansion speed, we estimated an individual random effect for the intercept and the slope of the linear model (to account for the dependence of the measurements over time for each line). On average, colony size, measured as the radius at the end of a 3-day expansion period, decreased at a rate of 95 µm per day (95% CI: [–129,–62]; p-value < 2.2 · 10^− 16) over the course of the experiment (Fig. 1). The lines accumulated on average 3.1 mutations per day (95% CI: [2.45, 3.71], Fig. 1). For the colony size data, the linear model explains about 67% of the variation (R_c² = 0.67) indicating that there is still considerable variation that this simple model cannot explain. This is not surprising since there are several unaccounted factors that potentially have an impact colony size, i.e. variation of mutation effect size, temperature, humidity, agar concentration, and fluctuations in nutrition composition. In contrast, the model used to predict number of mutations explains 95% of variation in the data (R_m² = 0.95) suggesting that mutations accumulate almost linearly over time. The linear accumulation of mutations suggests that the mutation rate and the generation time remained largely constant over the course of the experiment and shows that evolutionary changes in colony size did not impact the rate at which mutations accumulate. If the colony size data is split in four periods and the mixed effect model is used to analyze the time periods separately, the slope is not significantly different from 0 at period 0–12 days (p = 0.5391), 21–30 days (p = 0.4352), and 30–39 days (p = 0.0529). However, there is a significantly negative slope in the period 12–21 days (p = 0.0142), suggesting that the colony size only decrease significantly in the second period (day 12–21) and that it did not vary in the other periods.

dN/dS ratio decreases over time

We analyzed the mutations in four consecutive time periods: Mutations that occurred in days 0–12, days 12–21, days 21–30, and days 30–39, respectively (Fig. 2A). The analysis of the dN/dS ratio change over time suggests that there is a larger proportion of non-synonymous mutations than synonymous mutations at the beginning of the experiment (dN/dS = 1.4754, p = 0.0041) (Fig. 2B, and Table 1) indicative of positive selection during this early phase. The dN/dS ratio is not significantly different from 1 in the later period of the evolution experiment (Table 1) indicating that non-synonymous and synonymous mutations accumulate randomly at later stages. The dN/dS ratio is significantly different between day 0–12 and day 30–39 (p = 0.039). All other pairwise comparisons between the different time periods are not significant.

Table 1

dNdS ratio calculated for mutations occurring in four time periods (0–12, 12–21, 21–30, and 30–39 days). Reported p-values were obtained by a permutation test.
	day 0–12	day 12–21	day 21–30	day 30–39
dN/dS	1.4754	1.3516	1.2463	0.9909
p value	0.0041	0.1348	0.0823	0.9445

The effects of mutations on colony size shifts become more negative over time

We used an Elastic Net (EN) regression, which performs both variable selection and variable regularization, to determine the subset of genes that have the largest effect on colony size by analyzing non-synonymous and loss of function (LOF) mutations. This analysis estimates for each gene the effect a mutation has on colony size. Positive values indicate that a mutation causes an increase in colony size and negative values indicate a decrease in colony size. We used the change in colony size between two sampling points and a list of genes with new mutations during the two sampling points for the EN analysis. There were 6 genes remaining in the model associated with an increased colony size and 34 genes associated with a colony size reduction (Table 2). 15 genes out of the 34 genes are involved in metabolic processes, 15 genes are connected to the formation of cell membrane, transporter proteins, and motility, and 5 genes are controlling gene expression and DNA structure.

We additionally estimated mutation effects on colony growth by analyzing non-synonymous and loss of function (LOF) mutations with ridge regression, which performs only variable regularization without variable selection. We estimated an effect for each gene, and took it into account even if it was close to zero. Therefore, we could investigate the distribution of the effects of all genes. Ridge regression was done in four time periods to detect any potential change in the distribution of the effects over time (A: 0–12 days, B: 12–21 days, C: 21–30 days, and D: 30–39 days) (Fig. 3). The estimated mean mutation effect does not significantly from 0 in the first 12 days and after day 21 (3–12 days: p = 0.7858; 21–30 days: p = 0.0627; 30–39 days: p = 0.1125). Contrastingly, between days 12–21, we observe a significantly negative mean effect of a new mutation (p < 2.2 10–16) (Fig. 3). This result implies that there is either a shift to more deleterious mutations in the second period or that there are more beneficial mutations at the beginning of the experiment. The latter explanation is in line with the observed dN/dS ratio that is significantly larger than 1 during the first period.

Table 2

Effects of non-synonymous and loss of function mutations on colony size, as inferred by Elastic Net regression. Effect sizes are relative to the initial colony size. The functional units were defined using Ecocyc (24).
Name	Gene description	Pos. Coef.	Neg. Coef.	Function unit
croE	RNA polymerase assembly factor	0.867		DNA or RNA process
livM	Transporter	0.705		Transporter
ybiO	Transporter	0.243		Transporter
ycfQ	Transcriptional repressor	0.679		Regulator
fdoG	Formate dehydrogenase	0.627		Metabolic process
ybdH	Swarming motility	0.066		Motility
yheT	Predicted hydrolase		-3.766	Metabolic process
frlD	Phosphorylation		-0.695	Metabolic process
metL	Amino acid biosynthesis		-0.686	Metabolic process
pdxJ	Metabolic process		-0.596	Metabolic process
fixC	Flavoprotein		-0.593	Metabolic process
glnE	Glutamine synthesis		-0.533	Metabolic process
yphB	Conserved protein		-0.508	Metabolic process
yfeS	Conserved protein		-0.381	Metabolic process
ybhJ	Metabolic process		-0.181	Metabolic process
elbB	Lycopene biosynthesis		-0.177	Metabolic process
panC	Biosynthetic process		-0.104	Metabolic process
msyB	Heat sensitivity		-0.081	Metabolic process
gtrB	Prophage		-0.076	Metabolic process
hpc	Nitrate metabolism		-0.044	Metabolic process
dmlA	D-malate dehydrogenase		-0.032	Metabolic process
yfiL	Lipoprotein		-1.484	Membrane
wcaL	Colanic acid synthesis		-0.507	Membrane
lnt	Lipoprotein		-0.228	Membrane
yfjD	Inner membrane protein		-0.124	Membrane
yciM	Lipopolysaccharide assembly		-0.072	Membrane
ddpA	Peptide ABC transporter		-0.904	Transporter
fecC	Transporter		-0.751	Transporter
yqcE	Transporter		-0.282	Transporter
pheP	Phenylalanine transporter		-0.103	Transporter
alsA	Transporter		-0.081	Transporter
ccmB	Transporter		-0.073	Transporter
uidB	Glucuronide transporter		-0.045	Transporter
paaX	Regulator		-0.784	Regulator
rssB	Regulator of RpoS		-0.649	Regulator
preA	Swarming motility		-0.497	Motility
yeaJ	Motility		-0.011	Motility
recG	DNA repair		-0.245	DNA or RNA process
der	Ribosomal stability factor		-0.238	DNA or RNA process
leuP	tRNA		-0.188	DNA or RNA process

GO enrichment analysis

We investigated if there was a significant enrichment of non-synonymous and LOF mutations found to have an effect on colony size by our EN method (see Table 2) in gene ontology terms, and this for the four different time periods considered above as well as over the whole experiment. For this analysis, we used all genes irrespective of whether they had been affected by positive or negative mutations, since there were not enough mutations in each of these separate categories. We found two significantly enriched GO term using data from the entire experiment: organelle inner membrane (GO:0019866; q = 0.00017) and peptidoglycan-based cell wall (GO:0009274; q = 0.00202)(Fig. 4). Note that bacteria do not possess organelles, but genes in this GO term are defined as membrane-bounded structures with a specified protein content and specified biochemical output (25). We find the same two significant GO terms in the first period (day 0–12): organelle inner membrane (GO:0019866; q = 0.01725) and peptidoglycan-based cell wall (GO:0009274; q = 0.01725). There were no significant GO terms after 12 days until the end of the experiment. The genes that are mutated in the two GO terms (GO:0019866, GO:0009274) can be further divided in four functional groups using Ecocyc (24): flagella assembly, transporter and signaling proteins at the inner membrane, and peptidoglycan assembly of the cell wall (Fig. 4).

We investigated here the accumulation of mutations in 10 Escherichia coli lines over 39 days of expansion on agar plates. We analyzed the temporal dynamics of the effect of mutations on the speed of expansion of bacterial colonies on an agar plate. The focus was to identify the temporal dynamics of the interactions between selection and genetic drift during range expansions. We do not find here evidence of a constant decrease in fitness over time. Rather, the dynamics of fitness change is more complex, with the occurrence of a mixture of positively and negatively selected mutations at all stages, even though their relative proportions and effects varies over time (Figs. 2 and 3).

We find evidence of positive selection driven by non-synonymous mutations in the first 12 days, as attested by a significant dN/dS ratio (dN/dS = 1.48, p = 0.0041, Table 1). However, the estimated average effect of non-synonymous and LOF mutations on colony size is not significantly different from 0 in the first quarter of the experiment (Fig. 3). It suggests that there are beneficial mutations in the first 12 days of the experiment that are compensating for the effect of other deleterious mutations, resulting in a null effect on fitness. There is then a significant decrease in fitness between days 12 and 21, but the dN/dS ratio is not deviating significantly from 1. The observation of a constant fitness at the beginning of the experiment and of a decreasing fitness at a later stage of the experiment could be due to a limited number of mutations that can lead to an increase in colony size (26). After the reservoir of potential positive mutations is exhausted or becomes too small, we would indeed mainly see the effect of a constant accumulation of deleterious mutations, leading to a progressive decrease in the fitness of the bacteria on the front. Note that the rate of fitness gain declines also in well mixed (liquid growing) bacterial populations over time (27), but in contrast to an expanding populations on a two-dimensional surface, its molecular evolution is characterized by signatures of rapid adaptation during the experiment (27). After 21 days, the mutational effects are not significantly different from 0 (Fig. 3), which is in line with the predictions of a Fisher Geometric Model where the proportion of beneficial mutations increases when a population gets further away from its optimum (28). Under this line of reasoning, the accumulation of deleterious mutations during days 12 to 21 would have moved the lines away from their optimum, therefore allowing for a higher influx of beneficial mutations after 21 days. However the effect is either not strong enough to see a significant dN/dS ratio after 21 days in our experiment, or it is mainly driven by LOF mutations.

In this study, we focused on the average effect of mutations among all 10 lines, but mutations occurring in an individual line can show a large deviation from this average effect. There is indeed quite a high variability in the fitness trajectories among different lines (Fig. 1), as the fitness of some lines continues to decrease after 21 days. The fact that the mean effect of the mutations is not significantly different from zero after day 21 on Fig. 3 is also potentially due our limited sample size. A larger study performed over a longer time period would be useful to draw more definitive conclusions. The fact that the number of mutations per line increases linearly over time suggests that mutations occur at a constant rate, which is in line with previous studies of Escherichia coli lines in liquid medium (27, 29), where the rate of genomic evolution was nearly constant. However, in the previous evolution experiments in liquid culture, the dN/dS ratio was significantly larger than one (29) and fitness increased after a short time period relative to the ancestor (19, 27, 30) Our observation that the fitness decreases in the second period of the experiment (day 12–21) is in line with the theoretical predictions that natural selection is inefficient during range expansions due to low effective population size at the expanding front, leading to an inefficient purging of deleterious mutations (31, 32). Expansion speed depends generally on dispersal and growth rate, but mutations can have a different impact on these two mechanisms, and these two traits tend to interact and co-evolve (11, 33). Interestingly, an increase in colony size has been predicted for expanding motile bacteria where faster dispersal can evolve (16). Therefore, the relative strength of drift and selection might change over time (34).

The GO enrichment analysis performed on non-synonymous and LOF mutations revealed two significant GO terms in the total data set as well as in the first 12 days of evolution: organelle inner membrane (GO:0019866) and peptidoglycan-based cell wall (GO:0009274). The mutated genes belonging to these GO terms are coding for proteins functionally connected to the cell membrane and potentially involved in the surface structure of the cell (Fig. 4). There is evidence that structural changes of surface proteins can lead to bacterial cell sorting, such as to more easily allow them to move to the front of the expansion by reducing drag (35). Changes on the cell surface also potentially have an impact on the stability of the edge of the colony (36, 37). By weakening the stability of the colony, the same number of bacteria could spread over a larger area, and lead to a thinner colony (15), since they would be less densely packed. Our results thus strongly suggest that some non-synonymous mutations in membrane protein genes occurring early during the experiment lead to an increase in colony size and are therefore positively selected. Previous estimates of the distribution of fitness effects (DFEs) over the whole experiment suggest that there are on average more deleterious mutations accumulating in during a long period of range expansion on agar plates (13), but the DFE results suggested that there were also many potentially positively selected mutations occurring during these expansions, even though it was not possible to individualize them. Due to the relatively small sample size (10 lines) and the smaller number of mutations observed in each time period, it was not possible to infer period-specific DFEs, but we nevertheless show that these beneficial mutations accumulated early during the experiment. The study of a much larger number of strains could certainly enable one to examine if and how DFEs change over the course of the experiment.

Our results highlight the importance of considering the spatially explicit process of bacterial growth when studying bacterial adaptation and evolution, as functional constraints imposed by range expansions could seriously limit the ability of bacteria to cope with environmental changes (38). Complex adaptive processes demonstrated here in bacteria could also happen during the expansion of other populations, including humans, but also during the growth of solid tissues in eukaryotes. The analogy between the evolution of bacterial communities and the growth of eukaryotic tissue has recently been highlighted, in particular in cancer (39). Like bacteria, solid cancers evolve by a process of clonal expansion, exploring the adaptive landscapes of tissue ecosystems (40). Expansion load theory in non-recombining organisms could therefore also explain phenomena such as spontaneous tumor recession, irregular growth patterns, or extremely high clonal diversity in tumors (41–43). In addition to having triggered the development of specific life-history traits in most organism (reviewed in (44)), the negative impact of deleterious mutations could have led to the development of specific cellular mechanism preventing their specific accumulation during tissue growth, and apoptosis could be such an example.

Bacterial Strain

We used Escherichia coli K12 MG 1655 strains where the expression of the mutS gene is directly controlled by the arabinose promoter pBAD inserted in front of the mutS gene. In absence of arabinose, mutS is not expressed, leading to a higher spontaneous mutation rate due to the inactivation of the methyl-directed mismatch repair system (MMR,(45)). Additionally, our strain had a GFP marker located in the lac operon, which can be induced by IPTG (Isopropyl β-D-1-thiogalactopyranoside)

Experimental setup

Twelve bacterial strains were propagated on LB agar plates at 37°C for a total duration of 39 days. The strains were transferred on new agar plates every 3 days (Figure 5). An image of the colony was taken before transferring the strains to a new plate. The location of the sampling point of each transfer was chosen at random on the periphery of the colony. At each transfer, a sample containing about 100 million cells was collected from the colony front using a sterile pipette tip and resuspended in 100 µl 0.85% NaCl solution. About one million cells were then used to inoculate a new plate (Figure 5B). This expansion experiment on several plates aims at mimicking a continuous expansion for 39 day or 1650 generations (Figure 5C). We extracted DNA from six lines during each of the 13 transfers, and for six other lines, we extracted DNA at day 3, 12, 21, 30, and 39. We thus analysed a total of 108 DNA samples from the 12 lines (Figure 5A).

DNA extraction

After the range expansion experiment on agar, one million cells from the wave front were streaked out on an LB agar plate containing 0.5% arabinose and incubated for 24h at 37°C to isolate single clones. A single colony was dissolved in 100 μl dilution solution (0.85% NaCl) and 1 µl was transferred to a new LB agar plate containing 0.5% arabinose. The plate was then incubated for 24h at 37°C. Then, the entire colony was removed from the agar plate and resuspended in 1 ml dilution solution. Genomic DNA was extracted using the Wizard Genomic DNA Purification Kit (Promega) following the manufacturer protocol. The integrity of the DNA was checked by gel electrophoresis. The DNA concentration was determined by fluorometric quantification (Qubit 2.0).

Whole genome sequencing and variant calling

108 DNA samples of 12 lines were sequenced using a TruSeq DNA PCR-Free library (Illumina) on a HiSeq 3000 platform (Illumina), from which we obtained 100bp paired end reads for all samples. Trimmomatic 0.32 (46) was used to remove the adapter sequences from the reads and for quality trimming. Leading and trailing bases with quality below 3 were removed. The reads were scanned with a 4bp sliding window and cut if the average quality per base was below 15. Reads with a length below 36 were excluded from the analysis. Variants were identified using BRESEQ (version 0.27.2), a computational tool for analyzing short-read DNA data (47). BRESEQ uses Bowtie2 (Langmead, et al. 2009) to map reads to the Escherichia coli K12 MG1655 (NC_000913.3) reference genome. As a first step, it identifies potential new junctions between disjoint regions of the reference sequence using all available reads. BRESEQ then uses an empirical error model for base quality re-calibration considering the identity of the reference base, the identity of the mismatch base, the base position within the read, and the neighboring base identities. At each alignment position, BRESEQ calculates the posterior probability of a given nucleotide given the observed aligned reads. If the nucleotide with the highest posterior probability is different from the reference, BRESEQ records read alignment evidence. The top/bottom strand distribution of reads supporting the major base is compared to the top/bottom distribution of reads supporting the minor base by using a Fisher’s Exact Test to avoid false-positive polymorphism prediction due to sequencing-error hotspots in reads on one strand. A one-sided Kolmogorov-Smirnov test was used to test whether base quality scores supporting the minor mutational variants are suspiciously lower than the base quality scores supporting the major variant. We excluded two sample after analyzing the DNA sequences due to potential contaminations.

Estimation of dN/dS ratio

The number of synonymous and non-synonymous substitutions were computed in each line. The dN/dS ratio was then estimated by taking the expected number of synonymous and non-synonymous substitutions into account if all codon positions in the reference genome would have mutated. We used a bootstrap approach to test if the dN/dS ratio is significantly different from 1. dN/dS was computed using randomized data sets in which the mutations were randomly sampled with repetition among six types of non-synonymous and six types of synonymous mutations (four possible transition and two possible transversions).

Analysis of colony size and number of mutations

We determined for each time point (Figure 5A) the number of mutations that have accumulated in each of the 12 lines, as well as the corresponding colony size. After exclusion of one line due to contaminations we were left with 103 measurements of 11 lines between 3 and 39 days. We determined the change in colony size and the change in the number of mutations over time by fitting a mixed-effect linear model to the data. We fit a fixed effect slope to the data that describes the effects common to all lines, and the model also considers line-specific variability in the slope by including random effects b_i for the intercept and slope for the i-th line:

where X_i and Z_i are known fixed effect and random effect regressor matrices, ε_i is the within group error with a spherical Gaussian distribution, and Ψ is the variance-covariance matrix of the random effects.

Two types of determination coefficients (R²) can be calculated for mixed effect regression models. The marginal represent the variance explained by the fixed effects of the model, whereas the conditional represents the variance explained by the entire model (with both fixed and random effects). The r.squaredGLMM function of the R package MuMIn was used to calculate and . In the model for the change in colony size over time and the model for the change in number of mutations the addition of a random effect of the slope significantly improves the fit of the models compared to a model with only a random effect for the intercept (Likelihood ratio tests; for the colony size model: p-value= 0.0017; for the number of mutation model: p-value= < 0.0001).

Effect of mutations on the colony size

The difference in colony size (Δc) of two consecutive expansions on agar plates, each of these expansions lasting for three days, was calculated for all lines and the mutations that accumulated during this period were determined. Only non-synonymous, frameshift, and non-sense mutations were considered, and for each Δc, the number of mutations (M) in every gene was determined. M has the same number of rows as the change of colony size Δc and 888 columns, one for every gene that had at least one mutation during the experiment. We used a regression approach to model the change in colony size Δc with the number of mutations in the genes M:

Δc = M + ε

where ε is the vector of residuals.

To avoid overfitting due to the high dimensionality of M, ridge regression was used to estimate the effect of a mutation on colony size in a given gene. If a mutation in a gene has no effect on the colony size, ridge regression shrinks the coefficient close to zero. Positive coefficients indicate an increase of colony size and negative coefficients indicate a decrease. The shrinking of the parameters is controlled by the regularization parameter λ, whose value was chosen by 3-fold cross-validation using the cv.glmnet function of the glmnet package.

Gene ontology enrichment test

We tested if there was a signal of adaptation during different periods of the experiment by using a gene ontology (GO) enrichment analysis where we only used non-synonymous, frameshift and nonsense mutations in each time period. The test was performed with the topGO package for R (48) on the genes that were detected to have a positive coefficient in the ridge regression. The resulting list of genes was used separately to perform a Fisher’s exact test to determine significantly over-represented GO terms. The weight01 algorithms used in the topGo analysis iteratively removes the genes mapped to significant GO terms from higher level GO terms and the significance score of connected nodes are compared to detect the locally most significant terms in the GO graph by down-weighting genes in less significant neighbors. The GO enrichment was applied separately to the following time periods: days 0-12, days 12-21, days 21-30, and days 30-39

Parmesan C, Gaines S, Gonzalez L, Kaufman DM, Kingsolver J, Townsend Peterson A, et al. Empirical perspectives on species borders: from traditional biogeography to global change. Oikos. 2005;108(1):58-75.
Thomas CD, Bodsworth EJ, Wilson RJ, Simmons AD, Davies ZG, Musche M, et al. Ecological and evolutionary processes at expanding range margins. Nature. 2001;411(6837):577-81.
Pateman RM, Hill JK, Roy DB, Fox R, Thomas CD. Temperature-Dependent Alterations in Host Use Drive Rapid Range Expansion in a Butterfly. Science. 2012;336(6084):1028-30.
Shine R, Brown GP, Phillips BL. An evolutionary process that assembles phenotypes through space rather than through time. Proceedings of the National Academy of Sciences of the United States of America. 2011;108(14):5708-11.
Excoffier L, Foll M, Petit RJ. Genetic Consequences of Range Expansions. Annual Review of Ecology Evolution and Systematics. 2009;40:481-501.
Waters JM, Fraser CI, Hewitt GM. Founder takes all: density-dependent processes structure biodversity. Trends Ecol Evol. 2013;28(2):78-85.
Klopfstein S, Currat M, Excoffier L. The fate of mutations surfing on the wave of a range expansion. Mol Biol Evol. 2006;23(3):482-90.
Hallatschek O, Nelson DR. Life at the Front of an Expanding Population. Evolution. 2010;64(1):193-206.
Hallatschek O, Nelson DR. Gene surfing in expanding populations. Theor Popul Biol. 2008;73(1):158-70.
Hallatschek O, Hersen P, Ramanathan S, Nelson DR. Genetic drift at expanding frontiers promotes gene segregation. P Natl Acad Sci USA. 2007;104(50):19926-30.
Travis JMJ, Munkemuller T, Burton OJ, Best A, Dytham C, Johst K. Deleterious mutations can surf to high densities on the wave front of an expanding population. Molecular Biology and Evolution. 2007;24(10):2334-43.
Peischl S, Dupanloup I, Kirkpatrick M, Excoffier L. On the accumulation of deleterious mutations during range expansions. Mol Ecol. 2013;22(24):5972-82.
Bosshard L, Dupanloup I, Tenaillon O, Bruggmann R, Ackermann M, Peischl S, et al. Accumulation of Deleterious Mutations During Bacterial Range Expansions. Genetics. 2017;207(2):669-84.
Phillips BL, Brown GP, Webb JK, Shine R. Invasion and the evolution of speed in toads. Nature. 2006;439(7078):803.
Bosshard L, Peischl S, Ackermann M, Excoffier L. Mutational and Selective Processes Involved in Evolution during Bacterial Range Expansions. Mol Biol Evol. 2019;36(10):2313-27.
Deforet M, Carmona-Fontaine C, Korolev KS, Xavier JB. Evolution at the Edge of Expanding Populations. American Naturalist. 2019;194(3):291-305.
Lang GI, Rice DP, Hickman MJ, Sodergren E, Weinstock GM, Botstein D, et al. Pervasive genetic hitchhiking and clonal interference in forty evolving yeast populations. Nature. 2013;500(7464):571-+.
Kvitek DJ, Sherlock G. Whole Genome, Whole Population Sequencing Reveals That Loss of Signaling Networks Is the Major Adaptive Strategy in a Constant Environment. Plos Genet. 2013;9(11).
Tenaillon O, Barrick JE, Ribeck N, Deatherage DE, Blanchard JL, Dasgupta A, et al. Tempo and mode of genome evolution in a 50,000-generation experiment. Nature. 2016;536(7615):165-+.
Miller CR, Joyce P, Wichman HA. Mutational Effects and Population Dynamics During Viral Adaptation Challenge Current Models. Genetics. 2011;187(1):185-202.
Eldredge N, Thompson JN, Brakefield PM, Gavrilets S, Jablonski D, Jackson JBC, et al. The dynamics of evolutionary stasis. Paleobiology. 2005;31(2):133-45.
Blount ZD, Borland CZ, Lenski RE. Historical contingency and the evolution of a key innovation in an experimental population of Escherichia coli. P Natl Acad Sci USA. 2008;105(23):7899-906.
Pagel M, Venditti C, Meade A. Large punctuational contribution of speciation to evolutionary divergence at the molecular level. Science. 2006;314(5796):119-21.
Keseler IM, Collado-Vides J, Santos-Zavaleta A, Peralta-Gil M, Gama-Castro S, Muniz-Rascado L, et al. EcoCyc: a comprehensive database of Escherichia coli biology. Nucleic Acids Res. 2011;39(Database issue):D583-90.
Grant CR, Wan J, Komeili A. Organelle Formation in Bacteria and Archaea. Annu Rev Cell Dev Biol. 2018;34:217-38.
Elena SF, Lenski RE. Evolution experiments with microorganisms: The dynamics and genetic bases of adaptation. Nat Rev Genet. 2003;4(6):457-69.
Good BH, McDonald MJ, Barrick JE, Lenski RE, Desai MM. The dynamics of molecular evolution over 60,000 generations. Nature. 2017;551(7678):45-50.
Silander OK, Tenaillon O, Chao L. Understanding the evolutionary fate of finite populations: the dynamics of mutational effects. PLoS Biol. 2007;5(4):e94.
Barrick JE, Yu DS, Yoon SH, Jeong H, Oh TK, Schneider D, et al. Genome evolution and adaptation in a long-term experiment with Escherichia coli. Nature. 2009;461(7268):1243-7.
Barrick JE, Lenski RE. Genome dynamics during experimental evolution. Nat Rev Genet. 2013;14(12):827-39.
Peischl S, Dupanloup I, Kirkpatrick M, Excoffier L. On the accumulation of deleterious mutations during range expansions. Mol Ecol. 2013;22(24):5972-82.
Peischl S, Kirkpatrick M, Excoffier L. Expansion load and the evolutionary dynamics of a species range. Am Nat. 2015;185(4):E81-93.
Travis JM, Munkemuller T, Burton OJ, Best A, Dytham C, Johst K. Deleterious mutations can surf to high densities on the wave front of an expanding population. Mol Biol Evol. 2007;24(10):2334-43.
Peischl S, Gilbert KJ. Evolution of Dispersal Can Rescue Populations from Expansion Load. The American Naturalist.0(0):000-.
Oldewurtel ER, Kouzel N, Dewenter L, Henseler K, Maier B. Differential interaction forces govern bacterial sorting in early biofilms. Elife. 2015;4.
Serra DO, Richter AM, Klauck G, Mika F, Hengge R. Microanatomy at cellular resolution and spatial order of physiological differentiation in a bacterial biofilm. MBio. 2013;4(2):e00103-13.
Hobley L, Harkins C, MacPhee CE, Stanley-Wall NR. Giving structure to the biofilm matrix: an overview of individual strategies and emerging common themes. Fems Microbiol Rev. 2015;39(5):649-69.
Ferenci T. Trade-off Mechanisms Shaping the Diversity of Bacteria. Trends Microbiol. 2016;24(3):209-23.
Lambert G, Estevez-Salmeron L, Oh S, Liao D, Emerson BM, Tlsty TD, et al. An analogy between the evolution of drug resistance in bacterial communities and malignant tissues. Nat Rev Cancer. 2011;11(5):375-U138.
Greaves M, Maley CC. Clonal evolution in cancer. Nature. 2012;481(7381):306-13.
Ling S, Hu Z, Yang Z, Yang F, Li Y, Lin P, et al. Extremely high genetic diversity in a single tumor points to prevalence of non-Darwinian cell evolution. Proc Natl Acad Sci U S A. 2015;112(47):E6496-505.
Ibrahim-Hashim A, Robertson-Tessi M, Enriquez-Navas PM, Damaghi M, Balagurunathan Y, Wojtkowiak JW, et al. Defining Cancer Subpopulations by Adaptive Strategies Rather Than Molecular Properties Provides Novel Insights into Intratumoral Evolution. Cancer Res. 2017;77(9):2242-54.
Lorenzi T, Venkataraman C, Lorz A, Chaplain MAJ. The role of spatial variations of abiotic factors in mediating intratumour phenotypic heterogeneity. J Theor Biol. 2018;451:101-10.
Foster PL, Hanson AJ, Lee H, Popodi EM, Tang H. On the mutational topology of the bacterial genome. G3 (Bethesda). 2013;3(3):399-407.
Yang W. Structure and function of mismatch repair proteins. Mutat Res-DNA Repair. 2000;460(3-4):245-56.
Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30(15):2114-20.
Deatherage DE, Barrick JE. Identification of mutations in laboratory-evolved microbes from next-generation sequencing data using breseq. Methods Mol Biol. 2014;1151:165-88.
Alexa A, Rahnenfuhrer J, Lengauer T. Improved scoring of functional groups from gene expression data by decorrelating GO graph structure. Bioinformatics. 2006;22(13):1600-7.

Ethical approval and consent to participate

Not applicable

Consent for publication

Not applicable

Availability of data and material

The datasets used and analyzed during the current study are available from the corresponding author on reasonable request.

Competing interests

The authors have no competing financial interests.

Funding

LB was supported by a Swiss NSF grant No. 310030B-166605 to LE.

Authors’ contributions

L.B., S.P., M.A. and L.E. designed the study. L.B. performed laboratory experiments. L.B., S.P., and L.E. analyzed the data. L.B., S.P., M.A., and L.E. prepared the manuscript.

Acknowledgements

We are grateful to Tosso Leeb, Cord Drögemüller and the NGS core facility of the University of Berne for their support.

BosshardetalSI.docx

Download PDF

Journal Publication

published 23 Mar, 2020

Read the published version in BMC Genomics →

Editorial decision: Minor revision
30 Jan, 2020
Review #1 received at journal
28 Jan, 2020
Review #2 received at journal
28 Jan, 2020
Reviewer #1 agreed at journal
12 Jan, 2020
Reviewer #2 agreed at journal
12 Jan, 2020
Reviewers invited by journal
09 Jan, 2020
Editor assigned by journal
23 Dec, 2019
Submission checks completed at journal
22 Dec, 2019
Editor invited by journal
22 Dec, 2019
First submitted to journal
20 Dec, 2019

You are reading this older preprint version

Read the latest preprint version →

Dissection of the mutation accumulation process during bacterial range expansions

Status:

Journal Publication

Version 1

Abstract

Figures

Background

Results

Discussion

Conclusions

Methods

References

Declarations

Supplementary Files

Status:

Journal Publication

Version 1