The ghost of oysters past: museomics reveals isolation, low diversity and adaptive signatures of an extinct oyster population

doi:10.21203/rs.3.rs-3873137/v1

Download PDF

Article

The ghost of oysters past: museomics reveals isolation, low diversity and adaptive signatures of an extinct oyster population

https://doi.org/10.21203/rs.3.rs-3873137/v1

This work is licensed under a CC BY 4.0 License

You are reading this latest preprint version

Understanding the factors that predispose species and populations to decline and extinction is a major challenge of biodiversity research. In the present study, we investigated the historical population genomics of an extinct oyster population from the Wadden Sea collected between 1868 and 1888, and compared it to French and British populations sampled at the same time. The Wadden Sea is a unique habitat on the northern edge of the European oyster distribution. Our museomic results indicate that the now-extinct population was genetically isolated and had a lower nuclear genomic diversity than the examined French and British populations. Furthermore, genome scans revealed signatures of local adaptation, and population-specific divergence in several loci linked to fitness-relevant traits. The Wadden Sea oysters may have been predisposed for extinction because they were not naturally replenished from other populations, and the small population size did not allow them to adapt to anthropogenically-driven environmental change. In addition, anthropogenic translocations could not successfully replenish or replace this population because these foreign oysters may have been unable to reproduce in the unique Wadden Sea habitat. In summary, the Wadden Sea oysters exhibited all intrinsic drivers expected in a population predisposed for extinction.

Biological sciences/Evolution/Evolutionary genetics

Biological sciences/Biological techniques/Sequencing/Next generation sequencing

Biological sciences/Ecology/Conservation

Earth and environmental sciences/Ecology/Molecular ecology

Throughout the anthropocene, biodiversity has been lost at a rapid rate, ultimately leading to the extinction of thousands of species ^1–3. Before species go extinct globally, local populations are lost over time ^3–5. Identifying factors that predispose populations to extinction will benefit conservation biology and aid in the identification of at-risk biodiversity ⁶. Main anthropogenic drivers of biodiversity loss are climate change, overexploitation, habitat degradation, increasing disease prevalence and non-native species ^3,7,8. Intrinsic factors that are considered to expedite local extinction are small population size, genetic isolation and local adaptation ^9,10. Isolated populations have lower genetic diversity than well-connected meta-populations, and lack an inflow of potentially beneficial variants ^11,12. Low genetic diversity limits the potential for the timely adaptation to rapid environmental changes, or facilitates inbreeding depression ^12–14. The same factors may prevent recolonization because propagules are not dispersed to isolated habitats, and/or do not have the genetic adaptations required for successful recolonization ⁹.

Population genomic comparisons of extinct and extant populations can identify likely intrinsic drivers of extinction without the confounding effects of evolutionary history that hamper interspecies comparisons. Recent advances in ancient and historical DNA sequencing and data analysis make it now feasible to actually attempt such comparisons - but the few existing studies do not confirm the expected population genetic patterns. Two extinct populations of the Pookila had genomic diversity comparable to contemporary extant populations, though overall population connectivity is limited ¹⁵. Similarly, a microsatellite analysis of herbarium specimens revealed similar genetic diversity in an extinct and still extant mustard populations that were well-connected by gene flow, though the extinct population declined in diversity prior to extinction ¹⁶.

Especially the decline of keystone species and ecosystem engineers has large ecological and economical impacts, altering ecosystem functions and services substantially ¹⁷. One such group of declining ecosystem engineers are oysters ¹⁸. Like many other oyster species, the European oyster Ostrea edulis L. has decreased in abundance throughout much of its range in the past 150 years ^19,20. Most drastically, it went locally extinct in the German and Danish Wadden Sea around 1930, presumably due to a combination of overfishing, harsh winters and disease ²¹. Curiously, it has not yet recolonized this habitat, even though harvesting of oysters ceased decades ago, harsh winters have become an exception and much of the Wadden Sea is now a protected nature reserve.

Genetic isolation of the Wadden Sea population could have played a role in its demise. Analyses of mitochondrial genomes extracted from 150-year-old European oyster shells, including shells from the now-extinct Wadden Sea population, indicated limited connectivity between the Wadden Sea population and the French-British populations (Fig. 1A): while British and French oysters were not differentiated from each other, Wadden Sea oysters had a distinct genetic makeup with one private haplogroup ²². As mitochondrial genomes evolve presumably neutral, this indicates limited connectivity of the Wadden Sea population to the French-British populations. The extant populations are split into five geographic groups (Fig. 1B): a Scandinavian-Dutch group in the North Sea, a French-British group, a northern Iberian group, a southern Iberian-Mediterranean group and a Black Sea group ^20,23–27. The now-extinct Wadden Sea population was in the geographic center of the North Sea group, and could have belonged to this group. The five groups are delimited by oceanographic fronts and biogeographic barriers, which likely limit larval dispersal ²⁶. At the same time, this does not preclude the possibility of local adaptation at certain nuclear loci.

One example indicating local adaptation in this species is the allozyme loci arginine kinase (ARK), which has a strong geographic cline throughout the range of European oysters ^26,27. Another example identified several SNPs that had similar allele frequencies in the North Sea and the Black Sea, but that differed strongly from the allele frequencies observed in the Atlantic and Eastern Mediterranean ²⁵. Two hypotheses may explain this pattern: ancient gene flow between the Black Sea and the North Sea populations, and/or convergent adaptation at these or linked loci.

Understanding the genetic diversity, the potential levels of local adaptation and geographic isolation of the now-extinct Wadden Sea population in the context of populations that survived until today is challenging as it requires to “travel back in time”. Natural history collections represent one of the few resources to investigate extinct and historical biodiversity; they house historical specimens of many extinct and endangered species ^28,29. A large collection of historical European oyster shells from across Europe is housed at the Zoological Museum in Kiel, Germany ^30,31. These shells were collected by the German zoologist Karl-August Möbius between 1868 and 1888, prior to large-scale decline and local extinction of the European oyster. The dawn of museomics makes it possible to extract genome-scale data from these specimens ²². In the present study, we expanded these data to infer historical nuclear genomic diversity, population differentiation, as well as signatures of local adaptation ^32–35.

Genome assembly and annotation

The 10x sequencing Genomics Chromium run generated a total of 623 million barcoded reads. The final O. edulis draft assembly has a size 967 Mbp, with 14,796 scaffolds larger than 10 Kbp and a scaffold N50 of 77.76 Kbp. The final annotation comprises a total of 16,615 gene models and 45,064 predicted transcripts. A search using BUSCO v5 ³⁶ (metazoan_odb10) indicated that this gene set contains 91.2% single-copy universal metazoan genes (78.5% complete and 12.7% fragmented).

Whole genome shotgun sequencing, quality control and mapping

The two sequencing efforts for each of the 26 oysters collected between 1868 and 1888 across Europe generated 6,289,842 to 75,541,394 reads per run (Supplementary Table S1 for sampling information). Between 9.357% and 83.352% (median = 36.488%) of the reads mapped to the draft nuclear genome of O. edulis. After combining the runs for each sample, median genome coverage was 0.4478x, and ranged from 0.1008 to 2.5337x (Supplementary Table S2). A total of 243,005 sites passed our genotype filters and were used in the subsequent analyses. The historical mitochondrial genomes had a median coverage of 24.24x (range: 0.86–549.38), and 19 samples were covered with at least 4x across more than 90% of the genome.

Neutral genetic structure

Omitting samples with low coverage or the genomic scaffolds with adaptive signatures did not change results of the principal component analysis (PCA) qualitatively (Supplementary Fig. S1), and we thus present the results for all 26 samples and all genomic regions. Low and ultra-low coverage genomes have previously emerged as a resource for trait association mapping, population genomics and the identification of adaptation ^37–40. In the PCA, genomes of the now-extinct Wadden Sea population clustered tightly with each other, while the British and French oysters were less differentiated (Fig. 2A). The two Dutch oysters clustered with the Wadden Sea and French-British populations, respectively (Fig. 2A). The present-day oyster from the Limfjord collected in 2018 had an intermediate position between the Wadden Sea oysters and the French-British oysters. For the admixture analysis, K = 3 had the highest delta K value (1135.6372), followed by K = 5 (927.8341), K = 2 (912.6502) and K = 4 (895.1018). The admixture analysis were concordant with the PCA results: the Wadden Sea oysters belonged predominantly to the first ancestral population, the British oysters to the second and third ancestral population, and the French oysters to the third ancestral population (Fig. 2B,C). We tested for HWE and inbreeding in the Wadden Sea and French-British population separately. Of 634,136 SNPs, 29,481 (4.65%) had a significant inbreeding coefficient F and departure from HWE in the French-British population, and 18,875 (2.97%) in the Wadden Sea population. Only four of these SNPs deviated from HWE in both populations.

The phylogenetic tree reconstructed for the 19 mitochondrial genomes that passed our quality checks was congruent with the reconstruction of Hayer et al. ²². Most of the mitochondrial genomes fell into one of three distinct clades: one clade contained only Wadden Sea oysters (the “WS” clade of Hayer et al. 2021), one only French-British oysters (“NEA” clade), and the third clade contained British, Wadden Sea and Dutch oysters (“NS” clade) (Fig. 3A). The mitochondrial nucleotide diversity of the Wadden Sea population (0.010500, n = 7) was not significantly different from the British population (0.010613, n = 6), or the combined French-British population (0.007619, n = 10) (Fig. 3B). The French population alone had a lower nucleotide diversity (0.003459, n = 4), as it did not contain any haplotypes of the most divergent NS clade (Fig. 3A). The Dutch population had the highest nucleotide diversity (0.022170, n = 2) but the confidence in this estimate is low given the small sample size (Fig. 3B).

Genomic signatures of selection

For all genome scans, we compared the Wadden Sea population to the combined French-British population based on the results of the admixture analysis. The genome-wide median of Tajima’s D for the Wadden Sea population was − 0.425, and − 1.013 for the French-British population (Fig. 4A). Plotting Tajima’s D values for each population against each other revealed a cluster of 818 values that were either relatively low in the Wadden Sea population, or relatively high in the French-British population (Fig. 4A). These outliers clustered on 46 genomic scaffolds. Genome-wide Watterson’s theta was 324.74 for the French-British population, and 209.03 for the Wadden Sea population. Plotting Watterson’s theta values between populations revealed a similar cluster of outliers as for Tajima’s D, but with a less clear separation to the overall distribution (Fig. 4B).

The genome-wide unweighted Fst was 0.030766, and the weighted Fst was 0.052661. The distribution of sliding window Fst values was bimodal, with the majority of values centering around the genome-wide median (Fig. 4E). However, of the 47703 sliding windows, 1197 had Fst values that were significant outliers, centering around 0.22 (Fig. 4E). Of the 14,796 genomic fragments that make up the draft genome, the high Fst values clustered on 49 of these fragments. The outliers were exclusively found on fragments of intermediate genetic diversity in the Wadden Sea population (Fig. 4C) and relatively high genetic diversity in the French-British populations (Fig. 4D). This means that the genetic diversity surrounding these highly differentiated regions was lower in the Wadden Sea population than in the French-British population, indicative of selective sweeps in the Wadden Sea population. Outliers from Fst, Tajima’s D and theta occurred concomitantly on 40 genomic scaffolds that were between 70,975 and 1,041,195bp long. These scaffolds contain the strongest candidates for local adaptation.

A total of 524 mRNAs were functionally annotated within the 40 outlier regions. These annotations belonged to 297 different genes. Summarizing these genes by molecular function revealed that the majority had catalytic or binding activity (Fig. 5A). The most commonly discerned biological processes were cellular and metabolic but the outlier regions also contained genes responsible for immune system processes, biological adhesion, interspecies interaction and reproduction, all of which could be relevant for local adaptation (Fig. 5B). Lastly, the outlier regions contained genes for a large number of pathways, including stress and immune response pathways, and pathways related to disease in humans (Alzheimer, Parkinson, Huntington) (Fig. 5C).

The evolutionary history and demography of extinct biodiversity can elucidate processes and mechanisms that drive extinctions and inform us about the most vulnerable parts of biodiversity. We investigated the population genomic and evolutionary makeup of an extinct oyster population in the Wadden Sea, and compared it to historical populations sampled at the same time in France, England and the Netherlands, where oysters are still extant.

The now-extinct Wadden Sea population was small and isolated

The historical nuclear genomes clustered by marine basins, indicative of limited gene flow between them. The Wadden Sea oysters were highly divergent from the remaining oysters, and clustered tightly with each other. Limited hydrographic connectivity to Britain and France likely isolated the Wadden Sea population ⁴¹, concordant with results for historical mitochondrial genomes of the European oyster ²², and other marine invertebrate species ^42–46. Only one oyster from the Wadden Sea with an admixed genomic makeup hints at natural or anthropogenic dispersal of English oysters into the Wadden Sea. Few translocations have been reported into the Wadden Sea ^19,47–49, thus we assume natural dispersal until data on Wadden Sea translocations become available. The Wadden Sea diversity may also be native to the Netherlands given its geography, and the Dutch oyster that fell into the French-British cluster could be the result of massive restocking efforts with British oysters ^47,50. The French-British oysters were less well differentiated. Whether this is the result of natural dispersal or due to anthropogenic translocations remains unknown. Translocations of oysters have been carried out frequently from the 19th century on ⁴⁷, and possibly as early as Roman times ⁵¹. They could have already altered the genomic makeup of the oysters we investigated. On the other hand, a western and a northern European group are present in various other marine species, including eel, algae and crabs ^45,52,53.

The Wadden Sea population had lower nuclear genetic diversity than the French-British population and an overall more positive (though still negative) Tajima’s D. Low genetic diversity is correlated to higher extinction risk ^13,54, as populations with low genetic diversity may exhibit inbreeding depression, or are unable to adapt to changing environmental conditions or emerging diseases. Inbreeding depression is unlikely to play a role in the Wadden Sea oysters, as the French-British population had almost twice as many SNPs with signs of inbreeding than the Wadden Sea population. In other words, the French-British population appears to be more inbred than the Wadden Sea population. If the lower diversity of the Wadden Sea population played a role in their extinction, the likely process was the inability of this population to adapt to the rapidly changing environmental conditions of the anthropocene, which include climate change, the introduction of novel pathogens and pollution.

The Wadden Sea population conveyed signatures of local adaptation

To understand the contribution of adaptive processes to the differentiation of the Wadden Sea population and lack of recolonization, we conducted genome scans for signatures of selection. Parts of the genome have undergone more rapid divergence than the remainder of the genome, indicative of local adaptation. The relatively low Tajima's D and Watterson’s theta values in these same regions further points towards selective sweeps in the Wadden Sea population. Thus local adaptation was also a likely agent to differentiate the Wadden Sea population from the French-British population functionally, and keep them isolated. In support of this finding, restocking efforts of the Wadden Sea population with oysters from France, Britain and the Netherlands in the 1930s could not revive the population ^19,55. This could have been caused by the inability of oysters from outside of the Wadden Sea to survive and reproduce in the Wadden Sea with its colder winters.

The outlier regions contained genes with diverse functions. Which of these genes were the actual targets of selection cannot be ascertained. Commonly identified pathways and biological processes with a possible role in local adaptation were related to immune system and stress response, adhesion and reproduction. Genes implicated in stress response, disease and immune system processes may be relevant for local adaptation and extinction of the Wadden Sea oysters. A disease outbreak could have caused the extinction of the Wadden Sea oysters but did not have the same effect on other populations, which responded differently to the disease. Adhesion of oyster larvae to their final substrate is a crucial step in the life history of oysters. Local adaptation of relevant genes could point towards differences in the adhesion process driven by differences in habitat. Given the more challenging, stressful conditions of the Wadden Sea with higher variance in temperature and desiccation, local adaptations to these conditions may be encoded in stress response genes. Ideally, these genome scans should be coupled with functional experiments that test the fitness of oysters with different genomic variants. The extinction of the Wadden Sea oysters prevent such studies.

Ancient gene flow as an alternative to adaptation?

Though the contemporary oyster from the Limfjord does not have the same genomic makeup as the Wadden Sea oysters, we may assume that the now-extinct Wadden Sea oysters belonged to the same genetic cluster as present-day oysters from Scandinavia and the Netherlands. In this case, our findings match the results of Lapègue et al. ²⁵. They identified a number of genomic SNPs that are highly differentiated between the North Sea and English Channel/Atlantic populations. Curiously, the same outlier regions with similar allele frequencies were also present in oysters from the Black Sea. In addition to convergent adaptation of the North Sea and Black Sea populations, they propose an alternative explanation for this genetic parallelism: historical gene flow between the Black Sea and the North Sea. Chromosomal inversions could have preserved the allele frequencies in the Wadden Sea population. If so, the presence of genomic outlier regions may not necessarily point towards adaptation - though it certainly remains a possibility even in the face of ancient gene flow. The mitochondrial genome diversity could also point to such an old introgression event: we find one highly divergent clade that contains oysters from the Wadden Sea, England and the Netherlands. Given the high degree of divergence, this clade could have evolved in the Black Sea and been introduced into the North Sea during an ancient gene flow event. Further studies including additional historical Scandinavian and Black Sea populations are necessary to test this alternative hypothesis.

Reasons for the lack of recolonization of the Wadden Sea habitat

It appears curious that the Wadden Sea has never been recolonized from extant populations in its vicinity. We conclude that adaptations specific to the Wadden Sea population appear to have existed, preventing oysters from other populations to become established. The genomic dissimilarity between the Limfjord oyster and the historical Wadden Sea oysters suggests this. Alternatively, the larvae from surrounding populations simply never reached the Wadden Sea in sufficient quantities. The overall genetic isolation of the Wadden Sea oysters is an indication of this and is further corroborated by the currents in the region. The Limfjord has only a narrow opening to the North Sea and currents move predominantly northward from the German to the Danish Wadden Sea, thus the Danish Limfjord population is unlikely to be a natural donor of oyster larvae for the Wadden Sea ^56,57. Dutch oysters were also not available as larval donors: natural Dutch populations went extinct in the 1950s, and Dutch oyster fisheries have been sustained in areas that were closed off from the North Sea by means of seed oysters. Only very recently have European oysters been reported from Danish and Dutch offshore wind farms, hinting at a possible re-colonization of the North Sea ^58,59.

Our analyses shed light on the historically present diversity of an extinct oyster population from the Wadden Sea, and compared it to historical diversity of French and British oysters. The Wadden Sea oysters appear to have been adapted to the challenging conditions of the Wadden Sea, which explains why this region could not be recolonized by oysters from other regions after the local extinction event. Furthermore, foreign oysters may not have been able to reach the Wadden Sea, as indicated by the genetic isolation of its now-extinct population. The extinction itself may have been facilitated by the small population size of the Wadden Sea population, which prevented this population from responding to the rapidly changing environmental conditions of the anthropocene. In summary, the Wadden Sea population fulfills all genetic expectations for populations that are most at risk for extinction.

Genome assembly and annotation

At the beginning of this work, a reference genome for the European oyster was not yet available, and our initial aim was the production of a nuclear genome assembly for O. edulis. In October 2018, we purchased a fresh individual at Limfjord (Denmark) and extracted 25 mg of soft tissue with the MagAttract HMW DNAKit (Qiagen) following the manufacturer's protocol. The required fragment length larger than 50 Kbp was measured using Agilent TapeStation 4200 and one 10x Genomics Chromium library was prepared and sequenced on one Illumina HiSeq4000 lane (Illumina, San Diego, CA, USA) ⁶⁰. The resulting reads were used to assemble a draft genome using supernova v2.1.1 ⁶¹. In order to annotate this draft genome assembly, different evidence datasets were obtained from public repositories and used in a nextflow-based pipeline developed in-house (https://github.com/ikmb/esga) ⁶². A thorough description of this pipeline can be found on the github page, but in summary, the genome draft was initially repeat-masked using RepeatMasker v2.4.1 ^63,64. Next, C. gigas annotated proteins were downloaded from Ensembl (GCA902806645v1) ⁶⁵ and mapped against the repeat-masked genome using Spaln v2.4.6 and the taxon model “Echinode” ⁶⁶. The resulting models were considered high-confidence gene models. In parallel, all reviewed crustacean proteins were downloaded from UniProt ⁶⁷ and mapped using Spaln, while an O. edulis EST dataset (SRR3954443) was downloaded from NCBI and mapped using Minimap2 v2.22 ⁶⁸. These two sets of alignments were used as evidence-based hints to run Augustus v3.4.0 ^69,70. The resulting predicted gene models, together with the previously described high-confidence gene models, were used to finally run EVidenceModeler v1.1.1 ⁷¹ and obtain the final set of annotated gene structures.

Whole genome shotgun sequencing and mapping

DNA extractions in an ultra-clean facility, library preparation and initial shotgun sequencing were carried out by Hayer et al. ²² from historical oyster shells housed at the Zoological Museum in Kiel, Germany. These oysters were sampled alive along European coasts between 1868 and 1888, prior to the local extinction of the Wadden Sea population ³¹. Initially, the libraries were shotgun sequenced on 1/50th of a Illumina HiSeq lane each ²². We re-sequenced 26 of these libraries with the highest endogenous DNA content on 1/20th of a lane on the Illumina HiSeq 4000 platform (2*75 cycles) to generate higher genome coverage for nuclear genome analysis. The collection information for all analyzed specimens can be found in Supplementary Table S1.

De-multiplexing was performed by sorting reads corresponding to their p7 and p5 combi- nations using the Bcl2fastq software (Illumina, Inc.). Reads of the initial and the deeper sequencing runs were processed according to published protocols specific for aDNA using the EAGER pipeline ⁷². They were then mapped against the mitochondrial (GenBank acc. no. MT663266) and our newly assembled draft reference nuclear genome using the Circular mapper and BWA in the EAGER pipeline with the default setting for aDNA reads ⁷². All duplicate reads were removed using DeDup version 0.12.2, part of the EAGER pipeline, with the default options. To verify aDNA data sets, we evaluated the presence of postmortem DNA damage signatures from read alignments using mapDamage version 2.0.833.

In addition, we mapped the same Illumina reads that were used in the de novo assembly of the reference genome back to the reference nuclear genome and included this sample in the PCA. The aim was to understand if the present-day Limfjord population resembles the historical Wadden Sea population. We used angsD version 0.929 ⁷³ for all subsequent nuclear genome analyses. For all analyses, we included only genotype likelihoods with p-values < 1e-6, mapping quality > 30, minQ > 25, and a minor allele frequency > 0.05 that were present in 20 or more individuals (option “minInd 20”). Results were visualized in the R environment ⁷⁴ using base functions and ggplot2 ⁷⁵.

Genetic structure

We estimated individual genetic distance using the single read sampling approach and conducted a principal component analysis (PCA) with the ‘eigen’ function in R based on the resulting distance matrix. We also estimated individual admixture proportions for 2 to 5 ancestral populations (K) using NGSAdmix ⁷⁶, excluding the present-day oyster from the Limfjord. To infer the maximal value of K, we replicated the analysis 10 times and estimated delta K ⁷⁷ using R scripts available in the online version of the Marine Genomics course of the University of California Davis (https://baylab.github.io/MarineGenomics/week-9--population-structure-using-ngsadmix.html#week-9--population-structure-using-ngsadmix). We plotted the results of the replicate with the highest log likelihood. We tested each bi-allelic site for Hardy-Weinberg-Equilibrium (HWE) and calculated the inbreeding coefficient F in each population separately.

For the mitochondrial genomes, we generated consensus sequences for all sites that were covered by more than four reads in Geneious Prime ⁷⁸. We excluded samples with more than 10% missing data across their mitochondrial genome. We aligned the consensus sequences to the reference genome using the “Map to reference” option, removed regions that were poorly covered by the majority of sequences as well as sequences that had no base call at more than 200 sites. We exported the final alignment as a fasta file. In R, we calculated nucleotide diversity with the function ‘nucleo.div’ and reconstructed a neighbor joining tree based on genetic distances (‘dist.dna’ of package ‘ape’) ⁷⁹.

Genomic signatures of adaptation

To detect regions under potential local adaptation, we compared the Wadden Sea population with a combined French-British population. We combined French and British samples based on the results of the PCA. First, we calculated sliding window values for theta and Tajima's D for each population. Genomic regions of particularly low theta and negative Tajima’s D values in the Wadden Sea population that do not show the same pattern in the French-British population may have undergone recent selective sweeps ⁸⁰. Windows were 50,000 bp long for all calculations, shifting in steps of 10,000 bp. However, as the genome coverage was low and not even throughout the alignment, the number of sites that were available for the calculation of the different statistics varied between sliding windows. We omitted windows with fewer than 10,000 usable sites. Outlier values were determined by linear regression between Tajima’s D values from each population and calculation of Cook’s distance in R (functions ‘lm’ and ‘cooksd’). The same approach was used for theta. Furthermore, we calculated sliding window Fst between the two populations. In such genome scans, high Fst values indicate potential regions that are under local selection. Significant outliers were identified with Rosner's generalized extreme Studentized deviate test ⁸¹, where the p-value was Bonferroni corrected for the total number of Fst values tested. We then identified the genomic scaffolds that contained high Fst, low theta as well as negative Tajima's D values as the best candidate targets of local selection.

We searched the draft annotation of the Ostrea edulis genome for genes that were located on the genomic regions under potential selection. We extracted the preferred gene names from the .gff file and summarized the gene functions in the Panther web interface ^82,83 into categories relating to their molecular functions, biological processes, pathways and protein classes. Homo sapiens was the reference database to map the preferred names against. We chose H. sapiens because no mollusc species was available as reference, several of our annotations were taken from the human genome, and it is one of the most extensive databases.

Acknowledgements

The authors are indebted to Alexander Flache, Johanna Kanwisch, Rexford Dumevi and Magdalena Haller, who supported us with processing the oyster samples in the lab. This study was financially supported by a grant from the the German Federal Ministry of Education and Research (Bundesministerium für Bildung und Forschung, project number 01UQ1711), by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) through the project 390870439 (EXC 2150—ROOTS) to B.K.-K, and by an Evolutionary, Ecological, and Conservation Genomics Research Award of the American Genetics Society granted to C.E..

Author contributions

C.E., B.K.-K. And D.B. designed the study. S.H. and A.I. generated the data. C.E., N.d.S., Z.M., J.S: and M.S.-T. analysed and interpreted the data. C.E. wrote the first draft. All authors revised the first draft substantially. All authors have approved the submitted version. All authors have agreed both to be personally accountable for the author's own contributions and to ensure that questions related to the accuracy or integrity of any part of the work, even ones in which the author was not personally involved, are appropriately investigated, resolved, and the resolution documented in the literature.

Data availability statement

The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request. The raw sequences and the assembled, annotated draft reference genome will be deposited upon publication in NCBI SRA and Assemblies, respectively. The genomic data from the historical oyster shells are being analysed for other metagenomic components and will be publicly available upon completion of those analyses.

Competing Interests Statement

We declare no competing interests.

Biosafety Unit. Status and Trends of Global Biodiversity. in Global Biodiversity Outlook (Secretariat of the Convention on Biological Diversity, 2001).
Cardinale, B. J. et al. Biodiversity loss and its impact on humanity. Nature 486, 59–67 (2012).
Dirzo, R. et al. Defaunation in the Anthropocene. science 345, 401–406 (2014).
Ceballos, G. & Ehrlich, P. R. Mammal population losses and the extinction crisis. Science 296, 904–907 (2002).
Collen, B. et al. Monitoring change in vertebrate abundance: the Living Planet Index. Conserv. Biol. 23, 317–327 (2009).
Collen, B. et al. Predicting how populations decline to extinction. Philos. Trans. R. Soc. B Biol. Sci. 366, 2577–2586 (2011).
Turvey, S. T., Crees, J. J., Li, Z., Bielby, J. & Yuan, J. Long-term archives reveal shifting extinction selectivity in China’s postglacial mammal fauna. Proc. R. Soc. B Biol. Sci. 284, 20171979 (2017).
Wan, X. et al. Historical records reveal the distinctive associations of human disturbance and extreme climate change with local extinction of mammals. Proc. Natl. Acad. Sci. 116, 19001–19008 (2019).
Johannesson, K., Smolarz, K., Grahn, M. & André, C. The Future of Baltic Sea Populations: Local Extinction or Evolutionary Rescue? AMBIO 40, 179–190 (2011).
Orr, H. A. & Unckless, R. L. Population extinction and the genetics of adaptation. Am. Nat. 172, 160–169 (2008).
Harrison, S. Local extinction in a metapopulation context: an empirical evaluation. Biol. J. Linn. Soc. 42, 73–88 (1991).
Robinson, J. A. et al. Genomic signatures of extensive inbreeding in Isle Royale wolves, a population on the threshold of extinction. Sci. Adv. 5, eaau0757 (2019).
Frankham, R. Genetics and extinction. Biol. Conserv. 126, 131–140 (2005).
Newman, D. & Pilson, D. Increased probability of extinction due to decreased genetic effective population size: experimental populations of Clarkia pulchella. Evolution 51, 354–362 (1997).
Burns, P. A., Rowe, K. C., Parrott, M. L. & Roycroft, E. Population genomics of decline and local extinction in the endangered Australian Pookila. Biol. Conserv. 284, 110183 (2023).
Rosche, C. et al. Tracking population genetic signatures of local extinction with herbarium specimens. Ann. Bot. 129, 857–868 (2022).
Delibes-Mateos, M., Smith, A. T., Slobodchikoff, C. N. & Swenson, J. E. The paradox of keystone species persecuted as pests: a call for the conservation of abundant small mammals in their native range. Biol. Conserv. 144, 1335–1346 (2011).
Beck, M. W. et al. Oyster reefs at risk and recommendations for conservation, restoration, and management. Bioscience 61, 107–116 (2011).
Gercken, J. & Schmidt, A. Current status of the European Oyster (Ostrea edulis) and possibilities for restoration in the German North Sea. Neu Broderstorf (2014).
Vera, M. et al. Current genetic status, temporal stability and structure of the remnant wild European flat oyster populations: conservation and restoring implications. Mar. Biol. 163, 239 (2016).
Yonge, C. M. Oysters. vol. 18 (Collins, 1960).
Hayer, S. et al. Phylogeography in an “oyster” shell provides first insights into the genetic structure of an extinct Ostrea edulis population. Sci. Rep. 11, 1–10 (2021).
Diaz-Almela, E., Boudry, P., Launey, S., Bonhomme, F. & Lapegue, S. Reduced female gene flow in the European flat oyster Ostrea edulis. J. Hered. 95, 510–516 (2004).
Gutierrez, A. P. et al. Development of a Medium Density Combined-Species SNP Array for Pacific and European Oysters (Crassostrea gigas and Ostrea edulis). G3 GenesGenomesGenetics 7, 2209 (2017).
Lapègue, S., Reisser, C., Harrang, E., Heurtebise, S. & Bierne, N. Genetic parallelism between European flat oyster populations at the edge of their natural range. Evol. Appl. (2022).
Launey, S., Ledu, C., Boudry, P., Bonhomme, F. & Naciri-Graven, Y. Geographic Structure in the European Flat Oyster (Ostrea edulis L.) as Revealed by Microsatellite Polymorphism. J. Hered. 93, 331–351 (2002).
Saavedra, C., Zapata, C. & Alvarez, G. Geographical patterns of variability at allozyme loci in the European oyster Ostrea edulis. Mar. Biol. 122, 95–104 (1995).
Cameron, S. A. et al. Patterns of widespread decline in North American bumble bees. Proc. Natl. Acad. Sci. 108, 662 (2011).
Cavanaugh, K. C. et al. Climate-driven regime shifts in a mangrove–salt marsh ecotone over the past 250 years. Proc. Natl. Acad. Sci. 201902181 (2019) doi:10.1073/pnas.1902181116.
Hayer, S. et al. Coming and going – Historical distributions of the European oyster Ostrea edulis Linnaeus, 1758 and the introduced slipper limpet Crepidula fornicata Linnaeus, 1758 in the North Sea. PLOS ONE 14, e0224249 (2019).
Möbius, K. A. Die Auster und die Austernwirthschaft. (Parey, 1877).
Bi, K. et al. Unlocking the vault: next-generation museum population genomics. Mol. Ecol. 22, 6018–6032 (2013).
Habel, J. C., Husemann, M., Finger, A., Danley, P. D. & Zachos, F. E. The relevance of time series in molecular ecology and conservation biology. Biol. Rev. 89, 484–492 (2014).
Raxworthy, C. J. & Smith, B. T. Mining museums for historical DNA: advances and challenges in museomics. Trends Ecol. Evol. (2021).
Toussaint, E. F. et al. HyRAD-X exome capture museomics unravels giant ground beetle evolution. Genome Biol. Evol. (2021).
Manni, M., Berkeley, M. R., Seppey, M. & Zdobnov, E. M. BUSCO: assessing genomic data quality and beyond. Curr. Protoc. 1, e323 (2021).
Homburger, J. R. et al. Low coverage whole genome sequencing enables accurate assessment of common variants and calculation of genome-wide polygenic scores. Genome Med. 11, 1–12 (2019).
Li, S. et al. Ultra-low-coverage genome-wide association study—insights into gestational age using 17,844 embryo samples with preimplantation genetic testing. Genome Med. 15, 10 (2023).
Lou, R. N., Jacobs, A., Wilder, A. P. & Therkildsen, N. O. A beginner’s guide to low‐coverage whole genome sequencing for population genomics. Mol. Ecol. 30, 5966–5993 (2021).
Meisner, J., Albrechtsen, A. & Hanghøj, K. Detecting selection in low-coverage high-throughput sequencing data using principal component analysis. BMC Bioinformatics 22, 1–13 (2021).
Brown, J., Hill, A., Fernand, L. & Horsburgh, K. Observations of a seasonal jet-like circulation at the central North Sea cold pool margin. Estuar. Coast. Shelf Sci. 48, 343–355 (1999).
Geburzi, J. C. et al. An environmental gradient dominates ecological and genetic differentiation of marine invertebrates between the North and Baltic Sea. Ecol. Evol. 12, e8868 (2022).
Luttikhuizen, P. C., Drent, J. & Baker, A. J. Disjunct distribution of highly diverged mitochondrial lineage clade and population subdivision in a marine bivalve with pelagic larval dispersal. Mol. Ecol. 12, 2215–2229 (2003).
Maggs, C. A. et al. Evaluating signatures of glacial refugia for North Atlantic benthic marine taxa. Ecology 89, S108–S122 (2008).
Roman, J. O. E. & Palumbi, S. R. A global invader at home: population structure of the green crab, Carcinus maenas, in Europe. Mol. Ecol. 13, 2891–2898 (2004).
Tarnowska, K. et al. Comparative phylogeography of two sister (congeneric) species of cardiid bivalves: Strong influence of habitat, life history and post-glacial history. Estuar. Coast. Shelf Sci. 107, 150–158 (2012).
Bromley, C., McGonigle, C., Ashton, E. C. & Roberts, D. Bad moves: Pros and cons of moving oysters – A case study of global translocations of Ostrea edulis Linnaeus, 1758 (Mollusca: Bivalvia). Ocean Coast. Manag. 122, 103–115 (2016).
Neudecker, T. Zur Qualität von Austern aus der Flensburger Förde. Informationen Für Fischwirtsch. 26, 142–143 (1979).
Neudecker, T. Genutzte Muscheln und Schnecken (Exploited Bivalves and Snails). in Warnsignale aus der Nordsee 431 (Paul Parey, 1990).
Möbius, K. A. Ueber Austern-und Miesmuschelzucht und die Hebung derselben an den norddeutschen Küsten. (Verlag von Wiegandt und Hempel, 1870).
Eyton, T. C. A history of the oyster and the oyster fisheries. (John van Voorst, Paternoster Row, 1858).
Adey, W. H. & Steneck, R. S. Thermogeography over time creates biogeographic regions: a temperature/space/time‐integrated model and an abundance‐weighted test for benthic marine algae. J. Phycol. 37, 677–698 (2001).
Maes, G. & Volckaert, F. Clinal genetic variation and isolation by distance in the European eel Anguilla anguilla (L.). Biol. J. Linn. Soc. 77, 509–521 (2002).
Garner, A., Rachlow, J. L. & Hicks, J. F. Patterns of genetic diversity and its loss in mammalian populations. Conserv. Biol. 19, 1215–1221 (2005).
Hagmeier, A. Vorläufiger Bericht über die vorbereitenden Untersuchungen der Bodenfauna der Deutschen Bucht mit dem Petersen-Bodengreifer. Ber Dt Wiss Kommn Meeresforsch 1, (1925).
Meyerjürgens, J., Badewien, T. H., Garaba, S. P., Wolff, J.-O. & Zielinski, O. A state-of-the-art compact surface drifter reveals pathways of floating marine litter in the German bight. Front. Mar. Sci. 6, 58 (2019).
Otto, L. et al. Review of the physical oceanography of the North Sea. Neth. J. Sea Res. 26, 161–238 (1990).
Bouma, S. & Lengkeek, W. Benthic communities on hard substrates of the offshore wind farm. Rep Bur Waardenbg Bv Noordzeewind 84, (2012).
DONG Energy, A., Vattenfall, The Danish Energy Authority & The Danish Forest and Nature Agency. Danish offshore wind-key environmental issues. https://tethys.pnnl.gov/sites/default/files/publications/Danish_Offshore_Wind_Key_Environmental_Issues.pdf (2006).
Zheng, G. X. et al. Haplotyping germline and cancer genomes with high-throughput linked-read sequencing. Nat. Biotechnol. 34, 303–311 (2016).
Weisenfeld, N. I., Kumar, V., Shah, P., Church, D. M. & Jaffe, D. B. Direct determination of diploid genome sequences. Genome Res. 27, 757–767 (2017).
Di Tommaso, P. et al. Nextflow enables reproducible computational workflows. Nat. Biotechnol. 35, 316–319 (2017).
Chen, N. Using RepeatMasker to identify repetitive elements in genomic sequences. in Current protocols in bioinformatics (2004).
Smit, A., Hubley, R. & Green, P. RepeatMasker Open-4.0. (2013).
Cunningham, F. et al. Ensembl 2022. Nucleic Acids Res. 50, D988–D995 (2022).
Gotoh, O. Direct mapping and alignment of protein sequences onto genomic sequence. Bioinformatics 24, 2438–2444 (2008).
UniProt Consortium. UniProt: the universal protein knowledgebase in 2021. Nucleic Acids Res. 49, D480–D489 (2021).
Li, H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics 34, 3094–3100 (2018).
Stanke, M. et al. AUGUSTUS: ab initio prediction of alternative transcripts. Nucleic Acids Res 34, (2006).
Stanke, M. & Morgenstern, B. AUGUSTUS: a web server for gene prediction in eukaryotes that allows user-defined constraints. Nucleic Acids Res 33, (2005).
Haas, B. J. et al. Automated eukaryotic gene structure annotation using EVidenceModeler and the Program to Assemble Spliced Alignments. Genome Biol. 9, 1–22 (2008).
Peltzer, A. et al. EAGER: efficient ancient genome reconstruction. Genome Biol. 17, 1–14 (2016).
Korneliussen, T. S., Albrechtsen, A. & Nielsen, R. ANGSD: Analysis of Next Generation Sequencing Data. BMC Bioinformatics 15, 356 (2014).
R Core Team. R: A Language and Environment for Statistical Computing. (2019).
Wickham, H. The split-apply-combine strategy for data analysis. J. Stat. Softw. 40, 1–29 (2011).
Skotte, L., Korneliussen, T. S. & Albrechtsen, A. Estimating individual admixture proportions from next generation sequencing data. Genetics 195, 693–702 (2013).
Evanno, G., Regnaut, S. & Goudet, J. Detecting the number of clusters of individuals using the software structure: a simulation study. Mol. Ecol. 14, 2611–2620 (2005).
Kearse, M. et al. Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data (version 8.0.3). Bioinformatics 28, 1647–1649 (2012).
Paradis, E., Claude, J. & Strimmer, K. APE: analyses of phylogenetics and evolution in R language. Bioinformatics 20, 289–290 (2004).
Tajima, F. Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics 123, 595–595 (1989).
Rosner, B. On the detection of many outliers. Technometrics 17, 221–227 (1975).
Mi, H., Muruganujan, A. & Thomas, P. D. PANTHER in 2013: modeling the evolution of gene function, and other gene attributes, in the context of phylogenetic trees. Nucleic Acids Res. 41, D377–D386 (2012).
Thomas, P. D. et al. PANTHER: Making genome‐scale phylogenetics accessible to all. Protein Sci. 31, 8–22 (2022).

No competing interests reported.

Supplementarymaterial.pdf

Download PDF

Editorial decision: Revision requested
18 Feb, 2024
Reviews received at journal
14 Feb, 2024
Reviewers agreed at journal
06 Feb, 2024
Reviewers agreed at journal
05 Feb, 2024
Reviewers invited by journal
05 Feb, 2024
Editor assigned by journal
05 Feb, 2024
Editor invited by journal
04 Feb, 2024
Submission checks completed at journal
04 Feb, 2024
First submitted to journal
17 Jan, 2024

You are reading this latest preprint version

The ghost of oysters past: museomics reveals isolation, low diversity and adaptive signatures of an extinct oyster population

Status:

Version 1

Abstract

Figures

Introduction

Results

Discussion

Conclusions

Methods

Declarations

References

Additional Declarations

Supplementary Files

Status:

Version 1