In this study, we used DNA extraction and hybridization capture enrichment techniques to recover aDNA from medieval millet grains found in Areni-1, a cave site located in the Southern Caucasus a region that has an important history in the domestication and dispersal of many crops. Despite variable quality of aDNA, we were able to recover three chloroplast genomes from five Areni-1 grains assayed.
To date, excavations at Areni-1 have only produced millet grains from the medieval occupation. The Chalcolithic occupation of the cave occurred prior to the introduction of millet into the Southern Caucasus in the 2nd millennium BCE 23,36, so grains would not be expected to be present in excavated material from this period. As for the Bronze and Iron Ages, the occupation of Areni-1 during these times was brief and did not yield substantial plant deposits. The medieval occupants of Areni-1utilized a wide range of plants as food including various fruits (apricots, plums, and grapes) and cereals (wheat, barley, and sorghum) 23. Inclusion of millet in the suite of plants grown may have been a crop rotation strategy that the medieval occupants of Areni-1 adopted to maximize the productivity of agricultural land. Drought resistant millet may have been grown in the arid summers experienced in the Southern Caucasus, while wheat and barley could have been grown in the cooler/wetter seasons of the year 2. Millet may have been grown as food and fodder, but also used to produce alcoholic beverages. In his travel log of the 4th century BCE, the Greek mercenary and historian, Xenophon of Athens, described a wine made by the local inhabitants of the Armenian highlands which was strengthened with barley 37 and may have also contained millet.
Despite a similar well-preserved physical appearance, the endogenous DNA content varied among the Areni-1 grains used in this study as did the quality of the aDNA. For example, in the millet enriched for cpDNA, grain 11295a contained the highest level of endogenous DNA and the longest read length, while grain 11294a contained the least amount of endogenous DNA with the shortest read length (Table 2, Fig. 3a). The environment of the cave appears to be suited to prevent decomposition of biological material and macromolecules as excavations at Areni-1 have unearthed vertebrate soft tissues including a mummified goat 38 and brain tissue in a human skull 22, demonstrating that the conditions in the cave can preserve ephemeral biological material. It is not clear why the Areni-1 millet exhibited different levels of DNA preservation. Grains with poor preservation may have undergone different harvesting, threshing, or other anthropogenic treatments that influenced DNA stability. Post excavation factors, such as storage or handling conditions, may have also influenced the different levels of DNA preservation. Further, the micro-environment has been shown to impact the survival of DNA in mammalian bones39 and similar processes may have influenced the decay of aDNA in the Areni-1 millet grains. Local variations in pH, ion concentration, moisture, or other abiotic factors could have effected DNA stability in the Areni-1 millet 40. Variable preservation of aDNA from grains is not unique to the Areni-1 millet as a study of 6,000 year old barley found large differences in recoverable endogenous DNA in grains excavated from the same occupation layer of a cave 41. Regardless of the varying integrity of the aDNA, the recovery of chloroplast genomes from the Areni-1 samples demonstrates that broomcorn millet grains can, under the right conditions, preserve DNA for at least a 1,000 years. Even with a poorly preserved sample (11294a), hybridization capture enrichment was able to recover sufficient cpDNA to perform a chloroplast phylogenetic and haplotype analysis.
In the Paniceae grass chloroplast phylogeny, the Areni-1 and broomcorn millet reference clustered with several species native to North America (Panicum virgatum, switchgrass; Panicum capillare, witchgrass) and Whiteochloa capillipes, which is from Australia, New Guinea, and Indonesia. Panicum sumatrense (little millet), a cereal crop originally from India also clustered with the Areni-1/broomcorn millet. Panicum is the largest genera of the grass family Paniceae and has a cosmopolitan distribution so the clustering of the Areni-1/broomcorn millet with widely distributed species is not unusual 42. While it is unlikely that the specimens from Areni-1 are directly related to these species, it is possible that some of them share a common ancestor from Southeast Asia, further supporting the east to west introduction of P. miliaceum in Eurasia. It should be noted that the chloroplast phylogeny indicates a close relationship between broomcorn millet and P. capillare (bootstrap support = 100) an observation that supports a previous study of Panicum nuclear DNA, which found that diploid P. capillare or a close relative is the likely maternal genome donor to tetraploid broomcorn millet 43.
In an attempt to investigate the demographic history of broomcorn millet, chloroplast phylogenetic and haplotype analyses were performed using the Areni-1 samples with modern accessions of broomcorn millet. In these analyses only 63 SNPs were identified in a group that included 48 modern domesticated accessions, two wild accessions, and the three medieval millet samples from Areni-1 cave. Accordingly, there was little differences between these millet genomes which was reflected in the phylogenetic and haplotype analyses. In the phylogeny there were low bootstrap support values through much of the tree. In the haplotype network, the vast majority of the haplotypes were separated by 1 to 5 SNPs, which did not allow for any conclusions to be drawn on the demographics of broomcorn millet (Fig. 5a and 5b). The inability of chloroplast genomes to resolve intraspecies relationships is not unique to the current study. Research on the genera Amaranthus and Euphrasia found that whole chloroplast genomes were highly conserved and lacked the power to discriminate between some intraspecies accessions 44,45.
These chloroplast analyses did allow other observations to be made about the phylogenetics of broomcorn millet. In the chloroplast phylogeny, the domesticated accessions were split into two clades with no geographical structure and each clade containing accessions from locations throughout Eurasia. These results are similar to a study by Hunt et. al. 201146 which used 16 microsatellite loci across 98 broomcorn landraces and divided modern broomcorn millet into two clades, one composed solely of Chinese accessions and the other clade contained accessions from throughout Eurasia. It is not clear why the chloroplast phylogeny in the current study lacked a clade of solely Chinese accessions as observed by Hunt et. al. 2011. Previous studies have reported incongruent chloroplast and nuclear phylogenies and have attributed these differences to incomplete lineage sorting and introgression/hybridization 47. Similar processes may be involved with the discrepancies between the broomcorn nuclear and chloroplast genomes, however the low levels of diversity in the chloroplast genomes are likely the main contributing factor for these differences. The broomcorn chloroplast genomes contained sufficient information for gross scale separation of the different accessions but was insufficient to accurately resolve fine scale relationships.
In the haplotype analysis, the wild broomcorn millet contained a relatively high number of variants, 11 SNPs, that were not present in the domesticated accessions. Several scenarios that are not mutually exclusive could explain this observation. One possibility is that the wild accessions sampled may not be directly related to the broomcorn lineages that were domesticated. Another possible scenario is the SNPs observed in the wild accessions may represent diversity lost in the chloroplast genome during a domestication bottle neck. Such a loss of diversity has been reported in a study by Leigh et. al. 2013 48, which described a reduction in both alleles and haplotypes in the chloroplast genomes of different wheat species after domestication. The current study would not be able to capture pre-domestication diversity because the domestication process can happen relatively quickly as exemplified by barley. Although barley was domesticated approximately 10,000 years ago, the nuclear genomes from 6,000 year old barley grains from Israel show striking similarity to the genomes of modern local landraces 41, indicating the domestication process for barley was largely finished after 4,000 years. If following a similar timeline, the domestication of broomcorn millet would have been completed well before the medieval era and it would not be expected for the Areni-1 millet to contain significant levels of pre-domestication diversity. Resolving the reasons for the higher diversity in the wild accessions of the current study will require additional sampling of undomesticated and ancient chloroplast genomes.
It is possible that mapping of promiscuous DNA that has broken off the chloroplast genome and become incorporated in either the nuclear or mitochondrial genome has influenced the Areni-1 chloroplast results 49,50. Effort was made to minimize the impact of promiscuous cpDNA on the Areni-1 results by requiring a relatively high read depth (≥ 10) and at least 95% of the reads to call a mutation as an alternative allele. However, as it will be difficult to distinguish between promiscuous cpDNA and DNA from the chloroplast, proving that promiscuous cpDNA has not influenced the current results is extremely challenging. Regardless, the agreement of the visual identification of the Areni-1 millet as broomcorn millet and the placement of the Areni-1 millet with broomcorn millet in the Paniceae chloroplast phylogeny (Fig. 4) indicate that promiscuous cpDNA did not have a significant impact on the interspecies analysis.
In conclusion, here we generated 53 broomcorn millet chloroplast genomes, which includes three dating to the medieval era that can be used as resources to study this cereal crop and other C4 plants. A phylogeny generated from these chloroplast genomes divided the domesticated forms of broomcorn millet into two clades, an observation that parallels reports for using nuclear DNA. The wild accessions sampled in this study contained SNPs that were outside the diversity of the domesticated forms of broomcorn millet and the significance of these differences will require further sampling of chloroplast genomes to resolve. Lastly, this study demonstrates that broomcorn millet grains can preserve significant levels of DNA for hundreds if not thousands of years and can serve as an important genetic resource to investigate the evolution and domestication of this cereal.