Common vetch (Vicia sativa subsp. sativa) is a self-pollinated, annual and diploid leguminous forage that can adapt to different soil and climate and fix atmospheric nitrogen through its symbiotic relationship with rhizobia to improve soil structure (Chooi 1971; Chung et al. 2013b). As an important forage legume crop, common vetch contains high level of protein, starch, and oil and has been commonly used as green manure, pasture, silage, and hay (Chung et al. 2013a; Mirzapour et al. 2013). In addition to these qualities, the common vetch also has been widely used as good quality animal feedstock and health-promoting foods for human consumption (Uzun et al. 2011). In Turkey, Australia, New Zealand, China and other regions of the world, the common vetch is widely planted and used for agricultural production (Camas and Esendal 2006; Liu et al. 2014).
The population structure and genetic diversity of germplasm are considered critical factors for the discovery of new germplasm characteristics to develop and utilize germplasm resources for plant improvement (Istvanek et al. 2017). At the same time, the studies of genetic diversity can mine new gene resources, improve existing breeding materials and reflect the breeding level of cultivated species (Gowda et al. 2013). DNA-based molecular markers, which are multi-allelic and locus-specific, have applications in marker-assisted selection breeding and genetic diversity studies. In the last decade, the development and use of DNA-based molecular markers have increased remarkably. The variety of molecular marker techniques can be divided into two types: one type is non-polymerase chain reaction (PCR)-based markers (RFLP, restriction fragment length polymorphism) (Williams et al. 1990) and the other is PCR-based markers includes simple sequence repeat polymorphism (SSR) (Tautz and Renz 1984), random amplified polymorphic DNA (RAPD) (Welsh and Mcclelland 1990), amplified fragment length polymorphism (AFLP) (Vos et al. 1995) and single nucleotide polymorphisms (SNP). These markers have been widely applied in cultivar identification, evolution, linkage mapping, QTL mapping, and comparative genomics for various crop plants (Yang et al. 2015).
The majority of eukaryotic genes have been found to possess abundant, variable and widespread introns (Deutsch and Long 1999). As non-coding regions of a gene, introns are transcribed into mRNA but not translated since they are spliced out during pre-mRNA processing. Introns are less conserved than exon regions and accumulated a larger number of mutations, which can be exploited as genetic markers (Presgraves 2006) such as length and SNPs (Wang et al. 2014). Among the polymorphisms, intron length polymorphism (ILP) is an easily recognizable type due to its easy detection by the PCR method, namely, exon-primed intron-crossing PCR (EPIC-PCR) where primers are designed in exonic regions flanking the introns. Besides,
Compared with other previously reported DNA markers, ILP makers have a lot of advantages such as neutral, co-dominant,convenient, hypervariable, reliable and exhibiting high transferability rates between plant species (Yang et al. 2007). Due to the exon sequences are relatively more conserved, markers developed by this approach may be useful for more extensive applications than those designed in non-coding sequences. Until now, ILP molecular markers have been developed only in a few plants with genome sequences released. (Braglia et al. 2010; Chen et al. 2011; Galasso et al. 2011; Poczai et al. 2010; Shang et al. 2009; Tamura et al. 2012; Wei et al. 2015; Xia et al. 2012), such as rice (Wang et al. 2005), soybean (Shu et al. 2010), Dasypyrum villosum (Zhang et al. 2017a), and Daucus carota (Stelmach et al. 2017). For most species lacking genome-wide data, the characteristics of the exon-intron structure in homologous genes of different species can be used to infer the intron position of cDNA/EST based on homologous genes from related model organisms (Yang et al. 2007). Recently, with the rapid development of high-throughput transcriptome sequencing, collecting large numbers of nucleotide sequencing reads at the transcription level and having thus made the development of ILP markers more cost-effective and easier (Dong et al. 2017; Liu et al. 2014). For example, 502 ILP markers were successfully developed in Medicago sativa based on the alfalfa unigene sequences (Zhang et al. 2017b).
Due to the lack of genome-wide data, the current research on the relationship between species and genetic diversity of common vetch is concentrated on the morphological level (van de Wouw et al. 2003), and the related research at the molecular level remains limited. With the development of high-throughput transcriptome sequencing, using transcriptome data to develop the molecular markers has been made possible in common vetch. In the work described here, we took a large-scale search and developed a set of ILP markers based on the available transcriptome data of common vetch, and the genetic diversity assessments and potential for cross-species transferability of these markers in common vetch accessions were further analyzed.