Information about the genetic diversity of PGRs, which provide useful alleles associated with plant development and improvement, is very important for both the conservation and the utilization of germplasm that has been collected in a genebank34,35. With the development of molecular biology, DNA molecular marker technology provides useful information for the analysis of genetic diversity, genetic relationships, population structure, and core collections in the germplasm of many crop species4,9,16,21,22,23,24.
In the case of Perilla crop, much analysis has been performed on genetic diversity, genetic relationships, and population structure using amplified fragment length polymorphisms (AFLP)29,30, random amplification of polymorphic DNAs (RAPD)26,37, and SSR markers38,39,40,41. Unfortunately, and in contrast with other major crop species, other molecular marker technologies in Perilla species have not yet been developed. Among these marker systems, as already explained in the Introduction, SSR marker technology is highly polymorphic and reproducible, generally co-dominant and abundant in the plant genome, and it has provided useful information for the analysis of genetic diversity, genetic relationships, population structure, etc. in the germplasm of Perilla species38,42,43,44,45. Recently, SSR primer sets have been developed for Perilla crop by many researchers38,44,45,46,47 and used successfully for the analysis of genetic diversity, genetic relationships, population structure, and association mapping among the accessions of cultivated and weedy types of Perilla crop28,39,40,41,44,48,49,50,51. Therefore, this study used SSR markers to identify genetic diversity and relationships, population structure, and a core collection of RDA-Genebank Perilla germplasm. The GD and PIC values determined in this study of 0.567 and 0.522, respectively (Table 1), were compared with those of previous studies of Park et al.40 and Ma et al.41 that contained more weedy Perilla accessions and showed values of 0.577 and 0.625 for GD and 0.537 and 0.582 for PIC with 21 and 25 SSR markers, respectively. These findings reveal a lower level of genetic diversity in the collection of this study, which mainly consisted of cultivated P. frutescens var. frutescens. Although wild species have not yet been found in Perilla crop, many accessions of the weedy type of Perilla crop have been reported in East Asia, particularly in South Korea and China, and they show higher genetic diversity than accessions of the cultivated type of Perilla crop25,26,29,30,39,40,41,44,49.
Meanwhile, polymorphism of loci can be considered high, medium, or low with GDI > 0.5, GDI < 0.5 and > 0.25, or GDI < 0.25, respectively, according to a report by Vaiman et al.52. The population in the current study consisting of the 400 accessions of cultivated P. frutescens var. frutescens has average GD and PIC values of over 0.5 in the 22 SSR markers, indicating that this population has a relatively high genetic diversity (Table 1). Moreover, the 14 (based on GD) and 12 (based on PIC) SSR markers among the entire SSR markers showed a high level of polymorphism based on GDI (each > 0.5) (Table 1). The SSR markers named KNUPF used in this study were recently developed by our previous studies for Perilla crop44,45,47. Although only cultivated P. frutescens var. frutescens of Perilla crop, which has relatively lower diversity, was used as material in this study, many SSR markers showed relatively high GDI29,39,40,41,50,51. Therefore, these SSR markers were considered useful for identifying genetic diversity and population structure and for selecting a core collection for accessions of Perilla crop. Furthermore, these SSR markers will be very useful for genome-wide association study (GWAS) or quantitative traits loci (QTL) analysis, because these markers were developed by using the results of transcriptome analysis for Perilla crop44,45,46,52,53.
In East Asia, although China is considered the origin of Perilla crop, South Korea is assumed to be the secondary center of biodiversity of Perilla crop because of extensive cultivation, various uses, and high morphological and genetic diversity as well as the existence of weedy types25,26,29,30,49. Recently, cultivated P. frutescens var. frutescens of Perilla crop has become a cash crop in South Korea, and the cultivation area has expanded significantly. To maximize the use of genetic resources of cultivated P. frutescens var. frutescens of Perilla crop preserved in the RDA-Genebank, the genetic characteristics of the collected resources should be analyzed for efficient conservation and utilization in South Korea. Therefore, this study compared the average allele numbers and GDI values among the central (Group I, 148 accessions) and southern region (Group II, 211 accessions) accessions of South Korea and foreign or unknown (Group III, 41 accessions) accessions (Table 2). The highest allele number was revealed in Group II, followed by Group I and Group III, while the highest GDI values were confirmed in Group III, followed by Group I and Group II (Table 2). Although Group II had the highest allele number, genetic diversity for Group II was lower than that of Group I and II. This result suggests that Group II has the highest number of accessions with more Group II-specific alleles, while this group consists of more accessions with similar genetic characteristics than the other two groups. Moreover, when comparing the South Korea (Group I and II) and foreign accessions (Group III), the South Korea accessions show lower genetic diversity than the foreign accessions (Table 2). This result indicates that the cultivated type of var. frutescens of South Korea has a narrower genetic diversity than the foreign accessions, even though South Korea is the secondary center of Perilla crop. It may be that the environmental variation in the central and southern regions of South Korea is not severe and that farmers for Perilla crop want uniform properties for cultivated type of var. frutescens, such as green leaves and seeds with higher oil yield. However, in South Korea, many accessions of weedy type of var. frutescens with high genetic and phenotypic diversity were found throughout the region25,29,30,39.
Although numerous PGRs are currently conserved in genebanks around the world, the large amount of PGRs makes their accessibility and application difficult3,8. Moreover, management of these PGRs requires significant effort and expense. It is essential to select and manage a core collection, which can represent the entire collection. The evaluation of genetic distance or population structure among genotypes helps in the selection of parental combinations for generating new segregating populations, which preserves genetic diversity in breeding programs54. The identification of the genetic relationships and population structures of an entire collection may provide useful information for core collection selection and the management of PGRs. In this study, to understand the genetic relationships and population structure of 400 accessions of cultivated type of var. frutescens from central and southern regions of South Korea, we used two different methods: a model-based approach with STRUCTURE and a distance-based approach with a UPGMA dendrogram (Fig. 3, Figure S1). The STRUCTURE results revealed that the 400 accessions of cultivated var. frutescens could be divided into two major groups and an admixed group at K = 2 (Figs. 3, 4), while the UPGMA dendrogram results showed that the 400 accessions of cultivated var. frutescens were divided into ten major groups with 45.7% of genetic similarity. As mentioned above in the Results section, there was no clear geographical classification by STRUCTURE and UPGMA analysis among the 400 accessions of cultivated var. frutescens from the central and southern regions of South Korea and foreign regions (Fig. 3, Figures. S1, S2). Xie et al.55 mentioned that population structure and genetic relationship patterns of many accessions are affected by many factors, such as gene flow, selection by environment or human, and breeding systems. Perilla crop has a long history of cultivation in East Asia including in South Korea. In South Korea, because many native landraces of Perilla crop are still widespread, these seeds might be frequently exchanged between diverse regions by farmers or animals and birds, as previously reported by Lee et al.29,30.
Meanwhile, because of failure to select a core set by model and distance-based methods, this study utilized PowerCore to construct a core collection with maximum genetic diversity from the entire initial collection and with a minimal number of germplasm resources. In particular, this study used molecular data rather than phenotypic data to construct the core collection. This is because molecular data using molecular markers is more accurate for ensuring genetic diversity of the initial collection and preventing missing data or environmental interactions that typically exist in phenotypic data56. PowerCore in this study captured 100% of the alleles with a sampling percentage of 11%, based on 22 SSR markers throughout the collection (Table 4). The percentage of selected samples identified in this study was similar to the percentage (~ 10%) proposed by Brown57, while lower than the suggested percentage (20 ~ 30%) by Yonezawa et al.7. Sh.W. and Nei diversity indices were used for validation of the core collection, and the averages of the Sh.W. index (1.306) and Nei index (0.639) of the Perilla core collection were higher than those of the entire initial collection (Sh.W. = 1.059, Nei = 0.569), indicating increased genetic diversity of the core collection. This may be because of the removal of genetic redundancy in the core collection compared with the entire initial collection. It is obvious that this core collection is an exact representation of the diversity of the entire collection (Fig. 4, Table. 4). Although some accessions were absent and only a small number of traits were investigated in the PCA analysis, the accessions of the core collection selected by SSR markers were well reflected in three clusters based on the first axis of the PCA scatter plot by six agronomic traits (Fig. 5). In detail, the morphological analysis revealed all types of color of leaf surface (QL1) and stem color (QL3) in the core collection. Although no accession with light purple for color of reverse side leaf (QL2) was included in the core collection, the remaining three types were contained in the core collection (Table S1, Table 3). All three types of leaf shape (QL4) were included amongst the 372 accessions in the core collection. Moreover, all four types of degree of pubescence (QL5) were included in the 372 accessions of the core collection. In the case of flowering time (QL6), there were no accessions for early flowering type in the morphological evaluation of the 372 accessions, but the intermediate and late flowering types were contained in the core collection (Table 3).
This study constructed the first core collection of Korean Perilla accessions and maintained allelic richness. It can be considered as germplasm for identifying useful genes for important agricultural traits. Further modification of the core collection is expected by the continuous addition of new Perilla accessions, such as accessions of the two cultivated types of Perilla crop and their weedy types. Further analysis of phenotypic and agronomic traits for the core collection is necessary to provide more valuable information for the development and utilization of Perilla accessions in breeding programs.