Assessment of genetic diversity by GBS
We genotyped 199 Triticosecale spp. cultivars from the National Plant Germplasm System of America and our own collections using the GBS method. Triticale usually has an unstable ploidy. We therefore developed a method to analyze ploidy levels using reads mapped to chromosomes A, B, D, and R. The 199 accessions could be easily classified into five ploidy species based on reads coverage: 22 octaploid triticale (AABBDDRR), 153 hexaploid triticale (AABBRR), 20 hexaploid wheat (AABBDD), three tetraploid wheat (AABB), and one rye (RR) (Fig. 1a-e; Supplementary Table S1). Cytological analyses further supported the results (Fig. S1). The read mapping also revealed the deletion and addition of whole chromosomes or chromosome segments. The whole 2R chromosome was frequently lost, which we found in eight cultivars of octaploid triticale and 38 cultivars of hexaploid triticale (Supplementary Table S1). Chromosome 2D was added in 38 hexaploid triticales, with deletion of the 2R chromosome except in one cultivar (Supplementary Table S1). The 2D chromosome was also added in five hexaploid triticale cultivars, without deletion of chromosome 2R (Supplementary Table S1). Chromosome 6A was replaced by chromosome 6D in two hexaploid triticales (Supplementary Table S1). Although 20 cultivars were annotated as hexaploid wheat, one cultivar carried an additional chromosome 2R and one cultivar had chromosomes 6D and 1D replaced by 6R and 1R, indicating that these two cultivars were derived from octaploid triticale (Supplementary Table S1).
Some chromosome segments were also abnormal in these cultivars. For example, the short arm of 5R was partially lost in seven cultivars of octaploid triticale, with a similar segment length in all cultivars (Fig. S2). Fourteen cultivars lost part of the long arm of chromosome 2D, and the lost segment was shorter than the 5RS chromosome (Fig. 1f). The length of the deleted segments was different in the 14 cultivars, which could be divided into six types: two octaploid triticale, seven hexaploid triticale, and five hexaploid wheat (Fig. 1f; Supplementary Table S1). Obvious chromosomal translocations were observed on the long arm of 7D/7A, the short arm of 6D/6A, and the long arm of 1D/1A in three, two, and two cultivars, respectively (Fig. S3). Moreover, four events of segment insertion, and three events of segment deletion occurred in only one cultivar (Fig. S4). The location of the segment insertion remains unknown.
Single nucleotide polymorphism and phenotypic variation of hexaploid triticales
Because hexaploid triticale is also the main cultivated type (Lukaszewski and Gustafson 1987; Cheng and Murata 2002; Fox et al. 1990), we chose 153 hexaploid triticale cultivars for further SNP calling, phenotype evaluation, and GWAS analysis. A total of 434,304 reliable SNP markers were obtained in 153 hexaploid triticale cultivars. These SNPs were not evenly distributed among sub-genomes A, B, and R and 28.13 SNPs/Mb were detected for the entire genome (Fig. S5; Supplementary Table S2). Sub-genome B harbored the most SNPs (212,218) with the highest SNP density (24.56 kb/SNP), followed by A (187,518, 27.92 kb/SNP) and R (34,568, 3001.60 kb/SNP) (Supplementary Table S2). The ratio of the number of SNPs in the B or A genome to those in the R genome was 6.14 and 5.42, respectively. Sub-genome R harbored the least SNPs and the lowest SNP density compared with A and B, and the number of SNPs in the A genome was only slightly lower than that in the B genome. Chromosome 7A had the most SNPs (36,228) with the highest SNP density (20.32 kb/SNP), and 2R had the least SNPs (44) with the lowest SNP density (20015.53 kb/SNP) (Supplementary Table S2). To further reveal the genetic relationships of the genotypes, we estimated the optimal number of genetic clusters determined (K), ranging from one to 20 (Fig. 2a). The cross-validation (CV) error was minimized at k = 9 (Fig. 2a). Consistent with the neighbor-joining clustering result (Fig. 2b), the 153 hexaploid triticale genotypes could be classified into nine genetically distinct population groups (POP I–IV) (Fig. 2b, 3c). The average decay distance of linkage disequilibrium (LD) was about 25.4 kb at the threshold of r2 = 0.45 across all chromosomes (Fig. 2d).
Phenotypes of 153 hexaploid triticale were investigated in three environments to analyze the distribution, variation, and correlation of 10 agronomic traits. All agronomic traits showed a normal distribution (Fig. S6). The maximum values of PH, SL, uppermost internode length (UIL), SNS, GNS, SSP, TGW, grain area (GA), grain width (GW), and grain length (GL) were 200.60 cm, 15.72 cm, 45.50 cm, 31.52, 86.82, 82.47%, 57.95 g, 28.87 mm2, 3.70 mm, and 9.58 cm, respectively, while the minimum values were 95.97 cm, 8.43 cm, 16.10 cm, 16.32, 25.05, 46.68%, 27.27 g, 17.70 mm2, 2.66 mm, and 6.86 cm, respectively (Supplementary Table S3). The maximum values of all agronomic traits except for GA, GW, and GL were more than twice those of the matching minimum values (Supplementary Table S3). The GNS showed the highest coefficient of variation (CV) (20.02%), while GW had the lowest CV (5.80%) (Supplementary Table S3). Broad-sense heritability (H2) was calculated for the 10 agronomic traits, all traits showed high H2 except for SSP (H2 = 0.37) and UIL (H2 = 0.47) (Supplementary Table S3). This suggested that genetic components caused the variation in these accessions.
GWAS of agronomic traits in hexaploid triticale
GWAS is an effective method that uses natural populations to identify loci controlling complex traits. Grain yield is the most important index for triticale breeding, and the 10 agronomic traits observed in this research are closely associated with grain yield. In association analysis, we detected 253 marker-trait associations (MTAs) in three environments using the FarmCPU model at a significance value of -log10(P) > 6.90 (Supplementary Table S4). The number of MTAs for PH, SL, UIL, SNS, GNS, SSP, TGW, GA, GW, and GL were 85, 26, 9, 52, 6, 8, 16, 2, 2, and 47, respectively. The number of MTAs reflects the complexity of the traits to a certain extent. These MTAs were distributed across 20 chromosomes, except for 7R, and explained 0.09%-31.46% of the phenotypic variance. Of the MTAs, 105 were located on sub-genome A, 93 on sub-genome B, and 55 on sub-genome R. Chromosome 2A had the most MTAs (30 MTAs), while the least MTAs were present on chromosome 5R (five MTAs). We identified 21 reliable MTAs over two environments and three reliable MTAs over three environments (Fig. 3). Three loci were pleiotropic. 3Brs508522077 was associated with SL and SNS, 6Ars3762529 was associated with PH and UIL, and 1rs122212247 was associated with SNS and PH (Supplementary Table S4).
High quality wheat and rye references are useful resources for identification of candidate genes that underlie marker loci associated with agronomic traits in hexaploid triticale. We predicted genes associated significantly with MTAs for the annotation of these MTAs, and the functions of the predicted genes were annotated using a BLASTALL search. A total of 205 genes associated with 94 significant markers were predicted in the wheat and rye genomes (Supplementary Table S4). The functions of 16 genes were associated with the related traits (Table 1). Five genes were related to PH, and these encoded CASP-like protein, methyl jasmonate-inducible lipoxygenase 2, sugar transporter SWEET14-like, class I glutamine amidotransferase, and auxin transport protein BIG. The sugar transporter SWEET14-like was also associated with UIL. One stable MTA controlling SL was linked to transcripts TraesCS7A02G058900.1 and TraesCS7A02G059000.1, which were annotated as brassinosteroid LRR receptor kinase (Chono et al. 2003; Dockter et al. 2014) and DELLA protein DWARF8-like (Cassani et al. 2009), respectively. Two stable MTAs for SSP corresponded to transcripts TraesCS3B02G286500.1 and TraesCS7A02G301900.1, annotated as myb-related protein Hv33-like and DNA-directed RNA polymerase II subunit RPB1, respectively (Li et al. 2019; Li et al. 2020). The gene encoding GS3/putative transmembrane protein is a major gene for grain length and weight in rice (Fan et al. 2006). TraesCS4B02G196100.1 associated with TGW was annotated as a transmembrane protein and was homologous to the gene responsible for grain length and weight in rice (Fan et al. 2006). Ribosomal protein S6 kinase S6K plays an important role in cell proliferation (Henriques et al. 2013), and genes underlying cell proliferation can effectively increase grain size and yield (Brinton et al. 2017; Wang et al. 2012). TraesCS3A02G162200.1 and TraesCS3A02G162300.1 associated with GA encode ribosomal protein S6 kinase, which can promote cell proliferation and increase grain size and yield effectively. Two markers associated with GL were predicted as TraesCS1B02G285400.1, TraesCS1B02G285700.1, and TraesCS6A02G243900, which encode glucan endo-1,3-beta-glucosidase, GTPase, and xylan 1,4-beta-xylosidase, respectively.
Table 1 Potential candidate genes linked to significant MTAs
Trait
|
Environment
|
SNP
|
Chromosome
|
Candidate genes
|
Annotation
|
References
|
PH
|
E1/E3
|
4Brs411827353
|
4B
|
TraesCS4B02G189300.1
|
CASP-like protein
|
Bai et al. 2013; Li et al. 2011; Yang et al. 2015
|
PH
|
E1/E2/E3
|
5Ars29954289
|
5A
|
TraesCS5A02G032800.1
|
methyljasmonate-inducible lipoxygenase 2
|
Huber et al. 2005; Li et al. 2018
|
PH/
UIL
|
E1/E2/E3
|
6Ars3762529
|
6A
|
TraesCS6A02G009000.1
TraesCS6A02G009100.1
|
sugar transporter SWEET14-like
|
Kanno et al. 2016; Sun 2008; Wu et al. 2018
|
PH
|
E2/E3
|
NewChr1rs122212247
|
1R
|
Sc1Loc01952146.8
|
class I glutamine amidotransferase
|
Lou et al. 2021
|
PH
|
E1/E2/E3
|
NewChr5rs148736541
|
5R
|
Sc5Loc01715798.1
|
auxin transport protein BIG
|
Alheit et al. 2014; Börner et al. 1999; Würschum et al. 2014; Korzun et al. 1996
|
SL
|
E1/E2
|
7Ars28502318
|
7A
|
TraesCS7A02G058900.1
|
brassinosteroid LRR receptor kinase BRI1
|
Chono et al. 2003; Dockter et al. 2014
|
TraesCS7A02G059000.1
|
DELLA protein DWARF8-like
|
Cassani et al. 2009
|
SSP
|
E1/E2
|
3Brs458306736
|
3B
|
TraesCS3B02G286500.1
|
myb-related protein Hv33-like
|
Li et al. 2020
|
SSP
|
E1/E3
|
7Ars421807529
|
7A
|
TraesCS7A02G301900.1
|
DNA-directed RNA polymerase II subunit RPB1
|
Li et al. 2019
|
TGW
|
E1/E2
|
4Brs422552894
|
4B
|
TraesCS4B02G196100.1
|
transmembrane protein
|
Fan et al. 2006
|
GA
|
E1/E2
|
3Ars164571570
|
3A
|
TraesCS3A02G162200.1
TraesCS3A02G162300.1
|
ribosomal protein S6 kinase
|
Brinton et al. 2017; Wang et al. 2012
|
GL
|
E2/E3
|
1Brs496533861
|
1B
|
TraesCS1B02G285400.1
|
glucan endo-1,3-beta-glucosidase
|
Ayaad et al. 2021
|
TraesCS1B02G285700.1
|
GTPase
|
Zhang et al. 2019
|
GL
|
E2/E3
|
6Ars455341188
|
6A
|
TraesCS6A02G243900.1
|
xylan 1,4-beta-xylosidase
|
Yang et al. 2016
|