2.1 WEW Panel Geographic Distribution
The majority of WEW accessions were sourced from the regions of the Southern Levant (SL) and Southeastern (SE) Anatolia, with only 4 accessions from Iran and 18 from Iraq (Fig. 1, Supplementary Table S1). Additionally, 6 accessions had no passport information and were initially obtained from the other research institutions.
2.2 Genotyping
The average sequencing depth of accessions was 9.5x (Supplementary Figure S1), and the raw paired-end FASTQ files totaled 14 TB data, which were uploaded to the National Center for Biotechnology Information (NCBI) BioProject accession #PRJNA1007489. The number of filtered SNPs totaled 68 million, while in the selected 5% subset, there were 3,410,677 SNPs (Supplementary Table S3). The SNP density plot showed different densities of these variants across the 14 chromosomes (Supplementary Figure S2). For all chromosomes in the genome, there were at least 200K SNPs per chromosome (Supplementary Table S3).
2.3 Passport Data Curation
Allele matching revealed five pairs of WEW accessions with at least one duplicate (Supplementary Table S4). Most of the identified duplicate accessions (TA117 and TA1459; TA11202, TA11203 and TA119; TA131, TA132 and TA133; TA134 and TA135) were collected in nearby zones (Supplementary Table S1), particularly from Golan Heights and nearby regions. We also observed the need to update and ensure consistency in the ‘collection sites’ between the gene bank and the USDA GRIN database, particularly regarding changes in geopolitical maps.
2.4 Phylogenetic tree and PCA
The unrooted NJ phylogenetic tree revealed three major WEW clades (Fig. 2). Notably, a group of accessions near the Sea of Galilee (Lake Tiberias) formed a distinct clade, indicating significant genetic differentiation within this group, even among accessions from the same region around Lake Tiberias. A comparison of these accessions with the previously recorded spike morphology data of the accessions confirmed that these accessions represent the wide-spiked race judaicum [19, 20]. Accessions from Southeastern Anatolia (SE), Iran, and Iraq formed a separate clade, known as the Northern Population[14]. Other than judaicum, accessions from the Southern Levant (SL) constituted a single major clade, with some accessions from Lebanon and Syria forming a small subclade under the SL clade (Fig. 2). The northern and southern populations of WEW have been described elsewhere in the literature [14, 21, 22]; therefore, we named the genetically separated subgroups as the Northern Population, the Southern Levant (SL) Population and the judaicum. Additionally, four accessions (TA120, TA1468, TA11175 and TA11193) clustered at the base of the SL clade near the nodes where judaicum and the Northern Population separated. Interestingly, the two accessions, TA11208 and TA11218 from northern Syria also clustered with accessions from the northern Lebanon and the southern Syria (Fig. 2).
The division of WEW accessions in the PCA plot reflected their geographic origin. The judaicum accessions form a distinct subgroup, clearly differentiating them from the rest. The first principal component (PC1), explaining 29% of the total variation, separated the judaicum accessions from the other two groups (Fig. 3). The second principal component (PC2), which accounted for 16% of the variation, distinguished accessions between the Northern Population and the SL. Accessions without passport information were grouped with those from the Northern Population (Fig. 3). As in the phylogenetic tree, four accessions (TA120, TA1468, TA11175 and TA11193) were grouped between the judaicum and the other accessions clusters in the PCA plot.
2.5 Population Structure
The population structure analysis further confirmed the differentiation of the WEW collection into various subgroups and admixture groups, aligning well with the phylogenetic tree and PCA grouping. When testing K = 2 ancestral subpopulations, only the judaicum subgroup was differentiated from the rest of the accessions (Fig. 4). The Northern and the SL population accessions were separated when K was increased to 3 or more (Fig. 4). However, no solid differentiation of the subpopulation was observed after K = 3. The four accessions that were separated from the SL and judaicum clades in both the phylogenetic tree and PCA were identified as admixtures including ancestry of judaicum and SL subgroups (Fig. 4).
To evaluate the relationship between these WEW subpopulations, particularly between judaicum and domesticated emmer (T. turgidum subsp. dicoccum), concerning the point of domestication and the cultivated emmer's immediate ancestor, we utilized 31,277 SNPs generated by genotyping-by-sequencing (GBS) of both wild and cultivated emmer. The genetic clustering based on the GBS data revealed that domesticated emmer is more closely related to wild emmer accessions from the Northern Population (Supplementary Figure S3), suggesting that judaicum is not the immediate ancestor of domesticated emmer wheat.
2.6 Diversity and FST
The Nei’s diversity index and the number of segregating alleles computed indicated that the SL population has a higher Nei's index (0.17) compared to the Northern Population (0.15) and judaicum (0.06) (Supplementary Table S5). The pairwise FST calculations among the subpopulations revealed that judaicum is more differentiated than both the Northern Population and the SL. All three pairwise FST approaches (Weir and Cockerham, beta, and Nei's) displayed a similar pattern of values among subpopulations. Specifically, the Northern and SL populations exhibited the lowest FST value (Nei's FST = 0.05), while judaicum and the Northern Population showed the highest pairwise FST values (Nei's FST = 0.204) [Supplementary Table S6]. Thus, judaicum represents the most genetically distinct WEW subgroup compared to the Northern and SL populations.
2.7 Linkage Disequilibrium
In each subpopulation, the linkage decay was rapid over the first 250 Kb (Supplementary Figure S4) with the pattern varying slightly among the subpopulations. Specifically, in the judaicum subpopulation, the mean r2 dropped from 0.8 to 0.4 (half) over this interval, whereas in the Southern Levant and Northern Populations, the mean r2 dropped from 0.5 to 0.25 and from 0.7 to 0.35, respectively (Supplementary Figure S4).
2.8 Genetic Resources for Disease Resistance in WEW
With the stem rust evaluations, we found two accessions (TA11165 and TA11196) resistant or moderately resistant to four races of the stem rust pathogen (Supplementary Table S1), whereas four (1.4%) other accessions (TA1042, TA1045, TA11208, and TA1464) were resistant or moderately resistant to three races. Fourteen other accessions (4.8%) were resistant or moderately resistant to two races, and 28 accessions (9.6%) were resistant or moderately resistant to only one race (Supplementary Table S1). The largest number of accessions resistant to an individual stem rust pathogen race was 26 (9%) to race TTKSK, followed by 17 (5.8%) to TKTTF, 14 (4.8%) to TTTTF, 10 (3.4%) to JRCQC, and 9 (3.1%) to TTRTF.
For the leaf rust evaluations, we found four (1.4%) accessions (TA66, TA1014, TA1434, and TA1465) that were either resistant or moderately resistant to all five races of leaf rust pathogen (Supplementary Table S1 and Supplementary Figure S6). The accession TA11156, which was resistant across all races, exhibited no symptoms (score ‘0’) for any leaf rust pathogen. Unfortunately, TA11156 is not included in our genotyping list and could not be displayed in the phylogenetic tree. The accessions TA11204 and TA1474 were resistant or moderately resistant to four races of the leaf rust pathogen. Four (1.4%) other accessions (TA1389, TA1470, TA10667, and TA11208) were resistant or moderately resistant to three races. Similarly, 14 accessions (4.8%) were resistant or moderately resistant to two of the five races; and 33 accessions (11.3%) were resistant/moderately resistant to only one race (Supplementary Table S1 and Supplementary Figure S6). The largest number of accessions resistant/moderately resistant to an individual race was 41 (14.1%) to race TBBGS, followed by 19 (6.6%) to TNBGJ, 18 (6.2%) to BBBQD, 12 (4.2%) to TFBGQ, and 11 (3.8%) to MFPSB.
With the stripe rust evaluations, we found 59 accessions (20.3%) accessions were resistant or moderately resistant to all five races; 28 accessions (9.6%) were resistant or moderately resistant to four of the five races; 23 accessions (8%) were resistant or moderately resistant to three of the five races; 24 accessions (8.2%) were resistant to two of the five races; and 50 accessions (17.2%) were resistant to only one race (Supplementary Table S1 and Supplementary Figure S7). The largest number of accessions resistant to an individual race was 150 (51.5%) to race PSTv-37, followed by 117 (40.2%) to race PST5006, 106 (36.4%) to race PSTv-143, 105 (36.1%) to race PSTv14, and 96 (33%) to race PSTv-221.
We also identified accessions that were resistant to all three rust pathogens. Notably, accession TA11204 was resistant or moderately resistant to 10 out of 15 tested races and was resistant or moderately resistant to at least two races of all three rust pathogens (Supplementary Table S1). Two other accessions, TA1474 and TA11196, were resistant to nine races together of three pathogens. Interestingly, TA1474 was not resistant to any stem rust pathogen races, whereas TA11196 was not resistant to any leaf rust races, but both accessions were resistant or moderately resistant to all five races of stripe rust (Supplementary Table S1). Two other accessions, TA11165 and TA1470, were resistant or moderately resistant to at least eight races. Thirteen (4.5%) other accessions were resistant to seven of 15 tested races, whereas 17 (5.8%) other accessions were resistant or moderately resistant to six races.
2.9 Distributions of Resistant Accessions Across Geography
We compared the rust reaction of accessions to their geographic origin to discern patterns of concentration for this trait. Comparison of the rust reactions with their sites of origin revealed that that the SL population has a higher proportion of resistant accessions than the northern population and judaicum (Table 1, Fig. 5–6, Supplementary Figure S6 and S7). Additionally, the response of each rust pathogen varied across each subpopulation.
Table 1
Percentage of resistant or moderately resistant accessions per population group. Total accessions that were genetically grouped under SL,
Disease | Pathogen Races | % Resistant accessions per subgroup |
SL | Northern Population | judaicum |
Stem rust | TTKSK | 14.4 | 0 | 5.7 |
JRCQC | 6.1 | 0 | 0 |
TKTTF | 7.8 | 0 | 8.6 |
TTRTF | 4.2 | 0 | 2.9 |
TTTTF | 7.2 | 0 | 5.7 |
Leaf Rust | TBBGS | 19.2 | 4.1 | 8.6 |
BBBQD | 7.2 | 0 | 11.4 |
TFBGQ | 3.6 | 4.1 | 0 |
MFPSB | 3.6 | 2.6 | 0 |
TNBGJ | 7.8 | 2.6 | 5.7 |
Stripe Rust | PSTv-221 | 51.3 | 1.3 | 29.4 |
PSTv14 | 51.8 | 14.3 | 17.6 |
PSTv-143 | 47.6 | 11.7 | 40 |
PST5006 | 51.2 | 15.6 | 48.6 |
PSTv-37 | 65.9 | 21.8 | 54.3 |
2.9.1 Geographical distributions of stem rust resistance
For all stem rust races, the majority of resistant and moderately resistant accessions originated from the SL subpopulation (Table 1), particularly from the Haifa zone (Supplementary Table 1). Within the SL subgroup, the highest percentage of resistant accessions (14.4%) was observed against the race TTKSK, whereas the lowest percentage of resistant or moderately resistant accessions (4.2%) was observed against the race TTRTF (Table 1). Surprisingly, no resistant or moderately resistant accessions were observed in the Northern Population for any races of the stem rust pathogen. Within the judaicum subpopulation, the highest percentage of resistant accessions (8.6%) was observed against the race TKTTF, while no accessions were resistant to the race JRCQC (Table 1, Fig. 6).
2.9.2 Geographical distributions of leaf rust resistance
For all leaf rust pathogen races, the majority of resistant or moderately resistant accessions originated from the SL subpopulation (Table 1, Supplementary Table 1). Within the SL subgroup, the highest percentage of resistant accessions (19.2%) was observed against the race TBBGS, whereas the lowest percentage of resistant accessions (3.6%) was observed against the races TFBGQ and MFPSB (Table 1). Within the Northern Population, the highest percentage of resistant or moderately resistant accessions (4.2%) was observed against races like TBBGS and TFBGQ (Table 1), whereas no accessions resistant to BBBQD were observed in this subpopulation. Within the judaicum subpopulation, we observed resistant or moderately resistant accessions as follows: 11.4% to BBBQD, 8.6% to TBBGS, 5.7% to TNBGJ, and no resistant accessions to TFBGQ and MFPSB (Table 1, Supplementary Table 1).
2.9.3 Geographical distributions of stripe rust resistance
For the stripe rust pathogens, a higher number of resistant or moderately resistant accessions originated from the Southern Levant (SL) subpopulation (Table 1, Supplementary Table S1, and Supplementary Figure S7). Within the SL subgroup, the highest percentage of resistant accessions (65.9%) was observed against race PSTv-37, whereas the lowest percentage of resistant accessions (47.6%) was observed against race PSTv-143 (Table 1). This indicates that about 50% of accessions in the SL group are resistant to all five races of the stripe rust pathogen. Within the Northern Population, the highest percentage of resistant or moderately resistant accessions (21.8%) was observed against race PSTv-37, whereas the lowest percentage (1.3%) of resistant accessions was against PSTv-221 (Table 1). Within the judaicum subpopulation, the highest percentage (54.3%) of resistant or moderately resistant accessions was observed against race PSTv-37, while the lowest percentage (17.6%) of resistant accessions was observed against race PSTv-14 (Table 1, Supplementary Table S1, and Figure S7).
2.10 Genome-Wide Association Study (GWAS)
The GWAS revealed highly significant novel and some known loci that explain higher phenotypic variances (up to 60%) (Table 2; Supplementary Tables S7 and S8). Most of the GWAS models identified common causal regions whether we ran an analysis in the entire or only in the SL population.
Table 2
The genomic regions that are statistically significant (p < 0.05) after Bonferroni correction for the marker-trait association for stem rust (SR) resistance in the WEW. The phenotypic variance explained (PVE) in % and P-value was reported for the GWAS model represented by the bold characters (G, V, F, b). The Zavitan genome (v2.1) positions were used to track the annotated genes for the position. The loci naming at the second column follows chromosome_position.
Pathogen Races | Peak loci and Position | GWAS model | PVE % | P-value | Pop Group | Stem Rust QTL and functional genes on Zavitan [WEW_v2.1] around the GWAS peak marker region, and the literature cited for the known genes |
TTKSK (Ug99) | 1B_54289580 | G, V, F, b | 54 | 4.5e− 12 | To, SL | Trichome birefringence-like 10 (TBL)[23, 44] (54.29 Mb), and QTL at 1BS in durum [68] and WEW [23] |
4B_672753066 | G, b | 9.4 | 7.5e− 10 | To | Gibberellin 2-beta-dioxygenase 6-like at 672.9 Mb and zinc finger protein KNUCKLES-like (672.5 Mb) |
1B_94672107 | G, V, F | 7.5 | 1.5e− 9 | To, SL | Probable sugar phosphate/phosphate translocator At3g11320 (94.674 Mb) |
3B_380702044 | F | 3 | 2.4e− 12 | SL | Uncharacterized protein at 380.7 Mb; putative L-cysteine desulfhydrase 1, transcript variant X1 at 381.2 Mb |
5A_567843744 | G | 3 | 2.5e− 10 | SL | BTB/POZ domain-containing protein At4g01160-like (567.85 Mb) |
JRCQC | 1B_54289580 | G, V, F, b | 26.5 | 2.2e− 14 | To, SL | *** location overlapped with the loci for TTKSK at 1BS*** |
3A_55126148 | G, b, F | 25 | 7.9e− 11 | SL | JRCQC QTL at 3A of durum[69]; uncharacterized protein at 55.12 Mb; histone H2B.4-like at 55.14 Mb and probable LRR receptor-like serine/threonine-protein kinase At3g47570 at 54.95 |
5A_390655074 | G, b | 18.8 | 6.12e− 10 | To | QTL reported at 5AL of the WEW and durum wheat[68, 70]; myb-related protein 306-like at 390.55 Mb |
6B_709557414 | G, F, b | 12.1 | 3.1e− 12 | To, SL | i) Sr13[71] ortholog [MW033594.1 T. turgidum subsp. durum cultivar CAT-A1 Sr13] aligned (91%) at 717.02 Mb: RGA5-like, transcript variant X1 in the annotated file ii) Lysine-specific histone demethylase 1 homolog 3-like at 709.59 Mb iii) Putative disease resistance protein RGA3 and RGA4 at 715.7 Mb iv) Disease resistance protein PIK6-NP-like and G-type lectin S-receptor-like serine/threonine-protein kinase SD2-5 at 716 Mb v) Reported QTL at 6BL of durum and WEW [68, 70] |
TKTTF | 2B_714517390 | G, V, b | 14.7 | 1.1e− 14 | To, SL | Importin subunit beta-1-like, transcript variant X1 and QTL for the race in durum 2BL [72]; Wheat ortholog SR9E_H2 at vicinity (699.23 Mb) |
5A_597691058 | G, b | 12 | 1.e− 16 | To, SL | Probable WRKY transcription factor 57 at 597.69 Mb, role of WRKY in plant defense [73] *** |
1B_56521356 | b | 6.7 | 5.13e− 16 | SL | 5'-methylthioadenosine/S-adenosylhomocysteine nucleosidase-like, transcript variant X2 (56.52 Mb) |
3B_127728931 | F, b | 5 | 6.25e− 21 | SL | Acyl transferase 9-like at 127.64 Mb; glutamine–tRNA ligase-like at 127.5 Mb and berberine bridge enzyme-like 26 at 127.49 Mb |
3B_116165318 | G, b | 3 | 2.3e− 17 | To, SL | Serine/threonine-protein kinase Aurora-2 at 116.33 Mb; barley Rpg1 candidate gene [74] |
6B_165520546 | G, b | 3 | 5.8e− 12 | To | BOI-related E3 ubiquitin-protein ligase 1-like (165.41 Mb); putative homeobox-leucine zipper protein HO at 165.30 Mb and probable C-terminal domain small phosphatase at 165.77 Mb: plant immunity-related gene [75] |
TTRTF | 6A_534202576 | G, V | 41.3 | 1.1e− 8 | To, SL | QTL for TTRTF reported at 6AL in durum wheat[76]; bZIP transcription factor 2-like at 534.23 Mb and uncharacterized protein at 534.20 Mb; and eukaryotic translation initiation factor 5-like, transcript variant X1 at 534.23; Sr13 ortholog from durum mapped on 633.6 Mb |
3A_673490015 | F, b | 29.4 | 5.2e− 09 | SL, To | QTL at 3AL for this race in durum wheat[76]; putative disease resistance protein RGA3, transcript variant X3 at 673.56 Mb and exocyst complex component EXO70A1-like at 673.55 Mb |
5A_597691058 | b | 10 | 5.7e− 14 | To | *** location overlapped with the loci for TKTTF at 5AL*** |
2B_529637064 | G, b | 9 | 6.2e− 13 | To | Peptide methionine sulfoxide reductase A2-1-like at 529.62 Mb [77]; NAC transcription factor NAM-B2-like; and BTB/POZ and TAZ domain-containing protein 2-like at 529.9 Mb |
5B_359661332 | F, b | 5 | 9.9e− 10 | To | aminopeptidase M1-C-like at 360.63 Mb |
TTTTF | 4B_452141975 | G, F, b | 28.7 | 1.2e− 8 | To | QTL at 4BL for this race in durum panel [68]; putative clathrin assembly protein At2g25430 at 452.14 Mb; palmitoyltransferase akr1-like, transcript variant X1 at 452.15 Mb and E3 ubiquitin-protein ligase XB3-like, transcript variant X2 at 453.06 Mb |
1B_469154488 | b | 15.2 | 4.2e − 10 | To | Several QTL for stem rust race TTTTF at 1B in durum [68]; probable glycosyltransferase STELLO2 at 469.16 Mb and probable membrane-associated kinase regulator 1, transcript variant X1 at 469.01 Mb |
2A_608467259 | b | 14 | 4.6e− 18 | SL | CBS domain-containing protein CBSCBSPB3-like, transcript variant X6 at 608.37 Mb and putative E3 ubiquitin-protein ligase SINA-like 6 at 608.55 Mb |
5B_539977539 | b | 13.6 | 1.2e− 19 | SL | Cortical cell-delineating protein-like and T-complex protein 1 subunit beta-like at 539.95 at Mb |
5A_4511589 | F, b | 12.6 | 7.1e− 22 | To | Pentatricopeptide repeat-containing protein At1g18900-like at 4.51 Mb |
2A_288803558 | b | 7.4 | 4.6e− 18 | SL | Probable ribose-5-phosphate isomerase 3, chloroplastic at 289.0 Mb |
5A_263785125 | F, b | 4 | 1.9e− 11 | SL | Serine/threonine-protein phosphatase 7 long form homolog at 263.93 Mb and basic leucine zipper 19-like at 263.50 Mb, which is plant defense gene [78–80] |
4B_570338558 | F, b | 3 | 1.5e− 10 | SL | Protein IQ-DOMAIN 1-like at 570.32 Mb basic blue protein-like at 570.31 Mb and QTL at 4BL in durum [68] |
G = rrBLUP; V = CMLM; F = FarmCPU and b = Blink |
SL = Southern Levant population except for judaicum |
Pop = Population |
To = Total population in the panel |
2.10.1 GWAS for the stem rust
Across all five races of stem rust evaluated, we identified 28 genomic regions significantly associated with resistance to P. graminis f. sp. tritici based on Bonferroni-corrected P-values (Table 2). In response to race TTKSK, we found a highly significant locus at 1BS, at 54.289 Mb, explaining 54.3% of the phenotype variance. This region was identified by various GWAS models such as Blink, FarmCPU, CMLM, and rrBLUP (Table 2 and Fig. 7). Additional loci associated with resistance to this race were detected on chromosomes 3B (380 Mb), 4B (672 Mb), and 5A (567 Mb) (Table 2). While many of these associations are novel, some have been reported in earlier studies [23]. For the JRCQC stem rust race, we found a major association at the same location at 1BS (PVE = 26.5%) as with resistance to TTKSK (54.28 Mb) (Table 2). Other significant loci for this race were identified at 3A, 5A and 6B (Table 2).
For the TKTTF race, we observed two major associations at 2BL (PVE = 14.7%) and 5AL (PVE = 12%). Additional significant loci for resistance to this race were detected at 1BS, 3BS, and 6BS (Table 2). Similarly, to race TTRTF, the major genomic region associated with resistance was identified at 3AL, 5AL, 2BL, and 5BL (Table 2). Likewise, for the TTTTF race, a major QTL was detected at 4BL. Other significantly associated loci for this race were observed at 1BL, 2AL, 4BL, 5AS, and 5BL (Table 2).
2.10.2 GWAS for the leaf rust
Twenty-five (25) loci were found associated with resistance to the five races of Puccinia triticina. Major effect loci for resistance to race TBBGS were found at 1AS (PVE = 10%), 2A, 5A, 4B, 4A, and 3A (Supplementary Table S7). With race BBBQD, significant associations were found at 3BS, 6BL, 4BL, 3BL, 2AS, and 1AS (Supplementary Table S7). Perhaps due to the limited number of resistant accessions identified to race TFBGQ (Supplementary Figure S6), only two loci were identified: on chromosomes 3B and 6B, exclusively within the SL group (Supplementary Table S7). For race MFPSB, four different genomic regions were associated with resistance at 2AL, 2BL, 3BS, and 7BS (Supplementary Table S7). For race TNBGJ, significant associations (PVE = 46%) were observed around 257.9 Mb on chromosome 2A, as well as on 3BS (87 Mb), 2BL (498 Mb), 1BS (7 Mb), 2AS (31 Mb), and 2AL (733 Mb) (Supplementary Table S7).
2.10.3 GWAS for the stripe/yellow rust
For the five yellow/stripe rust races, we detected a total of 32 loci associated with resistance, ranging from 4 to 7 loci per race (Supplementary Table S8). A highly significant major effect locus for resistance to race PSTv-221 was positioned at 99 Mb on chromosome 1BS, explaining about 16% of the phenotypic variance (Supplementary Table S8). Other significant loci associated with resistance to this race were detected at 1A (321 Mb), 3A (633 Mb), 6B (444 Mb), 5A (563 Mb), and 7A (221 Mb) (Supplementary Table S8). For race PSTv-14, significantly associated loci were observed on chromosomes 1A, 1B, 2A, 4B, 5B, 7A, and 7B, contributing from 3–16.7% of phenotypic variance (Supplementary Table S8). For stripe rust race PSTv-143, significant marker-trait associations were found at 1A, 1B, 2A, 3B, 4A, and 5B, contributing up to 22.2% of phenotypic variance. Similarly, for race PST5006, the most significant association was observed at 6B at 155 Mb, while other loci detected for this race were on 1A, 1B, 2A, 4B, and 7A of WEW (Supplementary Table S8). For another race, PSTv-37, we obtained a highly influential loci at 3AL that contributed nearly 60% of PVE and was located around 716 Mb of the Zavitan (v2.1) genome. Another important association for this race was observed at 1BS, and other minor loci were also found at 1A, 2A, 4B, 6A, and 6B (Supplementary Table S8).
2.11 Known Candidate Genes and Novel Loci Identified by GWAS
The highly significant locus at 1BS associated with resistance to P. graminis f. sp. triticiraces TTKS and JRCQC was discovered near the ortholog of the trichome birefringence-like (TBL) family gene (Supplementary Table S9) on the Zavitan genome. The locus consistently emerged in GWAS conducted with the entire population or the SL, across multiple GWAS models, affirming the reliability of the identified loci. Another locus, '6B_709557414', found on chromosome 6B could be Sr13. Notably, when aligning the 2Kb coding sequence of durum Sr13, a high alignment rate (96%) was observed at 6AL, the expected position of Sr13, but also a substantial alignment (91%) at 709 Mb of 6B, suggesting a potential additional copy of Sr13 at 6BL (Supplementary Table S9). This could indicate the introgression of durum gene Sr13 into the Zavitan, given the significant introgression from durum observed in the wild emmer wheat [24]. Alternatively, the gene at 6B could be a novel gene. The region around 709 Mb at 6BL harbors several functional genes, including the disease-related gene RGA3 at 715 Mb (Table 2).
For leaf rust, the GWAS-identified loci were associated with several potential candidate genes. For instance, the locus identified for the TBBGS race on 4BS at 70 Mb likely represents the receptor protein kinase-like protein ZAR1, a gene associated with leaf rust resistance [25]. Another gene for the same race, located on the long arm of 4A (748 Mb), likely represents the ortholog of a leaf rust-related kinase, known as the Lr10-like gene (Supplementary Table S7). Additionally, a locus for MFPSB at 2A (2A_733147249) was found to be collocated with an ortholog of the RING-H2 finger protein ATL43-like gene, which plays a role in plant defense [26]. Similarly, another orthologous gene, 'NF-X1-type zinc finger NFXL2-like', collocated with the GWAS-identified locus at 3B (159 Mb) for TBBGS, has also been reported as related to leaf rust resistance [27]. Some other GWAS-identified loci are collocated with other functional genes and most likely represent novel loci (Supplementary Table S7).
For stripe rust resistance, some GWAS peaks we identified were collocated with the ortholog sequences of previously reported stripe rust resistance genes, such as Yr15, YrU1, and Yr36 (Supplementary Table S8-S9). A GWAS peak loci for races PSTv-221 and PSTv-14 (1B_99349879), located at 99 Mb on chromosome 1BS, was identified near the alignment (93%) position of the Yr15 at 97.9 Mb, while the GWAS locus for PSTv-14 (5A_563398244) was closer to the alignment (99%) position (Supplementary Table S9) of the urartu stripe rust gene YrU1 at 554 Mb of 5AL. Similarly, the GWAS peak at 6B (6B_155315249) was in the vicinity of the alignment (98%) position of an ortholog of the wheat stripe rust gene Yr36 (Supplementary Table S8). Although Yr36 is an adult plant resistance gene, when pyramiding with other adult plant resistance genes such as Yr18 and Yr28 also conferred seedling stage resistance[28]. Additionally, a GWAS-identified locus for PSTv-14 (7A_638642386) was found to collocate with an annotated gene, Ethylene-responsive transcription factor 3-like at 638.77 Mb on 7A. Similarly, the WRKY transcription factor was found to collocate with the GWAS loci at 1BS (1B_123847300) for the races PSTv-143 and PSTv-37. Both of these genes have been previously reported as related to stripe rust resistance [29, 30]. Some other previously known stripe rust QTL and the loci we identified here using GWAS were collocated or resided in the vicinity of the same chromosomes as described in Supplementary Table S8.