Genotyping of introgression lines
We genotyped three introgression lines (IL) that were generated using MAS procedure, for identifying the best female parent showing the shortest introgression size and lower heterozygosity, for the backcrossing (BC4) with Ruta. Of the 25K SNPs on the wheat array we identified ~ 5100 SNPs for both the A and B wheat genomes, with an average density of 0.5 SNPs/Mb. Filtering the data and ordering the SNPs according to their physical positions (Table S4) validated the presence of the target QGpc.huj.uh-5B.2 in the three ILs, however, the shortest introgression (58.37 Mb) was found in plant IL99-2 (further designated as IL99) compared to introgression size of 514 Mb in IL72 and IL73. A low level of retained alleles from the original tetraploid parent (LDN) was found in the three ILs, mainly on the short arm of chromosome (Chr) 2A, albeit with differing lengths and levels of heterozygosity. IL99 displayed a low proportion of the retained LDN allele, whereas IL72 and IL73 retained alleles of LDN or heterozygous regions on Chr 3B, and 4B and 5A. Notably, the genotyping also confirmed that all the 3 BC3 lines did not carry the Gpc-B1 functional allele of WEW, that was previously identified as a major QTL in other mapping populations (Uauy et al. 2006; Distelfeld et al. 2007). Taken together, the genotyping results showed that IL99-2 (hereafter IL99) retained a minimal % of the LDN allele, and no heterozygotes regions were found on Chr 5B; furthermore, it had a low percentage of heterozygosity (0.5%) at the whole-genome level (Table S5). Therefore, IL99 was selected for validation of GPC and GY in the field and was further used as a female parent for backcrossing with Ruta.
Trait validation of GPC and GY under three environments
Our assessment of IL99 and Ruta grown in the three environments showed that GPC and TKW in both genotypes were significantly lower in the Upper Galilee field, in which N was not applied at pre-sowing, compared to the two other fields that were treated with split N doses. The extremely low GPC values found for both genotypes in the Upper Galilee suggest that the field presented intense N deficiency that may have resulted from the absence of the pre-sowing N treatment, together with possible leaching of N due to heavy rains during the plant-establishment period. Furthermore, in this low-N field, a higher GPC (33%) was found in IL99 compared to Ruta (10.84 vs. 8.15, p = 0.007), whereas no differences in GPC were found between the two genotypes in the two other fields, both under high water availability, i.e., in Acre (14.14 vs. 14.24, p > 0.05), and in the semi-desert field in Reim with 220 mm rain (13.91 vs. 13.57, p > 0.05). In Reim, we found a significant difference (p = 0.045) in GPD between the average normalized residues for Ruta (-0.178) and IL99 (0.0008), indicating the latters ability to accumulate higher GPC than predicted from GY alone. In addition, we detected a significant negative correlation (r = -0.74, p < 0.05) between GPC and TKW in the field at Acre (Table S6). In the Upper Galilee and in Reim, these relationships were not a significant factor for the increased GPC in IL99. No significant differences were found in TKW between Ruta and IL99 in any of the fields, indicating that the high GPC was not negatively correlated with TKW. High variability was found in GY, and plant height was measured in only two environments (Acre and Upper Galilee), showing no significant difference between genotypes under these conditions (Fig. 2).
Phenotypic response to LN and FN using a semi-hydroponic system
To evaluate LN tolerance in IL99 as compared with Ruta and to establish a sensitive, reproducible, and accurate method for phenotyping under controlled conditions, sever N stress was applied using a semi-hydroponic system. Reduced leaf elongation rates compared with the optimal N level was found as a reliable trait for assessing N starvation at the seedling stage (Gioia et al. 2015). Furthermore, our measurements of SLL at 14 days were highly correlated with FSW and DSW (r = 0.76 and r = 0.81, p < 0.05) both in Ruta and IL99. SLL, NL, FSW, and DSW did not differ in IL99, or showed only a slight but non-significant change, in response to LN. Therefore, SLL measured after 14 days of N deficiency was used here as an indicator to evaluate the response to N stress for fine phenotyping of recombinants. Our results confirmed that severe N stress has a negative impact on the recurrent parent Ruta. Significant changes were found in Ruta in all the measured traits under LN compared to FN, while IL99 showed only small differences for most studied traits (Fig. 3; Table S7).
Fine mapping of QGpc.huj.uh-5B.2
For the fine-mapping procedure we designed 13 KASP markers, 2 flanking the QTL and 11 saturating the QTL interval, based on SNPs found between the parental lines (LDN and G18-16, RIL12 and Ruta). Exome capture data that produced more than 620,000 SNP/indels at the whole-genome level enabled to us to identify 13 SNPs evenly distributed across the QTL region of 28.28 Mb (from 27.05 to 55.33 Mb) (Fig. 4a). We further converted SNPs to KASP markers based on the Chinese Spring RefSeq v2.0 genome (Zhu et al., 2021) (Table S2). IL99 (BC3F4) was backcrossed with Ruta to produce the BC4F1 population. To confirm heterozygosity, all BC4F1 individuals were genotyped using the KASP markers flanking the QTL (P1 WTa_060365, and P13 WTa_060962). To select single- and then double-homozygous recombinants for fine mapping, we first genotyped 1416 BC4F2 with an additional KASP marker positioned in the middle of the QTL (P6 WTa_060520). This resulted in two recombinants at BC4F3: type I (homozygous at P1, and heterozygous at P6), which were genotyped in the next generation (BC4F4) with KASP markers P2 to P6; and type II (heterozygous at P6, and homozygous at P13) which were genotyped in the next generation (BC4F4) with KASP markers P6 to P12. In total, we genotyped 2500 BC4F4 plants using 13 KASP markers along the QTL interval. This procedure enabled us to identify 26 NILs homozygous recombinants, each representing recombination events along the QTL interval (Fig. 4b). These 26 NILs were classified into 16 haplotypes. For example, NILs 15, 20, 21, and 31 each carried a QTL of similar size (Fig. 4b).
The segregation of phenotypic response to LN found in all 26 BC4F5 NILs was conducted using the semi-hydroponic system. A reduction of SLL between the two conditions was recorded as a response to LN. Our results showed that 12 NILs (of 7 haplotype groups) (15, 20, 21, 31, 25, 29, 27, 3, 32, 33, 34, and 39) had no significant reduction in SLL, and were therefore regarded as having an IL99 phenotype (G phenotype; Fig. 4c). On the other hand, SLL of 14 NILs (1, 2, 4, 8, 9, 16, 18, 19, 40, 35, 38, 36, 43, and 44) was significantly reduced (233.67 vs. 167.78 mm, p < 0.05), and these were regarded as having the Ruta phenotype (R) (Fig. 4c).
Comparison of the relative reduction of SLL between the two alleles R and G (Fig. 5) confirmed that NILs with the R allele exhibit a notable (p < 0.05) reduction in leaf growth, with its frequency decreased by approximately 65.8%, whereas NILs of the WEW allele G decreased by a smaller margin, around 11.5%. Altogether, the fine mapping which integrated the genotyping and phenotyping of 26 NILs (Fig. 4b) showed that the 12 NILs showing tolerance to N stress share a sub-QTL haplotype derived from the WEW region. We anchored the KASP markers used for the fine mapping and determined that the physical size of the target QTL region is approximately 1.29 Mb.
Candidate gene identification and microcollinearity analysis
The delimited region 0f 1.29 Mb (from 36.36 to 37.65) was estimated based on WEW Zavitan reference genome. This size differed from the T. durum var. Svevo – 1.7 Mb (from 37.03 to 38.73), and the CS – 1.92 Mb (from 36.6 to 38.52) assemblies. In WEW genome we identified 13 high-confidence genes (listed in Table 1): 2 (TRIDC5BG005570 and TRIDC5BG005760) of unknown function (Gene Ontology [GO]), and 11 classified using GO functions, or from information in the literature, into different categories. The largest group consisted of 6 genes associated with abiotic stress (TRIDC5BG005540, TRIDC5BG005560, TRIDC5BG005580, TRIDC5BG005630, TRIDC5BG005710, and TRIDC5BG005770), encoding pentatricopeptide repeat 336, cold-regulated protein, histone-lysine N-methyltransferase (HKMT) ATXR6, serine protease HtrA, syntaxin-132 (SYP132), and a metallohydrolase/oxidoreductase (MHO) superfamily protein, respectively. The second group consisted of 3 genes that were involved in N transport or N metabolism, and N stress (TRIDC5BG005530, TRIDC5BG005550, and TRIDC5BG005600), encoding 15-cis-zeta-carotene isomerase (Z-ISO), UPS1, and importin subunit β1 (KPNB1), respectively. In addition, 1 gene was involved in signaling pathways and autophosphorylation (TRIDC5BG005520), encoding p21-activated protein kinase-interacting protein 1; and 1 gene (TRIDC5BG005640) encoded a disease resistance protein.
Table 1
Homologs of putative WEW candidate genes identified in the 1.29 Mb sub-QTL, compared with T. durum var. Svevo and CS assemblies.
# | T. dicoccoides Zavitan | Start, bp | End, bp | T. turgidum Svevo | T. aestivum CS | Gene annotation | GO function |
1 | TRIDC5BG005520 | 36360061 | 36361911 | TRITD5Bv1G013710 | TraesCS5B02G033700 | p21-ACTIVATED PROTEIN KINASE-INTERACTING PROTEIN 1 | Signaling pathways |
2 | TRIDC5BG005530 | 36365122 | 36372548 | TRITD5Bv1G013720 | NA | 15-cis-ZETA-CAROTENE ISOMERASE | Nitrogen stress |
3 | TRIDC5BG005540 | 36396013 | 36398271 | TRITD5Bv1G013670 | TraesCS5B02G033500 | PENTATRICOPEPTIDE REPEAT 336 | Stress |
4 | TRIDC5BG005550 | 36404321 | 36406308 | TRITD5Bv1G013770 | TraesCS5B02G033400 | UREIDE PERMEASE1 | Nitrogen stress |
5 | TRIDC5BG005560 | 36428596 | 36428820 | TRITD5Bv1G013610 | TraesCS5B02G033300 | Cold-regulated protein | Cold stress |
6 | TRIDC5BG005570 | 36428604 | 36428765 | NA | NA | Undescribed protein | NA |
7 | TRIDC5BG005580 | 36437689 | 36438391 | TRITD5Bv1G013570 | NA | HISTONE-LYSINE N-METHYLTRANSFERASE ATXR6 | Development and stress |
8 | TRIDC5BG005600 | 36531860 | 36537534 | TRITD5Bv1G013910 | TraesCS5B02G034200 | IMPORTIN SUBUNIT BETA-1 | Nitrogen stress |
9 | TRIDC5BG005630 | 36820431 | 36822867 | TRITD5Bv1G013990 | TraesCS5B02G034400 | SERINE PROTEASE HTRA | Stress |
10 | TRIDC5BG005640 | 36947849 | 36949190 | TRITD5Bv1G014030 | TraesCS5B02G034600 | Disease resistance protein | Biotic stress |
11 | TRIDC5BG005710 | 37234570 | 37237309 | NA | TraesCS5B02G035600 | SYNTAXIN-132 | Stress |
12 | TRIDC5BG005760 | 37446998 | 37447553 | TRITD5Bv1G018160 | TraesCS5B02G035500 | Unknown function | NA |
13 | TRIDC5BG005770 | 37645630 | 37648660 | TRITD5Bv1G014330 | TraesCS5B02G035300 | METALLOHYDROLASE/OXIDOREDUCTASE SUPERFAMILY PROTEIN | Stress |
Homology indicates sequence similarity due to a common ancestry, whereas collinearity indicates homology due to the linear arrangement of genes along a chromosome (Chen et al. 2020). Microcollinearity analysis of the 1.29 Mb sub-QTL found in QGpc.huj.uh-5B.2 between Zavitan (WEW), CS (T. aestivum), and Svevo (T. turgidum) was conducted to better understand structural variations, such as rearrangements, duplications, and inversions (Fig. 6). Notably, the microcollinearity tool developed by (Chen et al. 2020) uses V1 of the Zavitan genome as a reference; nevertheless, since for all of our previous analyses we used Zavitan v2, we first made sure that the order of our target genes in the analyzed sub-QTL is similar in these two versions. The homology and microcollinearity of the 13 candidate genes showed some differences between the three species and included insertions, inversions, deletions, and translocations. Notably, homologous genes observed in our analysis may or may not exhibit collinearity, depending on the evolutionary events that occurred after their divergence. We found three genes with no microcollinearity or homology to WEW, Svevo or CS (Fig. 6): one gene, TRIDC5BG005570, was not annotated, and was specific to WEW; the second gene (TRIDC5BG005710) encodes SYP132, and had a homolog in T. aestivum, but in a different location (38.35 Mb outside the sub-QTL), and the third gene (TRIDC5BG005760), which is unannotated, was homologous to both species but found in different locations, outside of the sub-QTL in the two genotypes (38.51 Mb in CS, and 51.18 Mb in Svevo).
Gene expression under N deficiency tested by RT-qPCR
We selected seven genes for analysis by RT-qPCR. The first gene encodes UPS1, resides within the 1.29 Mb QTL. The remaining six genes encoding auxin response factor 6 (ARF6), NRT1/PTR FAMILY 2.11 nitrogen transporter (NRT), ent-kaurenoic acid oxidase 1 (EKAO1), SEC1 family transport protein SLY1 (SEC1), 9-cis-epoxycarotenoid dioxygenase (NCED), and WD40 repeat-like protein (WD40) reside in the full QTL interval. UPS1 was significantly activated in response to N stress in IL99, showing a 2.39-fold increase (p < 0.05) compared to Ruta, and a 1.85-fold increase (p < 0.05) in NIL21 which showed tolerance to N stress (IL99 type), compared to NIL38 (Ruta type) under N stress (Fig. 7a). ARF6 was upregulated by 49.5% in IL99 compared to Ruta, and by 45.7% in NIL21 vs. NIL38. NRT expression was 2.36-fold higher (p < 0.05) in Ruta compared to IL99, and no significant differences were found between the NILs. In the RT-qPCR analysis, the remaining genes EKAO1, SEC1, NCED, and WD40, that are distributed across the entire QTL, exhibited no difference in expression between the parental lines or between NILs (not presented in Fig. 7). Notably, differential activation of candidate genes under N stress implies that they may—but not necessarily—have a causative effect on N tolerance (Peredo and Cardon 2020). These results suggest that the UPS1 allele of WEW may have an important role for improved response to N stress in IL99 and other NILs with this allele.