Phenotype characterization of the gpa1 mutant
Normally, the seedlings of gpa1 have an albino first leaf and display chlorosis at the tip of the second leaf (Fig. 1A). A striped pattern then develops with expansion of the second leaf (Figs. 1A and 1B). Striping occasionally occurs on the third and subsequent leaves. Consistent with the visual difference, chemical analysis also indicated that levels of chlorophyll a, b and total in striped leaves of BW397 were significantly lower than that of WT Bowman plants (Fig. 1C). An adult BW397 plant produces albino or striped flag leaf, spike, awns and even anthers (Fig. 1D and Additional Fig. 1), and the mutant is much shorter than WT (Fig. 1D). The mutants display reduced fertility with an average of 4 seeds/spike compared to 18 seeds/spike in WT (Fig. 1E). Seed size of BW397, measured as 100-seed weight, is only 3/4 that of the WT’s (Fig. 1F). In addition to the striped leaf phenotype, the gpa1 mutation causes a systemic effect on barley growth and development.
To confirm the striped phenotype is associated with defects in chloroplast biogenesis, we compared the chloroplast ultrastructure of Bowman and BW397 using transmission electron microscopy (TEM) (Fig. 2). As expected, numerous normal chloroplasts with well-organized thylakoids are developed in the WT cells (Figs. 2A and 2B). Thylakoids are arranged in grana stacks, which are connected by stroma lamellae (Fig. 2C). On the contrary, BW397 cells in the albino sections contain few undeveloped plastids (Fig. 2D). Lamellae, acting like the skeleton in a normal chloroplast, is not well developed in the mutant (Figs. 2D, 2E, and 2F). Stacked grana and thylakoid membranes are also missing in these undifferentiated plastids (Figs. 2E and 2F). Therefore, chloroplast biogenesis in the white stripes of BW397 is abolished by the gpa1 mutation.
Genetic mapping of Gpa1
A total of 510 F2 plants derived from the cross between Bowman and BW397 were used for genetic mapping of Gpa1. Of those, 133 were striped, while the remaining 377 F2 plants exhibited normal green leaf color. The segregation ratio of striped/green fits 1:3 (ꭓ2 = 0.316, df = 1, and P = 0.57), suggesting that the gpa1 mutation is monofactorial recessive. As an immediate strategy for gene localization and marker discovery, SNP array analysis was used to genotype 48 F2 plants (24 striped and 24 green) together with the parental lines. As the Gpa1 gene was anchored to 2H [20], we focused on the SNPs on 2H and identified 1013 polymorphic markers on this chromosome (Additional Table 1).
Initial genotype analysis of 48 F2 progeny revealed that the Gpa1 gene was located in ~ 5.56 Mb region flanked by two array SNPs, JHI-Hv50k-2016-139629 and JHI-Hv50k-2016-142540 (Additional Table 1). To increase the resolution of mapping, we enriched this region with SSR markers previously co-located to the stripped phenotype [20], known SNPs on 2H consensus map [26], and specific 50 k markers within this region (Table 1). The analysis of an additional 104 F2 progeny genotyped with these markers further delimited the Gpa1 gene between M4 and M6 (Fig. 3A). When we enlarged the segregating population to 510 F2 plants, the Gpa1 region was narrowed down to a 410-kb region by M19 and M8 (Fig. 3B), where one co-segregating SNP, M20, was identified.
Physical localization of Gpa1
Fifteen putative protein-coding genes were identified in the fine-mapping interval according to the reference genome assembly [27] (Table 2). Of them, three putatively encode acyl-protein thioesterase 1-like proteins (HORVU.MOREX.r2.2HG0177100/G1, HORVU.MOREX.r2.2HG0177110/G2, and HORVU.MOREX.r2.2HG0177120/G3), and one each for EH domain-containing protein 1 (HORVU.MOREX.r2.2HG0177170/G4), ethylene-responsive transcription factor (HORVU.MOREX.r2.2HG0177180/G6), FAD-binding Berberine family protein (HORVU.MOREX.r2.2HG0177170/G7), and choline transporter-related family protein (HORVU.MOREX.r2.2HG0177310/G15). A gene cluster functionally related to photosynthesis or organelle biogenesis was also identified in the Gpa1 region, including 4 blue copper genes with high similarity (HORVU.MOREX.r2.2HG0177210/G7, HORVU.MOREX.r2.2HG0177220/G9, HORVU.MOREX.r2.2HG0177230/G10, and HORVU.MOREX.r2.2HG0177250/G12), one DNA topoisomerase gene (HORVU.MOREX.r2.2HG0177240/G11), and 2 genes encoding putative PTOXs with homology to alternative oxidase (HORVU.MOREX.r2.2HG0177250/G13 and HORVU.MOREX.r2.2HG0177260/G14).
Blue copper protein function as an electron shuffler in electron transfer reactions, such as biological nitrogen fixation, respiration and photosynthesis. Structure analysis indicated the putative blue copper proteins, G7, G9, G10, and G12, in the Gpa1 region contain a domain identified in plastocyanin, the long-range electron carrier between photosystems II and I [28]. Alternative oxidase is involved in the regulation of redox state of the electron transport chain in organelles [8, 9, 18]. Particularly, the putative coding product of G14 shares high homology with IM (AT4G22260), the plastid terminal oxidase in Arabidopsis [8, 9]. Therefore, G14 was named HvPTOX hereafter. For the G11 gene, two different products, one PPR protein and one DNA gyrase, were predicted in the sense and antisense strand, respectively (Additional Fig. 2). The putative introns of the DNA gyrase gene contain coding exons in the reverse complementary strand for the predicted PRR gene, and vice versa (Additional Fig. 2). The EST match for the putative DNA gyrase was identified (FD525137), but we did not find ESTs for the predicted PPR from the available databases. DNA gyrase or topoisomerase has been linked to regulation of DNA replication and transcription during chloroplast biogenesis [29]. Although lacking EST matches, the predicted PPR protein is highly homologous to SVR7, one of the suppressors of the Arabidopsis var2 mutation [19]. Therefore, members of this functionally related gene cluster were selected for further analysis.
Selection of the Gpa1 candidate
The four blue copper proteins within this region share at least 75% sequence identity, and the coding products of G9 and G12 vary by only one amino acid (aa) substitution. We speculated that these blue copper proteins may function redundantly, and mutation on one gene may not result in apparent phenotype change. Moreover, the AOX encoded by G13 is incomplete, and was eliminated from further analysis. Using Bowman, BW397, Lyallpur and its isogenic mutant GSHO 519, we focused on identifying sequence polymorphism between the gene alleles in G11 and G14 (HvPTOX).
Although a few SNPs were detected between the Bowman and Morex alleles of G11, Bowman, BW327, GSHO 519 and Lyallpur share identical genomic sequences including the coding region (Additional Table 2). This suggested that G11 might not be one of the candidates for Gpa1.
Gene prediction and EST matches (DK626738, DK619131, BF626913, RUS39D06w and HB15J15r) showed that HvPTOX contains 10 coding exons and 9 introns (Fig. 4A). The full-length coding region was successfully amplified in Bowman and Lyallpur, but not in mutants of BW397 and GSHO519 (Fig. 4B, Additional Table 3). We only obtained the 3’ region of the coding sequence in the mutants (Fig. 4B, Additional Table 3). To capture the full length of HvPTOX, we conducted FPNI-PCR to acquire the coding sequence at the 5’ region. Sequencing of the product derived from FPNI-PCR indicated that the first three and almost half of the fourth exon were missing in the mutated allele (Additional Fig. 3). However, the sequence proximal to the fourth exon of HvPTOX in BW397 cannot be aligned to the assembled reference genome, and it is not homologous to any known protein coding sequences or transposable elements.
To determine if the gene mutation was caused by insertion or deletion, we designed more markers at the potential promoter and far-upstream regions (2.5 kb upstream of the start codon) of HvPTOX. Our result indicated that those markers were all missing in mutants (Fig. 4C). In line with the deletion of the promoter, RT-PCR confirmed that HvPTOX was not expressed in mutants (Fig. 4D). Its expression in mutants could not be detected even with 30 PCR cycles, in contrast to the expression abundance indicated by the 25 cycles in wild types (Fig. 4D). Although the size of the deletion was unknown, HvPTOX structure, expression and thus function was totally disrupted in mutants. Therefore, the HvPTOX gene (G4/HORVU.MOREX.r2.2HG0177270) was selected as a strong candidate. The identity of Gpa1 still needs to be further verified in homologous or heterologous systems.