Characterization of targeted endosperm fine tissues at early and middle storage phase
In rice caryopsis development, 7DAF represents the initial storage phase and 12DAF is characterized as the most active storage phase in terms of dry matter accumulation (Ishimaru et al. 2003). At 7DAF, length of caryopsis was already determined (Ishimaru et al. 2003; Fig. 1A), and cellularization of endosperm is almost completed (Ishimaru et al. 2003) with ample water content (Ishimaru et al. 2009; Fig. 1B). At 12DAF, width of caryopsis was already determined (Ishimaru et al. 2003; Fig. 1C), with most zones of the endosperm looking milky white (Fig. 1D) except for the slight transparency at the center zone of the starchy endosperm (Fig. 1E). At 15DAF, the transparent zone spread to the medium layer of the endosperm (Fig. 1F, G, and H). At maturity, the entire zone of endosperm became transparent (Fig. 1I and J).
According to the TEM fine tissues in the starchy endosperm, the center zone was distinguished with the presence of starch granules (CSE, Fig. 2A and E); while the dorsal side (DSE, Fig.2B and F) and lateral side (LSE, Fig. 2C and G) had less starch granules. Instead, AL contained aleurone particles, oil bodies, and mitochondria (Fig. 2H and J). Lipids are specifically localized at the aleurone cell layers (Ishimaru et al. 2015), and globulins are accumulated in those aleurone particles (Ogawa et al. 1979). On the other hand, starchy endosperm accumulates starch and storage proteins (prolamin and glutelin) with temporal and spatial differences. Starch granules were predominantly formed in the central zone of the endosperm tissues at 7DAF, and the accumulation of starch granules is extended in cells of DSE and LSE at 12DAF (Hoshikawa et al. 1968). Protein bodies (PB) I and PBII are predominantly observed in LSE (Fig. 2I), but hardly observed in DSE and CSE (Fig. 2A, B, E, and F).
In the present study, we targeted to identify the differential expression of genes in developing endosperm at 7DAF and 12DAF by isolating fine tissues of AL, CSE, DSE, and LSE with LM (Fig. 3A-D). The dynamic accumulations of storage compounds are spatially and temporally coincident with changes in water distribution in the developing rice endosperm (Horigane et al. 2001; Ishimaru et al. 2009). Overall microscopic observations showed distinct developmental and spatial accumulation patterns of starch granules, storage proteins (PBI, and PBII), and lipids among AL, CSE, DSE, and LSE.
Fine-tuned co-regulons preferentially expressed in dorsal aleurone layers (AL) and distinct zones of starchy endosperm tissues (CSE, DSE and LSE) during the onset of cellularization and peak of storage phase
So far, the genome-wide analysis using mRNA/proteins extracted from whole grains revealed the detailed temporal changes in metabolic pathway (Gao et al. 2013; Xu et al. 2010) and identified seed-specific regulation of transcription factors (Sharma et al. 2012; Xue et al. 2012) during seed development. However, until now the heterogeneous functions of distinct endosperm compartments regulating distinct seed storage events have not been deciphered. The present study further provides the opportunity to derive gene regulatory networks to link the spatial storage processes in the developing rice endosperm. Based on weighted gene correlation network analysis using WGCNA method (Langfelder and Horvath 2008) followed by the topological overlap matrix (TOM) similarity algorithm, we identified 60 modules (M1-M60) (Fig. 4). Of them, 18 modules depicted distinct fine-tuned co-expression patterns (co-regulons) of genes showing preferential or specific expression in distinct zones of fine tissues of starchy endosperm (CSE, DSE, and LSE) and in dorsal aleurone layers (AL) during 7 and 12DAF (Fig. 4).
The heatmap of 18 modules showed the distinct fine-tuned patterns of expression, suggesting that complex spatial and temporal expression patterns exist in developing rice endosperm at early and middle storage phase. The modules M2, M3, M7, M11, M12, M13, M14, M15, M16, M17, and M18 are preferentially or even specifically expressed in starchy endosperm fractions in comparison to AL tissue (Fig. 5A, B, and C). Interestingly, there are finer differences observed within these modules (Table 1). For instance, M2 module preferentially expressed in CSE during 7 DAF and its expression patterns were found in LSE and DSE during 12 DAF (Fig. 5A, the top panel). The module M3 genes are expressed preferentially in LSE and DSE during 12 DAF (Fig. 5A, the second panel from the top). MapMan-based enriched functional categories showed that genes involved in cell organization, cytoskeleton, and microtubulin were enriched in M2 and M3 (Fig. 6A and B), suggesting that those genes have a role at first to complete cellularization in the CSE zone during 7 DAF and in DSE and LSE regions during 12 DAF. Hoshikawa (1967) found that rapid cell elongation in starchy endosperm occurs around the central zone spread to the outer zone of starchy endosperm after the onset of starch accumulation. Expression profile in M2 and M3 depicts fine spatial and temporal regulation in the cell framework for developing endosperm, which supports the histological study of Hoshikawa (1967). Modules M11, M12 and M13 are preferentially expressed in various endosperm fractions during 7 DAF and are least expressed in CSE during 12 DAF (Fig. 5A, bottom three panels). The co-expressed regulons of M11 imply amyloplast development pathway (Fig. 6D), M12 and M13 modules with glycolysis and protein biosynthesis machinery (Fig. 6E and F). The M7 module genes are preferentially expressed in CSE during 7 DAF and 12 DAF and as well in LSE and DSE during 12 DAF (Fig. 5A, the third panel from the top), and enriched for starch metabolism (Fig. 6G-K). The co-expressed modules M14, M15, M16, M17 and M18 are preferentially or even specifically expressed in CSE with subtle differences between those modules (Fig. 5B). Critical gene regulatory network for accumulation of starch and storage protein, but not the accumulation of lipid, is hypothesized to exist in these six modules (M7, M14-18). Although the modules M4, M5, M6, M8, M9, and M10 are preferentially expressed in AL, the individual modules show temporal variation between 7 and 12 DAF (Fig. 5C). These gene regulatory networks which are preferentially expressed in AL are enriched for redox, antioxidants, various transporters and triacylglycerol biosynthesis (Fig. 8A-F). In the following subsections, we infer the in-depth analyses by co-expression gene regulatory network and MapMan-based enriched functional categories of genes in relation to the tissue- and stage-specific storage compound accumulation in each endosperm zone.
Unraveling the potential regulators of starch metabolism
Gene regulatory network showed that three modules (M2, M7 and M12) were preferentially enriched with starch metabolism genes (Fig. 6A, C, and E). Genes co-expressed in module M2 includes starch metabolism genes (invertase, hexokinase, phosphoglucomutase and ADP-glucose pyropohosphorylase (AGPase) large subunit, cell division and cytoskeleton organization, protein folding (HSP70, HSP20), G-proteins (Rab11C, OsRac3, RGP2, RAB5B) and transcription factors (bHLH, Jumonji, homeobox, S1FA, C2C2(Zn) GATA), which are preferentially expressed in all tissue types of endosperm during 7 and 12 DAF (except CSE at 12DAF) (Supplementary Table S3). Genes co-expressed in M12 includes starch metabolism genes (hexokinase, monosaccharide transporters, phosphoglucomutase and AGPase small subunit, isoamylase, alpha 1,4-glucan phosphorylase, trehalose-phosphate/synthase), cell division and cytoskeleton organization, protein synthesis, protein targeting, protein degradation (26S ubiquitin proteasome, E2 ubiquitin, alanine protease, aspartine protease, serine protease), protein folding (chaperonins of 10KDa, 20KDa, 60KDa, HSP70, HSP80, DnaJ), G-proteins, calmodulins and transcription factors (MYB, bZIP, Zn-finger(CCHC)), which are preferentially expressed in all tissue types of endosperm during 7 and 12 DAF (except CSE at 12DAF) (Supplementary Table S3). The M7 co-regulons encode sucrose synthase 1 (SUS1 and SUS2) and starch synthase 1 (SSI), glucose 6-phosphate (G6P) translocator from starch metabolism, depict distinct expression patterns with peak of expression in CSE during 7 DAF and expanding its spatial expression in LSE and DSE at 12 DAF. The starch accumulation is coincided with expression of two rate limiting genes such as sucrose cleavage genes (SUS1 and SUS2) which produces UDP-glucose and AGPase large and small subunits involved in producing ADP-glucose, a precursor for the production of amylose and amylopectin chains in CSE during 7 DAF and in DSE and LSE during 12 DAF. In addition, we noticed expression of G6P translocator and ADP-glucose translocator in the starchy endosperm, suggesting that both pathways of cytosolic and plastidic production of ADP-glucose are possible in the heterogeneous endosperm tissue (Toyota et al. 2006). According to the TEM observation, active accumulation of starch was almost completed in CSE at 12DAF (Fig. 2F). The expression pattern of genes for starch metabolism in M2, M7 and M12 was spatially and temporally coincident with starch accumulation in the endosperm.
Toc34 and OEP75 encode a translocon that facilitates the translocation of polypeptide from cytosol to plastid at outer envelope membrane (Andrès et al. 2010). Genes encoding chloroplastic outer envelope membrane protein (OEP75) (LOC_Os03g16440) and chloroplast protein import component Toc34 family protein (LOC_Os03g13730) were categorized into M7 (Supplementary Figure S1). A plastid-localized heat shock protein (LOC_Os12g14070) in M7 regulates the protein import into amyloplast associated with Tic complex (Andrès et al. 2010), thereby mutant lacking this gene shows the floury (chalky) phenotype with aberrant amyloplast development (Zhu et al. 2018; Tabassum et al. 2020). Regulation of starch accumulation by molecular chaperones in coordination with the Tic complex is noted in SE. Interestingly, a number of molecular chaperones HSP involved in protein folding were identified in four modules M2, M7, M12, and M13 (Fig. 6A, C, E, and F; Supplementary Figure S2). The effect of other heat shock proteins categorized into M2, M7, M12, and M13 on the phenotype in starchy endosperm is still elusive.
Co-expression gene regulatory network analysis showed that M14-18, which are characterized as high expression in CSE at 12DAF (Fig. 5B), involves lipid metabolism (Fig. 7A-E). Cells of CSE were almost occupied by starch granules at 12DAF (Fig. 2F), but not by lipids (Ishimaru et al. 2015). In fact, specific expression of CDP-alcohol phosphatidyltransferase (LOC_Os03g17520), glycerophosphoryl diester phosphodiesterase (LOC_Os02g37590), glycosyl transferase (LOC_Os03g15840), acyl-CoA synthetase (LOC_Os01g48910), MGDG synthase type A (LOC_Os02g55910), phosphatidate cytidylyltransferase (LOC_Os01g55360), coclaurine N-methyltransferase (LOC_Os06g37610), which are involved in glycerolipid metabolism, phospholipid biosynthesis, glycerophospholipid metabolism, was observed in CSE at 12DAF (Supplementary Figure S3). Genes for lipid metabolism showing specific expression in CSE at 12DAF (Supplementary Figure S3) seemed to be quite different from those for lipid metabolism in AL (Supplementary Figure S4). Recent genetic study with floury shrunken endosperm1 (fse1) revealed the involvement of phospholipid metabolism in amyloplast development in rice (Long et al. 2018). The fse1 mutant results in white-cored type of mild chalk phenotype (Long et al. 2018), suggesting a role of phospholipid biosynthesis in the amyloplast development particularly in the center zone of starchy endosperm (CSE). Zhou et al. (2016) proposed a model that enhancement of lipid metabolism and amylose biosynthesis pathway in the developing endosperm is essential to increase the resistant starch (RS) type five content. RS type five consists of amylose-lipid complexes (Raigond et al. 2015) contributes to glucose homeostasis, which is beneficial to counter type II diabetes because of the restriction of starch granules to swelling during cooking and thus contributes to slower digestion. RS content is positively correlated with amylose content in the rice grains (Yang et al. 2006). Our LM-based expression analysis revealed that maintained high expression of GBSSI (LOC_Os06g04200), which is responsible for amylose biosynthesis (Itoh et al. 2003), in CSE at 12DAF (Supplementary Figure S5). Amylose content is highest in CSE zone, followed by medium levels in DSE and LSE zones, and lowest in peripheral zones (AL) in the rice endosperm (Itani et al. 2002). Simultaneous high expression level of GBSSI (Supplementary Figure S5) and specific lipid metabolism (Supplementary Figure S3) occurred in CSE at the middle storage phase. The present LM-based expression data in CSE at 12DAF provided clues to elucidate the genes which are involved in the resistant starch biosynthetic pathway in rice.
Starch accumulation patterns in endosperm precedes non-degenerative programmed cell death (PCD) under desiccation tolerant mechanism
Kobayashi et al. (2013) demonstrated that PCD initiated in the CSE zone at middle storage phase, then spread to the peripheral zone of starchy endosperm. We hypothesized that our LM-based expression analysis could reveal the genes that played a critical role in the onset of PCD in the CSE zone at the middle storage phase. MapMan-based enriched functional categories of genes indicated the ubiquitin-dependent protein degradation in M14, M15, M16, M17, and M18 preferentially upregulated in CSE at 12DAF (Fig. 7A-E). PCD is largely used to describe the process of apoptosis and autophagy (Reape et al. 2008), and these genes are preferentially expressed in modules M14-18. Autophagy protein 8 (LOC_Os04g53240) and Autophagy protein Apg9 family protein (LOC_Os03g14380) were found in M15 and M17, respectively (Supplementary Figure S6). Furthermore, autophagy-related proteins, a UBA/THIF-type NAD/FAD binding fold domain containing protein (LOC_Os01g42850) and two WD40-like domain containing protein (LOC_Os01g57720 and LOC_Os05g33610) were found to be expressed in M17 and M18, respectively with very high expression only in CSE at 12DAF (Supplementary Figure S6). These results indicate a possibility that PCD through autophagy would occur in CSE at the middle stage. Our LM-based tissue specific expression analysis supports PCD events histologically demonstrated by Kobayashi et al. (2013). The CSE zone started transparency (Fig. 1D and E) after the completion of starch accumulation there at middle storage phase (Fig. 2E), meaning the drastic reduction in water content in CSE zone during the middle storage phase (Horigane et al. 2001; Ishimaru et al. 2009). It could be hypothesized that the CSE zone has a specific biological strategy to adapt the dehydration process toward maturation.
The genes involved in abiotic stress response were preferentially enriched in modules M14-M18. These includes, transcription factors, C-myb-like transcription factor (LOC_Os01g62410, designated as OsMYB3R-2, Dai et al. 2007), MADS27 transcription factor (LOC_Os02g36924, designated as OsMADS27, Chen et al. 2018) (Supplementary Figure S7) are suggested to regulate abiotic stress tolerance in rice plants. We found heat shock factor protein 3 (LOC_Os09g35790, HSF 3) and heat stress transcription factor Spl7 (LOC_Os04g48030) as potential candidates for abiotic stress tolerance transcription factor (Supplementary Figure S7). Specific expression of calcium-dependent protein kinase (LOC_Os03g03660, OsCPK7 or OsCDPK11), two calcium-binding EF-hand domain containing proteins (LOC_Os05g31620 and LOC_Os09g24580), Calmodulin 2/3/5 (LOC_Os01g72100) in M18 were observed in CSE at 12DAF (Supplementary Figure S8). MapMan-based enriched functional categories of genes indicated the critical involvement of calcium-mediated signaling in M17 (Fig. 7D) with specific expression of two calcium-dependent protein kinase (LOC_Os03g03660 and LOC_Os04g49510) and calcium-binding EF-hand domain containing protein (LOC_Os01g72080) in CSE at 12DAF (Supplementary Figure S8). These results suggest the calcium-dependent signal transduction pathway has an important role in CSE at 12DAF. Innermost layers corresponding to the CSE zone contained 45% calcium, when compared to the entire rice grains, thereby it is suggested that calcium is essential for cell development in the inner zone of endosperm during grain filling (Itani et al. 2002). We postulate that calcium-calmodulin-dependent protein kinase cascade has a crucial role in signal transduction during the dehydration process that started in CSE at middle storage phase. Very specifically, two carbonic anhydrase (CA) (LOC_Os01g45274 and LOC_Os09g28150 in M18), an enzyme which has main role in supplying CO2 for carbon fixation by Rubisco by catalyzing the reversible hydration of CO2 in leaves, were specifically expressed in CSE at 12DAF (Supplementary Figure S9). LOC_Os09g28150 is the α-type CA, whereas LOC_Os01g45274 is the β (chloroplast)-type CA. In the leaves of rapeseeds, Wang et al. (2016) suggested the involvement of phosphorylation of β-type CA for adaptation to drought stress. It is likely that β-type carbonic anhydrase may play an essential role in adaptation to the dehydration process in the amyloplast after completion of starch accumulation in CSE. Involvement of α-type carbonic anhydrase in dehydration adaptation remains an open question.
Co-expression gene regulatory network analysis indicated the involvement of plant hormones such as auxin (M15, M16, M17, and M18), abscisic acid (M18), ethylene (M16 and M17) (Fig. 7A-E). Those plant hormones may influence the abiotic stress tolerance during cellular dehydration process in CSE after the middle stage, possibly in coordination with transcription factors and genes related to stress tolerance described above. On the other hand, CSE predominantly accumulates starch (Fig. 2E). CSE is the first endosperm tissue to start dehydration (Fig. 1D and E) while other SE tissues are in the peak of starch accumulation. An abiotic stress tolerance mechanism in CSE at middle stage would contribute to preserving the starch granules as a reserve for the energy source at germination of the next generation.
Lateral starchy endosperm (LSE) gene regulatory network infers ongoing storage protein accumulation
Predominant localization of PBI (prolamin) and PBII (glutelin) in the LSE zone indicated large amounts of storage proteins there, according to the TEM observation (Fig. 2C, G, and I). In spite of such predominant localization of PBI and PBII in LSE zone, only two genes for glutelin were found in M11 and M15 (Supplementary Figure S10), whereas no gene for prolamin was categorized into any module (Supplementary Table S3). These results suggest that predominant localization of PBI and PBII in the LSE zone is determined by other regulators. Genes categorized into M11 and M13 showed that expression was almost absent in CSE at 12DAF although it was high in all endosperm tissues at 7DAF (Fig. 4; Table 1), suggesting that M11 and M13 would contain the important genes for protein synthesis and/or initial amyloplast development. Co-expression analysis of gene regulatory networks revealed that a transcriptional factor, Myb, DNA-binding domain containing protein (LOC_Os02g02370) was categorized into M11 (Fig. 6D) with Glu-B5 (LOC_Os02g14600). Glu-B5 mutant, W3660, drastically reduces glutelin content in the rice grain (Wang et al. 2005). M11 may possess the genes that are involved in synthesis of glutelin. Co-expression analysis of gene regulatory networks in M13 suggests the involvement of GRP94, plant hormones such as gibberellin, ethylene, and auxin (Fig. 6F). GRP94 (LOC_Os06g50300) is a molecular chaperone localized in endoplasmic reticulum (ER). Ishimaru et al. (2019) reported that expression of the genes for ER-localized molecular chaperones is thermo-sensitive, suggesting the regulation of prolamin content and initial amyloplast development in rice grain. Expression pattern of GRP94 showed that expression level was preferentially high in starchy endosperm (CSE, LSE, DSE) at 7DAF, then expression level maintained high at LSE at 12DAF (Supplementary Figure S10). In addition, M13 possesses the glutelin A-1 storage protein (LOC_Os02g25860), which built diverse network to the neighboring genes including gibberellin-induced MYB transcriptional factor (LOC_Os03g38210) (Fig. 6F; Supplementary Table S5). Co-regulons of M13 module encodes the expression of various sucrose cleaving enzymes (invertases and sucrose synthase), trigger of glycolysis, mitochondrial electron transport, TCA and amino acid metabolic pathways (Fig. 6F). There also found two gibberellin-regulated 60S ribosomal protein L9 (LOC_Os09g31180 and LOC_Os02g01340) in M13 (Fig. 6F). M13 may possess the genes that are involved in synthesis of prolamin and glutelin as well as the genes that are involved in starch accumulation. Whether gibberelin influences the protein synthesis and/or initial amyloplast development through the ribosomal protein needs to be further investigated.
Aleurone-specific gene regulatory networks characterized to be involved in oil deposition, assimilate transport, stress tolerant and defense mechanism, TCA cycle and reactive oxygen species scavenging
Dorsal aleurone layers (AL) contain large amounts of lipids as oil bodies (Fig. 2H, J; Ishimaru et al. 2015). AL plays a specific role in uptake of sucrose and amino acids from maternal tissues into developing endosperm until the late storage phase (Opakra et al. 1981). Cell morphology in AL was quite different from that in other starchy endosperm tissues, especially in terms of storage compounds (Fig. 2A-J). Elucidating the molecular mechanisms in the development of AL will be useful to understand assimilate transport, positional cell fate and morphogenesis, and oil deposition. Co-expression analysis showed that six modules (M4, M5, M6, M8, M9, and M10) possessed the AL-preferential expression (Fig. 5C). Co-expression analysis of gene regulatory networks indicated the involvement of genes for lipid metabolism in all six modules (Fig. 8A-F), suggesting that those genes are involved in the biological cellular event in AL for oil deposition. Evidence suggests that specific expression of NADH-dependent enoyl-ACP reductase (LOC_Os08g23810 in M5), Acetyl-coenzyme A carboxylase (LOC_Os05g22940 in M6), dihydrolipoamide S-acetyltransferase (LOC_Os12g08170 in M5; LOC_Os08g33440 in M8), Pyruvate dehydrogenase E1 beta subunit (LOC_Os03g44300 in M9), Pyruvate kinase isozyme G (LOC_Os01g47080 in M9), Long-chain-fatty-acid--CoA ligase 4 (LOC_Os11g06880 in M9), etc. in AL reinforces the participation of those genes in fatty acid biosynthesis and chain elongation in AL (Supplementary Figure S4). Co-expression analysis of gene regulatory networks and MapMan-based enriched functional categories of genes also indicated the involvement of genes for TCA cycle in M4, M5, M6, M8, and M9 (Fig. 8A-E). Ishimaru et al. (2015) previously demonstrated with LM that AL predominantly expresses the genes for TCA cycle and oxidative phosphorylation in the presence of oxygen and large numbers of mitochondria at early storage phase. Expression of genes for TCA cycle in M4, M5, M6, M8, and M9 (Fig. 8A-E) contributes to maintaining the redox status in AL to supply ATP for oil deposition through aerobic respiration during ripening.
MapMan-based enriched functional categories of genes indicated the unspecified development in M4 (Fig. 8A) with preferential expression of genes in AL at 12DAF (Fig. 5C, the top panel), suggesting that genes categorized into M4 have a particular role(s) in maintaining homeostasis at the middle storage phase. Co-expression analysis of gene regulatory networks indicated the genes for globulin were linked with the number of neighboring genes in M4 (Fig. 8A) as well as M9 and M10 (Fig. 8E and F). The genes are considered to be involved in the synthesis of 7S globulin, which is specifically accumulated in aleurone cells (Shewry, 1995). In soybean and wheat, it is known that 7S globulin is not a storage protein, but a multi-functional protein with abiotic stress responses (Hirano et al. 1992; Omi et al. 1996) and endoxylanase inhibitor for the defense mechanism from pathogenic bacteria and fungi (Fierens et al. 2003; Yoshizawa et al. 2011). AL accumulates 7S globulin possibly for the abiotic stress response and defense mechanism to pathogens in rice. Dehydrin and late embryogenesis abundant (LEA) II protein, whose genes were categorized into M10 in gene regulatory network (Fig. 8F), also confer the desiccation tolerance to seeds during maturation in cereals (See review, Kosava et al. 2014). M4, M9, and M10 possess many AL-specific transcription factors related to abiotic stress responses and pathogen defense (Supplementary Figure S11). A transcription factor, MADS-box domain containing protein (LOC_Os08g41950, designated as OsMADS7, Zhang et al. 2018), a heat shock transcription factor 29 (LOC_Os01g53220, designated as OsHsfC1b, Schmidt et al. 2012), a transcription factor MYBS3 (LOC_Os05g37060, designated as MID1, Guo et al. 2016), and a heat shock transcription factor 31 (LOC_Os01g39020, designated as OsHsfA7, Liu et al. 2013), are suggested to regulate abiotic stress tolerance in rice plants. On the other hands, WRKY transcription factor 71 (LOC_Os02g08440, designated as OsWRKY71, Liu et al. 2007), WRKY transcription factor 51 (LOC_Os04g21950, designated as OsWRKY51, Hwang et al. 2016), and Transcription factor jumonji, JmjN domain containing protein (LOC_Os01g67970, designated as JMJ705, Li et al. 2013) are suggested to regulate pathogen defense signaling pathway in rice plants in crosstalk with plant hormones. A set of different genes specifically expressed in AL suggests that AL may have a different stress adaptation strategy including desiccation tolerance from CSE during maturation.
In the gene regulatory network, M4, M8, M9, and M10 possessed various plant hormones such as abscisic acid, ethylene, auxin (Fig. 8A, D, E, and F), and ethylene-responsive transcription factor-like protein, AP2/EREBP, suggesting the relation of plant hormones in induction of genes involved in the stress tolerance and defense mechanism in AL. Among AP2/EREBP genes, the AP2 domain containing protein RAP2.6 (LOC_Os03g08470) was specifically expressed in AL at 12DAF in M4 (Supplementary Figure S12). Recently, Xiong et al. (2018) reported that alleles of LOC_Os03g08470, designated as OsLG3, from upland rice confers the drought tolerance in rice by inducing reactive oxygen species scavenging. According to MapMan-based enriched functional categories, genes involved in redox, ascorbate-glutathione cycle, detoxifies H2O2, were prominent in module M5 (Fig. 8B) and M6 (Fig. 8C). AL functions as a main route of assimilates into developing endosperm (Opakra et al. 1981), thereby AL must be active until the late storage phase (Fig. 1G and H) by maintaining moisture content at outer layers of endosperm (Horigane et al. 2001; Ishimaru et al. 2009). There is a possibility that an AP2/EREBP gene, OsLG3, activates the genes involved in redox, ascorbate-glutathione cycle, contributing to the maintenance of cell activity and function of AL until maturation. It should be noted, however, AP2/EREBP protein plays diverse roles in biological cellular processes. In the developing endosperm of barley, AP2/EREBP gene is highly expressed in developing endosperm transfer cells (Thiel et al. 2008), which corresponds to AL in the developing rice endosperm of this study. Because of influences of AP2/EREBP protein on seed size, grain weight, and storage compound accumulation in Arabidopsis (Jofuku et al. 2005; Ohto et al. 2005), Thiel et al. (2008) speculated that AP2/EREBP that is highly expressed in endosperm transfer cells has similar effect on barley seed development as demonstrated by Arabidopsis. Whether AP2/EREBP family proteins categorized into M4, M9, and M10 (Supplementary Table S3) have a similar influence on seed characters in rice as demonstrated by Arabidopsis seeds or not need to be further investigated.
Arabidopsis leafy cotyledon 1 (LEC1) is an embryo-specific NUCLEAR FACTOE Y (NF-Y) transcription factor that regulates embryogenesis (Lothan et al. 1998). In rice, OsNF-YB1 (LOC_Os02g49410) homologous to LEC1 is specifically expressed in aleurone cells of developing endosperm (Sun et al. 2014) and regulates the grain filling by controlling a sucrose transporter, OsSUT1 (LOC_Os03g07480, Hirose et al. 1997) interacting with an ERF transcription factor Os#ERF115 (LOC_Os08g41030) (Bai et al. 2016; Xu et al. 2016). Our LM-based expression analysis also confirmed the aleurone-specific expression of OsNF-YB1, OsERF115, and OsSUT1 in M8, M9, and M8, respectively (Supplementary Figure S13). Other transporters such as monosaccharide, amino acid, ion, inorganic transporters were found in M4, M9, and M10 (Supplementary Figure S13), supporting the essential role of AL for unloading of nutrients into developing endosperm. Transcription factors in four modules (M4, M8, M9, M10), which showed specific or preferential expression in AL (Fig. 5C), would structure the cascade with aleurone cell morphogenesis and transport-related genes during grain filling.