Unveiling eccDNA Dynamics in Rice: Insights into Adaptation to Nutritional Stress

doi:10.21203/rs.3.rs-4803624/v1

Download PDF

Article

Unveiling eccDNA Dynamics in Rice: Insights into Adaptation to Nutritional Stress

https://doi.org/10.21203/rs.3.rs-4803624/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Extrachromosomal circular DNAs (eccDNAs) have been identified in various eukaryotic organisms and play a crucial role in genomic plasticity. However, in crop plants, the role of eccDNAs in responses to environmental cues, particularly nutritional stresses, remains unexplored. Rice (Oryza sativa ssp. japonica), a vital crop for more than half the world's population and an excellent plant model for genomic studies, faces numerous environmental challenges during growth. Therefore, we conducted comprehensive studies investigating the distribution, sequence information, and potential responses of rice eccDNAs to nutritional stresses. We described the landscape of rice eccDNAs during optimal growth phase change and identified the specific induction on gene-overlapped eccDNAs (ecGenes), Transposable Element-overlapped eccDNAs (ecTEs), and the full-length repeat units-overlapped eccDNAs (full-length ecRepeatUnits) in response to nitrogen (N) and phosphorus (P) deficiency. Furthermore, we analyzed multiple-fragment eccDNAs and proposed a TE-mediated homologous recombination mechanism as the origin of rice multiple-fragment eccDNAs. Our studies provide direct evidence of the role of eccDNAs in rice genome plasticity under nutritional stresses and highlight the significance of their abundance and specificity.

Biological sciences/Plant sciences/Plant stress responses/Abiotic

Biological sciences/Plant sciences/Plant molecular biology

Exploring genomic plasticity has led to identifying extrachromosomal circular DNAs (eccDNAs) across a wide range of eukaryotes¹. Using light microscopy, large extrachromosomal circular molecules known as Double Minutes (DMs) were initially observed in mammalian cells and higher plants². These observations supported Franklin Stahl’s hypothesis that DNAs might be organized as circular molecules within chromosomes in higher organisms³. Subsequent analyses revealed that DMs carried chromatin bodies with chromosomal homologous sequences of megabase length but lacking telomeres and centromeres, linking them to oncogene amplification in tumors⁴. Further research highlighted the presence of eccDNAs in diverse eukaryotic cells, including yeast (Saccharomyces cerevisiae) and higher plants^5,6,7. Moreover, the chromosomal topology rearrangements caused by the formation of large eccDNAs can co-amplify enhancer elements and over-express oncogenes^8,9. Smaller eccDNAs can produce mature regulatory miRNA and modulate gene expression¹⁰. Additionally, eccDNAs have been shown to possess innate immunostimulatory activity, primarily due to their circularity, independent of their sequence content¹¹.

In yeast, eccDNAs harboring functional genes enhance environmental adaptability and drive genomic evolution. For instance, the copy number of the CUP1 gene, involved in copper resistance, increases via eccDNAs formation in response to environmental stress¹². While limiting nitrogen (N) supply enhances the expression levels of the yeast GAP1 gene by eccDNAs amplification (GAP1^circles), facilitating the transport of amino acids across the plasma membrane¹³. GAP1^circle are produced via Homologous Recombination (HR) between two Long Terminal Repeat (LTR) elements surrounding the chromosomal GAP1 gene, causing deletions and supporting the role of HR in eccDNA formation¹³. These studies highlight the crucial role of eccDNAs in environmental stresses responses in eukaryotic cells.

The presence of eccDNAs has been reported in several plant species¹⁴, including Arabidopsis (Arabidopsis thaliana)¹⁵, sugarcane (Saccharum officinarum)¹⁶, and rice (Oryza sativa)¹⁷. A notable example of the functional role of eccDNAs in plants is glyphosate resistance¹⁸ in the noxious weed Amaranthus palmeri, which is due to eccDNA increased copy number of the gene encoding the target enzyme of this herbicide, 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS)¹⁹. This eccDNA replicon, carrying EPSPS and 58 other genes, contributes to widespread glyphosate resistance in A. palmeri across America^20,21. Additionally, bioinformatic analysis of eccDNAs in cold-tolerant potato cultivars identified valuable markers for molecular breeding²². These findings suggest that eccDNAs in plants play a crucial role in rapid adaptation and evolution, highlighting their potential impact on plant biology and agricultural practices.

Analysis of eccDNA sequences can also aid in identifying active Long Terminal Repeats (LTRs) in different plant tissues and enhance the understanding of TE-driven evolution and genetic adaptation^22,23. High TE-derived eccDNA loads in Arabidopsis ddm1 mutants suggested a role of eccDNAs in genome instability by altering DNA repair pathways²³. Recently, dynamic analysis in rice also revealed important information on eccDNA origin¹⁷. Although increasing data on the functional role of eccDNAs on adaptive mechanisms and genome evolution has been recently reported, the role of eccDNAs in plant adaptation to environmental pressures, including nutritional stresses, remains largely unexplored⁷. To address this gap, we investigated the landscape of both single-fragment and multiple-fragment eccDNAs in rice (Oryza sativa) plants subjected to nitrogen (N) and phosphorus (P) stresses. We examined the differences in the sequence and abundance of genes and TEs in rice eccDNAs during optimal growth and in response to N and P stresses. Our results demonstrate that eccDNAs respond to fluctuations in nutritional supply and contribute to rice genomic plasticity.

eccDNAs are derived from diverse region across the rice genome

Rice (Oryza sativa ssp. japonica cv. Nipponbare) shoot tissues were collected from eight treatments with two biological replicates each to isolate eccDNAs. Optimal nutrient growth stages were set for 1 (Ctrl_D1), 3 (Ctrl_D3), 7 (Ctrl_D7), and 14 days (Ctrl_D14). Low N treatments were set for 3 (LN_D3) and 7 days (LN_D7), and low P treatments for 7 (LP_D7) and 14 days (LP_D14) (Supplementary Fig. 1a-b; see METHODS). After removing contaminant chromosomal DNA, extracted eccDNAs were purified and amplified by random Rolling Circle Amplification (rRCA) to generate high-quality libraries for Oxford Nanopore long-read sequencing (Supplementary Fig. 1c, see METHODS). Arbitrarily primed-alike PCR (apLike PCR) to confirm the total removal of linear DNA digestion and EcoR1 digestion of rRCA products to confirm the isolation and amplification of eccDNAs (Supplementary Fig. 2a-b). Samples of circular DNA lacking linear DNA were used forOxford Nanopore sequencing libraries. Nanopore sequencing data was quality controlled (Supplementary Data 1) and analyzed as described in the METHODS section and Supplementary Fig. 3a. After evaluating the bioinformatic tools CIDER-Seq2²⁴, ECCsplorer²⁵, ecc_caller²⁶ and ecc_finder²⁷ for eccDNA detection, we found that ecc_finder provides the best bioinformatic framework for the precise identification of eccDNAs from long-read Nanopore sequencing data. Long reads facilitate eccDNA characterization compared to Illumina short-read sequencing, which would need substantial improvement to identify eccDNAs from repeated loci of large genomes (Supplementary Fig. 3b).

Analysis with ecc_finder showed that the size distribution eccDNAs ranged from 200 bp to 37 kbp, with all eccDNAs having a minimum size of 200 bp due to ecc_finder's identification criteria (Fig. 1a; Supplementary Fig. 3b). Although a few eccDNAs were over 37 kb, the average size across all the treatments was approximately 500 bp. This size distribution was similar to that observed in other organisms, such as human cells¹¹, Arabidopsis¹⁵, and the short-read sequencing analysis from rice¹⁷ (Supplementary Fig. 4a). It is worth mentioning that there were no significant statistical differences in size distribution between the treatments (Supplementary Fig. 4b). Based on chromosomal distribution (Fig. 1b), eccDNAs were widely distributed across the rice genome, with the highest eccDNAs density in pericentromeric and centromeric regions (Fig. 1c). Furthermore, these high-density eccDNA regions also overlap with areas of high transposon elements (TEs) and DNA methylation density.

To assess the location of eccDNAs in the genome, we merged eccDNAs with overlaps of over 200 bp from each sample and assigned them a single ID number (Supplementary Fig. 5a). From the initial 140619 eccDNAs, we obtained 96757 merged eccDNAs with unique IDs (Supplementary Fig. 5b). This ID-eccDNAs were located into different genomic features, namely genes (5'UTR, CDS, 3'UTR), 2 kb upstream of the genes (up2kb), 2 kb downstream of the genes (down2kb) and intergenic regions. 30% of ID-eccDNAs overlapped gene regions (ecGenes) (Fig. 1d), of which 267 covered full-length genes. 15.8% ID-eccDNAs mapped to up2kb, 8.8% to down2kb, and 45.1% to intergenic regions. A more detailed analysis of genes showed that 40.1% overlapped to CDS regions (Fig. 1g), 18.9% to 3'UTRs, and 8% to 5'UTRs.

When mapping to transposon elements or non-coding RNAs, 51.2% ID-eccDNAs mapped to TEs (ecTEs) (Fig. 1e) and 40.3% to non-coding RNAs (ecNon-codingRNAs) (Fig. 1f). Although most eccDNAs mapped to single genomic features, 3504 originated from regions encompassing genes, TEs, and non-coding RNAs (Fig. 1h) and 5668 to regions covering up2kb, TEs, and non-coding RNAs (Fig. 1i). Additionally, we identified 2843 eccDNAs covering a region spanning from the down2kb, TEs and non-coding RNA regions (Fig. 1j).

All the data in Fig. 1 demonstrate that the rice eccDNAs landscape is widely spread across the rice genome, with ecTEs mainly originating from centromeric regions.

Gene-overlapped eccDNAs show dynamic changes during rice growth

To explore the dynamic changes of eccDNA production during rice growth, we investigated gene-overlapped eccDNAs (ecGenes) at three stages of rice development in optimal growth conditions: short-term (Ctrl_D1 vs. Ctrl_D3), mid-term (Ctrl_D1 vs. Ctrl_D7) and long-term (Ctrl_D1 vs. Ctrl_D14). After performing GMPR normalization and EMDomics differential analysis, we identified ecGenes as differentially enriched if their q-value < 0.05 (see METHODS). We further classified differential ecGenes as exclusive if they were present in both biological replicates of one treatment and absent in the two replicates of the other and as differential if the number of reads for one ecGene was higher in one treatment than the other. We classified differential and exclusive ecGenes based on analysis of Gene Ontology (GO) enrichment categories and summarized using REVIGO (see METHODS). When comparing GO results of differential + exclusive ecGenes with exclusive ecGenes alone, we found no substantial differences in the total counts or the GO enrichment categories (Supplementary Fig. 6; Supplementary Data 2), reflecting that exclusive ecGenes were the most statistically significant.

In the short-term optimal growth stage, we identified 863 exclusive ecGenes, of which 230 were unique to Ctrl_D1 and 633 to Ctrl_D3 (Fig. 2a). We performed GO enrichment of exclusive ecGenes from both samples to capture eccDNAs with positive and negative effects on specific processes. GO enrichment analysis on these 863 ecGenes (Fig. 2d) revealed enriched categories included reproductive system development (2.86-fold) and developmental process involved in reproduction (2.83-fold), as well as categories associated with metabolic processes, among which N compound metabolic process was significantly represented with 177 enriched ecGenes (Supplementary Data 3.2). Notably, there is significant enrichment in the regulation of short-day photoperiodism/flowering category, exhibiting the highest fold enrichment value of 20.19 (Supplementary Data 3.1). This category includes ecGenes mapping to OsCO3, OsMADS51, and SDG718, all of which play important roles in regulating short-day flowering in rice^28,29,30.

In the mid-term optimal growth phase, we identified 331 exclusive ecGenes, with 210 unique to Ctrl_D1 and 121 unique to Ctrl_D7 (Fig. 2b). GO enrichment analysis of these 331 ecGenes revealed four enriched categories (Fig. 2d). Cellular process category, was observed in both the short-term and mid-term growth phases. The lysyl-tRNA aminoacylation category had the highest fold enrichment value (> 100), which may indicate the role of eccDNAs in regulating the mechanisms of transferring activated amino acids to the 3'-OH group of lysine-accepting tRNA in rice (Supplementary Data 4). It is worth mentioning that unlike the developmental pathways enriched in the short-term growth phase, the flower development category was uniquely identified in the mid-term phase, suggesting that ecGenes dynamics respond to specific developmental processes along initial growth.

In the long-term optimal growth stage, we identified 404 exclusive ecGenes, with 182 unique to Ctrl_D1 and 222 to Ctrl_D14 (Fig. 2c). GO enrichment analysis of these 404 exclusive ecGenes (Fig. 2d) revealed that the N compound metabolic process category was enriched at this stage. However, it had fewer enriched ecGenes (81) than in the short-term growth stage (177). The DNA damage response pathway had the highest fold enrichment value (4.44), suggesting a potential link between the mechanisms by which eccDNAs originate and DNA damage during long-term growth phase³¹.

The number of exclusive ecGenes in different growth stages showed a significant fluctuation, decreasing sharply from the short-term to mid-term growth stage but then increasing from the mid-term to the long-term (Fig. 2a-c). Compared to the short-term growth stage, the number of significantly enriched pathways was less in the mid-term and long-term growth. However, specific pathways showed a higher fold enrichment value in the longer growth phases. Among the GO enrichment on cellular components, the cellular anatomical entity category was identified in all growth stages with similar fold enrichment values (1.31 in short-term growth, 1.36 in both mid-term and long-term growth) (Supplementary Fig. 7b). Moreover, in the long-term growth stage, transferase complex was observed as the category with the highest fold enrichment (3). Thus, our GO results showed that ecGenes are enriched specifically in different growth stages, suggesting a role of ecGenes on rice development.

We validated one ecGene detected in all growth stages by inverse PCR and SANGER sequencing (SANGERseq) in both 7-day and 14-day control samples (Fig. 2f-g). This ecGene originated from the Os06g0651100 locus region and covered most of the first intron (Fig. 2e).

Gene-overlapped eccDNAs respond to N stress

N deficiency is the most prevalent environmental stress affecting plants in natural ecosystems³². According to our previous research³³, N concentration in shoot tissue usually starts to decrease rapidly on the third day after LN treatment, and a stronger transcriptional response to N starvation is activated after seven days of treatment. To investigate the role of eccDNAs under N limitation, we conducted low nitrogen (LN) treatments on rice plants in a short treatment of 3 days (Ctrl_D3 vs. LN_D3) and a long-term of 7 days (Ctrl_D7 vs. LN_D7). As for the analysis of developmental stages, no statistical differences in GO-enriched categories were found when differential eccDNAS were added to exclusive ecGenes for LN treatments (Supplementary Fig. 8; Supplementary Data 5), suggesting that de novo formation rather than an increase in the abundance of ecGenes differentiate growth stages or responses to stress treatment.

During short-term LN treatment, we identified 2424 exclusive ecGenes, among which 639 were only detected in Ctrl_D3, and 1785 were specific to LN_D3 (Fig. 3a). GO enrichment analysis of these ecGenes showed that N compounds metabolic process category was significantly enriched with 443 ecGenes (Supplementary Data 6.1; Fig. 3c). When compared to the same category during short-term optimal growth, the count of enriched ecGenes increased from 177 in optimal growth to 443 in LN treatment. Furthermore, among these 443 enriched ecGenes, we highlight several with known functions related specifically to N stress (Supplementary Data 6.1). Interestingly, we also identified the N compound transport category (Supplementary Data 6.2). These GO enrichment results indicate that in response to LN stress, ecGenes abundance shifted from a potential developmental role to a more specific N starvation response. In addition to pathways related to N stress, we also found enriched categories in the phosphate-containing compound metabolic process, which is related to phosphorus metabolism and “response to stimulus” and “response to stress”, which suggested a relationship between ecGenes and a general response to abiotic stress.

In the long-term LN treatment, we identified 1033 exclusive ecGenes, among which 101 were specific to Ctrl_D7, and 932 for LN_D7 (Fig. 3b). GO enrichment analysis of these 1033 exclusive ecGenes showed that N compound transport category was still enriched (Fig. 3d; Supplementary Data 7), suggesting the roles of ecGenes in the long-term N stress. Interestingly, the chloroplast protein-transporting ATPase activity category was extremely enriched in molecular function with the highest fold enrichment value of 45.34 (Supplementary Fig. 9a). This indicated that ecGenes were involved in chloroplast-related molecular functions to respond to the relative long-term N stress.

Compared to the short-term LN treatment, the number of exclusive ecGenes in long-term samples decreased significantly. However, the proportion of exclusive ecGenes detected only in LN_D7 (932/1033) is much higher than in LN_D3 (1785/2424). Even though fewer categories from biological process were enriched during the long-term N treatment, the N compound transport was still statistically enriched, revealing that specific ecGenes are produced in response to LN along different growth phases.

We validated one ecGene that shows a significant induction in the long-term LN treatment, overlapping with OsNPF2.4 (Os03g0687000) (Fig. 3e), known of whose expression level in old leaves is induced by N starvation and plays an important role in low-affinity nitrate acquisition and long-distance transport³⁴. Inverse PCR and SANGERseq analyses showed that this ecGene is clearly detected in 7-day LN sample and had the same sequence as that originally obtained from our eccDNA sequencing (Fig. 3f).

Gene-overlapped eccDNAs respond to P stress

Besides N deficiency, plants commonly face P deficiency in most soils due to low P availability³⁵. Considering the lower demand of P than N at early tillering stages, rice takes more days to respond to low phosphorus (LP) treatment at the global transcription and phenotypic levels³⁶. According to transcriptome analysis, the initial significant response of major P starvation-related genes in rice was observed 7 days after treatment³⁶. Thus, the short-term LP treatment was set at 7 days (Ctrl_D7 vs. LP_D7), and the long-term LP treatment at 14 days (Ctrl_D14 vs. LP_D14).

In the short-term LP treatment, we identified 556 exclusive ecGenes, among which 91 were specific to Ctrl_D7 and 465 to LP_D7 (Fig. 4a). GO enrichment analysis (Fig. 4c) identified three highly enriched categories among those categorized as biological process: tissue development (fold enrichment value = 5.87), meristem maintenance (fold enrichment value = 8.52), and meristem development (fold enrichment value = 10.6). These categories highlight the potential role of ecGenes in the plant´s developmental response to P deficiency, which is known to alter root and shoot architecture³⁶. Additionally, we identified the N compound metabolic process category, encompassing 100 ecGenes (Supplementary Data 9.1). Interestingly, despite the absence of P-related categories in biological process, we highlight the monoatomic cation transmembrane transporter activity category, enriched in molecular function (Supplementary Fig. 11a). This finding suggests that eccDNAs may play a role in modulating membrane transporter activity in response to the short-term LP treatment (Supplementary Data 9.2).

In the long-term LP treatment, we identified 348 exclusive ecGenes, of which 206 were only detected in Ctrl_D14 and 142 in LP_D14 (Fig. 4b). GO enrichment analysis (Fig. 4c) identified again the N compound metabolic process category, encompassing 72 ecGenes (Supplementary Data 10.1). We identified the response to stimulus category as the general response to abiotic stress. Worth noticing is the highest fold enrichment for the regulation of macroautophagy category with a value over 100 (Supplementary Data 10.2). This category suggests an important role of genes in modulating the frequency, rate, or extent of macroautophagy during long-term LP treatment. Within the enrichment of cellular component, we identified a high fold enrichment (43.97) for phosphatidylinositol 3-kinase complex (Supplementary Fig. 11b). This indicates that during long-term LP treatment, ecGenes actively participate in modulating the expression of the phosphatidylinositol 3-kinase (PI3K) complex (Supplementary Data 10.3).

Comparing long-term LP treatment to short-term treatments, we found a decrease in the count and proportion of exclusive ecGenes detected only in LP samples. However, the number of categories in GO enrichment of biological process and cellular component did not change significantly. Most categories related to metabolic process were enriched in both short-term and long-term LP treatment. However, specific pathways were specifically enriched for each of the two low Pi treatments (Fig. 4c). Generally, the variation and abundance of ecGenes in response to LP treatment were not as pronounced as in LN, regardless of duration of the P-starvation treatments. Nevertheless, there is still a clear and specific ecGene response to limited P supply in rice. Moreover, based on the dynamics observed in GO enrichment results, we suggest that extending the duration of the stress treatment could alter the set of ecGenes produced as a response.

We performed inverse PCR, SANGER sequencing, and KBseq (based on Illumina platform) on one of the differential ecGenes identified in the long-term LP treatment spanning nearly the entire length of OsACP1 (Os01g0720400) (Fig. 4d), which has been previously reported to play a role in rice P starvation³⁷. Our results validated the presence of this ecGene in 14-day LP sample (Fig. 4e).

Nutritional stresses lead to variation in TE-overlapped eccDNAs in rice

The relationship between eccDNAs and transposon elements (TEs) has been recently studied in several organisms, especially plants, such as Arabidopsis and Potato^14,22,23. We quantitatively analyzed eccDNAs overlapped to transposable elements (ecTEs) and full-length repeat units (full-length ecRepeatUnits) across all treatments. We identified 35,485 ecTEs and 6,866 full-length ecRepeatUnits from the 16 samples spanning 8 treatments (Supplementary Fig. 12a). Among the ecTEs, 611 were common to all treatments (Supplementary Fig. 12b).

Over 70% of ecTEs were DNA transposons, with more than 60% belonging to the Zator and hAT families (Fig. 5a, b). The proportion of DNA transposons in ecTEs did not change significantly during short-term and mid-term optimal growth (Fig. 5b). However, under LN treatment, we highlight a substantial change in the ratio of DNA transposons to retrotransposons (Fig. 5b, d). The ratio of retrotransposons increased from 12% in unstressed controls to 21.27% in the short-term LN treatment and 26.72% in the long-term LN treatment. This increase was more prevalent for the Gypsy family of LTRs (Fig. 5c, d). A similar trend was also observed for the LP treatments, in which the proportion of retrotransposons among ecTCs increased from 12–17.47% in LP_D7 and 20.94% in LP_D14 (Fig. 5d). These suggest that both LN and LP treatments induce an increase in the ratio of retrotransposons in ecTEs, with greater induction in longer treatments.

Among the full-length ecRepeatUnits, 68 were common to all treatments (Supplementary Fig. 12c). Then we focused on the main DNA transposon families, especially Stowaway, which makes up about 2% of the rice genome^38,39. We characterized 364 full-length ecRepeatUnits from Stowaway family. The count of Stowaway- full-length ecRepeatUnits was higher in all nutrient-stressed samples (226) than in all controls (138) (Fig. 5e-g). However, prolonged nutritional stresses decreased the count of Stowaway units (from 134 in LN_D3 to 19 in LN_D7, and from 47 in LP_D7 to 4 in LP_D14). We identified 26 out of the 36 Stowaway subfamilies in the ecRepeatUnits from all treatments (Supplementary Fig. 12d). Both LN and LP treatments affected the number of subfamilies and the average reads per subfamily. Samples at 3 days had the highest count of subfamilies (20 in Ctrl_D3, 24 in LN_D3), followed by samples at 7 days (12 in Ctrl_D7, 13 in LN_D7, 16 in LP_D7), and 14 days (5 in Ctrl_D14, 3 in LP_D14), revealing a negative correlation between prolonged treatment or developmental stage with the number of Stowaway subfamilies (Fig. 5e-g).

In addition to Stowaway, we identified another DNA transposon family, Kiddo, which appeared exclusively in both LN and LP treatment samples at 3 and 7 days (Fig. 5e-g). Previous studies suggest Kiddo might still be active in rice genome⁴⁰. These results highlight the dynamic nature of full-length ecRepeatUnits linked to DNA transposon families in response to nutritional stresses and treatment duration, particularly emphasizing the prevalence and potential activity of the Stowaway and Kiddo elements in eccDNAs.

Identification and validation of multiple-fragment eccDNAs in rice

Single fragment is typically the predominant eccDNAs class found in all organisms examined to date11 and in this study (Fig. 6a). However, a fraction of eccDNAs have sequences derived from different genome regions, either with two or more non-contiguous fragments from the same chromosome or two or more sequences from different chromosomes. Here, we named this kind of eccDNAs multiple-fragment eccDNAs (MF-eccDNAs) as previously defined¹¹. To complement the comprehensive understanding of rice eccDNAs (Supplementary Fig. 3a, c), eccDNA_RCA_Nanopore⁴¹, a tool reported with good performance in MF-eccDNA detection in human cells¹¹, was used for the identification of rice MF-eccDNAs. eccDNA_RCA_Nanopore was specifically designed to detect cross-chromosomal or far-distance junctional reads from long-read sequencing, unlike other tools⁴¹. When comparing ecGenes identified by ecc_finder and eccDNA_RCA_Nanopore, 91.4% were detected by both pipelines, which validated the reliability of our eccDNA identification. Based on eccDNA_RNA_Nanopore analysis, we observed that 91% of eccDNAs consisted of only one fragment (Nfragment = 1) (Fig. 6a), while the remaining 9% were composed of two to ten non-contiguous DNA regions (Nfragment ≥ 2).

We mapped all MF-eccDNA regions to the rice genome (Fig. 6a and see METHODS) and grouped the results by core gene IDs and the number of fragments (Nfragment). The origin of sequences present in MF-eccDNA was widespread in the rice genome. One notable example is a set of MF-eccDNAs ranging from 3 to 5 fragments containing the common core gene ID Os04g034305 (cytochrome c oxidase subunit 2). This set of MF-eccDNAs had a high load in LN_D7 (Fig. 6c-d, h). The 5-fragment eccDNAs of this set were validated by inverse PCR and Whole Plasmid Sequencing for both 7-day LN and 14-day LP samples (Fig. 6f-g).

We also identified another set of MF-eccDNAs with a putative cytochrome P450 as the core gene (Os05g0372300), composed of 2 to 7 fragments, with 5-fragment being the most common variant (Fig. 6e), which was more prevalent in both LN_D7 and LP_D14 (Fig. 6c). It is also important to note that we did not identify MF-eccDNAs with Os04g0343050 in any Ctrl_D3 or Ctrl_D7. Similarly, MF-eccDNAs with Os05g0372300 were absent in our Ctrl_D1 sample. Interestingly, these two sets of MF-eccDNAs always had an LTR element on one side and a DNA transposon element on the other (Supplementary Data 11.1, 11.2), suggesting possible mechanisms for their origin based on the function of TEs in rice.

On the other hand, two sets of MF-eccDNAs with Os01g0791033 or Os12g0423313 as core genes appeared in all samples, predominantly composed of 2-fragment (Supplementary Fig. 13a-c). As shown in the IGV snapshots (Supplementary Fig. 13f-g), some of the two fragments were next to each other and surrounded by several LTR elements from the same subfamilies (Supplementary Data 11.3, 11.4), suggesting an LTR-related homologous recombination (HR) mechanism in MF-eccDNA formation¹³ (Supplementary Fig. 13d-e). An increased count of these MF-eccDNAs during longer growth (Supplementary Fig. 13a; Fig. 6c) suggests that an extended optimal growth phase leads to the accumulation of specific MF-eccDNAs. WE also observed an increase in the number of reads of these two MF-eccDNA sets in LN_D3 and LN_D7 (Supplementary Fig. 13a; Fig. 6c), being much higher in LN_D7. While not as pronounced as in LN stress, we observed that MF-eccDNAs repertoire and abundance are modulated by P status in rice (Fig. 6c). These results reveal nutrient stress leads to the accumulation of MF-eccDNAs, highlighting the dynamic rice genomic responses to nutritional signals.

ATAC-seq effectively validates high-density regions of rice eccDNAs

To further validate the high eccDNA density regions in rice, we performed the Transposase-Accessible Chromatin with high-throughput sequencing (ATACseq) on Ctrl_D7, LN_D7 and LP_D7. As reported in other organisms, libraries prepared for classic ATACseq may also cover circular DNA molecules⁴². However, due to the lack of enrichment for circular DNA molecules, ATACseq only allows the detection high-density regions of eccDNAs. Although ATACseq has been previously utilized for eccDNA detection in animal cells^43,44, its application in plants has not been explored.

After applying the short-read mode from ecc_finder on ATAC-seq data, 14 regions were identified in our analysis (Fig. 7a). These 14 regions were characterized as high-density regions of single-fragment eccDNAs (ecc_finder) or MF-eccDNAs (eccDNA_RCA_Nanopore). Several mapped inside or near centromeres, which have been according to our results are high-density eccDNA regions in rice (Fig. 6i; Fig. 1c). Seven regions did not exhibit significant differences between treatments (Fig. 6j). However, the read number of eccDNAs such as those overlapping with chr01:39110652–39128615 or chr11:11895489–11900602, increased substantially in LN_D7 when compared to Ctrl_D7 or LP_D7 (Fig. 7b).

Since Kumar et al.⁴² and Kang et al.⁴⁴ reported the potential of ATACseq to identify eccDNAs, this technique has been validated for human cancer cell lines and rice in our work. Thus, this innovative approach provides valuable insights into eccDNA dynamics in plants and opens new avenues for future research.

Here, we report a comprehensive analysis of single-fragment and multiple-fragment eccDNAs (MF-eccDNAs) in rice. We characterized the presence and abundance of eccDNAs throughout plant development under optimal growth conditions and their dynamics under both short-term and long-term N and P stress. Our findings showed that rice eccDNAs could contribute to plant development and responses to nutritional stresses by altering the abundance of ecGenes and ecTEs.

Recently, extensive research to characterize eccDNAs in various organisms has been conducted^6,11,15,17. Our Nanopore sequencing data uncover their size distribution, with most eccDNAs ranging between 200 bp and 600 bp (Fig. 1a), consistent with the study on rice eccDNAs using short Illumina reads¹⁷. However, likely due to the analysis of samples subjected to nutritional treatment and the long read sequencing platform used, we identified a 4-fold higher number of distinct eccDNAs in rice shoots (96757) than that previously reported (22183)¹⁷. Additionally, we observed a high percentage of eccDNAs mapping to CDS regions (Fig. 1), which differs from findings in human cancer cell lines, where most eccDNAs originate from 5'UTR regions³.

LTRs in eccDNAs dynamics assume hypothetical mechanism on origin

Research has shown a strong association between eccDNAs and TEs in rice¹⁴ and Arabidopsis²³. LTRs have garnered particular attention due to their higher activity and their ability to circularize through DNA repair mechanisms^14,22. Our study shows that nutritional stresses increase the abundance of LTRs in ecTEs (Fig. 5c-d). Although we did not directly assess the activity of these LTRs, our findings suggest the active participation of retrotransposons in the dynamics of eccDNAs formation under nutritional stresses.

Also, our analysis of MF-eccDNAs suggests a mechanism of homologous recombination (HR) based on retrotransposons or remnants of LTRs (Supplementary Fig. 13). According to the IGV snapshot (Supplementary Fig. 13f), the two regions of a unique set of MF-eccDNAs were flanked by several LTRs from the same family, Os4_05_6L. Moreover, the gap region between these two eccDNA fragments skipped a long LTR from Os4_05_6L, strongly suggesting a looping mechanism for the formation of eccDNAs in rice similar to what was previously reported for the formation of the GAP^circle in yeast¹³. Our finding adds that an LTR-mediated HR mechanism produces both single-fragment eccDNAs and MF-eccDNAs (Supplementary Fig. 14).

Potential roles of DNA transposons Stowaway and Kiddo in eccDNA dynamics

Unlike previous studies on eccDNA dynamics in rice¹⁷, our findings suggest a significant accumulation of DNA transposons rather than retrotransposons. We observed the widespread identification of the Stowaway family in full-length ecRepeatUnits (Fig. 5h). Although our classification in ecTEs did not reveal a significant number of miniature inverted-repeat transposable elements (MITEs), which includes the Stowaway family⁴⁵, we noted an enrichment of Tc1-Mariner elements across all treatments. Previous studies showed that Tc1-Mariner elements share similar terminal inverted repeats (TIR) and target site duplications (TSD) with Stowaway³⁹. A complex relationship between Tc1-Mariner and Stowaway elements was previously reported³⁸, which could clarify the discrepancy in classification. Despite the lack of detailed understanding of Stowaway's activity mechanisms in rice, it has been demonstrated that Stowaway elements interact with Mariner-like transposases to achieve transposable activity in rice³⁹. Given the accumulation of Tc1-Mariner in ecTEs and Stowaway elements in the full-length ecRepeatUnits, we propose that these Stowaway units are active and form circular molecules in rice.

An active transposition event of the MITE-type DNA transposon Kiddo into the rice ubiquitin2 promoter⁴⁰ was reported. This event not only indicated the activity of Kiddo itself but also demonstrated that MITE transcripts induced RNA-dependent DNA methylation⁴⁶. Our full-length ecRepeatUnits analysis found Kiddo in nutritionally stressed samples, suggesting that Kiddo might be activated by environmental signals such as nutrient availability (Fig. 5e-g). Although further research is needed to determine whether these Kiddo units participate in changes in DNA methylation in response to environmental factors, the observed higher load of TE-related eccDNAs in the ddm1 Arabidopsis mutant²³ supports a hypothesis about the potential role of rice ecTEs in DNA methylation.

Functional gene-overlapped eccDNAs are involved in rice growth and responses to N and P stresses

Our analysis underscores the critical functional roles of gene-overlapped eccDNAs (ecGenes) during rice development and its responses to nutritional stress. We found that the GO enrichment of biological process varies across growth stages (Fig. 2d). During mid-term growth, we observed significant categories related to lysyl-tRNA aminoacylation (biological process) and lysine-tRNA ligase activity (molecular function), with the same two differential ecGenes showing exceptionally high fold enrichment values (> 100) (Fig. 2d; Supplementary Fig. 7a). Lysyl-tRNA synthetases (LysRS) are thought to be nonhomologous duplications of aminoacyl-tRNA synthetases and, in some instances, assist in the aminoacylation of the rare amino acid pyrrolysine^47,48. In the long-term growth phase, we detected that the DNA damage response pathway had the highest fold enrichment value (4.44), suggesting that eccDNAs play a role in preventing DNA damage during the reproductive stage or as a response to environmental factors causing DNA damage during normal development. Our findings also indicate that differential ecGenes are more abundant during initial growth. Further research is needed to elucidate the detailed mechanisms of how these ecGenes influence rice growth and development.

When subjected to LN stress, the changes in ecGenes were more pronounced compared to those observed for optimal growth. During short-term LN treatment, N compound metabolic process showed a much higher accumulation of exclusive ecGenes (443) (Fig. 3c), among which several gene IDs were associated with LN stress (Supplementary Data 6.1). For instance, we identified ecGenes containing segments of OsTIR1, whose expression is regulated by the accumulation of OsmiR393. Nitrogen-induced expression of OsTIR1 promotes rice tillering⁴⁹. Additionally, we discovered ecGenes derived from OsGOGAT1, a gene encoding a central enzyme in ammonium assimilation in rice, which works alongside OsAMT1;2 to enhance rice survival under LN stress⁵⁰. We also detected ecGenes associated with OsNAR2.1, a component of the high-affinity nitrate transporter system OsNRT2.1/2.2 and OsNRT2.3⁵¹. OsNAR2.1 facilitates nitrate absorption across various concentration levels and regulates auxin transport from the shoot to the root as part of the rice nitrate signaling system^52,53. Moreover, we identified ecGenes related to OsNIT2, which is activated by OsNAR2.1 and may contribute to root growth maintenance under varied nitrogen forms and concentrations⁵⁴. These findings suggest that ecGenes play critical roles in the rice response to LN stress, highlighting their importance in nitrogen metabolism and signaling pathways essential for plant adaptation and survival under short-term nitrogen-limited conditions.

In addition to N compound metabolic process, we identified several enriched ecGenes in the N compound transport pathway during both short-term (Supplementary Data 6.2) and long-term (Supplementary Data 7) LN treatments. In short-term LN treatment, exclusive ecGenes enriched in N compound transport included Os06g0633800, a putative amino acid transporter. We also noted enriched ecGenes of the peptide transporter gene OsPTR3 (OsNPF5.5) in this category⁵⁵. During long-term, the profile of exclusive ecGenes enriched in N compound transport shifted. We identified another putative peptide transporter gene, PTR2 (Os05g0431700), and an amino acid transporter gene, OsGAT3, known for its role in gamma-aminobutyric acid transport⁵⁶. Additionally, we identified two amino acid permease genes, OsAAP11 (Os11g0195600) and OsAAP3, which significantly improve N use efficiency⁵⁷. These findings underscore the specific response of rice ecGenes to environmental LN pressure, highlighting the dynamic regulation of N compound transport and the potential crucial role of ecGenes in adaptation to nitrogen-limited conditions.

In both short-term (7 days) and long-term (14 days) LP treatments, we identified enrichment of the N compound metabolic process category. However, comparing enriched exclusive ecGenes in this category of LP to those of LN (Supplementary Data 9.1, 10.1, and 6.1) revealed distinct gene IDs, indicating differences in the ecGenes response to different nutritional stress. During the short-term LP treatment, GO enrichment analysis highlighted OsPI4K2 in the catalytic activity category. OsPI4K2, or β-type Phosphatidylinositol 4-kinase, is critical in membrane targeting and phospholipid binding, which is part of the lipid remodeling process during P starvation response⁵⁸. During the long term, we identified the PI3K complex category in cellular component (Supplementary Data 10.3). Although the functions of the genes enriched in this category have not been extensively studied, PI3K plays an important role in growth regulation and stress responses, functioning as a positive regulator of gibberellin (GA) signaling and ABA-induced hydrogen peroxide production in rice leaves^59,60,61. We also found ecGenes overlapping with OsPHT4;3, a member of the OsPHT4 family of phosphate transporters⁶². Thus, our study reveals that ecGenes play specific functions in the LP rescue response of rice. More detailed investigations are needed to uncover how these exclusive ecGenes could modulate the rice response to phosphorus deficiency.

In summary (Fig. 8), our investigation into rice eccDNAs provides crucial insights into their role in plants. Our findings reveal potential mechanisms linking ecGenes and ecTEs to growth, development, and adaptive strategies under nitrogen- and phosphorus-limiting conditions. Additionally, we established robust statistical analysis workflows for differential elements within eccDNAs, highlighting genomic plasticity in rice. This study marks a significant advancement in rice eccDNA research and opens new dimension to study plant responses to environmental stress.

Plant material and genomic DNA isolation

Rice (Oryza sativa ssp. japonica cv. Nipponbare) was performed using a hydroponic system with modified Yoshida solution (1976) under controlled conditions (30 ^oC for 14 hours during light/day and 26 ^oC for 10 hours at dark/night). After pre-germination in tap water for 2 days, seeds were transferred to optimal nutrient solution containing 0.3125 mM (NH₄)₂SO₄, 0.3125 mM Ca(NO₃)₂, 0.1 mM NaH₂PO₄, 0.2565 mM K₂SO₄, 0.1865 mM CaCl₂, 0.8215 mM MgSO₄, 4.5 µM MnCl₂, 0.0375 µM (NH₄)₆Mo₇O₂₄, 9.5 µM H₃BO₃, 0.076 µM ZnSO₄, 0.0775 µM CuSO₄ and 0.018 mM EDTA-Fe. Seedlings were grown for 8 days with the growth solution renewed every 2 days before transferring to nutrient stress treatments. For the low nitrogen (LN) treatment, seedlings were grown in media with NH₄⁺ and NO₃⁻ 0.0625 mM for 3 days (short-term LN) and 7 days (long-term LN) treatments. For the low phosphorus (LP) treatments, seedlings were grown in media containing 5 µM phosphate (H2PO4⁻) for 7 days (short-term LP) and 14 days (long-term LP). Both treatment and control media solutions were renewed daily during the whole treatment period. Shoot tissue for two biological replicates of each treatment was harvested for subsequent experiments.

Genomic DNA (gDNA) from rice shoots was isolated using the CTAB method to prepare Oxford Nanopore sequencing samples⁶³. RNAse A (omega BIO-TEK) treatment was required to maintain a higher quality gDNA. After the column purification with Genomic DNA Clean & Concentrator^TM-10 kit (ZYMO RESEARCH), gDNA quality and concentration were determined through agarose gel electrophoresis and Nanodrop™ One Spectrophotometer (Thermo Scientific™ Invitrogen™).

Linear DNA digestion

5 µg of purified gDNA from each sample was treated with ATP-dependent PlasmidSafe DNase (Lucigen) to remove linear DNAs following the midi-size reaction protocol. In brief, 10 µL of PlasmidSafe DNase and 10 µL of 10 mM ATP were added every 24 hours. After 4 days, products were purified using the Genomic DNA Clean & Concentrator™-10 kit and eluted with 24 µL of 0.3x TE Buffer. This was followed by a mini-size preparation with an additional 16 hours of digestion. The products were purified and eluted again. Then followed by a mini-size preparation with 16 hours of digestion once more. Finally, the pure eccDNAs after 128 hours digestion were concentrated in 15 µL of 0.3x TE Buffer for later enrichment.

Enrichment of extrachromosomal circular DNAs

The random Rolling Circle Amplification (rRCA) has been performed using the illustra™ TempliPhi 100 Amplification Kit (GE Healthcare Life Sciences) following the manufacturer’s instructions. In brief, 0.5 µL eccDNA template was mixed with 5 µL Sample Buffer then heated at 95 ^oC for 3 minutes. After cooling down the template to 4°C, 5 µL of Reaction Buffer and 0.2 µL of Enzyme Mix were added and incubated at 28°C for 65 hours for eccDNA amplification. After the enrichment, products were tested with EcolR1 digestion and compared to the gDNAs.

Debranching and polishing steps were required to prepare de-hyperbranched rRCA products for Nanopore sequencing. rRCA products were purified using magnetic beads (Omega BIO-TEK), and 200 µL samples from 5 rRCA reactions were mixed with 36 U DNA polymerase phi29 (New England Biolabs), reaction buffer, and BSA and incubated at 30°C for 2 hours. The reaction was inactivated by heating at 65°C for 5 minutes. 200 U S1 nuclease (Thermo Scientific™ Invitrogen™) digested the single-strand rRCA branches at 37°C for 30 minutes, followed by magnetic bead purification. For gap filling and polishing, 12 U T4 DNA polymerase and 40 U DNA polymerase I (New England Biolabs) were incubated with NEB buffer II at 25°C for 1 hour and inactivated at 75°C for 10 minutes. The final debranched and polished products were purified using magnetic beads and ready for library preparation.

Sequencing and data analysis

Libraries preparation for Nanopore sequencing platform performing using the SQK-LSK110 kit according to the manufacturers instructions and loaded into a FLO-PRO002 flow-cell for sequencing. After sequencing, raw data were basecalled using Guppy (version6.1.5). The parameters used for Guppy were as follows: --flowcell FLO-PRO002 --kit SQK-LSK110 --calib_detect --trim_barcodes --trim_strategy dna --disable_pings --device auto --num_callers 16. The output data in fastq format was further processed through porechop (version0.2.4) to remove adapters. The parameters used for porechop were as follows: --format fastq --extra_end_trim 0 --discard_middle. Clean reads were then quality control analyzed using LongQC (version1.2.0c). The parameters used were the following: sampleqc --x ont-ligation.

Identification and distribution analysis of eccDNAs

ecc_finder (version1.0.0) was used for eccDNA identification. ecc_finder excludes long reads originating from linear genomic repeats, removes alignments which are shorter than 200 bp and less than 2 repeat units or the divergence rate between repeat units exceeds 25%, at last keeps loci which covers > = 3 reads including at least 80% of reads length as a region where eccDNAs originate²⁷. The chromosomal distribution of eccDNAs across whole genome was plotted through karyoploteR (version1.20.3). The density of eccDNAs per 100 kb was calculated through samtools (version1.12) and bedtools (version2.30.0), then plotted through circlize (version0.4.16). The size distribution of eccDNAs together with the other visualization was plotted through ggplot2 (version3.3).

Differential ecGenes analysis

The overlapping between genes and eccDNAs was done by bedtools (version2.30.0), among which eccDNAs that overlapped with genes were named as ecGenes. A counting matrix of ecGenes detected in each treatment was built with their read number for differential analysis. Zero inflated poisson distribution of our count data was described in all counting matrices through the performance package (version0.10.9) from R. Considering the distribution as zero inflated, the GMPR⁶⁴ command from GUniFrac package (version1.8) was used for normalizing the counting matrix. Finally, EMDomics⁶⁵ (version2.24.0) was used for differential analysis of ecGenes, ecGene with a q-value was lower than 0.05 was considered as differential ecGene.

Gene Ontology enrichment and REVIGO analysis

The gene ontology analysis has been performed for both differential ecGenes and exclusive ecGenes through the web-page supported by PANTHER⁶⁶ (http://geneontology.org). To reduce the highly redundant categories and make more accurate interpret, both p-value from the GO enrichment analysis together with the IDs of GO categories were used for REVIGO⁶⁷ summary through online program REVIGO (version 1.8.1) (http://revigo.irb.hr). The filtering settings were adjusted as “Medium (pin = 0.7)” as recommended in their publication, the resulting lists from REVIGO were visualized as dot plots using ggplot2.

Transposon elements and repeat units analysis

Rice transposon elements database was annotated by TransposonUltimate⁶⁸ and downloaded from the O_SATIVA_JAP folder named Annotation on the web-page

(https://cellgeni.cog.sanger.ac.uk/browser.html?shared=transposonultimate/). TransposonUltimate, a newer and well-developed tool for TE detection and classification in rice, provides more comprehensive information on the classification. The overlap analysis between TEs and eccDNAs was done by bedtools. eccDNAs that overlapped to TEs were named as ecTEs. Based on the information of classification from TransposonUltimate, ecTEs have been sorted into DNA transposons and Retrotransposons. Then the detailed categories, for instance, Zator, hAT and Tc1-Mariner from DNA transposon as well as Gypsy and Copia from Long Terminal Repeat (LRT) retrotransposon have been described in visual for each treatment samples through ggplot2.

On the other hand, to have more information on unique family of repeat units that associate with rice eccDNAs, the overlapping between repeat units and eccDNAs was done by bedtools. The repeat units database was downloaded from the RAP-DB⁶⁹ (https://rapdb.dna.affrc.go.jp/download/irgsp1.html). Unique eccDNAs that fully covering the repeat units were named as full-length ecRepeatUnits. Besides the LTRs, insight on several possible active DNA transposons families, for instance, Stowaway and Kiddo, have also been taken to the sankey diagram with the use of sankeyNetwork command from the networkD3 package (version0.4).

ATAC-seq

ATAC-seq was performed on rice samples subjected to three different conditions: 7 days Low Phosphorus (LP_D7), 7 days Low Nitrogen (LN_D7), and 7 days Control (Ctrl_D7), to further validate the presence of eccDNAs in rice. Nuclei Isolation Buffer (NIB), PBS with BSA and RNAse inhibitor (PBSBR), and 60% Percoll solution were prepared and precooled for nuclei isolation. Approximately 0.2 grams of rice shoot tissue was chopped in 2 mL of NIB, then filtered through a 40 µm cell strainer. The nuclear suspension was centrifuged at 4°C, 2000 g for 15 minutes. The nuclei pellet was washed with PBS, centrifuged again, and then washed with 60% Percoll solution. After washing with Percoll, the suspension was centrifuged at 4°C, 1000 g for 10 minutes. The nuclei pellet was washed twice with PBSBR and resuspended in 200 µL of PBSBR. The nuclear suspension was quantified, and 10⁶ nuclei were used for subsequent Tn5 transposase digestion. Following the manufacturer's instructions for the TruePrep® DNA Library Prep Kit V2 for Illumina (Vazyme #TD501), the nuclei were digested with Tn5 transposase. The resulting products were purified, followed by PCR to add adapters. Fragments were size-selected between 250 bp and 1000 bp using magnetic beads. The ATAC-seq libraries were then ready for sequencing on the Illumina platform. ATAC-seq data was analyzed using the short-read model from ecc_finder for the detection of eccDNAs.

Multiple-fragment eccDNA characterization

eccDNA_RCA_Nanopore¹¹ was used for identification of multiple-fragment eccDNAs (MF-eccDNAs) (https://github.com/icebert/eccDNA_RCA_nanopore). After filtering with the parameters “Nfullpass” and “Nfragment” = 1, the resulting eccDNAs were considered as MF-eccDNAs and completed the overlapping with genes reference file through bedtools. The IGV browser was used to show more details among MF-eccDNAs, genes and TEs.

Inverse PCR validations

Genomic DNA was extracted from shoot tissue of rice through CTAB method as mentioned forward. After digestion with ATP-dependent PlasmidSafe DNase for 128 hours, rRCA was performed and purified with Genomic DNA Clean & Concentrator^TM-10 kit. gDNA, pure eccDNA products after PlasmidSafe DNase digestion as well as the rRCA products were used as the templates for inverse PCR validations. Inverse primers were designed and blasted through online tools of NCBI https://blast.ncbi.nlm.nih.gov/Blast.cgi. Both 2x Phanta Max Master Mix (Vazyme) and Taq DNA Polymerase with ThermoPol® Buffer (New England Biolabs) were used in PCR validation following their official recommendations. PCR products were checked through agarose gel electrophoresis first, and then purified through E.Z.N.A.® Gel Extraction Kit (omega BIO-TEK) for further validations on sequences from SANGER sequencing, KBseq based on the Illumina platform or the Whole Plasmid Sequencing based on the Nanopore platform.

Data availability

All sequencing data generated in this study have been deposited at National Center for Biotechnology Information (NCBI) under Bioproject accession-ID: PRJNA1136237. Source data are provided with this paper.

Acknowledgments

This study is supported by National Key Research and Development Program of China (2021YFF1000400), Jiangsu Seed Industry Revitalization Project (JBGS [2021] 011). We thank the technical supporting from the high-performance computing platform of Bioinformatics Center, Nanjing Agricultural University. We thank Zhitao Zhu (Bioinformatics Center, Nanjing Agricultural University) for his outstanding assistance on computing platform testing. We thank Dr. Shichao Wang (National Key Laboratory of Crop Genetics & Germplasm Enhancement and Utilization, Nanjing Agricultural University), Dr. Gerardo Alejo Jacuinde (IGCAST, Texas Tech University), Dr. Francisco Perez Zavala (IGCAST, Texas Tech University), Dr. Gabriela Cabrales Orona (IGCAST, Texas Tech University), Benjamin Perez Sanchez (IGCAST, Texas Tech University) and Valeria Flores Tinoco (IGCAST, Texas Tech University) for their invaluable advice on this research.

Author contributions

Experimental design: H.N., L.Y.-V., G.X. D.L-A. and L.H.-E.; eccDNAs extraction, detection and validation: H.N., L.Y.-V. and M.G.; Nuclear extraction and ATACseq: M.C., L.G., L.Y.-V. and M.G.; data analysis: H.N., L.Y.-V., D.L.-A., G.X. and L.H.-E.; manuscript writing and revision: H.N., L.Y.-V., D.L.-A., G.X. and L.H.-E. All authors read and approved the manuscript.

Competing interests

The authors declare no competing interests.

Cao X, Wang S, Ge L, Zhang W, Huang J, Sun W (2021) Extrachromosomal Circular DNA: Category, Biogenesis, Recognition, and Functions. Front veterinary Sci 8:693641. https://doi.org/10.3389/fvets.2021.693641
HOTTA Y, BASSEL A, CIRCULARITY OF DNA IN CELLS OF MAMMALS AND HIGHER PLANTS (1965) Proc Natl Acad Sci USA 53(2):356–362. https://doi.org/10.1073/pnas.53.2.356. MOLECULAR SIZE AND
Paulsen T, Kumar P, Koseoglu MM, Dutta A (2018) Discoveries of Extrachromosomal Circles of DNA in Normal and Tumor Cells. Trends Genet 34(4):270–278. https://doi.org/10.1016/j.tig.2017.12.010
Wahl GM (1989) The importance of circular DNA in mammalian gene amplification. Cancer Res 49(6):1333–1340
Noer JB, Hørsdal OK, Xiang X, Luo Y, Regenberg B (2022) Extrachromosomal circular DNA in cancer: history, current knowledge, and methods. Trends Genet 38(7):766–781. https://doi.org/10.1016/j.tig.2022.02.007
Møller HD, Parsons L, Jørgensen TS, Botstein D, Regenberg B (2015) Extrachromosomal circular DNA is common in yeast. Proc Natl Acad Sci USA 112(24):E3114–E3122. https://doi.org/10.1073/pnas.1508825112
Peng H, Mirouze M, Bucher E (2022) Extrachromosomal circular DNA: A neglected nucleic acid molecule in plants. Curr Opin Plant Biol 69:102263. https://doi.org/10.1016/j.pbi.2022.102263
Wu S, Turner KM, Nguyen N, Raviram R, Erb M, Santini J, Luebeck J, Rajkumar U, Diao Y, Li B, Zhang W, Jameson N, Corces MR, Granja JM, Chen X, Coruh C, Abnousi A, Houston J, Ye Z, Hu R, Mischel PS (2019) Circular ecDNA promotes accessible chromatin and high oncogene expression. Nature 575(7784):699–703. https://doi.org/10.1038/s41586-019-1763-5
Morton AR, Dogan-Artun N, Faber ZJ, MacLeod G, Bartels CF, Piazza MS, Allan KC, Mack SC, Wang X, Gimple RC, Wu Q, Rubin BP, Shetty S, Angers S, Dirks PB, Sallari RC, Lupien M, Rich JN, Scacheri PC (2019) Functional Enhancers Shape Extrachromosomal Oncogene Amplifications. Cell 179(6):1330–1341e13. https://doi.org/10.1016/j.cell.2019.10.039
Paulsen T, Shibata Y, Kumar P, Dillon L, Dutta A (2019) Small extrachromosomal circular DNAs, microDNA, produce short regulatory RNAs that suppress gene expression independent of canonical promoters. Nucleic Acids Res 47(9):4586–4596. https://doi.org/10.1093/nar/gkz155
Wang Y, Wang M, Djekidel MN, Chen H, Liu D, Alt FW, Zhang Y (2021) eccDNAs are apoptotic products with high innate immunostimulatory activity. Nature 599(7884):308–314. https://doi.org/10.1038/s41586-021-04009-w
Hull RM, King M, Pizza G, Krueger F, Vergara X, Houseley J (2019) Transcription-induced formation of extrachromosomal DNA during yeast ageing. PLoS Biol 17(12):e3000471. https://doi.org/10.1371/journal.pbio.3000471
Gresham D, Usaite R, Germann SM, Lisby M, Botstein D, Regenberg B (2010) Adaptation to diverse nitrogen-limited environments by deletion or extrachromosomal element formation of the GAP1 locus. Proceedings of the National Academy of Sciences of the United States of America, 107(43), 18551–18556. https://doi.org/10.1073/pnas.1014023107
Lanciano S, Carpentier MC, Llauro C, Jobet E, Robakowska-Hyzorek D, Lasserre E, Ghesquière A, Panaud O, Mirouze M (2017) Sequencing the extrachromosomal circular mobilome reveals retrotransposon activity in plants. PLoS Genet 13(2):e1006630. https://doi.org/10.1371/journal.pgen.1006630
Wang K, Tian H, Wang L, Wang L, Tan Y, Zhang Z, Sun K, Yin M, Wei Q, Guo B, Han J, Zhang P, Li H, Liu Y, Zhao H, Sun X (2021) Deciphering extrachromosomal circular DNA in Arabidopsis. Computational and structural biotechnology journal. 19:1176–1183. https://doi.org/10.1016/j.csbj.2021.01.043
Huang Y, Ding W, Zhang M, Han J, Jing Y, Yao W, Hasterok R, Wang Z, Wang K (2021) The formation and evolution of centromeric satellite repeats in Saccharum species. Plant journal: cell Mol biology 106(3):616–629. https://doi.org/10.1111/tpj.15186
Zhuang J, Zhang Y, Zhou C, Fan D, Huang T, Feng Q, Lu Y, Zhao Y, Zhao Q, Han B, Lu T (2024) Dynamics of extrachromosomal circular DNA in rice. Nat Commun 15(1):2413. https://doi.org/10.1038/s41467-024-46691-0
Gaines TA, Zhang W, Wang D, Bukun B, Chisholm ST, Shaner DL, Nissen SJ, Patzoldt WL, Tranel PJ, Culpepper AS, Grey TL, Webster TM, Vencill WK, Sammons RD, Jiang J, Preston C, Leach JE, Westra P (2010) Gene amplification confers glyphosate resistance in Amaranthus palmeri. Proc Natl Acad Sci USA 107(3):1029–1034. https://doi.org/10.1073/pnas.0906649107
Koo DH, Molin WT, Saski CA, Jiang J, Putta K, Jugulam M, Friebe B, Gill BS (2018) Extrachromosomal circular DNA-based amplification and transmission of herbicide resistance in crop weed Amaranthus palmeri. Proc Natl Acad Sci USA 115(13):3332–3337. https://doi.org/10.1073/pnas.1719354115
Molin WT, Patterson EL, Saski CA (2020a) Homogeneity among glyphosate-resistant Amaranthus palmeri in geographically distant locations. PLoS ONE 15(9):e0233813. https://doi.org/10.1371/journal.pone.0233813
Molin WT, Yaguchi A, Blenner M, Saski CA (2020b) The EccDNA Replicon: A Heritable, Extranuclear Vehicle That Enables Gene Amplification and Glyphosate Resistance in Amaranthus palmeri. Plant Cell 32(7):2132–2140. https://doi.org/10.1105/tpc.20.00099
Esposito S, Barteri F, Casacuberta J, Mirouze M, Carputo D, Aversano R (2019) LTR-TEs abundance, timing and mobility in Solanum commersonii and S. tuberosum genomes following cold-stress conditions. Planta, 250(5), 1781–1787. https://doi.org/10.1007/s00425-019-03283-3
Zhang P, Mbodj A, Soundiramourtty A, Llauro C, Ghesquière A, Ingouff M, Slotkin K, Pontvianne R, Catoni F, M., Mirouze M (2023) Extrachromosomal circular DNA and structural variants highlight genome instability in Arabidopsis epigenetic mutants. Nat Commun 14(1):5236. https://doi.org/10.1038/s41467-023-41023-0
Mehta D, Cornet L, Hirsch-Hoffmann M, Zaidi SS, Vanderschuren H (2020) Nat Protoc 15(5):1673–1689. https://doi.org/10.1038/s41596-020-0301-0. Full-length sequencing of circular DNA viruses and extrachromosomal circular DNA using CIDER-Seq
Mann L, Seibt KM, Weber B, Heitkam T (2022) ECCsplorer: a pipeline to detect extrachromosomal circular DNA (eccDNA) from next-generation sequencing data. BMC Bioinformatics 23(1):40. https://doi.org/10.1186/s12859-021-04545-2
Joubert PM, Krasileva KV (2022) The extrachromosomal circular DNAs of the rice blast pathogen Magnaporthe oryzae contain a wide variety of LTR retrotransposons, genes, and effectors. BMC Biol 20(1):260. https://doi.org/10.1186/s12915-022-01457-2
Zhang P, Peng H, Llauro C, Bucher E, Mirouze M (2021) Front Plant Sci 12:743742. https://doi.org/10.3389/fpls.2021.743742. ecc_finder: A Robust and Accurate Tool for Detecting Extrachromosomal Circular DNA From Sequencing Data
Kim SK, Yun CH, Lee JH, Jang YH, Park HY, Kim JK (2008) OsCO3, a CONSTANS-LIKE gene, controls flowering by negatively regulating the expression of FT-like genes under SD conditions in rice. Planta 228(2):355–365. https://doi.org/10.1007/s00425-008-0742-0
Kim SL, Lee S, Kim HJ, Nam HG, An G (2007) OsMADS51 is a short-day flowering promoter that functions upstream of Ehd1, OsMADS14, and Hd3a. Plant physiology, 145(4), 1484–1494. https://doi.org/10.1104/pp.107.103291
Liu X, Zhou C, Zhao Y, Zhou S, Wang W, Zhou DX (2014) The rice enhancer of zeste [E(z)] genes SDG711 and SDG718 are respectively involved in long day and short day signaling to mediate the accurate photoperiod control of flowering time. Front Plant Sci 5:591. https://doi.org/10.3389/fpls.2014.00591
Yan Y, Guo G, Huang J, Gao M, Zhu Q, Zeng S, Gong Z, Xu Z (2020) Current understanding of extrachromosomal circular DNA in cancer pathogenesis and therapeutic resistance. J Hematol Oncol 13(1):124. https://doi.org/10.1186/s13045-020-00960-9
Xu G, Fan X, Miller AJ (2012) Plant nitrogen assimilation and use efficiency. Annu Rev Plant Biol 63:153–182. https://doi.org/10.1146/annurev-arplant-042811-105532
Li B, Xin W, Sun S, Shen Q, Xu G (2006) Physiological and molecular responses of nitrogen-starvedrice plants to re-supply of different nitrogen sources. Plant and Soil, 287, 145–159. https://DOI10.1007/s11104-006-9051-1
Xia X, Fan X, Wei J, Feng H, Qu H, Xie D, Miller AJ, Xu G (2015) Rice nitrate transporter OsNPF2.4 functions in low-affinity acquisition and long-distance transport. J Exp Bot 66(1):317–331. https://doi.org/10.1093/jxb/eru425
Chang MX, Gu M, Xia YW, Dai XL, Dai CR, Zhang J, Wang SC, Qu HY, Yamaji N, Ma F, J., Xu GH (2019) OsPHT1;3 Mediates Uptake, Translocation, and Remobilization of Phosphate under Extremely Low Phosphate Regimes. Plant Physiol 179(2):656–670. https://doi.org/10.1104/pp.18.01097
Secco D, Jabnoune M, Walker H, Shou H, Wu P, Poirier Y, Whelan J (2013) Spatio-temporal transcript profiling of rice roots and shoots in response to phosphate starvation and recovery. Plant Cell 25(11):4285–4304. https://doi.org/10.1105/tpc.113.117325
Hur YJ, Lee HG, Jeon EJ, Lee YY, Nam MH, Yi G, Eun MY, Nam J, Lee JH, Kim DH (2007) A phosphate starvation-induced acid phosphatase from Oryza sativa: phosphate regulation and transgenic expression. Biotechnol Lett 29(5):829–835. https://doi.org/10.1007/s10529-007-9318-5
Feschotte C, Swamy L, Wessler SR (2003) Genome-wide analysis of mariner-like transposable elements in rice reveals complex relationships with stowaway miniature inverted repeat transposable elements (MITEs). Genetics 163(2):747–758. https://doi.org/10.1093/genetics/163.2.747
Feschotte C, Osterlund MT, Peeler R, Wessler SR (2005) DNA-binding specificity of rice mariner-like transposases and interactions with Stowaway MITEs. Nucleic Acids Res 33(7):2153–2165. https://doi.org/10.1093/nar/gki509
Yang G, Dong J, Chandrasekharan MB, Hall TC (2001) Kiddo, a new transposable element family closely associated with rice genes. Mol Genet genomics: MGG 266(3):417–424. https://doi.org/10.1007/s004380100530
Wang Y, Wang M, Zhang Y (2023) Purification, full-length sequencing and genomic origin mapping of eccDNA. Nat Protoc 18(3):683–699. https://doi.org/10.1038/s41596-022-00783-7
Kumar P, Kiran S, Saha S, Su Z, Paulsen T, Chatrath A, Shibata Y, Shibata E, Dutta A (2020) ATAC-seq identifies thousands of extrachromosomal circular DNA in cancer and cell lines. Sci Adv 6(20):eaba2489. https://doi.org/10.1126/sciadv.aba2489
Su Z, Saha S, Paulsen T, Kumar P, Dutta A (2021) ATAC-Seq-based Identification of Extrachromosomal Circular DNA in Mammalian Cells and Its Validation Using Inverse PCR and FISH. Bio-protocol 11(9):e4003. https://doi.org/10.21769/BioProtoc.4003
Kang J, Dai Y, Li J, Fan H, Zhao Z (2023) Investigating cellular heterogeneity at the single-cell level by the flexible and mobile extrachromosomal circular DNA. Comput Struct Biotechnol J 21:1115–1121. https://doi.org/10.1016/j.csbj.2023.01.025
Bureau TE, Wessler SR (1994) Stowaway: a new family of inverted repeat elements associated with the genes of both monocotyledonous and dicotyledonous plants. Plant Cell 6(6):907–916. https://doi.org/10.1105/tpc.6.6.907
Yang G, Lee YH, Jiang Y, Shi X, Kertbundit S, Hall TC (2005) A two-edged role for the transposable element Kiddo in the rice ubiquitin2 promoter. Plant Cell 17(5):1559–1568. https://doi.org/10.1105/tpc.104.030528
Rubio Gomez MA, Ibba M (2020) Aminoacyl-tRNA synthetases, vol 26. RNA (New York, pp 910–936. 8https://doi.org/10.1261/rna.071720.119
Pang YL, Poruri K, Martinis SA (2014) tRNA synthetase: tRNA aminoacylation and beyond. Wiley interdisciplinary reviews. RNA 5(4):461–480. https://doi.org/10.1002/wrna.1224
Li X, Xia K, Liang Z, Chen K, Gao C, Zhang M (2016) MicroRNA393 is involved in nitrogen-promoted rice tillering through regulation of auxin signal transduction in axillary buds. Sci Rep 6:32158. https://doi.org/10.1038/srep32158
Lee S, Marmagne A, Park J, Fabien C, Yim Y, Kim SJ, Kim TH, Lim PO, Masclaux-Daubresse C, Nam HG (2020) Concurrent activation of OsAMT1;2 and OsGOGAT1 in rice leads to enhanced nitrogen use efficiency under nitrogen limitation. Plant journal: cell Mol biology 103(1):7–20. https://doi.org/10.1111/tpj.14794
Liu X, Huang D, Tao J, Miller AJ, Fan X, Xu G (2014) Identification and functional assay of the interaction motifs in the partner protein OsNAR2.1 of the two-component system for high-affinity nitrate transport. New Phytol 204(1):74–80. https://doi.org/10.1111/nph.12986
Yan M, Fan X, Feng H, Miller AJ, Shen Q, Xu G (2011) Rice OsNAR2.1 interacts with OsNRT2.1, OsNRT2.2 and OsNRT2.3a nitrate transporters to provide uptake over high and low concentration ranges. Plant Cell Environ 34(8):1360–1372. https://doi.org/10.1111/j.1365-3040.2011.02335.x
Huang S, Chen S, Liang Z, Zhang C, Yan M, Chen J, Xu G, Fan X, Zhang Y (2015) Knockdown of the partner protein OsNAR2.1 for high-affinity nitrate transport represses lateral root formation in a nitrate-dependent manner. Sci Rep 5:18192. https://doi.org/10.1038/srep18192
Song M, Fan X, Chen J, Qu H, Luo L, Xu G (2020) OsNAR2.1 Interaction with OsNIT1 and OsNIT2 Functions in Root-growth Responses to Nitrate and Ammonium. Plant physiology, 183(1), 289–303. https://doi.org/10.1104/pp.19.01364
Léran, S., Varala, K., Boyer, J. C., Chiurazzi, M., Crawford, N., Daniel-Vedele, F.,David, L., Dickstein, R., Fernandez, E., Forde, B., Gassmann, W., Geiger, D., Gojon,A., Gong, J. M., Halkier, B. A., Harris, J. M., Hedrich, R., Limami, A. M., Rentsch,D., Seo, M., … Lacombe, B. (2014). A unified nomenclature of NITRATE TRANSPORTER 1/PEPTIDE TRANSPORTER family members in plants. Trends in plant science, 19(1), 5–9. https://doi.org/10.1016/j.tplants.2013.08.008
Zhao H, Ma H, Yu L, Wang X, Zhao J (2012) Genome-wide survey and expression analysis of amino acid transporter gene family in rice (Oryza sativa L). PLoS ONE 7(11):e49210. https://doi.org/10.1371/journal.pone.0049210
Lu K, Wu B, Wang J, Zhu W, Nie H, Qian J, Huang W, Fang Z (2018) Blocking amino acid transporter OsAAP3 improves grain yield by promoting outgrowth buds and increasing tiller number in rice. Plant Biotechnol J 16(10):1710–1722. https://doi.org/10.1111/pbi.12907
Lou Y, Ma H, Lin WH, Chu ZQ, Mueller-Roeber B, Xu ZH, Xue HW (2006) The highly charged region of plant beta-type phosphatidylinositol 4-kinase is involved in membrane targeting and phospholipid binding. Plant Mol Biol 60(5):729–746. https://doi.org/10.1007/s11103-005-5548-x
Liu J, Zhou J, Xing D (2012) Phosphatidylinositol 3-kinase plays a vital role in regulation of rice seed vigor via altering NADPH oxidase activity. PLoS ONE 7(3):e33817. https://doi.org/10.1371/journal.pone.0033817
Day P, MacCleery S, Kim SH, Gilroy S (2006) Gibberellin signaling through phosphatidylinositol 3-kinase. Second Pan American Plant Membrane Biology Workshop, South Padre Island, TX
Hung K, Kao C (2005) Phosphatidylinositol 3-phosphate is required for abscisic acid-induced hydrogen peroxide production in rice leaves. Plant Growth Regul 45:95–101. https://doi.org/10.1007/s10725-005-1434-4
Li R, Wang JL, Xu L, Sun MH, Yi KK, Zhao HY (2020) Functional Analysis of Phosphate Transporter OsPHT4 Family Members in Rice. Rice Sci 27(6):493–503. https://doi.org/10.1016/j.rsci.2020.09.006
Healey A, Furtado A, Cooper T, Henry RJ (2014) Protocol: a simple method for extracting next-generation sequencing quality genomic DNA from recalcitrant plant species. Plant methods 10:21. https://doi.org/10.1186/1746-4811-10-21
Chen L, Reeve J, Zhang L, Huang S, Wang X, Chen J (2018) PeerJ 6:e4600. https://doi.org/10.7717/peerj.4600. GMPR: A robust normalization method for zero-inflated count data with application to microbiome sequencing data
Nabavi S, Schmolze D, Maitituoheti M, Malladi S, Beck AH (2016) EMDomics: a robust and powerful method for the identification of genes differentially expressed between heterogeneous classes. Bioinf (Oxford England) 32(4):533–541. https://doi.org/10.1093/bioinformatics/btv634
Mi H, Muruganujan A, Ebert D, Huang X, Thomas PD (2019) PANTHER version 14: more genomes, a new PANTHER GO-slim and improvements in enrichment analysis tools. Nucleic Acids Res 47(D1):D419–D426. https://doi.org/10.1093/nar/gky1038
Supek F, Bošnjak M, Škunca N, Šmuc T (2011) REVIGO summarizes and visualizes long lists of gene ontology terms. PLoS ONE 6(7):e21800. https://doi.org/10.1371/journal.pone.0021800
Riehl K, Riccio C, Miska EA, Hemberg M (2022) TransposonUltimate: software for transposon classification, annotation and detection. Nucleic Acids Res 50(11):e64. https://doi.org/10.1093/nar/gkac136
Kawahara, Y., de la Bastide, M., Hamilton, J. P., Kanamori, H., McCombie, W. R., Ouyang,S., Schwartz, D. C., Tanaka, T., Wu, J., Zhou, S., Childs, K. L., Davidson, R. M.,Lin, H., Quesada-Ocampo, L., Vaillancourt, B., Sakai, H., Lee, S. S., Kim, J., Numa,H., Itoh, T., … Matsumoto, T. (2013). Improvement of the Oryza sativa Nipponbare reference genome using next generation sequence and optical map data. Rice (New York,N.Y.), 6(1), 4. https://doi.org/10.1186/1939-8433-6-4

There is NO Competing Interest.

SupplementaryData0718.xlsx
Supplementary Data_0718.xlsx
SupplementaryFigures0718.pdf

Download PDF

Version 1

posted

You are reading this latest preprint version

Unveiling eccDNA Dynamics in Rice: Insights into Adaptation to Nutritional Stress

Status:

Version 1

Abstract

Figures

Introduction

Results

eccDNAs are derived from diverse region across the rice genome

Gene-overlapped eccDNAs show dynamic changes during rice growth

Gene-overlapped eccDNAs respond to N stress

Gene-overlapped eccDNAs respond to P stress

Nutritional stresses lead to variation in TE-overlapped eccDNAs in rice

Identification and validation of multiple-fragment eccDNAs in rice

ATAC-seq effectively validates high-density regions of rice eccDNAs

Discussion

LTRs in eccDNAs dynamics assume hypothetical mechanism on origin

Functional gene-overlapped eccDNAs are involved in rice growth and responses to N and P stresses

Methods

Plant material and genomic DNA isolation

Linear DNA digestion

Enrichment of extrachromosomal circular DNAs

Sequencing and data analysis

Identification and distribution analysis of eccDNAs

Gene Ontology enrichment and REVIGO analysis

Transposon elements and repeat units analysis

ATAC-seq

Multiple-fragment eccDNA characterization

Inverse PCR validations

Declarations

References

Additional Declarations

Supplementary Files

Status:

Version 1