The primordial germ cells (PGCs) of starved AMPK mutant L1 larvae have abnormally high levels of nuclear localized SET-2, an MLL/SET1-like H3K4 histone methyltransferase that is likely responsible for the observed increase in trimethylated histone H3 at lysine 4 (H3K4me3), resulting in the inappropriate activation of transcriptional elongation in some genes1,10. Since the F2 generation is the first generation that was not directly exposed to the starvation11, we performed ChIP-seq to generate genome-wide H3K4me3 maps from the F2 adult descendants of starved AMPK mutant L1 larvae12 (the first transgenerational brood) (Fig. 1a). Consistent with our previous findings, H3K4me3 levels are abnormally high in the F2 descendants of starved AMPK mutants, while additional "ectopic" sites of deposition that diverge from the wild-type pattern of R-loop-associated loci, result in a strikingly distinct H3K4me3 signature in the aak-1/2 mutant animals compared to WT adult hermaphrodites, with 31 additional ectopic sites (Supplementary Fig. 1a).
H3K4me3 is found primarily on chromatin near or around proximal promoters or transcription start sites (TSSs)13. Consistent with this feature, we noted a high proportion of the "ectopic" H3K4me3 peaks identified in the descendants of starved AMPK mutants at, or around, promoter regions (Fig. 1b and Supplementary Fig. 1b). Annotation and peak location statistics reveal that the F2 descendants of starved AMPK mutant hermaphrodites (aak-1/-2) harbour more H3K4me3 signals spanning TSSs when compared to WT adults (Fig. 1c,d and Supplementary Fig. 1c-e). Further global analysis of H3K4me3 signals indicated that ~ 88% of the peak calls in aak-1/2 mutants accumulate at promoter-TSSs, while only ~ 36% are found at such loci in WT animals (Fig. 1e).
Blocking the formation of H3K4me3 at TSS-proximal regions is critical for early embryonic development in C. elegans14. We identified that 40% of the total H3K4me3 was deposited at proximal regions (2 kb upstream of TSS) in aak-1/2 mutants compared to only ~ 24% in WT animals (Supplementary Fig. 2), suggesting that the “ectopic” deposition of these H3K4me3 marks could generate an abnormal transcription-activating signal at the TSS-proximal regions of these genes.
Transgenerational H3K4me3 accumulation at TSSs may thus result in robust, but inappropriate, gene expression in starved AMPK mutants. If the synthesis of these gene products becomes miscoordinated with post embryonic developmental progression, their aberrant expression could negatively impact cellular homeostasis and/or growth. To determine if this aberrant transcription contributes to the starvation-induced sterility that occurs in the absence of AMPK signalling through untimely protein synthesis, we treated the starved mutants with the translation inhibitor cycloheximide (CHX) during the diapause and evaluated sterility following recovery in replete growth conditions. Unlike the treatment with the specific transcription elongation inhibitor, 5,6-dichloro-1-β-D-ribofuranosylbenzimidazole (DRB)1, blocking translation had no differential effect on the starvation-induced sterility of these AMPK mutants (Fig. 2a, b). These results strongly suggest that the starvation-induced reproductive defects typical of AMPK mutants arise from the inappropriate transcription/elongation that occurs in the PGCs, or potentially from the RNA products that are generated by these animals during starvation.
Efficient transcriptional elongation only occurs following multiple changes within the post-initiation RNA Polymerase II complex that enhance RNA processing and processivity15,16. In the absence of these components the elongating complex can pause until these contingencies are satisfied17,18. In some cases the nascent preRNA that is associated with these stalled complexes can thread back into the transcription bubble, where it can base pair with the non-coding strand of DNA. These triplexes are referred to as R-loops, and they are abundant non-B DNA structures formed by co-transcriptional DNA-RNA hybrids that can occur during transcriptional pausing19 (Fig. 4g). Their presence in the genome is strongly selected against due to their propensity to cause mutations. R-loop occurrence is associated with replication/transcription conflicts that result in DNA damage, while they are also a source of genome instability resulting from single nucleotide changes catalyzed by DNA modifying enzymes such as cytidine deaminases that can access the vulnerable single stranded DNA corresponding to the coding strand of expressed genes20.
Because these structures can cause genome instability that could be transmitted throughout successive generations, we probed the germline DNA from the F2 descendants of starved AMPK mutants for the presence of R-loops. In parallel, we stained the germ line of a mutant that is deficient for the Tho-Trex complex (thoc-2) that results in an increased abundance of R-loops21. Our analysis indicated that the AMPK mutant (aak-1/2) germ lines displayed significantly higher levels of R-loop foci compared to WT (Fig. 2c), which could be partially alleviated by microinjecting RNaseH enzyme22 or rnh-1-expressing transgenes into the gonad (Supplementary Fig. 3a,c), or by inhibition of inappropriate transcriptional elongation with DRB during periods of starvation (Supplementary Fig. 3b). Like the H3K4me3 levels1, the abnormal abundance of R-loops was gradually resolved over multiple generations, whereby R-loop numbers were most abundant in the germ cells of animals following starvation in P0 adults > F2 descendants > F5 descendants (Fig. 2b).
To better understand if a specific class of genes were prone to R-loop formation in the absence of AMPK during starvation, high-throughput sequencing and analysis of genomic DNA-RNA hybrids by immunoprecipitation (DRIP-seq) was performed identically to the H3K4me3 ChIP-seq experiments, using genomic DNA obtained from the F2 generation of starved WT and aak-1/2 animals (Supplementary Fig. 4a). We noted a dramatic expansion in the number of R-loops at loci that had not previously been associated with these structures, but also in the overall abundance throughout the entire genome (Supplementary Fig. 4b) in the F2 descendants of starved aak-1/2 mutants compared to WT F2 controls by comparing both read coverage and depth (Fig. 2e, f), with high reproducibility between biological replicates (Supplementary Fig. 5a, b). The specificity of the S9.6 antibody that recognizes the DNA-RNA hybrids was corroborated by the decrease in germline signal detected upon RNaseH treatment, consistent with these RNaseH-sensitive loci being R-loops (Fig. 2e).
The locus-specific enrichment of R-loops was validated by DRIP-qPCR in which some (~ 50%, Supplementary Fig. 3b) of the enhanced R-loops in the F2 descendants of starved aak-1/2 mutants could be resolved by blocking elongation during the period of starvation, while others remained unchanged (cit-1.2; Fig. 2g).
Genome wide DRIP-seq allowed us to identify a significant proportion of the genome (3.1% and 4.7% in the F2 descendants of starved WT and AMPK mutants, respectively) involved in R-loop formation in C. elegans (Supplementary Fig. 5d). In both WT and aak-1/2 mutants, a high number of genomic regions that were associated with R-loops were conserved between the two genotypes (Supplementary Fig. 5c), suggesting the presence of a common R-loop forming feature in, or around these loci23. As has been reported in Saccharomyces cerevisiae24and human cells25, R-loops are not restricted to nuclear genes, since we also detect them in the mitochondrial genome of C. elegans (Supplementary Fig. 4b, c). In Drosophila, R-loops are highly enriched in repetitive DNA sequences such as satellite DNAs26. We also observed that a substantial number of R-loops arise within satellite DNA regions, preferentially at centromeric heterochromatin, in the F2 descendants of starved AMPK mutants (Supplementary Fig. 5e).
GC skew also contributes to R-loop formation in humans19,26. The genomic sequence present within the extracted R-loop peaks from 1 kb up- and downstream exhibit strong GC skew in C. elegans, especially in the regions adjacent to the R-loop peaks (Fig. 3a). Although AT skew27 was not revealed in our genome-wide analysis, we did note that there are a substantial number of R-loop-associated sequences with strong AT skew in C. elegans (Supplementary Fig. 6), especially in the F2 descendants of starved AMPK mutants (Fig. 3b). Two polypyrimidine sequences and two poly(A) tracts were identified by HOMER de novo motif analysis (Fig. 3b) in AMPK mutant versus WT DRIP-seq, suggesting the possibility that a subset of R-loops can form in sequences associated with trans-splicing and polyadenylation23,24, but only in the F2 descendants of previously starved animals that lack all AMPK signalling28,29.
Our genome-wide analyses indicate that overall R-loop intensity in C. elegans is most pronounced at the promoter-TSS region (Fig. 3c and Supplementary Fig. 7a-c). Notably, R-loops accumulating at the promoter-TSS region occupy ~ 2.4% of the genome in the F2 descendants of previously starved AMPK mutant animals, compared to only ~ 1.7% in WT. Besides the promoter-TSS regions, R-loops at transcription termination sites (TTSs) occupied nearly ~ 20% of the total peaks (Fig. 3c), which is comparable to estimates of prevalent and conserved R-loop formation at promoter and terminator regions of Pol II-dependent genes in human and mouse genomes30,31. Among those genes with common promoter-associated R-loops in WT and the F2 descendants of starved AMPK mutant animals, only ~ 5.6% share R-loops at the corresponding TTS regions (Supplementary Fig. 7d), suggesting that distinct mechanisms are involved in the regulation of R-loop dynamics at these two intragenic hotspots.
In contrast, the proportion of R-loop peaks in the intron and intergenic regions was relatively low in both samples (Fig. 3c and Supplementary Fig. 7a, b). Notably, genome-wide R-loop signal distribution was increased over intergenic regions following AMPK depletion (Fig. 3d), suggesting potential functions of AMPK in regulating R-loop formation potentially by restricting the expression of intergenic sequences32. Considering that our H3K4me3 ChIP-seq and DRIP-seq signals have similar distribution patterns at the promoter-TSS regions in the F2 descendants of starved AMPK mutant animals (Fig. 1e and Fig. 3c), and that genes with promoter-localized R-loops coincide with significantly higher transcription activity of the resident genes30, our data suggest that the increased abundance of conserved and ectopic H3K4me3 deposition that we observe in the F2 descendants of starved AMPK mutant animals positively correlates with the formation and/or maintenance of R-loops (Fig. 3e, f).
Persistent, unresolved R-loops can promote DNA damage and genomic instability26,33. Elevated R-loop levels strongly correlate with sites of replication-stress-induced DNA damage34. We noted that the F2 descendants of starved AMPK mutant animals were hypersensitive to hydroxyurea (HU) and the alkylating agent methyl methanesulfonate (MMS), both of which are genotoxic drugs that compromise DNA replication35 (Fig. 4a,b and Supplementary Fig. 8a). These findings suggest that starved AMPK mutant descendants are hypersensitive to DNA damage and/or replication stress, potentially due to the increased abundance of R-loops.
The RecA ortholgue RAD-51 recombinase plays a critical role in the recognition of damage sites and their eventual repair36,37. During meiosis, the DSBs typical of meiotic intermediates are also recognized and bound by RAD-51 in the C. elegans germ cells38,39. We observed considerably more RAD-51 foci (markers of DSB repair intermediates) in each zone of the adult germ line in the descendants of starved AMPK mutants compared with WT (Fig. 4b,c). Moreover, we noticed that nuclei with highly abundant RAD-51 foci often appeared in the AMPK mutant germ lines (Fig. 4c), suggesting that these nuclei may have irreparable damage and are destined for elimination. Indeed, we observed an increased abundance of CED-1::GFP positive cells indicating that the frequency of apoptotic cell death21 was significantly enhanced in the germ lines of the descendants of starved AMPK mutants (Fig. 4d and Supplementary Fig. 8b).
If the additional RAD-51 foci are due to R-loop-induced DNA damage, at least some of these additional RAD-51 foci may localize to R-loop containing loci. We therefore determined the physical overlap between RNA-DNA hybrids and RAD-51 in the germ lines of the descendants of starved AMPK mutants using a proximity ligation assay (PLA) that detects interactions to a limit 30 ~ 40 nm40. Our results indicate that RAD-51 is localized to R-loops at some of these supernumerary RAD-51 foci, while positive PLA signals were undetectable in WT germ lines (Fig. 4e and Supplementary Fig. 8c). These results suggest that DNA damage is likely induced at these R-loops, potentially due to transcription-replication conflicts that arise in the mitotic zone and persist, or alternatively, it may represent collateral sites of damage that arise during the process of R-loop resolution41. If RAD-51 is sequestered at these sites it could potentially limit is normal involvement in meiotic break resolution, potentially contributing to the sterility observed in the AMPK mutant adults that were previously starved as emergent larvae. To determine if RAD-51 might be limiting in the germ line due to its sequestration at R-loops, we drove the expression of an extrachromosomal array of RAD-51 in the germ cells of AMPK mutants. Providing additional copies of RAD-51 partially suppressed the sterility of AMPK mutant hermaphrodites that were subjected to acute starvation during the L1 stage (Supplementary Fig. 8d). Taken together, these data suggest that C. elegans larvae that lack all AMPK signalling ectopically deposit H3K4me3 marks that trigger aberrant transcriptional elongation resulting in an increased frequency of R-loop formation during periods of energy stress. These sites are transmissible along with their H3K4me3 marks, and are prone to DNA damage leading to RAD-51 sequestration, thus resulting in abnormal reproductive development, enhanced germline apoptosis, and ultimately culminating in sterility (Fig. 4g).