Roles of DNAme, H3K9me3 and H2AK119ub1 in repression of germline genes in pESCs. Three distinct epigenetic marks, including DNAme, H3K9me3 and H2AK119ub1, deposited by de novo DNMT3A/3B, SETDB1 and PRC1.6, respectively, have been previously implicated in silencing of germline genes in pESCs and/or somatic cells in mice (Fig. 1A). To determine whether the 137 DMS germline genes upregulated in E8.5 Dnmt3a/3b DKO embryos12 (Supplementary Data 1) are hypermethylated as well as bound by SETDB1 and PRC1.6 in pESCs, we reanalyzed a series of published whole-genome bisulphite sequencing (WGBS) and ChIP-seq datasets (Supplementary Data 2). As expected, the CGI promoter regions of most DMS germline genes, including all GGD genes18, are methylated in E8.5 embryos as well as pESCs (Fig. 1B-C and Supplementary Fig. 1A-B). However, not all of the 137 DMS germline genes are enriched for MGA/MAX and/or E2F6 (Fig. 1B- C, Supplementary Fig. 1C), consistent with the fact that 82 and 97 of these genes include E-box or E2F motifs, respectively, in their promoter regions, with only 61 including both motifs (Supplementary Fig. 1D and Supplementary Data 1). Manual curation of the function of the 40 genes in ESCs enriched for both MGA (RPKM > 2) and E2F6 (RPKM > 1) and showing > 30% DNAme in the promoter region revealed 18 associated with meiosis and 16 associated with genome-defence/piRNA biogenesis (Supplementary Data 3), including all GGD genes. Intriguingly, the DNAme level of these MGA and E2F6 bound genes is generally lower than that of the DMS germline genes that are not bound by these transcription factors (TFs), a trend also observed in E8.5 embryos (Fig. 1B and Supplementary Fig. 1B).
Many of the DMS germline and all GGD genes are also enriched for H3K9me3, SETDB1 and HP1β (Fig. 1C-D and Supplementary Fig. 1E), as well as H2AK119ub1 and PRC1.6 complex subunits PCGF6 and RING1B (Fig. 1C, Fig. 1E and Supplementary Fig. 1F). These regions are also enriched for H3K27me3 and the PRC2 complex subunit EED (Supplementary Fig. 1G), likely reflecting H2AK119ub1-dependent recruitment of PRC214,60. TRIM28, on the other hand, showed only low levels of enrichment at DMS germline gene promoters, including those with high H3K9me3 (Fig. 1C and Supplementary Fig. 1E), consistent with a non-canonical mechanism of SETDB1 recruitment to these loci. Indeed, co-enrichment of H2AK119ub1 and H3K9me3 is relatively rare at genic promoters (Supplementary Fig. 1H). Thus, PRC1.6 bound genes, including GGD genes, are marked by an unusual combination of repressive histone PTMs and intermediate levels of DNAme in pESCs.
To determine whether these repressive complexes play a role in silencing of DMS germline (and in particular GGD) genes in pESCs, we next reanalyzed a series of published RNA-seq data (Supplementary Data 2), where key subunits of each of the complexes described above were depleted. Strikingly, DMS GGD genes as well as other methylated germline genes bound by MGA and/or enriched for H3K9me3 or H2AK119ub1 were de-repressed upon KO or knock-down (KD) not only of Mga, Max and Dnmt3a/3b, but also of Setdb1, Cbx1/3/5 and Pcgf6 (Fig. 2A-D and Supplementary Fig. 2A-C). These observations are consistent with previous studies showing that a subset of the DMS germline genes, including GGD genes, are de-repressed in pESCs following KO of L3mbtl2, another core PRC1.6 subunit40,45,61. To determine whether H2AK119ub1 is required for silencing, we generated a Ring1b conditional (c)KO line. GGD genes in particular were upregulated following Ring1b deletion in pESCs (Fig. 2E), to levels similar to those observed in Pcgf6 cKO pESCs (Fig. 2D and Supplementary Fig. 2C). In contrast, only modest upregulation of GGD genes was observed in Trim28 cKO ESCs (Supplementary Fig. 2B), consistent with the absence of KAP1 binding at these genes (Fig. 1C and Supplementary Fig. 1D). Similarly, no upregulation of DMS germline genes was observed in Eed cKO ESCs (Supplementary Fig. 2C), indicating that H3K27me3, while enriched at PRC1.6 bound promoters, is not required for silencing of these genes. These results suggest that in pESCs, repression of a subset of DMS germline genes is dependent not only upon DNAme, but also on MGA/MAX, SETDB1 and the canonical PRC1.6 complex. However, as pESCs are a mixed population of cells at distinct differentiation states53–55, it is not possible to deduce either the temporal order in which these silencing complexes act during development, or their inter-dependencies in mediating such silencing.
H2AK119ub1, H3K9me3 and H3K27me3 are sequentially deposited prior to de novo DNAme at DMS germline genes To investigate the dynamics of these histone PTMs as well as DNAme at DMS germline genes during early embryonic/PGC development, we analyzed previously published genome-wide datasets from 2-cell, morula, inner cell mass (ICM), epiblast, and E9.5 PGCs and carried out H3K9me3 ULI-ChIP-seq62 for E9.5 PGCs (Fig. 3A). In the promoter regions of many DMS germline gene loci, H2AK119ub1, H3K9me3 and H3K27me3 are already present in the ICM or earlier stages (Fig. 3B, Supplementary Fig. 3A), with H2AK119ub1 deposited at DMS germline and GGD genes as early as the 2-cell stage, followed by H3K9me3 in morula and ICM and H3K27me3 in ICM and E6.5 epiblast63,64 (Supplementary Fig. 3A-B). In contrast, DNAme is not deposited at these loci until the epiblast stage, as expected (Fig. 3B). As observed in pESCs, MGA and E2F6 bound genes show a lower level of DNAme in E6.5 epiblast than most other DMS germline genes (Supplementary Fig. 3C). While H2AK119ub1, H3K9me3 and H3K27me3 persist at DMS germline genes following implantation, H3K9me3 and H3K27me3 progressively decrease at these genes in PGCs, coincident with their DNA demethylation and upregulation in E13.5 PGCs (Fig. 3B-C). Thus, H2AK119ub1 and H3K9me3 are deposited prior to de novo DNAme at many DMS germline gene promoters, including all GGD genes, implicating DNA methylation-independent repression of these loci during pre-implantation development.
To further characterize the roles of these chromatin marks, we employed the PGCLC culture system65, in which nESCs, EpiLCs, day (d)4–6 PGCLCs and d4 plus 7 days of culture (d4c7) PGCLCs represent the in vitro counterparts of E3.5 ICM, E6.5 epiblast, E9.5-11.5 PGCs and E13.5 PGCs, respectively (Fig. 3A). As expected, while DNAme levels in nESCs are relatively low in the promoter regions of DMS germline genes, H2AK119ub1, H3K9me3 and H3K27me3 are clearly enriched at a subset of these genes, including GGD genes (Fig. 3B and Supplementary Fig. 3A-B). Consistent with PRC1.6 binding at their promoter regions, the enrichment of H2AK119ub1 in nESCs and EpiLCs is positively correlated with the enrichment levels of MGA and E2F6, as measured in pESCs and nESCs, respectively (Supplementary Fig. 3C).
Following EpiLC induction, these loci are de novo methylated and retain each of these histone PTMs, with enrichment modestly increased relative to nESCs. Mirroring the E6.5 epiblast, DNAme levels in EpiLCs are still lower at MGA and E2F6 bound genes (including GGD genes) relative to most other genes, including DMS germline genes that are not bound by these TFs (Supplementary Fig. 3D). These observations indicate that PRC1.6 bound promoters are refractory to the wave of de novo DNAme that occurs during epiblast formation. H2AK119ub1, H3K9me3 and H3K27me3 are subsequently reduced in d4-6 PGCLCs, while DNAme persists (Fig. 3B and Supplementary Fig. 3A-B). Despite the increase in repressive marks in EpiLCs, expression of the majority of the GGD genes transiently increases in EpiLC relative to nESCs, perhaps due to the simultaneous upregulation of positive regulatory factors that bind to their CGI promoters (Fig. 3C). Regardless, these genes are then downregulated in d4 PGCLCs, before their upregulation again in d4c7 PGCLCs, as expected. Based on these observations, we subsequently employed the nESC/EpiLC/PGCLC system to model the relative importance of and crosstalk between each repressive pathway during pre- and post-implantation development.
DNAme plays a more important role in silencing of DMS germline genes in EpiLCs than nESCs. Analysis of previously generated WGBS data from DNMT deficient E6.5 epiblast cells66 revealed a dramatic reduction in DNAme at DMS germline genes in both Dnmt1 KO and Dnmt3a/3b DKO embryos (Fig. 4A-B), confirming the importance of de novo DNAme at this stage. As observed in the ICM to epiblast transition, EpiLCs also show much higher levels of DNAme in the promoter regions of DMS germline genes than the nESCs from which they are derived67,68. However, nESCs show a modestly higher level of DNAme in these regions than E3.5 ICM (Fig. 4A-B). Notably, pESCs express DNMT3A as well as TET1, which mediates oxidation of 5-methylcytosine (5mC) to 5-hydroxymethylcytosine (5hmC) in an active DNA demethylation pathway69. As bisulphite sequencing cannot discriminate between 5mC and 5hmC, we surmised that the low level of DNAme observed in DMS germline genes in nESCs may reflect the presence of both 5mC and 5hmC. Analysis of previously published 5hmCIP-seq data70 revealed that relative to all other genes in nESCs, the promoters of DMS germline genes, including GGD genes, are enriched for 5hmC (Supplementary Fig. 4A-B). Surprisingly, the levels of 5hmC enrichment are similar between nESCs and EpiLCs, and analysis of previously published ChIPseq data also reveals similar levels of TET1 enrichment between these cell types70,71. Taken together with the fact that WGBS indicates significantly higher levels of DNAme in EpiLCs, these data suggest that the ratio of 5hmC to 5mC is higher in nESCs. Closer inspection of these loci reveals that 5hmC and TET1 peaks overlap with each other and with MGA/MAX and E2F6 in the promoter region, corresponding to regions of reduced DNAme relative to flanking CpGs (Supplementary Fig. 4C). Thus, the low levels of DNAme observed in the promoter regions of GGD genes in nESCs represents a combination of 5mC and 5hmC, likely reflecting reiterative de novo DNAme followed by oxidation of this mark under these culture conditions.
To directly address the role of DNAme in silencing of DMS germline genes in nESCs, we employed a line harboring floxed alleles of Dnmt1, Dnmt3a and Dnmt3b72. Following the addition of 4-hydroxytamoxifen (4OHT) for four days, RNA-seq was conducted on control or Dnmt1/3a/3b conditional triple (cT)KO nESCs, or EpiLCs (isolated after two days of differentiation) (Fig. 4C). As expected, given the low levels of DNAme in nESCs, only 14/137 DMS germline genes showed > 30% promoter DNAme in control nESCs and were upregulated in Dnmt cTKO nESCs (Fig. 4D). The observed upregulation of the GGD genes Dazl and Piwil2, as well as the control gene Hoxb1 (Fig. 4D-E), is likely an indirect effect, as each of these genes show very low levels of DNAme in nESCs. In contrast, 74/137 DMS germline genes showed > 30% promoter DNAme in control EpiLCs and were upregulated in Dnmt cTKO EpiLCs (Fig. 4D). These include GGD genes Slc25a31, Taf7l and Tex19.1, which were only modestly upregulated in DNAme deficient nESCs (Fig. 4E). Intriguingly, the remaining GGD genes Ddx4, Mael and Mov10l1, which ranged in promoter DNAme levels from 5.5–21.3% and 19.5–67.3% in WT nESCs and EpiLCs, respectively, remained repressed following Dnmt cTKO in both cultures. Thus, consistent with the wave of global de novo DNAme that takes place with EpiLC induction, DNAme play a more important role in silencing of DMS germline genes in EpiLCs than nESCs, including several GGD genes. However, a subset of GGD genes is silenced independent of DNAme even in EpiLCs.
PRC1.6 plays a more important role than DNAme in silencing of GGD genes in nESCs. To investigate the potential roles of H3K9me3 and H2AK119ub1 in the regulation of DMS germline genes, we carried out ChIP-seq on a line that expresses a truncation mutant of MGA from the native locus that lacks the bHLHZ domain73, as well as Setdb1- and Pcgf6 cKO nESCs and EpiLCs. As conditional KO of Setdb1 in nESCs leads to a dramatic reduction in viability by day three in culture74, cKO lines were harvested following two days of 4OHT treatment (Supplementary Fig. 5A). Strikingly, H3K9me3 levels were reduced at DMS germline and GGD genes in both nESCs and EpiLCs of all three mutant lines (Fig. 5A), with the effect more pronounced for Mga-ΔHLH and Pcgf6 cKO in EpiLCs. Thus, both the bHLHZ DNA-binding domain of MGA and the PCGF6 core subunit of PRC1.6 are required for SETDB1 recruitment and/or activity at these loci. While H2AK119ub1 at DMS germline and GGD genes was also significantly reduced in Pcgf6 cKO nESCs and EpiLCs, as expected, this mark was unaffected in the Mga-ΔHLH and Setdb1 cKO lines (Fig. 5B). This suggests that neither MGA bHLHZ-dependent E-box-binding nor SETDB1 are required for persistence of H2AK119ub1 at these loci.
RNA-seq analyses of these KO nESCs and EpiLCs revealed varying levels of de-repression of DMS germline genes (Fig. 5C and Supplementary Data 4). Mga-ΔHLH showed the most consistent upregulation of GGD genes in nESCs and EpiLCs (Fig. 5D). Dnmt cTKO in nESCs resulted in only modest upregulation of most GGD genes when compared to Mga-ΔHLH and Pcgf6 cKO nESCs. For several of these genes, DNAme is apparently dispensable for silencing in EpiLCs as well, whereas PRC1.6 is required for transcriptional repression in both nESCs and EpiLCs. However, for 3 of the GGD genes, namely Slc25a31, Taf7l and Tex19.1, a more robust upregulation is clearly evident in Dnmt cTKO EpiLCs than nESCs. Consistent with this trend, DESeq2-based TCC (Tag Count Comparison) analysis75 of Dnmt cTKO EpiLCs yielded 70 upregulated DMS germline genes, versus only 28 in Dnmt cTKO nESCs (Supplementary Data 4). The increased level of DNAme in the CGI promoter regions of these genes (Fig. 4E) is likely responsible for the greater impact of Dnmt cTKO in EpiLCs. Furthermore, the engagement of DNAme-mediated repression may explain the reduced impact of Pcgf6 deletion on expression of all of the GGD genes in EpiLCs relative to nESCs (Fig. 5D).
As Pcgf6 deletion disrupted both H2AK119ub1 and H3K9me3 deposition, we next wished to determine the relative importance of these histone PTMs in transcriptional repression. We carried out RNA-seq on Setdb1 cKO, Ring1b cKO as well as Setdb1/Ring1b (Set/Ring) cDKO nESCs and EpiLCs, as above (Supplementary Fig. 5A). DESeq2-based TCC analysis yielded a similar number of upregulated DMS germline genes in Ring1b cKO versus Pcgf6 cKO nESCs or EpiLCs (Supplementary Fig. 5B and Supplementary Data 4). However, DMS germline and GGD genes showed a greater level of de-repression in Setdb1/Ring1b cDKO than individual Ring1b or Setdb1 cKO nESCs and EpiLCs (Fig. 5C-D and Supplementary Fig. 5C-D), with the fold-increase in expression for many GGD genes in Setdb1/Ring1b cDKO nESCs similar to that observed in Pcgf6 cKO nESCs (Fig. 5D and Supplementary Fig. 5D). These results suggest that RING1B and SETDB1 act in a combinatorial manner to repress PRC1.6 bound genes and are consistent with the observation that loss of PCGF6 disrupts both H2AK119ub1 and H3K9me3 deposition. However, unlike the Pcgf6 cKO, Setdb1/Ring1b cDKO EpiLCs show a level of de-repression that does not decrease relative to nESCs, indicating that SETDB1 is still essential for robust silencing of these genes even in the presence of a higher level of DNAme in their promoter regions.
Intriguingly, the fold-change in expression of most GGD genes in Mga-ΔHLH EpiLCs far exceeds that observed in EpiLCs generated from the other KO lines (Fig. 5D and Supplementary Fig. 5D), indicating that MGA/MAX may act at the nexus of PRC1.6, SETDB1 and de novo DNAme. To determine whether MGA is indeed required for de novo DNAme, we isolated genomic DNA from Mga-ΔHLH EpiLCs and conducted bisulphite sequencing on the CGI promoter regions of Mael and Mov10l1. Relative to control EpiLCs, DNAme levels decreased significantly at both genes, as observed in Dnmt cTKO EpiLCs (Fig. 5E, Supplementary Fig. 6A-B). Since these genes are not de-repressed in Dnmt cTKO EpiLCs (Fig. 4E, Fig. 5D), these observations indicate that even when present, DNAme is not necessarily required for silencing of GGD genes in EpiLCs. Only a moderate decrease or no change in DNAme was observed in these regions in Pcgf6-, Setdb1- or Ring1b cKO EpiLCs (Fig. 5D and Supplementary Fig. 6A-B), indicating that H3K9me3 and H2AK119ub1 may play redundant roles in potentiating de novo DNAme of these loci. To directly address whether PRC1.6 is indeed required for de novo DNAme of DMS germline gene promoter regions in vivo, we reanalyzed published WGBS data from Ring1b or L3mbtl2 KO E6.5 epiblast66. A subset of these DMS germline genes showed a modest decrease in DNAme in both KOs relative to controls, though not to the extent observed in Dnmt3a/3b DKO E6.5 epiblast (Fig. 5F). This includes the promoter regions of all GGD genes, with the exception of Dazl, which retains high levels of DNAme in both mutants (Fig. 5G). Taken together, these observations suggest that MGA/MAX binding potentiates de novo DNAme at DMS germline genes during epiblast formation, likely via establishment of a repressed chromatin state.