Optimization of CRISPR-Cas12f-based gene activation system
To construct an effective CRISPR activator to induce the expression of target genes inside cells, we connected the trans-activator domain VPR to the dead version of the CWCas12f (dCWCas12f, D354A) system15. The CRISPR-Cas12f system, in contrast to previously reported Cas12 effectors, has a relatively small size, which is pertinent for the core function of target DNA binding and cleavage within a single component19. Ideally, it recognizes the target DNA in a homodimeric form, allowing the recruitment of two copies of the VPR module to the upstream region of the target gene per Cas12f molecule (Fig. 1a). To optimize the operation of the dCWCas12f-VPR system within target DNA sequences, various configurations of linkers and nuclear localization signals (NLS) were employed to connect dCWCas12f and VPR (Fig. 1b; Supplementary Fig. S1). Additionally, plasmids inducing fluorescent signals within human-derived cells were co-delivered to compare their activities (Fig. 1c, d). When analyzing the activity of the dCWCas12f-VPR system in human-derived HEK293FT cells following transfection with plasmids harboring specific target sites (upstream sequence for TSS of NLRC4 and KLH29 genes) (Supplementary Table S1), normalizing the activity to GFP fluorescence intensity and comparing it to dTomato fluorescence intensity, it was observed that V3 exhibited relatively higher values (Mean fluorescence intensity 1.82 (24 h)–2.59 (48 h) for NLCR4, 0.70 (24 h)–1.60 (48 h) for KLH29), as shown in Supplementary Fig. S1. When different guide RNA versions (ge4.0 or ge4.1) were used, the ge4.1 version of the guide RNA yielded a relatively higher average fluorescence intensity (Fig. 1c, d). Therefore, we conducted additional follow-up experiments using the ge4.1 version of the guide RNA and the V3 version of the dCWCas12f-VPR system, which exhibited the highest value of average dTomato fluorescence for the various genes analyzed.
Further engineering of CRISPR-Cas12f system for enhanced regulation of gene expression
In the previous experiment, the optimized combination of linker types and NLS with the V3 version of the dCWCas12f-VPR system, along with the ge4.1 guide RNA, was used to improve the efficiency of gene expression activation on actual genomic sites in human-derived cells (Fig. 1e, f). The introduction of amino acid mutations in a similar Cas12f system led to an increase in the targeting efficiency of Cas12f toward the target DNA sequences18; therefore, identical mutations ((D171R, T175R, E179A) or (D171R, T175R, K358R, E556R)) were introduced at conserved amino acid positions within the dCWCas12f-VPR system to further enhance its efficiency (Supplementary Fig. S2a). Moreover, these mutant versions were further combined with the FUS-IDR domain28, which is known to recruit transcription initiation factors through phase separation, yielding four different versions (3.1–3.4) of the enhanced dCWCas12f-VPR system (Supplementary Fig. S2b). Using these four different versions, which were created via the introduction of various forms of mutations and FUS-IDR domain conjugation, gene expression activity in the upstream region of SOX10 in a human-derived cell line was compared and analyzed through tiling assays (Fig. 1e; Supplementary Fig. S3, Supplementary Table S1). When the 3.3 version of the dCWCas12f-VPR system was introduced into cells to induce gene expression, it consistently exhibited the highest average induction efficiency at various positions in the TSS upstream region (Fig. 1f). Additionally, this version exhibited the highest expression efficiency in the target site (sg3) of the TSS upstream region among the different versions. Notably, the enhanced 3.3 version of the dCWCas12f-VPR system (referred to as eCas12f-VPR) demonstrated an efficiency comparable to that of previously reported gene expression activation (CRISPRa) systems based on Cas9 or Cas12a (Fig. 1f).
Targeted activation of endogenous gene expression across multiple genes using Cas12f-based activator
We investigated whether the previously improved eCas12f-VPR system could universally induce gene expression at various genomic sites (CD2, CXCR4, HBB, IL1RN, ASCL1, and HBG) in human-derived cells (Fig. 2, Supplementary Table S1). To directly compare the regulatory effects of the CRISPR-Cas module on gene expression with existing CRISPR activators, we compared it with the well-characterized Cas12a system12. The previously reported LbCas12a-VPR system recognizes a T-rich PAM (TTTV) similar to the PAM (TTTR) of the eCas12f-VPR system, facilitating the direct comparison of gene expression efficiency at the same genomic sites (Supplementary Fig. S4, Supplementary Table S1). Across all tested gene loci, the eCas12f-VPR system exhibited an overall superior gene expression amplification efficiency compared to the LbCas12a-VPR system. Application of the LbCas12a-VPR and eCas12f-VPR systems resulted in an average of 29.72-fold and 965.46-fold induction of gene expression, respectively. When applying the eCas12f-VPR system, the gene expression amplification for CD2, CXCR4, HBB, IL1RN, ASCL1, and HBG increased by an average of approximately 29.61-, 1.46-, 5.08-, 1.99-, 3.95-, and 242.90-fold compared to that of the LbCas12a-VPR system (Fig. 2a-f, Supplementary Table S2). Notably, for the target genes HBB, ASCL1, and HBG (Fig. 2c, e, f), for which the existing LbCas12a-VPR system was unable to induce significant gene amplification, the eCas12f-VPR system was effective in inducing gene expression. Similar to previous reports, for both the LbCas12a-VPR and eCas12f-VPR systems, we observed significant variability in the gene expression efficiency for different target DNA sequences within the TSS upstream region of each gene. Consequently, when comparing gene expression induction for the same target DNA sequences within the same gene (Supplementary Fig. S4), the eCas12f-VPR system demonstrated more effective gene expression than the LbCas12a-VPR system. Both systems exhibited significant efficiency variations depending on the target DNA sequence.
Highly specific activation of gene expression using Cas12f-based activator
To confirm both the effective gene expression amplification characteristics and specificity for the target genes, we compared the on/off-target gene expression profiles (on/off-target mRNA fold enrichment ratio) of the eCas12f-VPR system to those of the LbCas12a-VPR system (Fig. 3). We searched for common targets and respective off-target candidate sequences based on the PAM nucleotide sequences for various gene loci within human-derived cells (HBB, ASCL1, HBG, CD2) (Fig. 3a, e, i, m; Supplementary Table S3). Subsequently, we verified the induction of gene expression in both on- and off-target sequences. When the LbCas12a-VPR system was used, we observed average on/off-target ratios of 5.42-fold for HBB, 5.49-fold for ASCL1, 135.00-fold for HBG, and 298.88-fold for CD2 (Fig. 3b, f, j, n). However, when the eCas12f-VPR system was applied, the average on/off-target ratios were 23.18-fold for HBB, 61.60-fold for ASCL1, 11257.15-fold for HBG, and 250.71-fold for CD2 (Fig. 3c, g, k, o). When using the eCas12f-VPR system to induce gene expression, we observed an overall increase in target specificity based on the on/off-target ratios, with an average improvement of 26.06-fold when compared to the LbCas12a-VPR system.