Engineering ABEmax with reduced RNA deaminase activities
Considering that cellular RNAs with wide range of aneuploidy copies have been discovered as RNA off-target substrates of ABEs [6–9], we reasoned that Mutect2, a GATK tool for sensitive detection of somatic point mutations in heterogeneous cancer samples [11], might be more appropriate than HaplotypeCaller (for detection of euploid germline SNPs) for detection of RNA edits [12] (Supplementary Fig. 1a). Thus, we reanalyzed ABEs- and their optimized variants (miniABEmax-V82G, ABE7.10-F148A, and ABEmaxAW)-induced RNA off-targets [6–9], and found that indeed, Mutect2 recovered 2.7-11-fold number of ABEs-induced RNA edits compared with that using HaplotypeCaller with similar editing signatures (endogenous A-to-I edits were deducted from control sequencing data), revealing these optimized ABE variants still retained a relatively large number of RNA edits (Supplementary Fig. 1b-g). Surprisingly, the overlapped RNA edits from HaplotypeCaller and Mutect2 were as few as 22–68% of HaplotypeCaller-calculated RNA off-targets (Supplementary Fig. 1h-m). We further performed Manhattan plotting for ABEmax-induced RNA off-targets [6] to show efficiency distributions of overlapped, HaplotypeCaller-specific, and Mutect2-specific RNA edits, respectively, demonstrating that the number of Mutect2-specific RNA edits was much more than HaplotypeCaller-specific edits, especially for those edits with 0–10% editing efficiency, which were ignored by HaplotypeCaller (Supplementary Fig. 1l). Meanwhile, lower overlapping ratio was discovered for the samples possessing fewer RNA edits (Supplementary Fig. 1m). Nine MuTect2-specific edits (with > 10% efficiency in RNA-seq data) were randomly selected for PCR validation with cDNAs subjected to RNA-seq experiment. Indeed, all of these amplicons were successfully detected with high or low efficiency of A-to-G mutations (Supplementary Fig. 1n), confirming the reliability of MuTect2-specific edits and suggesting that it is necessary to engineer ABEmax variants based on MuTect2 analysis.
Although an engineered TadA* has been evolved to be capable of deaminating DNA adenines [1, 2], both TadA* and wildtype TadA retain the ability to deaminate cellular RNAs [9]. We analyzed ABEmax-induced RNA edits in ABEmax-overexpressed HEK293T cells from a published RNA-seq dataset (Supplementary Fig. 2a-b) [6]. Meanwhile, we generated our data by co-transfecting HEK293T cells with a sgRNA to efficiently induce DNA A-to-G conversion within ABE site 8 (Supplementary Fig. 2c-e). The cells with highest 15% GFP signal were collected for on-target and off-target analysis. Higher overlapping ratios between two independent duplicates were observed for those A positions of RNA edits with higher editing efficiency, demonstrating the preferential affinity of ABEmax for highly edited RNAs (Supplementary Fig. 2a and 2d). Thus, we calculated the sequence logos for ABEmax-induced RNA edits with differential scope of editing efficiency, showing that higher-edited adenines preferentially located within a conserved motif being more close to UACGA (Supplementary Fig. 2b and 2e), which highly resembles the conserved loop region of tRNA substrate for ecTadA [3]. These data demonstrate that, consistent with a recent report [9], ABEmax induces efficient transcriptome-wide off-target RNA editing harboring core E.coli tRNA-like sequences.
Therefore, we hypothesized that disruption of the interaction between TadA/TadA* heterodimer and tRNA-loop structure may interfere the catalytic activities of ABEmax on RNA. Since there is no crystal structure information for the complexing between ecTadA and tRNA, we referred the co-crystal structure of Staphylococcus aureus TadA (saTadA) and tRNA as well as the alignment of the conserved amino acid sequences between ecTadA and saTadA with high similarity [3, 13], showing that the amino acids possibly responsible for interaction with tRNA are conserved between the two types of TadA (Fig. 1a and Supplementary Fig. 3a). Thus, we introduced a series of point mutations into either the TadA or TadA* monomer of ABEmax according to the interacting interface between homodimerized TadA and tRNA [3] to disrupt TadA/TadA* and tRNA interactions [3], and measured their RNA and DNA editing activities (Fig. 1b). To facilitate this test, we generated a robust reporter by cloning the E.coli Hokb (ecHokb) gene containing tRNA-like CTACGAA sequence, which has been reported to be highly edited by ecTadA at RNA levels [14], into a CMV promoter-driven vector. Then, this reporter was co-transfected with a sgRNA targeting HEK site 3 and ABEmax or its mutated variants, and the A-to-G editing efficiencies in ecHokb cDNA (reversely transcribed from mRNA) or genomic DNA (gDNA) were determined by deep sequencing on ecHokb cDNA or gDNA amplicons. It showed that both ABEmax and 2xTadA induced highly efficient RNA but not DNA editing within ecHokb locus. Notably, we identified three variants (N46A, H57A, and R153P) with substantially decreased RNA editing activities, especially R153P with most reduced RNA edits comparable to the negative Cas9n control (Fig. 1c and Supplementary Fig. 3b; the endogenous RNA A-to-I edits detected in native HEK293T cells were deducted). In addition, their DNA on-target editing activities were retained (Fig. 1d). Moreover, similar to ABEmax [2], all variants induced very few by-products and indels (Supplementary Fig. 3c-d). Three amino acids, including N46, R153, and the reported site E59 [6], were likely in close contact with tRNA near the enzymatic pocket in structural prediction (Supplementary Fig. 3e). Additionally, ABEmax-R153P variant exhibited comparable DNA on-target A-to-G editing activities for multiple target sites in human cells (HEK293T and U2OS cells) (Supplementary Fig. 4a-b). Thus, we identified three variants, especially ABEmax-R153P, with minimized RNA editing activities in the reporter assay.
Next, we performed RNA-seq analysis to detect transcriptome-wide RNA off-targets induced by ABEmax or R153 substituted variants (R153P, R153P/R153P*, and R153A/R153A*) (Supplementary Fig. 4c). Surprisingly, the total number of RNA off-targets induced by R153P/R153P*, and R153A/R153A* were comparable with ABEmax (Fig. 1e by MuTect2; a bit lower by HaplotypeCaller in Supplementary Fig. 4d), and R153P variant induced even more RNA edits, which was excluded in further test (Supplementary Fig. 4e). Actually, R153P/R153P* and R153A/R153A* was test after the observation of high RNA A-to-I edits induced by R153P variant. Sequence logos derived from highly edited adenines (> 20%) showed that the preference of ABEmax for a consensus TACG motif was diminished upon R153 substitution (Fig. 1f) and the number of RNA edits with > 40% efficiency for R153P/R153P* and R153A/R153A* was much fewer than ABEmax (Fig. 1e and Supplementary Fig. 4f), suggesting that R153P/R153P* and R153A/R153A* mainly affect highly edited RNA edits harboring conserved UACG motif. These data demonstrate that interfering R153 can minimize the RNA-editing catalytic activities of TadA on tRNA loop-like structures (Fig. 1e).
Deletion of Arginine 153 (del153) reduces RNA off-targeting activities in upgraded ABEs
Considering the importance of R153 for deamination (Fig. 1), we tried to generate upgraded ABEs with reduced RNA off-targets by deleting R153 from both TadA and/or TadA* within ABEmax (del153/del153*) or mini ABEmax (fused by TadA* and Cas9n [9] (mini del153). As expected, we demonstrated that compared with ABEmax or mini ABEmax, the RNA off-targets induced by del153/del153* and mini del153 were largely decreased, and there were as few as 291 (MuTect2) or 98 (HaplotypeCaller) RNA A-to-I edits for mini del153 group (Fig. 2a-c; Supplementary Fig. 5a), while both variants retained a relatively high DNA on-targeting activities (Fig. 2a). We also overlapped or merged the ABEmax-, del153/del153*-, or mini del153-induced RNA A-to-I edits using HaplotypeCaller and MuTect2, respectively. Compared with ABEmax, both del153/del153* and mini del153 induced remarkably decreased RNA edits of the overlapped, HaplotypeCaller-specific, MuTect2-specific, and merged edits (Supplementary Fig. 5b). Manhattan plots and histograms further confirmed that both the number and efficiency for del153/del153*- and mini del153-induced RNA A-to-I edits were strikingly decreased (Fig. 2b-c), accompanying with much lower mean frequencies throughout the transcriptome (Supplementary Fig. 5c).
Next, we compared our upgraded ABE (uABE) variants with reported variants possessing reduced RNA off-targeting activities, including ABEmaxAW (E59A/V106W*) [7], ABE-F148A/F148A* [8], and SECURE-ABEs (miniABEmax-K20A/R21A and miniABEmax-V82G (mini V82G)) [9]. The results showed that our upgraded variants, del153/del153* and mini del153 induced much fewer RNA off-targets by both MuTect2 and HaplotypeCaller tools under same experimental conditions (Fig. 2a; Supplementary Fig. 5a) (with much fewer RNA edits by using HaplotypeCaller).
We further characterize the DNA on-target editing activities of del153/del153* and mini del153 variants for another eight sites. It demonstrated that the DNA on-targeting activities of del153/del153* and mini del153 were highly similar to ABEmax for nearly all of detected sites, except for ABE site 12 with a bit lower but satisfactory editing efficiency (Fig. 2d; Supplementary Fig. 6a). Similar to SECURE-BE3-induced RNA C-to-U edits with perfect reducing effect [6], del153/del153* and mini del153 variants-induced RNA A-to-I edits were decreased to only dozens or hundreds of off-targets when using HaplotypeCaller (Supplementary Fig. 5a). Comprehensively considering the on-targeting and off-targeting activities of engineered variants, we designate del153/del153* and mini del153 as our best optimized ABE variants with minimized RNA editing activity. Recently reported ABE8e and ABE8s containing evolved mutations within TadA/TadA* possess increased DNA on-targeting activities as well as elevated RNA off-targeting activities [15, 16]. We tried to generate upgraded ABEs with higher DNA on-targeting and lower RNA off-targeting activity by deletion of R153 from ABE8e or ABE8s, demonstrating that RNA off-targets were remarkably decreased from HaplotypeCaller or MuTect2 calculations, and the number of ABE8s-del153 exhibited comparable number of RNA edits with mini ABEmax (Fig. 2e and Supplementary Fig. 6b). Notably, ABE8s-del153 and ABE8e-del153 showed comparable or slight lower levels of DNA A-to-G editing activities; however, the on-targeting activity of f ABE8e/8 s or ABE8e/8 s-del153 was much higher than ABEmax or mini ABEmax, and the editing window of ABE8e/8 s or ABE8e/8 s-del153 was also much wider than ABEmax or mini ABEmax (Fig. 2f; Supplementary Fig. 6c). In collection, we propose that deletion of R153 is a good strategy for reducing RNA off-targeting activities in upgraded ABEs.