QC of RNA-seq data.
On average, 50 million paired-end reads were sequenced per sample, and related QC metrices are shown in Supplementary Fig. S2. There is a total of 12 samples across all six time points, with two replicates per time point. It was a high-quality dataset in terms of read mapping and counting summaries (Fig. S2A and Fig. S2B). All samples clearly clustered by time points (Fig. S2C and Fig. S2D), as was expected and consistent with previous differential analysis12.
Isoform switch during T cell activation.
The isoform quantification was performed by using the computational algorithm Salmon20. By applying the isoform switch protocol described in the Methods Section, we identified eight genes with the most dominant isoform switching during T cell activation (Table 1 and Supplementary Fig. S3). We wanted to determine the functional relevance of the isoform switches in Table 1. Consistent with previously published data linking AS to cell cycle regulation25, seven of these eight genes either directly control cell cycle progression or are oncogenes. CDKN2C inhibits the activity of the cyclin D-CDK6 complex thus blocking the G1-to-S phase transition28. CENPM is a necessary component of the centromere, and conditional deletion halts cell division leading to cell death29,30. FBXO5 controls multiple cell cycle transitions as well as homologous DNA repair31–33. Methylation of piRNAs by HENMT1 is required for transposable element repression during germ cell division34. NET1 directly sequesters the phosphatase Cdc14 to allow cyclin-dependent kinase activity throughout the cell cycle35–37. SHLD1 is a component of the shieldin complex responsible for non-homologous end joining in the TP53 DNA damage repair response38,39. UNG is the major nuclear glycosylase responsible for removing mutagenic uracil from DNA during base excision repair and is necessary for class-switch recombination in B cells40,41. We selected CDKN2C, FBXO5, NT5E and NET1 for further exploration.
Table 1
Eight genes with isoform switch during T cell activation
Gene
|
Description
|
Location
|
Type
|
CDKN2C
|
cyclin dependent kinase inhibitor 2C
|
Nucleus
|
transcription regulator
|
CENPM
|
centromere protein M
|
Cytoplasm
|
other
|
FBXO5
|
F-box protein 5
|
Nucleus
|
enzyme
|
HENMT1
|
HEN methyltransferase 1
|
Cytoplasm
|
enzyme
|
NET1
|
neuroepithelial cell transforming 1
|
Nucleus
|
other
|
NT5E (CD73)
|
5'-nucleotidase ecto
|
Plasma Membrane
|
phosphatase
|
SHLD1
|
shieldin complex subunit 1
|
Nucleus
|
other
|
UNG
|
uracil DNA glycosylase
|
Nucleus
|
enzyme
|
Isoform switch in CDKN2C (Cyclin-Dependent Kinase Inhibitor 2C)/p18(INK4C).
CDKN2C inhibits T cell proliferation in response to TCR stimulation by binding to CDK6. This block is overcome through CD28 costimulation42,43. In resting T cells, the isoform CDKN2C-202 is dominant. Within two hours of activation through both the CD3 and CD28 pathways, levels of this isoform rapidly diminish. Then at 72 hours post-activation expression of the CDKN2C-203 isoform increases to predominate (Fig. 2A). Based on these data we hypothesized that the CDKN2C-202 isoform lacked the domains necessary to inhibit CDK6 thus leaving non-terminal effector T cells primed for proliferation in response to TCR stimulation.
CDKN2C encodes five ankyrin repeats44. The second and third form the inhibitory interface with CDK645. Alignment of the splice variants demonstrated that the CDKN2C-203 isoform has a single amino acid truncation of the fourth ankyrin domain and completely lacks the fifth ankyrin domain (Fig. 2B). Thus, both CDKN2C isoforms possess the second and third ankyrin domains, which are necessary for CDK6 binding. Another hypothesis we explored was that the CDKN2C-203 variant encodes an unstable CDKN2C isoform. Truncation mutants of CDKN2C lacking the fifth ankyrin domain appear unstable as this deletion construct produces little protein44,46. This CDKN2C isoform switch is likely important to CDK6 regulation and the generation of adaptive immunity while preventing lymphomas and inflammatory diseases42,43.
Isoform switch in FBXO5/FBX5 (F-box protein 5)/EMI1 (Early Mitotic Inhibitor 1).
FBXO5 is encoded at the minus strand, and has two expressed protein coding isoforms, i.e. FBXO5-201 and FBXO5-202 (Fig. 3A). FBXO5-202 is the dominant isoform at early time points, but FBXO5-201 becomes the dominant isoform at 72hr (Fig. 3A). The high expression level of FBXO5-201 at 72hr is evident from the sequence read coverage profile (Fig. 3B). Note FBXO5 also has a non-coding isoform FBXO5-203 that is barely expressed. After protein translation, FBXO5-201 is 46 amino acids longer than FBXO5-202 in the N-terminus. These 46 amino acids encode three key features: a putative signal peptide (Fig. 3C) and two potential Cdk phosphorylation sites. As a member of the F-box protein family, FBXO5 has several protein-protein interactions that could be affected by this isoform switch31–33. Despite a large body of work on FBXO5, the functional consequences of these three features are unknown.
FBXO5 is involved in the osteogenic differentiation of mesenchymal stem cells (MSCs)47. The expression of FBXO5 was upregulated after osteogenic induction in human periodontal ligament stem cells (hPDLSCs). FBXO5 knockdown attenuated migration, inhibited alkaline phosphatase (ALP) activity and mineralization, and decreased RUNX2, OSX, and OCN expression, while the overexpression of two transcript isoforms significantly accelerated migration, enhanced ALP activity and mineralization, and increased RUNX2, OSX, and OCN expression in hPDLSCs. It was concluded that both FBXO5-201 and FBXO5-202 promoted the migration and osteogenic differentiation potential of hPDLSCs, which identified a potential target for improving periodontal tissue regeneration. However, whether the two isoforms have different biological roles, especially during T cell activation, remains unclear.
Isoform switch in NT5E (5'-nucleotidase ecto)/CD73 (cluster of differentiation 73).
The protein encoded by this gene is a plasma membrane enzyme that catalyzes the conversion of extracellular AMP to adenosine. The encoded protein is used as a determinant of lymphocyte differentiation. Defects in this gene can lead to the calcification of joints and arteries. The two CCDS-validated transcripts of NT5E are NT5E-201 and NT5E-203, which differ with respect to the presence of exon 7 in NT5E-201 (Fig. 4). NT5E-201 encodes canonical CD73, denoted as CD73L, while the NT5E-203 transcript is predicted to encode a shorter protein CD73S. Human CD73S lacks amino acids 404–453, encoded by the missing exon 7. The dominant isoform is NT5E-203 at 2 and 4hr, but it is NT5E-201 at 0 and 72hr (Fig. 4). CD73S was expressed at low abundance in normal human tissues but was significantly up-regulated in cirrhosis and hepatocellular carcinoma (HCC)48. These two human isoforms exhibited functional differences, such that ectopic expression of canonical CD73L in human HepG2 cells was associated with decreased expression of the proliferation marker Ki67, whereas CD73S expression did not have an effect on Ki67 expression48. CD73S was glycosylated, catalytically inactive, unable to dimerize, and complexed intracellularly with the endoplasmic reticulum chaperone calnexin. Furthermore, CD73S negatively regulates CD73L activity and protein expression in a proteasome-dependent manner48. It remains unclear the roles of CD73L and CD73S in T cell activation, though new data suggest that CD8 + CD73 + T cells may be especially important mediators of immunosuppression in human head and neck cancer49.
Isoform switch in NET1 (Neuroepithelial cell transforming 1).
The gene NET1 is part of the family of Rho guanine nucleotide exchange factors. Members of this family activate Rho proteins by catalyzing the exchange of GDP for GTP. The protein encoded by this gene interacts with RhoA within the cell nucleus and may play a role in repairing DNA damage after ionizing radiation. Alternative splicing results in multiple transcript variants that encode different protein isoforms. Compared with NET1-202, the expression for NET1-201 is low at early time points but increases significantly and become the dominant isoform at 24 and 72hr. NET1-201 and NET1-202 display distinct exon usage in their 5’ ends (Fig. 5A). We performed Clustal Omega alignment of the predicted protein sequences of NET1-201 and − 202, and found that NET1-202 completely lacks the first nuclear localization sequences (NLS) in its N-terminus and that there is poor conservation of the second NLS (Fig. 5B). These NLSs are functionally important as oncogenic NET1 lacks the N-terminal 145 amino acids encoding the NLS, and deletion of the two NLS redistributes NET1 from the nucleus to the cytoplasm50,51. The dominance of the NLS-containing NET1-201 splice variant at 24- and 72-hours post-stimulation suggests that nuclear sequestration of Cdc14 by NET1 is important for TCR-driven T cell proliferation and maximally effective adaptive immune responses.
The regulation of the two isoforms of NET1 by transforming growth factor-β (TGF-β) in keratinocytes has been studied, and the results emphasize the importance of NET1-202 in the short- and long-term TGF-β-mediated regulation of epithelial-to-mesenchymal transition (EMT)52. It was found that short-term TGF-β treatment selectively induced NET1-202 (also termed as Net1A) but not NET1-201. Interestingly, long-term TGF-β treatment resulted in Net1A protein degradation by the proteasome. Silencing of Net1A resulted in disruption of E-cadherin- and zonula occludens-1 (ZO-1)-mediated junctions, as well as expression of the transcriptional repressor of E-cadherin, Slug and the mesenchymal markers N-cadherin, plasminogen activator inhibitor-1 (PAI-1) and fibronectin, indicating that late TGF-β-induced downregulation of Net1A is involved in EMT. In conclusion, this study provides new evidence for the differential regulation of the two isoforms of the RhoA-specific GEF NET1 by TGF-β. It points out differential regulatory effects of TGF-β on the NET1A isoform, depending on the duration of the signal52.