Bioinformatics analysis of proteins with dysregulated expression levels showed metabolic reprogramming in neonatal hearts
To determine the underlying correlations between neonatal cardiac regeneration and metabolic reprogramming in the perinatal stage, we profiled protein expression changes during the first 7 days at P1, P5, and P7 by general proteomics (Fig. 1A). In total, 5303 proteins were identified, among which 5104 proteins were quantified (Fig. 1B). When setting the fold change > 1.5, p < 0.05 as a differentially regulated protein, 388 proteins were upregulated and 319 proteins were downregulated in P5/P1, 484 proteins were upregulated and 426 proteins were downregulated in P7/P1, and 64 proteins were upregulated and 86 proteins were downregulated in P7/P5 (Fig. 1C), indicating dramatic changes in protein expression occurred in the neonatal heart.
To further understand the characteristics and functions of the altered proteins, we performed bioinformatics analyses to reveal the changes in protein expression in neonatal hearts. Gene set variation analysis (GSVA) for the 4721 proteins quantified at all 3 time points to recognized the alteration of different cellular processes, we selected C2 gene set in MSigDB as the reference data set and adopted "Poisson" method. With the development of neonatal heart, GSVA showed that glycolysis and gluconeogenesis, the p53 signaling pathway, the cell cycle and DNA replication were downregulated, while the PPAR signaling pathway, fatty acid metabolism, biosynthesis of unsaturated fatty acids and cardiac muscle contraction were upregulated (Figure S1A). These results indicated that fatty acid metabolism and cardiac muscle contraction were increased in neonatal hearts, whereas glycolysis and the cell cycle were inhibited, which might lead to the loss in cardiac regenerative capacity.
To explored the dynamic changes of protein level in neonatal heart, series test of cluster (STC) analysis was performed to recognize distinct protein clusters with similar expression trends, which might perform similar biological functions. In total, the 4721 proteins were divided into 8 clusters with similar coexpression trends (Cluster 1–8), accounting for 389, 309, 393, 326, 605, 1096, 807 and 796 proteins, respectively (Fig. 1D). Cluster 1 and 2 exhibited a decrease at P5, followed by an increase at P7, whereas clusters 3 and 4 displayed an increase at P5, subsequently followed by a decrease at P7. Cluster 5 was characterized by upregulated proteins, cluster 6 showed a decline at P5 but remained stable at P7. Cluster 7 displayed an upregulation at P5, which persisted at P7, and cluster 8 was primarily composed of downregulated proteins. Hence, we combined cluster 1 and cluster 2 to form C1, merged cluster 3 and cluster 4 to create C2, and assigned C3 to cluster 5, C4 to cluster 6, C5 to cluster 7, and C6 to cluster 8 for bioinformatics analyses. Through KEGG pathway analysis of the 6 clusters, we found that glycolysis was mainly identified in C4, while the Citrate cycle (TCA cycle), Oxidative phosphorylation, Fatty aicd metabolism, Fatty aicd degradation and PPAR signaling pathway were significantly enriched in C5. Additionally, Cell cycle, DNA replication, Mismatch repair and Spliceosome were prominently featured in C4 and C6 (Fig. 1E).
Furthermore, GO enrichment analyses were conductedfor these 6 clusters, with the P.adj < 0.05. In the biological process category, the outcomes revealed that glycolytic process was primarily associated with C4, tricarboxylic acid cycle, oxidative phosphorylation, lipid metabolic process, fatty acid metabolic process and cellular respiration were mainly identified in C5, signifying that glycolysis was suppressed and fatty acid oxidation was augmented in neonatal hearts. Moreover, DNA replication, DNA repair, chromatin remodeling, cell cycle, RNA splicing and ribosome biogenesis were enriched in C4 and C6, indicating that the cell cycle and cardiomyocyte proliferation were hindered in neonatal hearts (Figure S1B). In the cellular component category, chromosome (telomeric region), spliceosomal complex, nuclear matrix, cytosolic ribosome and ribonucleoprotein complex were discerned in C4 and C6, whereas C5 mainly include mitochondrial inner membrane, mitochondrial respiratory chain complex (I, III and IV), peroxisome and respiratory chain (Figure S1C). Additionally, in the molecular function category, C5 was primarily characterized by fatty-acyl-CoA binding, lipid binding and NAD binding, while RNA binding, actin binding and ribosome binding fell into C4 and C6 (Figure S1D). This analysis identified several proteins with the Log2 FC > Log2 1.2, glycolytic proteins mainly fell into C4 and C6, such as ADH5, GALM, PKM, TPI1, ALDOC, PGAM1 and PGK1, which showed decreased glycolysis in the neonatal heart (Fig. 1F). FAO-related proteins were enriched in C3 and C5, such as CPT2, CPT1β, ACSL1, ACADL and ACAT1, which were increased (Fig. 1G). In addition, the expression of proteins involved in the cell cycle, cyclin family and MCM family was reduced in the neonatal heart (Fig. 1H). Above all, these results demonstrated that metabolic remodeling occurred in the 7-d time window of the neonatal heart, accompanied by cell cycle arrest.
Global protein Kla levels were decreased with metabolic reprogramming during the perinatal stage
To establish the comprehensive profiling of the protein Kla in the neonatal heart, we selected 4 time points to detect the protein Kla abundance in perinatal hearts and embryonic Day 18.5 (E18.5), P1, P5, and P7 hearts (Fig. 2A-2B and Figure S2). We found that the Kla abundance of the P1 heart was similar to that of E18.5, which was consistent with the observation that a proper regenerative capacity is retained by P1 heart [16]. In addition, there was a sharp decline in Kla abundance in P5 and P7 hearts. This accompanying protein Kla alteration, metabolic reprogramming and cardiac regenerative ability loss in the postnatal heart indicates an underlying correlation between protein Kla and cardiac regeneration.
Thus, we selected P1, P5 and P7 hearts of C57BL6/J mice to perform Kla omics analysis. The Kla levels of Kla sites were detected by normalizing the Kla peptide abundance with the protein abundance based on the proteomics and Kla omics analyses. To validate the quality of the MS data, principal component analysis (PCA) showed that the 3 samples, P1, P5 and P7, were separated into three distinct quadrants, indicating significant differences in the protein Kla status of P1, P5 and P7 hearts (Figure S3A), and the peptide mass error of most peptides was less than 5 ppm, which means that the mass accuracy of the MS data was acceptable (Figure S3B). Finally, for most of the identified proteins, the number of Kla sites per protein was 1 to 3 (Figure S3C).
The Kla sites were identified and quantified through database searching, and further bioinformatics analysis was performed to reveal the marked alterations in normalized Kla levels in the metabolic remodeling period in mice. In total, 2297 lactylation sites from 980 proteins were identified, among which 1262 lactylation sites from 409 proteins were quantified (Fig. 2C). The postnatal standardization of protein lactylation levels exhibited a downward trend in murine cardiac tissue (Fig. 2D). And we generated a heatmap of lactylation omics, wherein a minority of protein sites displayed augmented lactylation level, yet the most proteins Kla level in heart decreased postnatally (Fig. 2E). Taken together, these results suggested that there was a global decrease in the protein Kla level accompanied by metabolic reprogramming during the perinatal stage.
Functional enrichment analysis of proteins with dysregulated Kla sites
To further understand the characteristics and function of the proteins with Kla sites, we first identified the Kla sites that were expressed at all 3 time points and obtained 782 differentially modified Kla sites in the postnatal heart. We applied several functional enrichment analyses to examine the dynamics of Kla changes in the neonatal heart. STC analysis for the differentially modified proteins grouped the 782 Kla sites into 10 coexpression trend clusters. Cluster 1, 2 and 3 were dominated by downregulated proteins, while cluster 4 and 5 mainly contained upregulated proteins, cluster 6 and 7 exhibited an increase at P5 followed by attenuation at P7, with the Kla level remaining greater than that of P1, cluster 8 and 9 demonstrated a parallel expression pattern to cluster 6 and 7, however, the Kla level at P7 was lower than that of P1. Cluster 10 was comprised of a mere three proteins Kla sites, and the sample size was deemed insufficient for bioinformatics analysis, consequently, cluster 10 was disregarded. Therefore, we amalgamated cluster 1, 2 and 3 to generate C1, combined cluster 4 and 5 to form C2, and designated cluster 6 and 7 as C3, cluster 8 and 9 as C4 for subsequent bioinformatics analyses. C1 and C4 contained 686 of all 782 Kla sites, suggesting that Kla level of most proteins were decreased after birth, which was consistent with Fig. 2 (Fig. 3A). Through KEGG enrichment analysis of these 4 clusters, glycolysis, protein translation (ribosome) and spliceosome were enriched in C1 and C4, which displayed a general decline in neonatal hearts. C2 showed a significant enrichment of cardiac muscle contraction and citrate cycle (TCA cycle). Furthermore, fatty acid degradation and PPAR signaling pathway were identified in C3 (Fig. 3B).
GO-based functional enrichment analysis was performed to elucidate proteins in these 4 clusters. When setting the P.adj < 0.05, the biological process category revealed significant enrichments of cytoplasmic translation, protein folding, actin cytoskeleton organization, glycolytic process, and canonical glycolysis in C1 and C4. In contrast, lipid metabolic process was primarily associated with C3 (Fig. 3C). Correspondingly, the cellular component category unveiled processes related to protein biosynthesis and cell proliferation, such as ribosome, cytoskeleton, cytosolic ribosome, and ribonucleoprotein complex, in C1 and C4 (Fig. 3D). Furthermore, in the molecular function category, double-stranded DNA binding, RNA binding, structural constituent of ribosome, and protein binding were identified in C1 and C4, while fatty-acyl-CoA binding and electron carrier activity were predominantly involved in C3 (Fig. 3E). Taken together, proteins Kla mainly participated in metabolic pathways, proteins biosynthesis and cell proliferation.
In addition, we performed GO-based enrichment analysis of C1-C4with P.adj < 0.05 to determine the other pathways containing lactylated proteins and discovered that protein Kla was implicated in other cellular processes apart from metabolism and cell cycle (Figure S4). We observed the participation of Kla in cellular processes pertaining to mitochondrial metabolism, cytoskeleton, and protein biosynthesis. Thus, these results suggested that nonhistone Kla might mainly modulate mitochondrial metabolism and protein biosynthesis.
With the underlying association of metabolic reprogramming and proteins Kla, we analyzed the alteration in metabolism-related protein Kla levels. C1 and C4 contained the Kla sites of glycolytic proteins PGAM2, PFKP, PKM, PGK1 and ALDOA. In addition to glycolytic proteins, proteins involved in DNA replication, such as Structural Maintenance of Chromosomes 3 (SMC3), Replication Protein A1 (RPA1) and Nucleophosmin 1 (NPM1), and ribosomal proteins family, RPL3, RPL6, RPS6 and RPS7, were also identified in C1 and C4. While C3 included critical regulators of fatty acid oxidation, FABP3, ECI1 and ACADVL.
Glycolysis and cell proliferation showed a positive correlation between proteomics and Kla omics, while FAO showed a negative correlation
To obtain a more comprehensive understanding of the implications of protein Kla in the neonatal 7-day time frame, we conducted a Pearson correlation coefficient analysis between proteomics and Kla omics to profile the correlation between the protein Kla level and the underlying cellular processes.
The Pearson correlation analysis demonstrated a clearly positive and negative correlation between protein abundance and Kla level of the modified sites (Fig. 4A). When we setting the absolute value of Pearson correlation coefficient༞0.8, we observed that the number of Kla sites exhibiting a positive correlation (172) was less than that with a negative correlation (208), but the number of proteins showing a positive correlation (109) outweighed those with negative correlation (81), indicated that the positively correlated modification sites were extensively distributed (Fig. 4B). Meanwhile, the Venn diagram displayed 109 positive correlated proteins and 81 negative correlated proteins, 10 of them were in the intersection, which contained both positive and negative correlated Kla sites (Fig. 4C).
The KEGG pathway analysis revealed that glycolysis/gluconeogenesis, ribosome, DNA replication and cell cycle were enriched in positive correlation group, while PPAR signaling pathway, TCA cycle, fatty acid degradation and oxidative phosphorylation were identified in negative correlation group (Fig. 4D). Besides, we conducted functional enrichment analysis of the positive correlation group based on GO, and found that the protein abundance of mRNA processing, ribosome biogenesis, DNA biosynthetic process and glycolytic process were positively correlated with their Kla level (Fig. 4E). And the GO analysis of the negative correlation group showed that the group comprised ATP metabolic process, cellular respiration, TCA cycle, fatty acid beta-oxidation and oxidative phosphorylation (Fig. 4F).
To further explore the potential role of Kla in cardiac metabolism and cell proliferation, we extracted proteins with differentially regulated Kla sites (Log2 FC > Log2 1.2) from glycolysis, FAO and the cell cycle. According to the expression pattern, we could classify glycolytic proteins into the following groups (Fig. 5A and 5C): class 1 included PFKP, ENO1, PKM and ALDOA, the protein level of class 1 proteins declined after birth, and the Kla level of class 1 also decreased. Class 2 identified DLD, ENO3 and PDHA1, which showed increased protein expression, while the Kla level of class 2 was variable; for example, ENO3 K275 decreased, but ENO3 K80 increased at P5 and then decreased at P7. Similarly, for FAO, there were 3 classes of proteins (Fig. 5B and 5D). In class 3, the protein abundance of ACADVL, ACADL, ACADM, ECI1 and ACSL1 was increased, while their Kla level was increased at P5 and then attenuated at P7, consistent with the enhanced FAO in the postnatal heart. According to our proteomic results, FAO-related protein expression was increased after birth; thus, we suspected that at P5, the upregulated Kla level might contribute to the increased protein expression and function. Class 4 contained HADH, HADHB, ACAA2 and ACADS, and the protein level was enhanced, but the Kla level was reduced in the 7-d time window after birth. In class 5 FAO proteins, the different Kla sites showed different expression trends in HADHA and ACAT1. The Kla levels at K569, K634, K519, K644, K415, K60 and K406 of HADHA were decreased in P5 and P7 hearts, whereas the K334, K284 and K390 of HADHA were similar to the class 3 proteins. There was a transition of the Kla level at P5. In addition to the Kla sites of ACAT1, K248 and K265 showing diverse statuses after birth, K248 showed an increase at P5 and then an attenuation at P7, while the Kla level of K265 was decreased after birth. Taken together, these results implied that the effect of Kla might be flexible and could have different functions at different Kla sites.
However, in regard to cell cycle-related proteins, we identified decreased protein expression in proteomics (Fig. 1H), but our Kla omics results showed that there were very few lactylated proteins involved in the cell cycle. Only a small minority of the modified proteins related to the cell cycle were identified, such as SMC3, RPA1, NPM1 and nuclear DNA helicase II (DHX9) (Fig. 5E), suggesting that protein lactylation might prefer to participate in cardiac metabolic activity compared with the cell cycle.
Moreover, we observed that the glycolytic proteins PFKP and ENO1 showed a positive correlation between the protein level and Kla level (Fig. 5F and 5G), and the DNA replication-related proteins SMC3 and NPM1 presented a positive correlation (Fig. 5H and 5I), while the FAO enzyme ACAT1 showed a negative correlation (Fig. 5J). The Kla level of ACADL K338 was positively correlated with the ACADL expression level, but ACADL K322 showed a negative correlation (Fig. 5K). Above all, these results suggested that in the neonatal heart, protein lactylation mainly participate in metabolic pathways, the correlation between protein and Kla level was positive in glycolysis and the cell cycle, and the correlation was negative in the PPAR signaling pathway and fatty acid β-oxidation.
Verification of proteins with dysregulated Kla sites
To verify the dynamic alteration of protein Kla levels in the 7-d time window after birth, several proteins with dysregulated Kla sites were selected, and the Kla omics results were validated. In the neonatal heart, there was increased fatty acid β-oxidation and decreased glycolysis, but the alteration of the Kla levels of these proteins varied (Fig. 6A). To validate the identification of lysine lactylation, we extracted the mass spectra of several Kla modified peptides of selected proteins (Figure S5-S8). However, due to the limitations in detecting Kla levels at specific sites, we detected the overall Kla abundance of selected proteins through coimmunoprecipitation and Western blot, and then normalized the Kla abundance with the corresponding protein expression level. Acyl-CoA dehydrogenase long chain (ACADL), Acyl-CoA dehydrogenase very long chain (ACADVL) and Acetyl-CoA acetyltransferase (ACAT1) are important enzymes that catalyze the mitochondrial β-oxidation pathway. The proteomics showed that the protein expression levels of ACADL, ACADVL and ACAT1 were increased at P5 and P7 (Fig. 5B). And we found that both the Kla abundance (Top band of Fig. 6B-C and Figure S9 A-B) and the normalized Kla level (Figure S10 A-B) of ACADL and ACAT1 were decreased in neonatal hearts. Besides, the Kla abundance of ACADVL increased at P5 and then declined at P7 (Top band of Fig. 6D and Figure S9C), aligning with the Kla omics data, however, when we normalized the Kla abundance by protein abundance, the Kla level of ACADVL showed a decrease at both P5 and P7 (Figure S10C), potentially due to a marked upregulation of the protein abundance at P5 and P7. PFKM and PKM are critical glycolytic enzymes, we found that in neonatal hearts, both the Kla abundance (Top band of Fig. 6E and Figure S9D) and the normalized Kla level of PFKM were decreased (Figure S10D), which was consistent with our omics data. Although the Kla abundance of PKM was decreased at P5 and P7 (Top band of Fig. 6F and Figure S9E), the Kla level of PKM after normalization was increased at P5 and decreased at P7 (Figure S10E), might result from the significant downregulation of the protein abundance at P5.
NPM1 is involved in various cellular processes, such as ribosome biogenesis, histone assembly, centrosome duplication, cell proliferation, and regulation of the tumor suppressor p53/TP53. We detected the Kla level of NPM1 after birth, which was similar with PKM, the Kla abundance and the protein expression level of NPM1 were both decreased at P7 (Fig. 6G and Figure S9F), while the Kla level after normalization showed upregulated at P5 and downregualted at P7 (Figure S10F). Moreover, we also detected the Kla level of 14-3-3z, which involved in many vital cellular processes such as metabolism, signal transduction and cell cycle regulation[23–26], we found that Kla abundance of 14-3-3z was decreased after birth (Top band of Figure S11), aligning with our Kla omics data. Above all, the verification of proteins with dysregulated Kla sites proved the reliability of our MS data. These proteins are involved in diverse cellular processes and thus elucidate the global alteration in Kla in postnatal hearts.