3.1 Gene Counting and FPKM Expression Data Analysis
Based on the downloaded datasets, we first conducted a preliminary analysis of gene counting and FPKM expression data. These datasets include gene counting and expression data for all samples, providing a foundation for subsequent differential expression analysis and functional annotation.
Overview of Datasets:
The file “GSE235751_gene_count_matrix1.txt.gz” contains gene counting data for all samples, suitable for differential expression analysis and other downstream analyses.
The file “GSE235751_genes_fpkm_expression.txt.gz” contains FPKM expression data for all samples, used for normalized gene expression analysis and gene set enrichment analysis.
Initial Data Analysis:
We performed statistical analysis on the gene counting data for all samples to evaluate the distribution of gene counts and ensure the uniformity and quality of the data. This analysis ensured the integrity and reliability of the gene counting data, laying a solid foundation for subsequent differential expression analysis.
FPKM expression data were normalized using the FPKM expression data for all samples. The results showed that most genes have consistent expression levels across different samples. This consistency indicates high data quality, and the normalization effectively eliminated technical biases, ensuring comparability between samples.
To better understand and present the data, we used Python to plot the distribution of gene counting and FPKM expression data. Figure 2 shows the gene count distribution for all samples, visually displaying the overall distribution and outliers using a box plot. Figure 3 shows the FPKM expression distribution for all samples, reflecting the distribution trend and consistency using a density plot.
These preliminary data analyses and visualizations provide a robust data foundation for subsequent differential expression gene screening and functional annotation, helping us to better understand the impact of AKI on cardiac gene expression.
3.2 Differential Expression Analysis of DEGs
This study used Python to conduct differential expression analysis on the dataset "GSE235751_gene_count_matrix1.txt.gz". Differential expression analysis between the UUO and control groups showed 5 differentially expressed genes (DEGs) at statistically significant levels. Among these, 2 genes were significantly upregulated and 3 genes were downregulated in the UUO group. Specifically: Upregulated Genes: Il6 and Tnf, associated with immune response and inflammation pathways. Downregulated Genes: Cs, Becn1, and Pex2, associated with mitochondrial oxidative bioenergetics, autophagy, and peroxisome pathways. Figure 4 presents the volcano plot of the DEGs, where labeled dots correspond to significantly upregulated or downregulated genes in the UUO group. The volcano plot illustrates the relationship between the P-value and log2(fold_change) for each gene, providing a clear visualization of the DEGs distribution. Although the number of significant DEGs was limited in our study, additional enrichment analysis demonstrated that these genes are involved in various crucial biological processes and signaling pathways. More genes and pathways that may be involved in the pathogenesis of AKI could be studied later.
To further understand the biological functions of these DEGs, we performed functional annotation and KEGG pathway analysis. Figure 5 shows the bubble plot of the functional annotation of DEGs. From this analysis, it was determined that these genes are significantly enriched in various biological processes, cellular components, and molecular functions, revealing a system-wide effect of acute kidney injury on cardiac gene expression.
These results indicate that AKI affects heart function through multiple biological pathways, including the cell cycle, immune response, mitochondrial function, and autophagy. By identifying these early molecular events, we can better understand the remote effects of AKI on the heart and provide new targets for future research and treatment.
3.3 PCA and t-SNE Analysis
To visualize the distinctions between samples, we carried out Principal Component Analysis (PCA) and t-distributed Stochastic Neighbor Embedding (t-SNE) on the dataset “GSE235751_gene_count_matrix1.txt.gz.” Principal Component 1 (PC1) and Principal Component 2 (PC2) are illustrated with PCA results, which demonstrate a clear separation of the UUO group from the control group based on the PC1 vs. PC2 dimensions. These results show striking differences in the gene expression profile of UUO compared with control kidneys. PCA extracts these differences effectively, visualizing the distribution of samples two-dimensionally.
Figure 6 shows the scatter plot of the PCA analysis, with each point representing a sample and different colors representing different experimental groups. In the PC1 and PC2 space, the UUO group is separate from the control group, indicating significant changes in gene expression due to acute kidney injury.
To validate the sample clustering, we repeated t-SNE analysis. t-Distributed Stochastic Neighbor Embedding, or t-SNE for short, is a non-linear dimensionality reduction technique well-suited for reducing high-dimensional data. The t-SNE analysis results showed the grouping of UUO and control groups in two-dimensional space, which verified their completely separate characteristics.
Figure 7 shows the scatter plot from the t-SNE analysis of sample-wise representations; each point corresponds to one sample, and different colors are used for distinct experimental groups. Consistent with the PCA results, two distinct clusters of the UUO group and control group were observed in the t-SNE 2D space.
These findings not only demonstrate that both PCA and t-SNE analyses reveal distinct differences between the UUO and control groups, but they also serve as crucial visualizations to elucidate how AKI impacts cardiac gene expression. From these analyses, it is clear that distinguishing gene expression differences among the experimental groups can help in understanding the molecular mechanisms of AKI.
3.4 Gene Set Enrichment Analysis (GSEA)
Using the dataset “GSE235751_genes_fpkm_expression.txt.gz,” we performed Gene Set Enrichment Analysis (GSEA) and identified several significantly enriched gene sets in the UUO group. The results indicated that the mitochondrial oxidative bioenergetics pathway and autophagy pathway were significantly downregulated, reflecting the inhibition of these critical metabolic and cellular cleaning processes following acute kidney injury.
Figure 8 shows the GSEA results for the mitochondrial oxidative bioenergetics pathway, with the enrichment plot clearly indicating significant downregulation in the UUO group. Figure 9 shows the GSEA results for the autophagy pathway, also indicating significant downregulation. These downregulated pathways may be associated with compromised cellular bioenergetics and self-cleaning capacities, further highlighting the systemic impact of acute kidney injury on the heart.
Moreover, pathways related to cell proliferation and inflammation were significantly upregulated in the UUO group. Figures 10 and 11 illustrate the GSEA results for cell proliferation and inflammation-related pathways, respectively. These results reflect the activation of cell proliferation and inflammatory responses induced by acute kidney injury, suggesting that these upregulated pathways are part of the heart’s repair and defense mechanisms.
Additionally, as a significantly upregulated transcript, Vimentin was highly expressed in the UUO group compared to the control group. Figure 12 shows the expression levels of Vimentin in both groups, with a bar chart and data points illustrating this difference. This indicates that Vimentin may play a crucial role in fibrosis or structural remodeling of the heart following acute kidney injury.
These GSEA results not only reveal the systemic impact of acute kidney injury on cardiac gene expression but also provide new insights into the molecular mechanisms of cardiorenal syndrome. These significantly altered gene sets and pathways could serve as potential targets for future research and therapeutic interventions.
3.5 Functional Annotation and Pathway Analysis
Then Python tools were used to perform functional annotation and KEGG pathway analysis of DEGs. Table 1 included upregulated genes involved in immune response (GO:0006955) and inflammatory response pathways (GO:0006954), such as Il6 and Tnf. On the other hand, Cs, Becn1, and Pex2 were underrepresented, more expressed genes that play important roles in mitochondrion (GO:0005739), autophagy (GO:0006914), and the peroxisome pathway (KEGG:04146), respectively, all showing a downtrend (see Table 2).
Table 1
Functional Annotation and KEGG Pathway Analysis of Upregulated Genes
Term | Overlap | Adjusted P-value | Combined Score |
immune response (GO:0006955) | 2 | 0.097610 | 2.383518 |
inflammatory response (GO:0006954) | 2 | 0.061670 | 1.123777 |
Table 2
༎Functional Annotation and KEGG Pathway Analysis of Downregulated Genes
Term | Overlap | Adjusted P-value | Combined Score |
mitochondrion (GO:0005739) | 1 | 0.069764 | 1.542303 |
autophagy (GO:0006914) | 1 | 0.073367 | 8.882557 |
peroxisome pathway (KEGG:04146) | 1 | 0.040781 | 2.154914 |
These results show that the systemic reactions caused by AKI activate immune and inflammatory responses in the heart while adjusting the processes of mitochondria, autophagy, and the peroxisome pathway.
This description ensures consistency with the previously identified number of DEGs and specific genes, maintaining the logical progression and scientific accuracy of the manuscript.