3.1 Gene Collection & Filtering
For this study, all of the genes responsible for BD and stroke are gathered from the NCBI online gene repository. After collecting from NCBI, the total no. of genes for BD are estimated at 743 and for stroke at 1540; the corresponding genes for Homo sapiens are 720 and 853 respectively. The values of the associated risk genes are shown in Table 1.
Table 1. The depiction of the gene collection of targeted disorders is derived from the NCBI repository. After processing and categorizing all of the related genes for Homo sapiens, there are 720 genes for BD and 853 genes for stroke. Table 1 displays the numerical values for the relevant genes.
Diseases
|
Total number of Gene
|
Total number of Homo Sapiens
|
Bipolar Disorder
|
743
|
720
|
Stroke
|
1540
|
853
|
3.2 Intersection of genes and discovery of shared genes
In this step, the link between the two diseases is established. Python programming language is utilized to identify genes associated with BD and stroke in this work. The BD & stroke intersection is used to find the common gene between two diseases. Genes that overlap between BD and Stroke have found 103 similar genes. To keep things simple, just the top 10 most important genes are included in the analysis. APP, ESR1, TP53, CTNNB1, NFKB1, MAPK1, TNFRSF1A, MAPK3, TNF and MYD88 were the top 10 genes. The top 10 genes account for the majority of the risk for developing those diseases. If we properly address these genes, we can mitigate these illnesses [32]. Figure 2 shows a Venn diagram illustrating the connection between the total number of genes and the proportion of shared genes. To begin this inquiry, genes were extracted from a reliable database. Following that, the dataset was subjected to the mining algorithm. Figure 2 demonstrates the findings of the verification inquiry.
3.3 PPI network among common genes
The PPI network is a mechanism for assigning physical interactions between proteins. The PPI network incorporates both active and passive connections between genes and hub proteins. Correlated genes, protein interactions, and common pathways are shown in this diagram. NetworkAnalyst is a powerful web-based intuitive visual analytics tool for analyzing and interpreting gene expression data at the system level [33]. By using the web-based NetworkAnalyzer, the STRING interactome repository is utilized to create SIF files for the network diagram. The STRING database (http://string-db.org) was designed to provide a comprehensive study and integration of active (physical) and passive (functional) protein-protein interactions [34]. The NetworkAnalyst tool is simple to use. The leading 10 concordant genes of APP, TP53, ESR1, CTNNB1, NFKB1, MAPK1, TNFRSF1A, MAPK3, TNF, and MYD88 were then explored using the PPI network. Ten frequently occurring genes for BD and stroke are provided in NetworkAnalyst. The result of NetworkAnalyst for hub genes is shown in Figure 3 below.
3.4 Topological properties
Betweenness centrality, cluster coefficient, topological coefficient and degree are all components of the PPI network. NetworkAnalyst is used to develop the PPI network, which is then used for topological analysis. The topological features of the PPI network in the Cytoscape tool are estimated using the Network Analyzer program in connection with the PPI structure. The Cytoscape Network Analyzer program uses the SIF document in the Cytoscape intuitive tool to determine topological properties. As seen in Table 3, the top 10 weighted genes have distinct topological properties. The different topological features of the PPI network are shown in Figure 4 (A, B, C, D). Figure 4(A) depicts a clustering measurement, which is a feature of the degree to which nodes in a network are clustered together on average. Betweenness centrality is a measuring of a vertex's influence on the information trickles for each pair of vertices, indicating that data travels mostly along the closest paths between them, as seen in Figure 4(B). Take a look at figure 4(C), Closeness centrality is a technique for detecting elements that is really successful at transmitting information over a network. The centrality of a node's closeness indicates its mean distance from all other endpoints. Nodes having a high intimacy score have the minimum distances to all other nodes. Figure 4(D) illustrates, the topological coefficient quantifies the magnitude to which a node links its peers to other nodes. Nodes with one or no neighbors are awarded a topological coefficient of 0.
Table 2. The Cytoscape program was used to determine the topological features of the top 10 significant genes in the PPI network.
Protein
Name
|
Degree
|
Betweenness
Centrality
|
Closeness
Centrality
|
Clustering
Coefficient
|
Topological
Coefficient
|
APP
|
1958
|
0.647375877
|
0.514682393
|
8.61E-05
|
0.001360901
|
ESR1
|
798
|
0.242729279
|
0.424589499
|
0.001415081
|
0.002733945
|
TP53
|
659
|
0.189862713
|
0.410766817
|
0.001701943
|
0.003276949
|
CTNNB1
|
330
|
0.102658892
|
0.454436971
|
0.004163213
|
0.004188482
|
NFKB1
|
240
|
0.055721604
|
0.382277788
|
0.006345886
|
0.007021912
|
MAPK1
|
187
|
0.042237979
|
0.3906556
|
0.011270197
|
0.008300151
|
TNFRSF1A
|
167
|
0.032575869
|
0.349841323
|
0.002164346
|
0.011958279
|
MAPK3
|
127
|
0.019560447
|
0.363533802
|
0.013748281
|
0.011678508
|
TNF
|
91
|
0.014450557
|
0.364577894
|
0.011233211
|
0.015429468
|
MYD88
|
87
|
0.012837737
|
0.33908046
|
0.006950013
|
0.029567975
|
3.5 Hub gene
This study identified 10 vital hub genes as being essential to the molecular pathway linking Bipolar Disorder (BD) and stroke: APP, ESR1, TP53, CTNNB1, NFKB1, MAPK1, TNFRSF1A, MAPK3, TNF, and MYD88 is shown in figure 5. The generated Protein-Protein Interaction (PPI) network showed notable topological features and high connectedness for these genes. They have a role in important biological processes that are pertinent to the pathophysiology of both conditions, including apoptosis, cell signaling, and inflammation. Notably, their significance in disease pathways is further highlighted by genes such as TP53 and MAPK1, which are renowned for their roles in cell cycle regulation and stress response. These hub genes may serve as indicators or therapeutic targets, and their discovery offers important new information about the molecular interactions between stroke and BD.
3.6 Enrichment Analysis: GO terms and Pathways
GO (Gene ontology) are classified into 3 categories. Biological process, molecular function, and cellular component are the three types of terms used in GO. Molecular function generally relates to the activity that is carried out by a gene at the molecular level, biological process describes the larger cellular or physiological role that is played by the gene, and cellular component corresponds to the part of the cell in which the gene product is responsible for carrying out its function. GO is a key concept of enrichment analysis that is generated by a web-based application like STRING. The findings of the GO investigation are shown in figure 6 and Table 3. A gene set enrichment approach is needed in pathway analysis. Fig 7 and Table 4 shows the pathway analysis that helps genome to pathways link.
Table 3: A detailed analysis was conducted on the association between common genes in GO keywords and GO pathways, along with the accompanying P-values. Significant enrichment of particular biological processes, cellular constituents, and molecular functions linked to the identified genes was found by this research.
Category
|
GO ID
|
Term
|
P-value
|
Genes
|
Biological process
|
GO:0045944
|
Positive Regulation Of Transcription By RNA Polymerase II
|
9.41E-10
|
APP;CTNNB1;ESR1 ;TP53;TNF;NFKB1 ;TNFRSF1A ;MAPK3
|
GO:0051972
|
Regulation Of Telomerase Activity
|
4.65E-09
|
CTNNB1;MAPK1 ;TP53;MAPK3
|
GO:0071260
|
Cellular Response To Mechanical Stimulus
|
6.07E-09
|
NFKB1;MYD88 ;TNFRSF1A ;MAPK3
|
GO:0045893
|
Positive Regulation Of DNA-templated Transcription
|
8.76E-09
|
AP;CTNNB1;ESR1 ;TP53;TNF;NFKB1 ;TNFRSF1A ;MAPK3
|
GO:0051091
|
Positive Regulation Of DNA-binding Transcription Factor Activity
|
6.48E-08
|
APP;CTNNB1;ESR1 ;TNF;MYD88
|
GO:0019221
|
Cytokine-Mediated Signaling Pathway
|
8.06E-08
|
TP53;TNF;MYD88 ;TNFRSF1A ;MAPK3
|
GO:0071675
|
Regulation Of Mononuclear Cell Migration
|
1.38E-07
|
MAPK1;TNF ;MAPK3
|
GO:0006357
|
Regulation Of Transcription By RNA Polymerase II
|
4.12E-07
|
APP;CTNNB1; ESR1;TP53;TNF; NFKB1;TNFRSF1A ;MAPK3
|
GO:0032212
|
Positive Regulation Of Telomere Maintenance Via Telomerase
|
4.43E-07
|
CTNNB1;MAPK1 ;MAPK3
|
GO:0051173
|
Positive Regulation Of Nitrogen Compound Metabolic Process
|
4.87E-07
|
APP TNF NFKB1
|
Category
|
GO ID
|
Term
|
P-value
|
Genes
|
Molecular Function
|
GO:0004707
|
MAP Kinase Activity
|
1.23E-05
|
MAPK1;MAPK3
|
GO:0019902
|
Phosphatase Binding
|
1.99E-05
|
CTNNB1;MAPK1 ;MAPK3
|
GO:0032813
|
Tumor Necrosis Factor Receptor Superfamily Binding
|
7.85E-05
|
TNF;MYD88
|
GO:0140296
|
General Transcription Initiation Factor Binding
|
9.71E-05
|
ESR1;TP53
|
GO:0030331
|
Nuclear Estrogen Receptor Binding
|
1.25E-04
|
CTNNB1;ESR1
|
GO:0001067
|
Transcription Regulatory Region Nucleic Acid Binding
|
1.57E-04
|
TP53;TNF;NFKB1
|
GO:0001222
|
Transcription Corepressor Binding
|
1.92E-04
|
CTNNB1;ESR1
|
GO:1990837
|
Sequence-Specific Double-Stranded DNA Binding
|
2.86E-04
|
ESR1;TP53;TNF ;NFKB1
|
GO:0140297
|
DNA-binding Transcription Factor Binding
|
3.09E-04
|
CTNNB1;TP53 ;MAPK3
|
GO:0001221
|
Transcription Coregulator Binding
|
0.001063621
|
CTNNB0;ESR0 ;TP53;NFKB1
|
Category
|
GO ID
|
Term
|
P-value
|
Genes
|
Cellular
Function
|
GO:0045121
|
Membrane Raft
|
6.81E-05
|
APP;TNF ;TNFRSF1A
|
GO:0000791
|
Euchromatin
|
2.10E-04
|
CTNNB1;ESR1
|
GO:0005788
|
Endoplasmic Reticulum Lumen
|
3.16E-04
|
APP;MAPK1 ;MAPK3
|
GO:0005769
|
Early Endosome
|
3.97E-04
|
APP;MAPK1 ;MAPK3
|
GO:0005901
|
Caveola
|
4.19E-04
|
MAPK1;MAPK3
|
GO:0034774
|
Secretory Granule Lumen
|
4.32E-04
|
APP;MAPK1 ;NFKB1
|
GO:0044853
|
Plasma Membrane Raft
|
7.31E-04
|
MAPK1;MAPK3
|
GO:0005925
|
Focal Adhesion
|
7.80E-04
|
CTNNB1;MAPK1 ;MAPK3
|
GO:0030055
|
Cell-Substrate Junction
|
8.27E-04
|
CTNNB1;MAPK1 ;MAPK3
|
GO:0005634
|
Nucleus
|
0.001782238
|
CTNNB1;MAPK1 ;ESR1;TP53;NFKB1 ;MYD88;MAPK3
|
Table 4: A thorough study was conducted to determine the significance of the association between P-values and frequent genes in the KEGG, WikiPathways, Reactome, and BioCarta databases. This study highlighted important biological processes implicated in the disease mechanisms by identifying key pathways that are considerably enriched with the shared genes.
Databases
|
Pathways
|
P-value
|
Genes
|
KEGG
|
Hepatitis C
|
1.89E-13
|
CTNNB1;MAPK1 ;TP53;TNF;NFKB1 ;TNFRSF1A;MAPK3
|
Lipid and atherosclerosis
|
1.76E-12
|
MAPK1;TP53;TNF ;NFKB1;MYD88 ;TNFRSF1A;MAPK3
|
Human cytomegalovirus infection
|
2.42E-12
|
CTNNB1;MAPK1 ;TP53;TNF;NFKB1 ;TNFRSF1A;MAPK3
|
Chagas disease
|
3.13E-12
|
MAPK1;TNF; NFKB1;MYD88 ;TNFRSF1A;MAPK3
|
Shigellosis
|
4.55E-12
|
MAPK1;TP53;TNF ;NFKB1;MYD88 ;TNFRSF1A;MAPK3
|
Salmonella infection
|
4.95E-12
|
CTNNB1;MAPK1 ;TNF;NFKB1; MYD88;TNFRSF1 ;MAPK3
|
Toxoplasmosis
|
5.55E-12
|
MAPK1;TNF; NFKB1;MYD88 ;TNFRSF1A;MAPK3
|
Sphingolipid signaling pathway
|
8.04E-12
|
MAPK1;TP53;TNF ;NFKB1;TNFRSF1 ;MAPK3
|
MAPK signaling pathway
|
1.60E-11
|
MAPK1;TP53;TNF ;NFKB1;MYD88 ;TNFRSF1A;MAPK3
|
Apoptosis
|
2.36E-11
|
MAPK1;TP53;TNF ;NFKB1;TNFRSF1A ;MAPK3
|
WikiPathways
|
IL 18 Signaling Pathway WP4754
|
1.00E-19
|
CTNNB1;MAPK1 ;TP53;TNF;NFKB1 ;MYD88;TNFRSF1A ;MAPK3
|
CKAP4 Signaling Pathway Map WP5322
|
1.39E-14
|
CTNNB1;MAPK1 ;ESR1;TP53 ;TNF ;NFKB1;TNFRSF1A
|
A Network Map Of Macrophage Stimulating Protein MSP Signaling WP5353
|
8.17E-13
|
CTNNB1;MAPK1 ;TP53;TNF;NFKB1 ;MAPK3
|
T Cell Activation SARS CoV 2 WP5098
|
1.26E-12
|
MAPK1;TP53;TNF ;NFKB1;MYD88; MAPK3
|
TNF Related Weak Inducer Of Apoptosis TWEAK Signaling Pathway WP2036
|
4.71E-12
|
CTNNB1;MAPK1 ;TNF;NFKB1; MAPK3
|
Alzheimer 39 S Disease And miRNA Effects WP2059
|
6.72E-12
|
APP;CTNNB1 ;MAPK1;TNF; NFKB1;TNFRSF1A ;MAPK3
|
Alzheimer 39 S Disease WP5124
|
6.72E-12
|
APP;CTNNB1 ;MAPK1;TNF ;NFKB1;TNFRSF1A ;MAPK3
|
Cardiac Hypertrophic Response WP2795
|
3.25E-11
|
MAPK1;TNF ;NFKB1 ; TNFRSF1A ;MAPK3
|
RAC1 PAK1 P38 MMP2 Pathway WP3303
|
9.72E-11
|
CTNNB1;MAPK1 ;TP53 ;NFKB1 ;MAPK3
|
Urotensin II Mediated Signaling Pathway WP5158
|
1.05E-10
|
CTNNB1;MAPK1 ;TNF;NFKB1; MAPK3
|
Reactome
|
MyD88 Cascade Initiated On Plasma Membrane R-HSA-975871
|
2.16E-12
|
APP MAPK1 TP53 NFKB1 MYD88 MAPK3
|
Signaling By Interleukins R-HSA-449147
|
2.82E-12
|
APP;MAPK1;TP53 ;TNF;NFKB1; MYD88;TNFRSF1A ;MAPK3
|
TRAF6 Mediated Induction Of NFkB And MAP Kinases Upon TLR7/8 Or 9 Activation R-HSA-975138
|
2.95E-12
|
APP;MAPK1;TP53 ;NFKB1;MYD88 ;MAPK3
|
MyD88 Dependent Cascade Initiated On Endosome R-HSA-975155
|
3.13E-12
|
APP;MAPK1;TP53 ;NFKB1;MYD88 ;MAPK3
|
Toll Like Receptor 7/8 (TLR7/8) Cascade R-HSA-168181
|
3.32E-12
|
APP;MAPK1;TP53 ;NFKB1;MYD88 ;MAPK3
|
Toll Like Receptor 9 (TLR9) Cascade R-HSA-168138
|
3.96E-12
|
APP;MAPK1;TP53 ;NFKB1 ;MYD88 ;MAPK3
|
MyD88:MAL(TIRAP) Cascade Initiated On Plasma Membrane R-HSA-166058
|
5.55E-12
|
APP;MAPK1;TP53 ;NFKB1;MYD88 ;MAPK3
|
Toll Like Receptor 4 (TLR4) Cascade R-HSA-166016
|
2.17E-11
|
APP;MAPK1;TP53 ;NFKB1;MYD88 ;MAPK3
|
Toll-like Receptor Cascades R-HSA-168898
|
5.26E-11
|
APP;MAPK1;TP53 ;NFKB1;MYD88 ;MAPK3
|
Cytokine Signaling In Immune System R-HSA-1280215
|
9.37E-11
|
APP;MAPK1;TP53 ;TNF;NFKB1 ;MYD88 ;TNFRSF1A;MAPK3
|
BioCarta
|
NF-kB Signaling Pathway Homo sapiens h nfkbPathway
|
1.88E-10
|
TNF;NFKB1 ;MYD88;TNFRSF1A
|
Ceramide Signaling Pathway Homo sapiens h ceramidePathway
|
1.28E-09
|
MAPK1;TNF; TNFRSF1A ;MAPK3
|
Keratinocyte Differentiation Homo sapiens h keratinocytePathway
|
9.12E-09
|
MAPK1;TNF; TNFRSF1A;MAPK3
|
Cadmium induces DNA synthesis and proliferation in macrophages Homo sapiens h cdMacPathway
|
5.02E-08
|
MAPK1;TNF ;MAPK3
|
Role of ERBB2 in Signal Transduction and Oncology Homo sapiens h her2Pathway
|
2.93E-07
|
MAPK1;ESR1 ;MAPK3
|
Trefoil Factors Initiate Mucosal Healing Homo sapiens h tffPathway
|
6.37E-07
|
CTNNB1;MAPK1 ;MAPK3
|
Mechanism of Gene Regulation by Peroxisome Proliferators via PPARa Homo sapiens h pparaPathway
|
1.96E-06
|
MAPK1;TNF; MAPK3
|
Stat3 Signaling Pathway Homo sapiens h stat3Pathway
|
6.29E-06
|
MAPK1;MAPK3
|
SODD/TNFR1 Signaling Pathway Homo sapiens h soddPathway
|
8.08E-06
|
TNF;TNFRSF1A
|
Overview of telomerase protein component gene hTert Transcriptional Regulation Homo sapiens h tertpathway
|
1.01E-05
|
ESR1;TP53
|
BioPlanet
|
Chagas disease
|
3.53E-12
|
MAPK1;TNF; NFKB1;MYD88; TNFRSF1A;MAPK3
|
Ceramide signaling pathway
|
1.79E-11
|
MAPK1;TNF; NFKB1;TNFRSF1A;MAPK3
|
Keratinocyte differentiation
|
3.92E-11
|
MAPK1;TNF; NFKB1;TNFRSF1A ;MAPK3
|
Alzheimer's disease
|
6.80E-11
|
APP;MAPK1;TP53 ;TNF;TNFRSF1A ;MAPK3
|
Cadmium-induced DNA biosynthesis and proliferation in macrophages
|
7.47E-11
|
MAPK1;TNF ;NFKB1;MAPK3
|
TGF-beta signaling pathway
|
1.18E-10
|
CTNNB1;MAPK1 ;TP53 ;TNF;NFKB1 ;MAPK3
|
Leishmaniasis
|
1.30E-10
|
MAPK1;TNF ;NFKB1;MYD88 ;MAPK3
|
Modulation of interferon signaling by chaperones
|
2.78E-10
|
TP53;TNF;NFKB1 ;TNFRSF1A
|
MicroRNAs in cardiomyocyte hypertrophy
|
3.05E-10
|
CTNNB1;MAPK1; TNF;NFKB1; MAPK3
|
NF-kappaB signaling pathway
|
3.33E-10
|
TNF;NFKB1;MYD88;TNFRSF1A
|
3.7 Gene Regulatory Network
The term GRN is used to express the association between genes in order to better comprehend them. We used web-based impactful NetworkAnalyst software to build the GRN. Generally, gene regulatory networks (GRNs) may be split into three categories: gene-miRNA connections, transcription factor-gene interactions, and TF-miRNA co-regulation network. Gene-miRNA interaction and TF-gene interaction are shown in Figures 8(A) and 8(B), respectively [35].
3.8 Co-expression Network and Physical Interaction Pathway
Typically, a co-expression network is an undirected graph; this network revealed a high association between two proteins. Co-expression is the initial stage in inference since it establishes the link between two transcripts that are expressed together. If two genes were identified to interact in PPI research, they are related. These data sets have assisted find out how similar ligands are to protein networks on a large scale. Lignin-based networks, which predict the potential of nearby proteins to engage connected compounds, might supplement genetically orientated gene networks that anticipate the significance of function or diseases. We examine in depth how much genetic redundancy, practical PPIs, co-expression, and genetic disease identification [32] may be made possible by such links between ligand-based proteins. Two or more proteins may be involved in a physical contact, resulting in binary interactions and more complicated proteins [33]. As an example of this, the actual interaction of ligands, which developed in protein families, is the primary mechanism for establishing protein-protein interactions [34]. In this study, GeneMANIA was put to use in order to build networks of physical contact and co-expression for ten genes responsible for the condition. The physical interaction and co-expression network is shown in figure 9.
3.9 Identification of drug signature
Drug signature analysis was carried out using the Enrichr software, which accesses the DSigDB database, in order to find potential drug compounds for Bipolar Disorder (BD) and stroke. The common DEGs found in both disease were utilised in this research. Numerous therapeutic options were identified that may be able to modify the expression of important hub genes that are shared by stroke and BD based on p-value and modified p-value criteria. The capacity of these substances to target the pathways linked to neuroprotection, inflammation, and cell cycle regulation—all of which are crucial to the pathophysiology of these conditions—led to their identification. The results point to a promising direction for future research and clinical application: these medications may be repurposed or further studied for their therapeutic potential in treating the comorbidities of stroke and BD. TABLE 5 displays the most potent medication molecules.
Table 5. Drug-suggested chemicals associated to Stroke, and Bipolar disorder have been identified through analysis of their shared DEGs. Potential therapeutic drugs that target these common DEGs have been identified by this approach, providing intriguing treatment options for a variety of diseases.
Name of drugs
|
P-value
|
Adjusted P-value
|
Genes
|
ACROLEIN CTD 00005313
|
1.973573
|
4.515536
|
APP;CTNNB1;MAPK1
ESR1;TP53;TNF;NFKB1 MAPK3
|
N-Acetyl-L-cysteine CTD 00005305
|
5.247344
|
6.002962
|
APP;CTNNB1;MAPK1;
ESR1;TP53;TNF;
NFKB1;
TNFRSF1A;MAPK3
|
dicumarol CTD 00005515
|
1.397297
|
7.992539
|
MAPK1;ESR1;TP53 TNF;NFKB1;MAPK3
|
1'-Acetoxychavicol acetate CTD 00002113
|
1.397297
|
7.9925399
|
MAPK1;TP53;TNF NFKB1;TNFRSF1A; MAPK3
|
AH 23848 CTD 00002080
|
2.605087
|
1.192088
|
CTNNB1;MAPK1 TNF;NFKB1; TNFRSF1A;MAPK3
|
EINECS 250-892-2 CTD 00001193
|
9.296261
|
3.509643
|
APP;CTNNB1;MAPK1;
ESR1;TP53;TNF;
NFKB1;
TNFRSF1A;MAPK3
|
LY 294002 CTD 00003061
|
1.073754
|
3.509643
|
APP;CTNNB1;MAPK1;
ESR1;TP53;TNF;
NFKB1;
TNFRSF1A;MAPK3
|
p'-DDE CTD 00005754
|
1.911957
|
4.860620
|
MAPK1;ESR1;TP53 TNF;NFKB1;MAPK3
|
flurbiprofen CTD 00005993
|
1.911957
|
4.860620
|
APP;MAPK1;ESR1 TP53;TNF;MAPK3
|
celecoxib CTD 00003448
|
2.173026
|
4.971884
|
CTNNB1;MAPK1 ESR1;TP53;TNF; NFKB1;MAPK3
|