Analysis of the selected three structural proteins. The best consensus sequences of CD2v (AJB28366), C-type like protein (AAC28412) and p72 (QID21249) (Supplementary material 1) after multiple sequence alignments and Blast-p analysis were used for screening epitopes using VaxiJen v2.0. VaxiJen v2.0 antigenicity scores showed all the three proteins as antigenic. p72 is the most antigenic with the score 0.5063 followed by C-type lectin protein with 0.4583 and CD2v with 0.4171.
T-cell epitopes predictions. Selection of very immunogenic CTL and HTL is very important for the vaccine to produce long lasting immunity. In particular, CTL response is more important for effective viral clearance. On the other hand, HTL epitopes play a crucial role in generating both humoral and cellular immune responses. The extracellular domains of CD2v, C-type lectin and p72 were submitted to IEDB server to predict (10mer) CTLs and (15mer) HTLs epitopes, applying the IEDB recommended 2020.04 (NetMHCpan EL 4.0) and IEDB recommended 2.22 (Consensus) prediction methods, respectively. Predicted CTL (Supplementary Tables 1–3) and HTL (Supplementary Tables 4–12) epitopes with strong binding affinities (based on low percentile rank) were screened further using allele criteria promiscuity, antigenicity and non-toxicity.to select the best epitopes. Using these criteria, 10 SLA-I (CD2v-1, C-type lectin-4, and p72-5) and 3 MHC-II (CD2v-1, C-type lectin-1, and p72-1) (Tables 1 and 2).
Protein
|
|
SLA I
|
|
|
Table 1
CTL epitopes predicted using NetMHCpan 1.2 revealing promiscuity. VaxiJen v2.0 was used for predicting antigenicity scores keeping a threshold of 0.4 and ToxiPred was used to predict toxicity.
CD2v
|
Peptides (positions)
|
Alleles
|
Antigenicity
|
Toxicity
|
|
TNKSFLNYYW (151–160)
|
SLA-2*0402, SLA-1*0501,
SLA-2*0401
|
0.5346
|
Non-toxic
|
C-type lectin
|
KYTGLIDKNY (6–15)
|
SLA-2*0102, SLA-1*0501
|
1.1132
|
Non-toxic
|
FSNNIDEKNY (81–90)
|
SLA-2*1002
|
0.4413
|
Non-toxic
|
KKYNYESGYW (116–125)
|
SLA-1*0501, SLA-3*0701, SLA-3*0301, SLA-3*0303, SLA-3*0401
|
1.0042
|
Non-toxic
|
KKVNYTGLLF (145–154)
|
SLA-1*0401, SLA-3*0601, SLA-3*0301, SLA-3*0303, SLA-3*0304, SLA-1*0801,
SLA-3*0701
|
1.2916
|
Non-toxic
|
p72
|
YGKPDPEPTL (40–49)
|
SLA-1*1101, SLA-3*0101, SLA-2*0701, SLA-2*0501, SLA-3*0501, SLA-3*0502,
SLA-3*0503, SLA-6*0101, SLA-6*0102, SLA-6*0103, SLA-6*0104, SLA-6*0105
|
1.4480
|
Non-toxic
|
|
KPYVPVGFEY (65–74)
|
SLA-1*0701, SLA-1*0702, SLA-2*0102, SLA-2*0101, SLA-1*0501, SLA-2*0302,
SLA-2*0401, SLA-2*0402.
|
1.3466
|
Non-toxic
|
|
SVSIPFGERF (347–356)
|
SLA-1*1201, SLA-1*0201, SLA-1*0202, SLA-2*0302, SLA-1*0601, SLA-2*0501, SLA-2*0101, SLA-1*0401, SLA-1*0501, SLA-2*1001.
|
0.8502
|
Non-toxic
|
|
SRRNIRFKPW (387–396)
|
SLA-3*0701, SLA-3*0301, SLA-3*0303, SLA-3*0304, SLA-3*0401, SLA-3*0601, SLA-2*0402.
|
1.9015
|
Non-toxic
|
|
SISDISPVTY (521–530)
|
SLA-2*1002, SLA-1*0401,
SLA-2*0302, SLA-1*1301,
SLA-1*0801, SLA-1*0601,
SLA-2*1001, SLA-2*0401,
SLA-1*0201, SLA-1*0202.
|
1.5700
|
Non-toxic
|
MHC II
|
Table 2
HTL epitopes predicted using consensus revealing promiscuity. VaxiJen v2.0 was used for predicting antigenicity scores keeping a threshold of 0.4 and ToxiPred was used to predict toxicity.
Protein
|
Peptides (position)
|
Alleles
|
Antigenicity
|
Toxicity
|
CD2v
|
LVYSRNRINYTINLL (96–110)
|
HLA-DRB3*02:02, HLA-DRB3*01:01, HLA-DRB1*15:01, HLA-DRB1*07:01, HLA-DRB1*03:01, HLA-DRB5*01:01, HLA-DRB4*01:01
|
0.8727
|
Non-toxic
|
|
|
HLA-DPA1*01:03/DPB1*08:01, HLA-DPA1*01:03/DPB1*05:01, HLA-DPA1*01:03/DPB1*01:01, HLA-DPA1*01:03/DPB1*02:02, HLA-DPA1*01:03/DPB1*06:01, HLA-DPA1*01:03/DPB1*03:01, HLA-DPA1*01:03/DPB1*04:02.
|
|
|
|
|
HLA-DQA1*01:01/DQB1*02:05, HLA-DQA1*01:01/DQB1*02:01, HLA-DQA1*01:01/DQB1*02:02, HLA-DQA1*01:01/DQB1*02:04.
|
|
|
C-type like lectin
|
TKKYNYESGYWVNYS (115–129)
|
HLA-DRB3*01:01, HLA-DRB1*15:01, HLA-DRB3*02:02, HLA-DRB1*07:01, HLA-DRB5*01:01, HLA-DRB1*03:01, HLA-DRB4*01:01
|
1.0513
|
Non-toxic
|
|
|
HLA-DPA1*01:03/DPB1*04:01, HLA-DPA1*02:01/DPB1*105:01, HLA-DPA1*03:02/DPB1*04:01, HLA-DPA1*04:01/DPB1*126:01,
|
|
|
|
|
HLA-DQA1*05:01/DQB1*05:06, HLA-DQA1*03:03/DQB1*06:28, HLA-DQA1*01:02/DQB1*03:14,
|
|
|
p72
|
HHAEISFQDRDTALP (502–516)
|
HLA-DRB4*01:01, HLA-DRB3*01:01, HLA-DRB1*03:01, HLA-DRB1*15:01, HLA-DRB3*02:02, HLA-DRB1*07:01, HLA-DRB5*01:01
|
2.0206
|
Non-toxic
|
|
|
HLA-DPA1*01:06/DPB1*76:01, HLA-DPA1*02:04/DPB1*41:01, HLA-DPA1*03:01/DPB1*109:01, HLA-DPA1*01:03/DPB1*03:01.
|
|
|
|
|
HLA-DQA1*01:01/DQB1*02:01, HLA-DQA1*01:04/DQB1*04:03, HLA-DQA1*01:02/DQB1*05:12.
|
|
|
B-cell epitopes predictions. Linear B cell epitopes were selected using ABCpred server whereas discontinuous B cell epitopes were predicted from the proteins using IEDB Discotope 2.0. The predicted linear B cell epitopes were ranked according to their scores obtained by trained recurrent neural network. Higher score of the peptide indicates the higher probability to be an epitope. Out of the 14mers generated (Supplementary Tables 13–16) from input sequences, 6 B-cells (CD2v-1, C-type lectin-4, and p72-2 as linear epitopes; p72-2 as discontinuous epitopes) were short listed based on their scores and VaxiJen antigenicity (Tables 3 and 4).
Table 3
B-cell receptors linear epitopes prediction with ABCpred server. Antigenicity and toxicity were predicted using VaxiJen and ToxiPred, respectively.
Protein
|
Peptides (position)
|
Score
|
Antigenicity
|
Toxicity
|
CD2v
|
SLITCEKTNGTNIR (125–138)
|
0.68
|
0.6030
|
Non-toxic
|
C-type lectin
|
NDTNLLNLTKKYNY (107–120)
|
0.87
|
1.1367
|
Non-toxic
|
p72
|
CSHTNPKFLSQHFP (268–281)
|
0.68
|
0.6627
|
Non-toxic
|
DITPITDATYLDIR (294–307)
|
0.58
|
1.7313
|
Non-toxic
|
IFN-γ epitopes prediction. IFNepitope server was employed to predict out the IFN-γ epitopes. Predicted epitopes with strong binding affinities (based on low percentile rank) were screened further using allele criteria promiscuity, antigenicity and non-toxicity to select the best epitopes (Supplementary Tables 17–19). Using these criteria, 3 IFN-γ (CD2v-1, C-type lectin-1, and p72-1) were selected (Table 5).
Table 4
B-cell receptors discontinuous epitopes with Discotope 2.0. Antigenicity and toxicity were predicted using VaxiJen and ToxiPred, respectively.
Protein
|
Peptides (position)
|
Antigenicity
|
Toxicity
|
p72
|
FPGLFVRQSRFIAGR (371–385)
|
0.4942
|
Non-toxic
|
|
SRRNIRFLPWFIPGV (387–401)
|
1.5164
|
Non-toxic
|
Table 5
IFN-γ epitopes predicted against 3D structure of the ASFV proteins CD2v, C-type lectin and p72. Antigenicity and toxicity of the epitopes were predicted using VaxiJen and ToxiPred servers, respectively.
Protein
|
Peptides (position)
|
Score
|
Antigenicity
|
Toxicity
|
CD2v
|
TVTLNSNINSETEGI (25–40)
|
12
|
0.2457
|
Non-Toxic
|
C-type lectin
|
ESGYWVNYSLANNQS (120–135)
|
1
|
0.7730
|
Non-Toxic
|
p72
|
SNIKNVNKSYGKPDP (30–45)
|
6
|
1.0862
|
Non-Toxic
|
Multi‑epitope subunit vaccine construction, structural modelling and validation. The vaccine construct was developed by joining epitopes with the highest prediction scores. A total of 22 epitopes comprising 10 CTL epitopes, 3 HTL, 6 B-cell epitopes and 3 IFN-γ epitopes were selected based on their prediction score and were joined together using their corresponding linkers. HTL epitopes, B‐cell epitopes, IFN-γ and CTL epitopes were joined together using GPGPG, KK and AAY linkers, respectively. To boost the vaccine construct immunogenicity, a 45 amino acid‐long sequence of Sus scrofa β‐defensin was added as an adjuvant at the N‐terminal of the construct with the aid of EAAAK linker. EAAAK linkers aid in limiting the interaction of the adjuvant with other protein regions with efficient separation and bolstering stability (25). At the C terminal end TAT protein which is cell penetrating was appended to the construct to help it in intracellular delivery (Fig. 2A). The final vaccine construct was 437 amino acid residues long and was visualized using PyMol (Fig. 2B).
Vaccine construct’s physicochemical parameters. ExPASy ProtParam tool revealed various physiochemical properties of the vaccine construct. The construct is composed of 437 amino acid residues with a molecular weight of 48991.45 g/mol, theoretical pI of 9.84, revealing that the vaccine construct is basic. The instability index (II) score of 34.91 classifies the vaccine construct as being a stable protein in nature since values above 40 are regarded as being unstable in nature. The GRAVY index of -0.708, validates the hydrophilic nature of the vaccine construct revealing that it can form interactions with surrounding water molecules. The aliphatic index 59.70 shows that the construct is thermostable in nature. The estimated half-life was 1 h (mammalian reticulocytes, in vitro), 30 min (yeast, in vivo) and greater 10 hours (E. coli, in vivo) (Supplementary material 2). The antigenicity of the vaccine construct was predicted using VaxiJen server and was found to be a good antigen with a score of 0.6782 with the default set threshold value of 0.4 for viruses. Furthermore, AlgPred server was employed to determine the allergenicity of the vaccine construct. using a hybrid approach (SVMc + IgE epitope + ARPs BLAST + MAST). The vaccine construct was observed to be non-allergenic. The 3D structure of the vaccine construct was modelled from its linear structure using trRosetta; then, the modelled structure was refined using ModRefiner. Thereafter, the 3D structure was validated using ERRAT score at the ERRAT server (https://servicesn.mbi.ucla.edu/ERRAT/) which evaluates the calculation of unbounded interactions in the vaccine construct structure (Fig. 2C). Ramachandran plot analysis was also generated through ERRAT server (Fig. 2D). This was proceeded by ProSA-web analysis to validate the structure further based on Z-score predicted (Fig. 2E).
Molecular docking of vaccine construct (ligand) with immune receptors (TLR-9) and SLA-1. To optimize the interaction affinity between the vaccine construct and TLR‐9 (PDB:3WPB)/SLA-1 (PDB: 3QQ4), molecular docking was carried out using Haddock 2.4 server. For vaccine construct-TLR-9 interaction, the server clustered 138 structures in 12 cluster(s), which represents 69% of the water-refined models generated. The top cluster is the most reliable according to HADDOCK. Its Z-score of -1.6, indicates how many standard deviations from the average this cluster is located in terms of score. The HADDOCK score was − 30.2 +/- 8.2. The best cluster was selected from the docked clusters depending on lowest HADDOCK score. Thereafter, HADDOCK Refinement Interface was employed to refine the chosen cluster wherein HADDOCK clustered 10 structures in 1 cluster(s), which represents 100% of the water-refined models HADDOCK generated. The top cluster had a Z-score of 0.0 and HADDOCK score of -171.8 +/- 4.1. Docking of the vaccine construct-SLA-I receptors was also carried out using HADDOCK 2.4. HADDOCK clustered 154 structures in 6 cluster(s), which represents 77% of the water-refined models HADDOCK generated. The top cluster had a Z-score of -1.7 and HADDOCK score of -84.0 +/- 5.9. When HADDOCK Refinement was carried out, HADDOCK clustered 10 structures in 1 cluster(s), which represents 100% of the water-refined models HADDOCK generated. The top cluster had a Z-score of 0.0 and HADDOCK score of -106.5 +/- 3.0. The best structure after refinement from each docked complex were chosen and their binding affinity was determined using PRODIGY web server (Table 6). The low binding energy signifies the formation of a strong complex. Docking interaction between vaccine construct and TLR-9 revealed 14 H-bonds and 5 salt bridges that are involved in the interaction. In a like manner, docking analysis between vaccine construct and SLA-1 uncovered 14 hydrogen bonds and 4 salt bridges formed during the interaction (Fig. 3A). Also, in the docked vaccine construct-SLA-1 complex, the vaccine construct interactions are mainly with the PBG pockets A, E and F. In pocket A, the vaccine construct K427 interacted with the SLA-1 E63 through H-bond and salt bridge; L163 of the vaccine construct interacted with the L163 through non-bonded interactions. In the E pocket, E416 of the vaccine construct interacted with the R156 through H-bond and salt bridge; also, R413 of the vaccine construct interacted with E152 and R114 through salt bridge and non-bonded interactions, respectively. In the F pocket, the vaccine construct interacted P405 interacted with T80 through H-bond and non-bonded interaction, while P406 interacted with K146 through H-bonding. Also in the F pocket, T398 of the vaccine construct bond to T73 through H-bond, R413 of the vaccine construct bond to T74 through H-bond and non-bonding interactions, with T147 through non-bonded interactions and R114 through non-bonded interaction (Fig. 3B).
Table 6
The binding affinity (ΔG) and dissociation constant (Kd) predicted values of the docked complexes vaccine construct-TLR-9 and vaccine construct-SLA-1 at 25.0℃. The prediction was carried out at PRODIGY server.
Vaccine construct-Receptor complex
|
Gibbs free energy (kcal mol− 1)
|
Kd (M)
|
Vaccine construct-TLR-9
|
-17.0
|
3.5E-13
|
Vaccine construct-SLA-1
|
-8.9
|
3.1E-07
|
Molecular dynamics simulation. To estimate the stability of the vaccine construct- immune receptor complexes, molecular dynamic simulation of the refined docked complexes (vaccine constrcuct-TLR-9 and vaccine construct-SLA-1) were carried out using GROMACS. The compactness of the two simulation systems around their protein axes was revealed by the radius gyration plots. Vaccine construct-TLR-9 and vaccine construct-SLA-1 complexes had the mean gyration values of 4.0 nm and 4.5 nm, respectively (Fig. 4A-B). This result indicates a relatively stable folded structure of the two complexes. For the density of the docked complexes, vaccine construct-TLR-9, the simulation system had 1007.78 kg/m3 with a total drift of 0.086 kg/m3 while that of the vaccine construct-SLA-1 was 1000.69 kg/m3 with a total drift of 0.073 kg/m3(Fig. 4C-D). The temperature and pressure of the simulation system during the production run was around 300 K and 1.3 bar, respectively for the two complexes showing a stable system and successful molecular dynamics simulation run. (Figs. 4E–H). The RMSD of vaccine construct-TLR-9 complex had some fluctuations during the simulation (Fig. 4I). Vaccine construct-SLA-1 had fluctuations 0–4 ns, thereafter, it became stable and the RMSD value remained around 1.1 nm, indicating that the conformation of this complex was stable (Fig. 4J). RMSF plots for the two complexes had high peaks indicating high degrees of flexibility in the vaccine construct (Fig. 4K-L).
Codon optimization of the vaccine construct and in silico cloning. Codon adaptation carried out using JCat was used to optimize the codon usage of the vaccine construct in E. coli (strain K12) and to obtain the gene sequence of the vaccine construct. Codon adaptation index was 0.92, GC content was 53.01% and the optimized codon sequences were 1311 nucleotides long. To obtain good protein expression, the GC content is expected to be in the range of 30%-70%; Therefore, the GC content of 53.01% is indicative of vaccine construct good expression in E. coli. To ensure translation efficiency, RNAfold web server (26) was employed to predict the secondary structure of the optimized codon sequences. The predicted secondary structure of the optimized sequence was found to be stable and it lacked hairpin loop or pseudoknot at its starting site (Fig. 5A). Certain codon features like restriction enzyme cleavage sites (BamHI and EcoRI) and prokaryotic ribosome binding sites in the E. coli plasmid pET-28(+) using the SnapGene software (https://www.snapgene.com/) were also taken into cognizance to ensure successful expression of the gene sequence. The whole clone was 6665 bp in size (Fig. 5B). For immune simulation, antibody response was observed with high levels of IgM and IgG antibodies which is common in virus vaccinations (Fig. 5C). Also, high levels of cytokine, T-helper cells and cytotoxic T-cells were observed (Fig. 5D-F). These observations point to an evidence a potent immune response that would be induced with the vaccine construct.