3.1 T - cell epitopes prediction
The initial screening of amino acid sequences of all 5 proteins for antigenicity, showed a score greater than the threshold value of 0.4 indicating probable antigens, these sequences were then submitted to NetCTL server to predict possible CTL epitopes, which resulted in 37 possible epitopes for S. protein, out of which 14 showed no toxicity and 8 positive immunogenicity score. Ultimately, the top 4 epitopes were selected for inclusion in the multi-epitope vaccine construct. For M. protein, 10 epitopes were predicted, 5 epitopes were non-toxin and one showed a positive immunogenicity score. For E. protein, 3 epitopes were predicted, two of which showed an antigenicity score higher than the threshold value and non-toxic, but neither showed a positive immunogenicity score, hence, not included in the vaccine construct.
For N. protein, 9 epitopes were predicted, 6 showed an antigenicity score higher than the threshold value, all 6 predicted epitopes were non-toxin, of which, five showed positive immunogenicity score, and only the top two were selected to be included in the construct. For the nonstructural polyproteins, on the other hand, 170 epitopes were predicted, of which 96 showed antigenicity score higher than the threshold, and the best 12 were selected based on toxigenicity and immunogenicity results, Table 1.
Table 1
CTL predicted epitopes of all five proteins with the antigenicity, toxicity and immunogenicity results
Protein
|
peptide
|
length
|
Score
|
toxicity
|
immunogenicity
|
S.
|
QLTPTWRVY
|
9 mer
|
1.2119
|
Non-toxic
|
0.31555
|
VLPFNDGVY
|
9 mer
|
0.4642
|
Non-toxic
|
0.1815
|
WTAGAAAYY
|
9 mer
|
0.6306
|
Non-toxic
|
0.15259
|
CNDPFLGVY
|
9 mer
|
0.4295
|
Non-toxic
|
0.15232
|
M.
|
AGDSGFAAY
|
9 mer
|
0.9095
|
Non-toxic
|
0.03981
|
N.
|
LSPRWYFYY
|
9 mer
|
1.2832
|
Non-toxic
|
0.35734
|
DLSPRWYFY
|
9 mer
|
1.7645
|
Non-toxic
|
0.25933
|
ORF1a
|
VSDIDITFL
|
9 mer
|
2.2906
|
Non-toxic
|
0.38916
|
TLRVEAFEY
|
9 mer
|
0.4509
|
Non-toxic
|
0.34997
|
HVGEIPVAY
|
9 mer
|
0.6413
|
Non-toxic
|
0.28861
|
STNVTIATY
|
9 mer
|
0.7143
|
Non-toxic
|
0.25822
|
LVSDIDITF
|
9 mer
|
1.7830
|
Non-toxic
|
0.2541
|
NGDVVAIDY
|
9 mer
|
0.6625
|
Non-toxic
|
0.25105
|
VVDYGARFY
|
9 mer
|
0.4908
|
Non-toxic
|
0.18539
|
GTDPYEDFQ
|
9 mer
|
0.5315
|
Non-toxic
|
0.17381
|
VTNNTFTLK
|
9 mer
|
0.7146
|
Non-toxic
|
0.16567
|
ETSWQTGDF
|
9 mer
|
1.3140
|
Non-toxic
|
0.13449
|
FMGRIRSVY
|
9 mer
|
0.5212
|
Non-toxic
|
0.1259
|
VVVNAANVY
|
9 mer
|
0.4078
|
Non-toxic
|
0.10048
|
The HTL epitopes prediction with MHC-II binding tool of IEDB and based on percentile rank less than 10, resulted in 17 epitopes for S. protein, 12 were non-allergenic, 10 were non-toxic and a single epitope showed a positive interferon-gamma induction result. For M. protein were the predicted HTL epitopes were 55, out of which 43 were non-allergenic antigenic non-toxic epitopes, and only 3 epitopes showed positive interferon-gamma induction results. None of the predicted HTL epitopes of N. protein showed interferon-gamma positive results, therefore none were in the vaccine construct. Similarly, all HTL epitopes predicted for E. protein failed to pass either the antigenicity, allergenicity, or interferon-gamma induction assessment. Out of 96 HTL predicted epitopes for polyproteins of ORF1a, only 6 epitopes passed the antigenicity, allergenicity, toxigenicity, and interferon-gamma induction assessment, results are shown in Table 2.
Table 2
HTL predicted epitopes of all five proteins, antigenicity score and IFN-gamma score
Protein
|
peptide
|
length
|
Antigenicity
|
IFN-gamma
|
S.
|
TRFASVYAWNRKRIS
|
15 mer
|
0.4963
|
0.7315567
|
FQTLLALHRSYLTPG
|
15 mer
|
0.5789
|
0.26071055
|
QPYRVVVLSFELLHA
|
15 mer
|
0.9109
|
0.60855322
|
M.
|
SRTLSYYKLGASQRV
|
15 mer
|
0.5731
|
0.09462399
|
LVGLMWLSYFIASFR
|
15 mer
|
0.5535
|
0.20134879
|
ORF1a
|
VSTQEFRYMNSQGLL
|
15 mer
|
0.4972
|
0.97632046
|
AALGVLMSNLGMPSY
|
15 mer
|
0.8521
|
0.11429986
|
TLNGLWLDDVVYCPR
|
15 mer
|
0.4558
|
0.04915351
|
AYESLRPDTRYVLMD
|
15 mer
|
0.5553
|
0.30777655
|
SAGIFGADPIHSLRV
|
15 mer
|
0.5839
|
0.24837266
|
MFTPLVPFWITIAYI
|
15 mer
|
0.6806
|
0.1415124
|
3.2 B-cell epitopes prediction
The B-cell epitopes are an important part of the multi-epitope vaccine because recognition of these epitopes by B lymphocytes elicit antibody production, which is a key process in adaptive immunity. For all five proteins, linear B-cell epitopes were predicted using Bepipred Linear Epitope Prediction 2.0 method, Emini surface accessibility prediction method, and Kolaskar & Tongaonkar antigenicity method, these methods were selected because they assess properties that are important for predicting potential epitopes, such as antigenicity, surface accessibility, and flexibility. The resultant plots were then inspected for overlapping regions showing epitopes by the three methods. The only protein to show such an overlapping region was N. protein with a sequence of 10 amino acids from 380–390. The results of all amino acid sequences are shown in Fig. 1.
3.3 Population coverage
The selected epitopes were then analyzed to determine the percentage of the world population coverage for MHC-I and MHC-II alleles. The MHC-I alleles assessed included; HLA-B*15:01, HLA-A*30:02, HLA-A*01:01, HLA-B*40:01, HLA-B*07:02, HLA-B*51:01, HLA-A*68:02, HLA-A*02:01, HLA-A*02:06, HLA-B*08:01, HLA-A*02:03, HLA-A*33:01, HLA-A*24:02, HLA-A*23:01, HLA-B*44:03, HLA-B*44:02, HLA-A*31:01, HLA-B*53:01, HLA-A*11:01, HLA-A*68:01, HLA-A*30:01, HLA-B*57:01, HLA-A*03:01, HLA-A*26:01, HLA-B*58:01, HLA-A*32:01, HLA-B*35:0. The world coverage for these alleles was 98.55%. For MHC-II, the alleles assessed included; HLA-DRB1*07:01, HLA-DRB1*15:01, HLA-DRB3*01:01. The coverage for these alleles was 49.02. The combine allele coverage for both MHC-I and MHC-II was found to be 99.26% which indicates a high population coverage for selected epitopes. Figure 2.
3.4 Multi-epitope vaccine construction
For the construction of the final vaccine construct, the most appropriate predicted epitopes were selected, this included one B-cell linear epitope from N. protein, 4 CTL and 3 HTL epitopes from S. protein, one CTL and 2 HTL epitopes from M. protein, 2 CTL epitopes from N. protein, 12 CTL and 6 HTL epitopes from ORF1a. These epitopes were joined together with two types of linkers, AAY for linear B-cell and CTL epitopes, and GPGPG for HTL epitopes, with cysteine residue at the N-terminal and EPEA tag at C-terminal, this yielded the following 468 amino acid peptide chain:
CQALPQRQKKQQAAYQLTPTWRVYAAYVLPFNDGVYAAYWTAGAAAYYAAYCNDPFLGVYAAYAGDSGFA
AYAAYLSPRWYFYYAAYSPDDQIGYYAAYVSDIDITFLAAYTLRVEAFEYAAYHVGEIPVAYAAYSTNVTIATY
AAYLVSDIDITFAANGDVVAIDYAAYVVDYGARFYAAYGTDPYEDFQAAYVTNNTFTLKAAYETSWQTGDFA
AYFMGRIRSVYAAYVVVNAANVYGPGPGTRFASVYAWNRKRISGPGPGFQTLLALHRSYLTPGGPGPGQPYR
VVVLSFELLHAGPGPGSRTLSYYKLGASQRVGPGPGLVGLMWLSYFIASFRGPGPGVSTQEFRYMNSQGLLG
PGPGAALGVLMSNLGMPSYGPGPGTLNGLWLDDVVYCPRGPGPGAYESLRPDTRYVLMDGPGPGSAGIFGA
DPIHSLRVGPGPMFTPLVPFWITIAYIGPGPGEPEA
3.5 Physiochemical properties of the vaccine construct
The results obtained from the ProtParam server, showed that the novel vaccine construct has a molecular weight of 50.417 Ka which is an optimum molecular weight for an antigenic protein. The theoretical PI for the construct was 5.41 indicating an acidic nature, with a total of 30 negatively charged residues and 25 positively charged residues. The estimated half-life is 1.2 hours in mammalian reticulocytes in vitro, > 20 hours in yeast in vivo, and > 10 hours in E. coli in vivo, indicating a good construct for future cloning. The instability index was computed to be 30.79 suggesting stable protein. The aliphatic index of 75.38, which indicates a thermostable protein. The grand average of hydropathicity (GRAVY) was 0.040, a positive value close to zero means a slightly hydrophobic molecule.
3.6 Vaccine modeling and structure analysis
Based on the amino acid sequence of the vaccine construct, the result of the PSIpred server revealed different secondary structures. This is considered a primary step towards predicting the three-dimensional structure of the protein. Figure 4
The 3D protein model was then predicted with two modeling approaches; threading model with IntFOLD server and Ab initio modeling with the trRosetta server, the resultant models were then analyzed with Ramachandran plot and ProSA-web z-score based on X-ray crystallography and NMR analysis, the best-predicted model showed 98% of the residues in the favorable region in Ramachandran plot, Fig. 4A, and z-score of – 6.01, determined by x-ray crystallography, Fig. 4B.
The statistics of non-bonded interactions between different atom types, and then the error function value was plotted against a position of a-9 residue sliding window, calculated by comparison with statistics from highly refined structures, carried out using ERRAT server, and the calculated error value obtained was 81.928, which falls well below 91% indicating a relatively average overall quality for the selected protein model, this is can be justified by the fact that the modeling process was carried out using ab initio modeling approach. Figure 5A.
3.7 Molecular docking and dynamics
The final vaccine construct was docked with Toll-like receptor 3 (PDB ID: 1ziw) using the FRODOCK server. The docked vaccine-receptor complex was then prepared for simulation using a protein-prep wizard and Pymol software using the default settings, the molecular dynamics simulation was then carried out using the Desmond tool and Superpose1.0 server (http://superpose.wishartlab.com) for calculating the root mean square deviation (RMSD) value of 3.78 which suggests a relatively poor binding pose at the site of the receptor and vaccine binding. Figure 5B.
3.8 Immune response simulation
Measuring the immune response is a pivotal step for vaccine designing, this contingent on a number of algorithms that make use of mathematical models to illustrate the fine details of the immunological process. In the present study, the C-ImmSim server was used to simulate immune response with the candidate vaccine construct. Simulation with this tool focuses on B-cell epitope binding, class I and II HLA epitope binding, and the binding of the T-cell receptor to HLA-peptide complexes, the tool then details the dynamics of immune cells populations and the molecules involved in the immune response [47].
The simulation results showed an increased and sustained level of B- memory and active cells, and a high level of IgM, which represents the primary response against the antigen, this suggests effective humoral response, Fig. 6. A & B. T helper cell population showed very promising results, as the levels of memory helper cells and active T helper cells remained high for the entire period of simulation, suggesting prolonged humoral and cell-mediated immune response, Fig. 6. C & D. The results of the T cytotoxic cell population steady level of the memory cells, while the active cell population showed an increased level throughout the stimulation period, Fig. 6. E & F. The result of different immunoglobulin isotopes showed high level in the first two weeks followed by a gradual decline, similar result was shown by interferon-gamma level, this can be viewed as a positive point, hence, the first two weeks are considered detrimental for the course and outcome of the disease. [48] Fig. 6. G & H.
3.9 In silico molecular cloning
The DNA sequence produced by Jcat showed a GC content of 56% and a codon adaptation index of 1.0, which indicate a stable DNA sequence and a high level of protein expression. Figure 7.