Prediction and selection of epitopes
In this study, 405 amino acid sequence of TFDP3 (Uniprot ID: Q5H9I0), epitopes were predicted for B cells, CD8+ T lymphocytes and CD4+ T lymphocytes. The identification of B lymphocyte epitopes will induce the release of antibodies, stimulating humoral immunity. The epitopes of CD8+ T lymphocytes and CD4+ T lymphocytes interact with MHC class I and class II molecules, respectively, and can trigger the activation of dendritic cells, production of IFN-γ and induction of the apoptosis process in malignant neoplastic cells (Jahangirian et al., 2022). In addition, the epitopes of these cell types must have antigenicity, non-allergenicity, non-toxicity and the ability to induce the production of interferon-gamma, specifically the latter in the case of CD4+ T lymphocytes.
A total of 40 linear epitopes (each 16 amino acids long) of antigenic B lymphocytes were identified, half of which showed probable antigenicity (Table S1). Of these epitopes, 14 were classified as probably non-allergenic and non-toxic (Table 1).
Table 1: Predicted B cell epitopes and their immunogenic properties.
Sequence
|
Position
|
Vaxijen score
|
Allergenicity
|
Toxicity
|
TSSGGSQYSGSRVETP
|
368-383
|
1.0805
|
Non-allergenic
|
Non-toxic
|
RVERQKRLERIKQKQS
|
201-216
|
1.6392
|
Non-allergenic
|
Non-toxic
|
CSISDDKSEYLFKFNS
|
269-284
|
0.7976
|
Non-allergenic
|
Non-toxic
|
SSPPWAGQHNRKGEKN
|
92-107
|
0.7683
|
Non-allergenic
|
Non-toxic
|
DVKNIKRRTYDALNVL
|
156-171
|
1.0012
|
Non-allergenic
|
Non-toxic
|
SGSCSAEDLKMARNLV
|
306-321
|
0.7467
|
Non-allergenic
|
Non-toxic
|
PKALEPYVTEMAQGTF
|
322-337
|
0.5497
|
Non-allergenic
|
Non-toxic
|
LMDENQTSRPVAVHTS
|
17-32
|
0.5233
|
Non-allergenic
|
Non-toxic
|
LSMKVWETVQRKGTT
|
114-129
|
0.7633
|
Non-allergenic
|
Non-toxic
|
EDLKMARNLVPKALEP
|
312-327
|
0.5484
|
Non-allergenic
|
Non-toxic
|
PQRPAASNIPVVGSPN
|
62-77
|
0.8037
|
Non-allergenic
|
Non-toxic
|
TNSAQNCQNLRVERQK
|
191-206
|
0.9420
|
Non-allergenic
|
Non-toxic
|
PNESAYDVKNIKRRTY
|
150-165
|
0.6101
|
Non-allergenic
|
Non-toxic
|
FRAASNHASPNESAYD
|
141-156
|
0.5059
|
Non-allergenic
|
Non-toxic
|
Regarding the epitopes of CD8+ T lymphocytes for MHC class I, 17 linear epitopes (length of 9 amino acids each) were identified that showed probable antigenicity in relation to the representative HLA-A and HLA-B (Human Leukocyte Antigens) alleles (Table S2). Of these epitopes, 8 were classified as probable non-allergenic and probable non-toxic (Table 2).
Table 2: Predicted epitopes of CD8+ T lymphocytes for MHC class I antigens their immunogenic properties.
Sequence
|
Position
|
MHC I Alleles
|
Vaxijen score
|
Allergenicity
|
Toxicity
|
KVWETVQRK
|
117-125
|
HLA-A*03:01
HLA-A*11:01
HLA-A*30:01
HLA-A*31:01
|
0.7471
|
Non-allergenic
|
Non-toxic
|
DALNVLMAM
|
166-174
|
HLA-A*26:01
HLA-B*35:01
|
0.8599
|
Non-allergenic
|
Non-toxic
|
KRLERIKQK
|
206-214
|
HLA-B*27:05
|
1.5955
|
Non-allergenic
|
Non-toxic
|
RIKQKQSEL
|
210-218
|
HLA-B*07:02
HLA-B*08:01
HLA-A*30:01
|
1.5326
|
Non-allergenic
|
Non-toxic
|
SISDDKSEY
|
270-278
|
HLA-A*01:01
HLA-A*26:01
HLA-B*15:01
HLA-B*35:01
|
0.6114
|
Non-allergenic
|
Non-toxic
|
ISDDKSEYL
|
271-279
|
HLA-A*01:01
|
0.9410
|
Non-allergenic
|
Non-toxic
|
AGSRSNGTW
|
344-352
|
HLA-B*58:01
|
0.5672
|
Non-allergenic
|
Non-toxic
|
ATSSGGSQY
|
367-375
|
HLA-A*01:01
HLA-A*11:01
HLA-A*26:01
HLA-B*15:01
|
1.3318
|
Non-allergenic
|
Non-toxic
|
HLA - Human Leukocyte Antigen, *- indicates that HLA typing and its corresponding location in chromosome.
When the epitopes of CD4+ T lymphocytes for MHC class II were evaluated, 6 linear epitopes (length of 15 amino acids) were identified that showed probable antigenicity in relation to the representative HLA-DR alleles (Table S3). Of these epitopes, four were classified as probably non-allergenic, non-toxic, and INF-γ inducing (Table 3).
Table 3: Predicted epitopes of CD4+ T lymphocytes for MHC class II antigens their immunogenic properties.
Sequence
|
Position
|
MHC II Alleles (HLA)
|
Vaxijen score
|
Allergenicity
|
Toxicity
|
INF- γ Inducers
|
ELKVLMDENQTSRPV
|
91-105
|
DRB1*03:01DRB1*03:05DRB1*04:01DRB1*04:02DRB1*14:02DRB3*01:01
|
0.6150
|
Non-allergenic
|
Non-toxic
|
Inducer
|
SPPWAGQHNRKGEKN
|
171-185
|
DRB1*01:03
DRB1*04:01
DRB1*04:02
DRB1*04:08
|
0.8473
|
Non-allergenic
|
Non-toxic
|
Inducer
|
VGELVAKFRAASNHA
|
212-226
|
DRB1*11:01
DRB1*11:04
DRB1*13:01
|
0.6633
|
Non-allergenic
|
Non-toxic
|
Inducer
|
VAKFRAASNHASPNE
|
216-230
|
DRB1*01:01 DRB1*01:02 DRB1*01:03 DRB1*04:01 DRB1*04:02 DRB1*04:03 DRB1*04:04 DRB1*04:05 DRB1*04:08 DRB1*07:01 DRB1*08:01 DRB1*08:03 DRB1*09:01 DRB1*10:01 DRB1*11:01 DRB1*13:02 DRB1*13:03 DRB1*14:02 DRB3*16:01 DRB3*02:02 DRB5*01:01
|
0.6149
|
Non-allergenic
|
Non-toxic
|
Inducer
|
HLA - Human Leukocyte Antigen.
Analysis of the population distribution of the predicted epitopes for MHC class I and class II
The high polymorphism of MHC class I and class II proteins play a key role in adaptive immunity (Wieczorek et al., 2017). HLA alleles have a different distribution in different countries and geographic regions of the world. Because of these factors, the analysis of the population coverage of epitopes is an important step for their validation as eligible to compose a multi-epitope vaccine. The epitopes evaluated were those that did not overlap in relation to the position of the amino acids to explore more distinct regions of the amino acid sequence of TFDP3. In relation to MHC class I, the following epitopes were evaluated: KVWETVQRK (117-125), ATSSGGSQY (367-375) and SISDDKSEY (270-278). In MHC class II, the epitopes were: ELKVLMDENQTSRPV (91-105), SPPWAGQHNRKGEKN (171-185) and VAKFRAASNHASPNE (216-230). The combination of epitopes using IEDB tool showed a population coverage of 93.55% worldwide, 95.88% in Europe, 94.93% in the United States, 85.48% in China, 88.94% in Japan, and 79.29% in South America (Supplementary Figure 1; Figure 1). In Brazil, population coverage is 84.45% (Figure 01). Considering the population coverage in relation to ethnicity in Brazil, it was 67.87% for Amerindians (indigenous people of the American continent), 95.06% for Caucasoids, 93.85% for mixed race and 79.84% for mulatto (Supplementary Figure 2).
Construction of the multi-epitope vaccine
The multi-epitope vaccine was designed using epitopes that did not overlap, had broad population coverage, and immunogenic characteristics. The selected epitopes of B lymphocytes were: RVERQKRLERIKQKQS (201–216), TNSAQNCQNLRVERQK (191–206), SGSCSAEDLKMARNLV (306–321). On the other hand, the MHC epitopes of class I and class II were those evaluated in population coverage: MHC class I: KVWETVQRK (117-125), ATSSGGSQY (367-375) and SISDDKSEY (270-278); MHC class II: ELKVLMDENQTSRPV (91-105), SPPWAGQHNRKGEKN (171-185) and VAKFRAASNHASPNE (216-230). These epitopes have been joined by ligands widely used in the construction of peptide vaccines that can make the protein stable (Nezafat et al., 2014), increase immunogenicity (Avvagari et al., 2020), and aid in vaccine purification (Herrera, 2020). The ligands used were EAAAK in the N-terminal portion binding to the adjuvant, KK among the epitopes of B lymphocytes, GPGPG among MHC class II epitopes (CD4+ T lymphocyte), AAY among MHC class I epitopes (CD8+ T lymphocyte) and a Histidine tail (6xHist) in the terminal C portion. The adjuvant used was Mycobacterium tuberculosis Rv0652 ribosomal protein L7/L12 (MAKLSTDELLDAFKEMTLLELSDFVKKFEETFEVTAAAPVAVAAAGAAPAGAAVEAAEEQSEFDVILEAAGDKKIGVIKVVREIVSGLGLKEAKDLVDGAPKPLLEKVAKEAADEAKAKLEAAGATVTVK) with the aim of increasing immunogenicity (Lee et al., 2014; Mahdevar et al., 2022) (Figure 2). After joining the epitopes, a vaccine with a sequence of 291 amino acids was obtained.
Homology and physicochemical properties of the vaccine
The homology of the multi-epitope vaccine in relation to human proteins showed identity of 35.86% with TFPD3, 49.32% with E2F protein and 40.62% with transcription factor Dp-1 and its isoforms (Table 4). The alignment of the proteins that presented identity with the multi-epitope vaccine presents a percentage ranging from 26% to 13% of coverage. Being 26% corresponds to the target of the study, that is, it has a low risk of developing autoimmunity and affecting normal tissues.
Table 4: Homology of the vaccine in relation to human proteins.
Description
|
Max Score
|
Total Score
|
Query Cover
|
E value
|
Per. ident
|
Acc. Len
|
Transcription factor of Dp family member 3 (NP_057605.3)
|
67.8
|
107
|
26%
|
2E-11
|
35.86%
|
405
|
Proteína E2F-like (AAF37562.2)
|
65.5
|
104
|
22%
|
8E-11
|
49.32%
|
345
|
Transcription factor Dp-1 isoform X6
(XP_016876208.1)
|
40.0
|
40.0
|
13%
|
0.019
|
40.62%
|
337
|
Transcription factor Dp-1, isoform CRA_b (KAI2569993.1)
|
39.7
|
39.7
|
13%
|
0.016
|
40.62%
|
236
|
Transcription factor Dp-1(EAX09217.1)
|
39.7
|
39.7
|
13%
|
0.022
|
40.62%
|
312
|
Transcription factor Dp-1(NP_009042.1)
|
39.7
|
39.7
|
13%
|
0.024
|
40.62%
|
410
|
Transcription factor Dp-1 isoform X4
(XP_005268385.1)
|
39.7
|
39.7
|
13%
|
0.024
|
40.62%
|
406
|
Transcription factor Dp-1(AAN46090.1)
|
39.7
|
39.7
|
13%
|
0.026
|
40.62%
|
406
|
Transcription factor Dp-1 isoform X2 (XP_047286517.1)
|
39.3
|
39.3
|
13%
|
0.035
|
40.62%
|
472
|
Transcription factor Dp-1 isoform X1 (XP_016876206.1)
|
39.3
|
39.3
|
13%
|
0.035
|
40.62%
|
476
|
The multi-epitope vaccine is antigenic (Vaxijen v.2 = 0.7018), non-allergenic, and has a low risk of toxicity (only the KKTNSAQNCQ peptide sequence (position: 154) (SVM score = 0.04) showed low toxicity). Regarding the physicochemical properties, the multi-epitope vaccine has a molecular weight of 31,142.24 Da, an isoelectric point of 9.05 that can be used in purification through isoelectric precipitation. The solubility of the vaccine after overexpression in Escherichia coli was about 0.9609 (96.1%). In addition, it has an instability index of 31.94 that classifies the vaccine as stable, an aliphatic index of 71.24 indicating thermal stability, and a hydropathy index (GRAVY) of –0.619 that classifies it as hydrophilic. The estimated half-life was 1 hour in mammalian reticulocytes (in vivo), 30 minutes in yeast (in vivo) and greater than 10 hours in E. coli (in vivo).
Processing via proteosome, interaction with TAP transporters (MHC class I biosynthesis) and post-translational modifications of the vaccine
Following the multi-epitope vaccine, 85 cleavage sites were predicted at the binding site of C-terminal regions by the proteosome (Table S4). On the other hand, in the interaction with the TAP transporters, 96 peptides with high binding affinity were predicted when considering the sequence and properties of the amino acids of the multi-epitope vaccine (Table S5). When the post-translational modifications of the multi-epitope vaccine were evaluated, 7 O-glycosylation sites (position 153 with score of 0.67, position 156 with score of 0.5, position 205 with score for 0.62, position 206 with score for 0.64, position 215 with score of 0.90, position 242 with score of 0.51 and position 279 with score of 0.61) and 26 phosphorylation sites in which 15 of them can be targeted by more than one kinase (Table S6; Figure 3).
Secondary and tertiary structure, refinement and validation
The prediction of the secondary structure of the amino acid sequence of the multi-epitope vaccine by PSIPRED server indicated the presence of 7.56% of beta-leaf, 39.52% of regions without defined secondary structures, and 52.92% of alpha-helix (Figure 4). These results indicate a predominance of secondary alpha-helix structure.
On the other hand, the tertiary structure was determined using Robetta server with RoseTTAFOLD method, whose five models showed a confidence of 0.55 (Supplementary Figure 3). From then on, these models were evaluated for stereochemical quality by ERRAT, PROCHECK and ProSA-Web tools. According to the validation by ERRAT, model 4 presented the highest quality factor corresponding to 99.293%, showing that the model has a high resolution (Supplementary Figure 4; Figure 5A). To improve the quality of model 4, GalaxyRefine was used, which presented five models (Table S7), in which model 3, when evaluated by ERRAT presented a quality factor of 99.647. This value corresponds to a percentage increase in model quality of 0.354 compared to model 4 predicted by the Robetta server (Figure 5B). When model 3 was compared to model 4 before refinement, there was an increase in Clash score from 1.7 to 13.8, an increase in a favorable Rama value from 95.2 to 96.5, an increase in the MolProbity from 1.259 to 1.873, the RSMD went from 0 to 0.24, the GDT-HA went from 1 to 0.9923, and there was no bad rotamer (Table S7).
Then, model 3 obtained by refinement was evaluated by PROCHECK, VERIFY-3D and ProSA-web. The Ramachandran graph was obtained using PROCHECK, which showed 93.7% of the waste in favorable regions, 4.7% of the waste in additional permitted regions, and 0.8% of the waste in non-permitted regions (Figure 6 A). The three-dimensional structure was compatible with the amino acid sequence, since 80.76% of the residuals had a mean score of 3D-1D >=0.1 by Verify 3D server (Figure 6 B). In the graph obtained by ProSA, the z-score obtained from the structure was -6.98, which indicates that the structure is closer to the native structure of the protein (Figure 6, C-D). Figure 6 E shows the tertiary structure of model 3 obtained by refinement.
Identification of discontinuous B cell epitopes in the three-dimensional structure of multi-epitope vaccine
Identification of the regions of interaction between B lymphocyte epitopes and antibodies plays a key role for development of an immune response (Kringelum et al., 2012). In the three-dimensional structure of the multi-epitope vaccine, 6 linear epitopes were predicted, with size ranging from 41 to 4 residues and a score (PI) from 0.819 to 0.542 (Table 05). Regarding discontinuous epitopes, 6 epitopes were also predicted, with a size ranging from 62 to 6 residues and a score (PI) from 0.822 to 0.513 (Table 5). There was an overlap in part of residues of the linear epitopes with the residues of discontinuous epitopes, as in epitope 1 (204-209) and epitope 7 (207-233) (Figure 7), with high score values (PI), which indicates a presence of epitopes that make the multi-epitope vaccine more immunogenic.
Table 05: Linear epitopes of B lymphocytes of the three-dimensional structure of the multi-epitope vaccine.
Epitopes
|
Position
|
Peptides
|
Number of residues
|
Score
(PI)
|
1
|
204-249
|
QTSRPVGPGPGSPPWAGQHNRKGEKN
|
26
|
0.819
|
2
|
70-110
|
VILEAAGDKKIGVIKVVREIVSGLGLKEAKDLVDGAPKPLL
|
41
|
0.802
|
3
|
116-135
|
EAADEAKAKLEAAGATVTVK
|
20
|
0.708
|
4
|
266-291
|
ISDDKSEYAAYATSSGGSQYHHHHHH
|
26
|
0.688
|
5
|
164-183
|
NLRVERQKKKSGSCSAEDLK
|
20
|
0.63
|
6
|
1-4
|
EAAA
|
4
|
0.542
|
PI – protrusion index.
Table 06: Discontinuous epitopes of B lymphocytes of the three-dimensional structure of the multi-epitope vaccine.
Epitopes
|
Residues
|
Number of residues
|
Score
(PI)
|
7
|
A:R207, A:P208, A:V209, A:G210, A:P211, A:G212, A:P213, A:G214, A:S215, A:P216, A:P217, A:W218, A:A219, A:G220, A:Q221, A:H222, A:N223, A:R224, A:G226, A:E227, A:K228, A:N229, A:P233
|
23
|
0.822
|
8
|
A:V70, A:I71, A:L72, A:E73, A:A74, A:A75, A:G76, A:D77, A:K78, A:K79, A:I80, A:G81, A:V82, A:I83, A:K84, A:V85, A:V86, A:R87, A:E88, A:I89, A:V90, A:S91, A:G92, A:L93, A:G94, A:L95, A:K96, A:E97, A:A98, A:K99, A:D100, A:L101, A:V102, A:D103, A:G104, A:A105, A:P106, A:K107, A:P108, A:L109, A:L110, A:E111, A:A114, A:E116, A:A118, A:D119, A:E120, A:A121, A:K122, A:A123, A:K124, A:L125, A:E126, A:A127, A:A128, A:G129, A:A130, A:T131, A:V132, A:T133, A:V134, A:K135
|
62
|
0.762
|
9
|
A:N161, A:N164, A:L165, A:V167, A:E168, A:R169, A:Q170, A:K171, A:K172, A:K173, A:S174, A:G175, A:S176, A:I266, A:S267, A:D268, A:D269, A:K270, A:S271, A:E272, A:Y273, A:A274, A:A275, A:A277, A:T278, A:S280, A:G281, A:G282, A:S283, A:Q284, A:Y285, A:H286, A:H287, A:H288, A:H289, A:H291
|
36
|
0.688
|
10
|
A:C177, A:S178, A:A179, A:E180, A:D181, A:K183, A:A245, A:S246, A:N248, A:E249
|
10
|
0.618
|
11
|
A:K197, A:M200, A:D201, A:Q204, A:T205, A:S206, A:A240, A:N243, A:H244
|
9
|
0.613
|
12
|
A:E1, A:A2, A:A3, A:A4, A:K8, A:L9
|
6
|
0.513
|
PI – protrusion index
Molecular anchoring of multi-epitope vaccine with TLR-2, TLR-3, TLR-4, TLR-7 and TLR-9
Toll-like receptors (TLRs) are molecules that mediate the response in innate immunity in diverse disease conditions. There is evidence that these molecules are expressed in malignant neoplastic cells with a vital role in tumor progression (Kumar et al., 2022). The interaction of the multi-epitope vaccine with TLRs may enhance immune response. Therefore, the molecular docking interaction of the multi-epitope vaccine with TRL-2, TLR-3, TLR-4, TLR-7 and TLR-9 molecules was evaluated using ClusPro server. The binding affinity energy of TLRs best pose: TLR-2 (–691.4 Kcal/mol), TLR-3 (–806.9 Kcal/mol), TLR-4 (-705.6 Kcal/mol), TLR-7 (-1485.4 Kcal/mol) and TLR-9 (-1002.0 Kcal/mol). The PyMol software was used to visualize the complex generated by docking (Figure 8).
The identification of the molecular interactions between the vaccine and the TLRs was achieved by the PDBSum Generate server which identified a total of 23 hydrogen bonds, 8 saline bridges, and 190 non-contact interactions for TLR-2 in the C-chain, and 2 hydrogen bonds, 1 saline bridge, and 20 non-contact interactions for the TLR-2 D-chain (Figure 9A). For TLR-3 in the B chain, there are 13 hydrogen bonds, 12 saline bonds, and 217 non-contact interactions (Figure 9B). The presence of salt bridges and non-contact interactions in the interaction of the vaccine with TLRs are essential for the maintenance of protein structure and stability.
In addition, molecular interactions between the vaccine and TLR-4, TLR-7 and TLR-9 were identified. In TLR-4, 5 hydrogen bonds, 1 salt bridge and 52 non-contact interactions in the F chain were identified, and the presence of 5 hydrogen bonds and 68 non-contact interactions in the H-chain (Figure 9C). In TLR-7, 2 hydrogen bonds and 7 non-contact interactions in the I-chain, 1 hydrogen bond and 32 non-contact interactions in the L-chain, and 5 hydrogen bonds, 6 saline bridges and 177 non-contact interactions in the M-chain were identified (Figure 9D). In TLR-9, 10 hydrogen bonds, 1 saline bridge and 129 non-contact interactions in the N chain were identified (Figure 9E).
Evaluation of the stability in multi-epitope vaccine by molecular dynamics
Molecular dynamics is fundamental to assess the stability of molecules in a biological system. During the molecular dynamics simulations, structural variations of the multi-epitope vaccine, when interacted with Toll-like receptors (TLR2 and TLR3), were evaluated using the metrics of RMSD (root square of mean square deviation), RMSF (square root of quadratic fluctuation), Rg (radius of rotation) and SASA (surface area accessible to solvent).
The RSMD is the value used to calculate deviation in the protein backbone (Cα, C, and N) during the 100 ns. When the multi-epitope vaccine interacted with the TLRs, it showed an increase in RSMD at the beginning of the simulation, when an increase in temperature occurred, and then stabilized, indicating the equilibrium of the system. The multi-epitope vaccine in the interaction with TLR2 presented an RMSD with a mean value of approximately 0.86 nm (approximate standard deviation = 0.20 nm), while with TLR3 the approximate mean value was 0.46 nm (approximate standard deviation = 0.09 nm). After 20 ns of simulation, the multi-epitope vaccine tended to be stable in relation to both TLR2 (approximate mean value = 0.83 nm and approximate standard deviation = 0.17 nm) and TLR3 (approximate mean value = 0.47 nm and approximate standard deviation = 0.07 nm) (Figure 10 A-B). These results indicate that the multi-epitope vaccine has a higher stability after 20 ns, especially when it interacted with TLR3, as it has the lowest mean RSMD value.
With RSMF of alpha carbon (Cα) atoms it is possible to determine which amino acids of the multi-epitope vaccine are highly flexible over the simulation time. The higher RMSF values are usually associated with loops, and the lower ones with helices. In the graph of multi-epitope vaccine interaction with TLR2, residues 96 (0.7146 nm), 213 (0.6714 nm), 214 (0.6672 nm), 218 (0.6206 nm) and 287 (0.6206 nm) showed the highest RSMF values, which correspond in most of these residues to regions where loops predominate, which explains the flexibility of the molecule in these regions (Figure 10C). On the other hand, in relation to the multi-epitope vaccine with TLR3, the residues 279 (0.7442 nm), 281 (0.7042 nm), 283 (0.6814 nm), 284 (0.7506 nm) and 291 (0.6907 nm) located closer to the C-terminal portion presented the highest RSMF values and correspond as in TLR2 to amino acids located in the most flexible regions of the multi-epitope vaccine (Figure 10D). In both the interaction with TLR2 and TLR3, the multi-epitope vaccine presents flexible regions during simulation.
On the other hand, the Radius of Gyration (Rg) is related to the degree of folding of the multi-epitope vaccine throughout the simulation. The approximate mean value of the total turning radius of the multi-epitope vaccine in relation to the interaction with TLR2 was 2.67 nm (approximate standard deviation of 0.05 nm) and from 20000 ps of 2.66 nm (approximate standard deviation of 0.05 nm) (Figure 10E). Regarding the interaction with TLR3, the approximate mean value was 3.28 nm (approximate standard deviation 0.05 nm) and from 20000 ps onwards 3.27 nm (approximate standard deviation 0.04 nm) (Figure 10F). These data reveal that the multi-epitope vaccine, when interacting with TLR2, presents a higher degree of folding during the simulation, since its total Rg value was lower than the value of the interaction with TLR3.
The Solvent Accessible Surface Area (SASA) is a parameter that determines the surface of the multi-epitope vaccine exposed to the solvent during the simulation. SASA in the multi-epitope vaccine in relation to interaction with TLR2 presented an approximate mean value of 170.02 nm2 (approximate standard deviation 4.78 nm2) and with TLR3, 294.66 nm2 (approximate standard deviation 3.18 nm2) (Figure 10G-H). These results indicate that the multi-epitope vaccine, when it interacts with TLR3, has a more stable behavior in the environment with the solvent, since it presents an approximate mean SASA value higher than that found for TLR2.
The parameters evaluated in molecular dynamics showed that the interactions of the multi-epitope vaccine with TLR3 are more stable than with TLR2 during the simulation, except for the total radius of gyration, which was larger in relation to TLR3, indicating a lower packing. It is likely that the nature of interactions established between the multi-epitope vaccine and TLR3 and the more flexible regions of the multi-epitope vaccine have an influence on this packaging.
Codon adaptation and in silico cloning of the multi-epitope vaccine
Adaptation of codons is usually necessary when there is an interest in increasing the efficiency of the target gene. With the JAVA Codon Adaptation server, the amino acid sequence of the multi-epitope vaccine converted into nucleotide sequence was optimized in relation to E. coli (K12 strain) and obtained with an approximate value of 50.73% of CG content contained in the optimal range of 30-70%. In addition, the codon adaptation index (CAI) was 1.0, which corresponds to an ideal value for efficient protein expression. Both parameters reinforce that the designed vaccine has high transcriptional and translational efficiency. After the optimization of the multi-epitope vaccine, cloning was performed in the linearized Allele TA vector of 2707bp using the SnapGene software. The final length of the cloned vector was 3581bp, which corresponds to 2707bp of the vector plus the insert (amplified from the multi-epitope vaccine) of 873bp (Figure 11A). In order to visualize the distinction of the insert, vector and cloned vector when observing the 1% agarose gel of the electrophoresis simulation (Figure 11B-C) it was verified that the size of the cloned vector is lower than what was expected when in TBE buffer, but in low ionic strength (SB) sodium boric acid buffer it presents high resolution.
Simulation of the immune response after administration of the multi-epitope vaccine
In the simulation of the immune response triggered by the single administration and three doses of the multi-epitope vaccine, there was a lasting and more robust immune response in repeated exposure to the antigen. Considering that innate immunity is the primary step of the immune response involved in the presentation of the antigen and activation of T-cells. The multi-epitope vaccine activated and stimulated natural killer (NK) cells and macrophages in both vaccination schedules (Figure 12A, B, C and D). An increase in the level of cytokines was also observed in both vaccination schedules (Figure 12E and F). In the three administrations, there was an increase in cytokine interferon gamma (IFN-γ) with a tendency to decrease in the following exposures and a gradual increase in IL-2 (Figure 12F).
In addition, after both vaccination schedules, there was a decrease in activated macrophages, which indicates the probable action of anti-inflammatory cytokines such as IL-10 and TGF-β to control the immune response and prevent the development of an exacerbated inflammatory response (Figure 12C, D, E and F).
Regarding T-cell populations, the multi-epitope vaccine increased the Th1 subpopulation in both vaccine schedules (Figure 13A, B, C and D). Successive exposure to antigens kept the amount of Th1 cells elevated for longer. Th1 lymphocytes are induced by cytotoxic T lymphocytes that proliferate and increase cytotoxic capacity. This event can be observed by the increased activation of the cytotoxic T lymphocyte population and a decrease in resting cells with the immune response induced by the multi-epitope vaccine in both vaccine schedules (Figure 13E, F, G and H). However, in three administrations the T lymphocyte population remained activated longer compared to one administration (Figure 13H).
In the immune response mediated by the B lymphocyte population, there was an increase in memory cells in both vaccine schedules, with differentiation and production of IgM and IgG, and with a decrease in naïve B lymphocytes (Figure 14A, B, C and D). In addition, a gradual increase in the production of IgM+IgG, IgM, IgG1 and IgG1+IgG2 was identified with the administration of the injections at the pre-established intervals of the multi-epitope vaccine (Figure 14E and F). During the successive exposure of the multi-epitope vaccine, there was a more robust immune response. These results demonstrate that the multi-epitope vaccine with successive vaccine exposure has the potential to cause an effective and long-term immune response.