Expression, purification and immunization of E6 and E7 proteins
To obtain rabbit sera against respective immunogens for decoding IgG-epitomes of E6 and E7 proteins from HPV16/18/45, expression of all target proteins was carried out employing E. coli expression system. Since E6 and E7 proteins of HPV16 and 45 failed to be solely expressed in E. coli with many various prokaryotic expression plasmids, they were finally expressed in the form of fusion protein in E. coli (Fig. 1A and 1C), respectively, using the synthesized fusion genes encoding E6E7 proteins of HPV16 and 45, which were inserted into pET28a plasmid. The expression of HPV18-E6 and -E7 proteins was achieved using pBV221 and pRSET-A plasmids, respectively (Fig. 1B). All r-E6E7 fusion proteins of HPV16/45, r-E6 and -E7 proteins of HPV18 were purified with the PAGE-preparation method, and identified with mAb to His6-tag or mAb C1P5, respectively (Fig. 1A-C). After obtaining four r-proteins with the purity higher than 95%, they were used as immunogens to immunize rabbits, which were emulsified in complete Freund's and incomplete Freund's adjuvants. The collected anti-sera were used to determine the Ab titers by ELISA and the results showed that all r-proteins elicited high levels of antigen-specific Ab titers (Supplementary Fig. 1).
E6 and E7 epitomes of HPV16
HPV16 is the first to be found that it is associated with invasive cervical cancer [39, 40] and the most prevalent HPV type in the world. HPV E6 and E7 oncoproteins can regulate the cell cycle through reacting with p53 and pRB to contribute the progression of invasive cervical cancer [9-12]. However, only limited information were available about the humoral immune responses to E6 and E7 proteins, in particular linear BCEs, although for them there were several studies on BCE mapping with sera from cervical cancer patients with HPV16-positive, murine pAbs and mAbs [24-27]. Thus, it is still necessary to delineate all BCEs and type-specific BCEs in HPV16-E6 and E7.
To reveal non-conformational IgG-epitome of HPV16-E6 with 158 aa, in the first round of antigenic peptide mapping, 19 of 16mer-peptides (a1 to a19, a19 is 14mer) with an overlap of 8 aa covering the full-length sequence of E6 were expressed as r-protein fused with GST188 carrier in E. coli using the pXXGST-3 plasmid. For the second round of fine BCE identification, 75 octapeptides (a20 to a94) with an overlap of 7 aa corresponding all reactive 16/14mer-peptides shown in Western blotting were expressed in E. coli. In epitome mapping of HPV16-E7 protein, 12 of overlapping 16mer-peptides (b1 to b12, b12 is 10mer) and 56 overlapping 8mer-peptides (b13-b68) corresponding all reactive 16mer-peptides were employed.
As shown in Fig. 2A, Western blotting with the mixed rabbit sera to r-E6E7 revealed 10 antigenic 16/14mer-peptides of E6. Further, using 8 sets of octapeptides corresponding 10 antigenic 16/14mer-peptides, 8 fine BCE motifs (12PQER15, 35LECVYCK41, 47REVYDF52, 61YRDGNPY67, 90LYGTTLEQ97, 109RCINC113, 131RFHNIRG137 and 155ETQL158) were identified according to respective common sequences within continue reactive octapeptides (Fig. 2B and Supplementary Table 1), which were named as 16/E6-1 to 16/E6-8. The 16/E6-8 motif in reactive a89 located at the end of a19 were identified using 5 extra octapeptides (a90–a94), wherein the respective aa residue from N-terminal of a89 was sequentially substituted with alanine (A). Similarly, 7 antigenic 16/10mer-peptides of E7 were identified (Fig. 2C), 7 BCE motifs (7TLHEYML13, 12MLDLQ16, 21DLYC24, 30DSSEE34, 36DEIDGP41, 42AGQAEP47 and 95SQKP98) were precisely mapped employing 5 sets of octapeptides according to common sequences within reactive octapeptides (Fig. 2D and Supplementary Table 2), which were named as 16/E7-1 to 16/E7-7. The 16/E7-7 motif in reactive b63 located at the end of b12 were determined using 5 extra octapeptides (b64–b68), wherein the respective aa residue from N-terminal of b63 was sequentially substituted with A.
The position of 15 delineated BCEs on two predicted three-dimensional (3D) structures of HPV16-E6 and E7 proteins has been shown in Fig. 3A and 3D, wherein 3D structures of E6 and E7 are predicted according to the previously described method [41]. Of all mapped BCEs on these two proteins, 2 BCEs (16/E6-5 and 16/E6-6) on E6 and 1 BCE (16/E7-7) on E7 were located in predicted α-helical domains, respectively. The linear schematic diagrams of decoded E6 and E7-epitomes in the predicated 3D structures of E6 and E7 showed two obvious features: i) the distribution pattern of eight BCEs on E6 presenting disperse state; and ii) except for a BCE 16/E7-7 at the C-terminal of E7, the rest of 6 BCEs being situated in the first half of the protein, like those (7/8) of the HPV58-E7 epitome [30].
E6 and E7 epitomes of HPV18
HPV 18 is associated predominantly with adenocarcinomas and adenosquamous carcinomas compared with squamous cell carcinomas [42, 43] and is the second most common type in invasive cervical cancer worldwide and in China [5, 42]. It is essential to reveal all BCEs on HPV18-E6 and E7, in particular the type-specific BCEs, for developing specifically diagnostic reagent of HPV18. Although there have been studies on epitope mapping of HPV18-E6 and E7 using different methods including phage display library and/or peptide pin screening and sera from cervical cancer patients with HPV18-positive, rabbit pAbs and murine mAbs [27-29], the so-called BCEs revealed are longer antigenic peptides, and information about their fine and type-specific BCEs is still lacking.
For epitome mapping of HPV18-E6, 19 of overlapping 16mer-peptides (c1 to c19, c19 is 14mer) covering the full-length sequence of HPV18-E6 with 158 aa were used in the first round of BCE mapping. In the second round of fine BCE identification, 7 set of 57 octapeptides (c20 to c76) corresponding 8 mapped antigenic 16/14mer-peptides were used. As shown in Fig. 4A, 8 reactive peptides within c1-c19, and 6 fine BCEs (6DPTRR10, 76RELRHY81, 86YGDTLE91, 113NPAEKLRHL121, 133HYRGQ137 and 154RETQV158) were identified according to their common sequences within consecutively reactive octapeptides (Fig. 4B-4D and Supplementary Table 3), which were named as 18/E6-1 to 18/E6-6. Because of failing to find any reactive band within a set of overlapping octapeptides (c45-c53) corresponding reactive peptide c15, the nonapeptide BCE of 18/E6-4 was identified by using another group of 10mer-peptides (c54-c61). The 18/E6-6 motif in reactive c71 located at the end of Pc19 was identified using five extra octapeptides (c72-c76), wherein the respective aa residue from N-terminal of c71 was sequentially substituted with A.
In epitome mapping of HPV18-E7, 12 of overlapping 16mer-peptides (d1 to d12, d12 is 17mer) spanning the full-length sequence of HPV18-E7 with 105 aa were used in the first round of BCE mapping, and 6 sets of 63 octapeptides (d13 to d75) were used in the second round of fine BCE identification. Finally, 6 fine BCEs (10DIVL13, 20EIPVDLL26, 30QLSDSE35, 38NDEID42, 46HQHL49, and 101CASQQ105) were identified by Western blotting with rabbit pAbs to HPV18-E7 according to consecutively reactive bands (Fig. 4F) and their common sequence within respective reactive octapeptides (Supplementary Table 4), which were named as 18/E7-1 to 18/E7-6. Of them, the 18/E7-6 motif in reactive d71 located at the end of d12 was identified with four extra octapeptides (d72-d75), wherein the respective aa residue from N-terminal of d71 was sequentially substituted with A.
The position of 12 mapped BCEs on two predicted 3D structures of HPV18-E6 and E7 proteins has been shown in Fig. 3B and 3E. Of mapped BCEs, except for 2 BCEs (18/E6-1 and 18/E6-6), 4 BCEs on E6 and 1 BCE (18/E7-6) on E7 were shown to be located in respective α-helical domains. The linear schematic diagram of mapped BCEs on HPV18-E6 showed that most (5/6) of mapped BCEs are clustered at the C-terminal except for 18/E6-1 at the N-terminal (Fig. 3B), which is different with HPV16-E6 (Fig. 3A) but similar with HPV58-E6 [30]. Interestingly, like the epitomes of HPV16/58-E7 proteins, the majority (5/6) of mapped BCEs on HPV18-E7 are clustered at the N-terminal except for 18/E7-6 at the C-terminal (Fig. 3E).
E6 and E7 epitomes of HPV45
The HPV45 has been determined as the third most prevalent hr-HPV type involved in invasive cervical carcinoma, and found in 4-9% of cervical cancers worldwide [3, 5], which is closely related to HPV18 and belongs to HPV species 7 in the genus alpha-papillomavirus [44]. Unlike HPV16 and 18 that have been studied intensively, little is known about the biological properties of HPV45, but it has been reported that HPV45-E7 protein can be expressed in cervical cancer biopsies [45, 46], and is a transforming protein that binds to pRB and induce pRB-degradation and anchorage-independent cell cycle progression [47]. Furthermore, to our knowledge, the study on BCE mapping of HPV45-E6 and E7 proteins has not yet been carried out up to now.
For epitome mapping of HPV45-E6 and E7 proteins, 19 and 12 of overlapping 16mer-peptides (e1 to e19 for E6 and e19 is 14mer; f1 to f12 for E7, f11 and f12 are 17mer) representing full-length sequences of E6 with 158 aa and E7 with 106 aa were used to map antigenic peptides by rabbit pAbs against HPV45-E6E7 r-protein. After identification of 5 and 6 reactive peptides on E6 and E7 respectively (Fig. 5A and 5C), 6 sets of 29 octapeptides (e20-e48) for E6 and 47 octapeptides (f13 to f59) for E7 were used to reveal each fine BCE motif, respectively.
As shown in Fig. 5B, the consecutively reactive bands among e21-e25, e34-e37 and e43-e47 and a single reactive band of e28 in 3 sets of octapeptides were observed, and thus 4 fine BCEs (6DPKQ9, 9QRPYKLPD16, 80YYSNS84 and 113NPAE116) were identified according to each common sequence within reactive octapeptides (Supplementary Table 5), which were named as 45/E6-1 to 45/E6-4. The number of mapped BCEs in HPV45-E6 epitome is less than those (8 and 6) in HPV16 and 18-E6 epitomes, but it is identical with that in HPV58-E6 epitome [30]. For HPV45-E7 epitome mapping, 4 fine BCEs (18QNELD22, 39EEEN42, 47HAQL50 and 50LPARRAEP57) were identified according to respective common sequences within reactive octapeptides in f13-f27, f28-f43 and f44-f59 (Fig. 5D, Supplementary Table 6), which were designed as 45/E7-1 to 45/E7-4.
The position of delineated reactive 8 BCEs on two predicted 3D structures of HPV45-E6 and E7 proteins are shown in Fig. 3C and 3F. Of mapped 8 BCEs on E6 and E7, only one (45/E6-3) was located in a β-sheet domain (Fig. 3C). Their linear schematic diagrams of mapped BCEs in E6 and E7-epitiomes has been shown in the predicated 3D structures of E6 and E7. There is no distinct BCEs distribution property for E6 (Fig. 3C), and the epitome of HPV45-E7 showed the same feature of BCE distribution pattern with HPV16/18/58-E7, in which all four mapped BCEs were clustered at the N-terminal (Fig. 3F).
Specificity of mapped BCEs among hr-HPV homologous proteins
Since it has known that early E6 and E7 oncoproteins are consistently expressed in the HPV life cycle [8, 9] and they are associated with cervical cancer during hr-HPV infection [10-12], the goal of most E6/E7 epitope mapping studies mainly was to identify the antibody-reactive peptides using sera from cervical cancer patients with HPV16/18-positive and rabbit/murine pAbs [23, 27, 29]. Obviously, it is imperative to map all type-specific BCEs on E6 and E7 of each hr-HPV among homologous proteins for the development of highly specific and sensitive detection reagents.
Based on sequence alignment method, eight BCEs marked by an asterisk were defined as completely type-specific since the same and similar sequences with an aa mutation were not found in known and possible hr-HPVs, also including known lr-HPV types such as HPV6, HPV11, HPV40, HPV42, HPV43 and HPV44, which are 16/E7-1, 16/E7-2, 16/E7-7, 18/E6-5, 18/E7-2, 18/E7-6, 45/E7-1 and 45/E7-3 (Table 1). In addition, the other 9 BCEs marked by two asterisks could also be considered to be completely or highly type-specific, which are 16/E6-3, 16/E6-5, 16/E6-8, 18/E6-1, 18/E6-4, 18/E7-3, 18/E7-4, 45/E6-1, and 45/E6-4, as their similar peptides present only in another HPV type, although the cross-reactivity of each similar peptide with respective rabbit pAbs remains to be determined. In short, 8 completely and 9 possible completely type-specific BCEs would lay a foundation for the development of BCE-based diagnostic reagents and/or chips targeting HPV16, 18 and 45 in the future.
Delimitation of HR-BCE motifs within known HR-antigenic peptides
Based on the similarities of Ab responses to HPV18-E6 and E7 between rabbit and human [29], we investigate the cross-reactivity of mapped RR-BCEs with human sera to HPV16/18 by comparing sequences of mapped RR-BCEs with HR-peptides. Specifically, the RR-BCEs recognized by human serum can be determined according to whether they are present within corresponding HR-peptides identified by sera from patients with HPV16 and 18-positive. The results showed that 9 in 27 mapped RR-BCEs on E6 and E7 of HPV16/18 were determined to be recognized by human sera to HPV16/18 based on their presence in 8 known HR-peptides. Of them, 5 RR-BCEs including 16E6-5, 16E7-1, -2, -3, and -4 are present in 3 HR-peptides (E6:6, E7:1 and E7-DLYCYEQLNDSSEE of HPV16) [23, 27] as well as 4 RR-BCEs including 18E6-1, -2, -3 and 18E7-1 are present in 5 HR-peptides (pepE6/1, pepE6/2, pepE6/4, pepE7/1 and pepE701) [29]. Additionally, there was none RR-BCE of HPV16-E7 in the HR-E7:4 peptide [23], and two HR-BCEs (DEIDGVNH and LPARRA) mapped in the pepE7/1 and pepE701 peptides [29] did not exist in the decoded HPV18-E7 epitome, suggesting the differences in Ab responses to HPV16/18-E7 between both species.
Due to the similarities between 9 mapped RR-BCEs and eight known HR-peptides, we further delimitated HR-BCE motifs within HR-peptides based on each RR-BCE motif. As a result, three HR-BCE motifs named h-16/E6-1, h-16/E7-1 and h-16/E7-2 were delimitated as LYGTTLE, DLYC and DSSEE, of which the first HR-BCE motif in HR-peptide E6:6 YSKISEYRHYCYSLYGTTLE [23] is less a residue than RR-BCE 16/E6-5 motif, and the latter 2 HR-BCE motifs located at both ends of HR-peptide DLYCYEQLNDSSEE [27] are identical to RR-BCEs 16/E7-3 and 16/E7-4. Additionally, 2 HR-BCEs (TLHEYML and MLDLQ) in the middle of E7:1-HGDTPTLHEYMLDLQPETTD [23] are identical to BCEs 16E7-1 and 16/E7-2, but it could not be determined whether one or 2 HR-BCEs exist in HR-peptide E7:1 because the potential differences of antibody responses among rabbit and humans, and an overlap of 2 residues between these 2 RR-BCEs.
Taking the same method and based on four HR-peptides of HPV18-E6 and E7 proteins [29], 3 fine HR-BCEs named h-18/E6-1 to -3 were delimitated as DPTRR, RELRHY and RETQV motifs based on their common sequences in HR-MARFFDPTRRPYKLPDL (pepE6/1) and RR-BCE 18/E6-1, FYSRIRELRHYSDSVYGD (pepE6/2) and RR-BCE 18/E6-2, as well as NRARQEPLQRRRETQV (pepE6/4) and RR-BCE 18/E6-6, respectively. Another HR-BCE HQHL was shared in HR-IDGVNHQHLPARRAEPQR (pepE7/1) and RR-BCE 18/E7-4, although 1 HR-BCE (LPARRA) in this HR-peptide was mapped by human sera against HPV18 virus [29].