PEDV detection and phylogenetic analysis of the S gene
Of all the 362 feces and small intestine samples being tested, 160 samples were PEDV positive (44.20%) from 2015 to 2018, and the positive prevalence was 44.83%(26 positive samples and 58 test samples), 66.67%(46 positive samples and 69 test samples), 42.86%(27 positive samples and 63 test samples),35.47%(61 positive samples and 172 test samples) in 2015, 2016, 2017 and 2018, respectively. The positive prevalence of 2016 was the highest, and that of 2018 was the lowest. Sequence alignment showed that all these strains shared 92.9–100% homology for nucleotides and 91–100% identity for amino acids. Compared to the reference strain CV777, these strains shared 93.1–96.8% nucleotide homology and 91.5–96.8% amino acid identity. These strains shared 93.8–99% nucleotide homology and 92.5–98.9% amino acid identity with Chinese PEDV strains.
Sixty-two S genes of the tested strains and the representative strains that were published in GenBank were analyzed by a phylogenetic tree. As shown in Fig. 1, the phylogenetic analysis showed that these strains could be divided into two groups, namely, G1 and G2. G1 included the classical strains (CV777 and SM98) and some isolates from China, USA and Japan after 2010. Thus, G1 was further divided into three subgroups: G1-a, G1-b and G1-c. G1-a and G1-b were classical S-INDEL strains, and G1-c was a new S-INDEL strain. G2 was a non-S-INDEL strain and was also divided into two subgroups, G2-a and G2-b, which consisted of a number of severely virulent strains from all over the world since 2010. The strains in our study belonged to G1-b, G1-c, G2-a and G2-b. One strain, GDjm18-2, was categorized as subtype G1-b, which included classical vaccine strains CV777/attenuated and JS2008. One strain GDjm17-1 belonged to the G1-c cluster. The other strains that were identified in our study formed eight clusters. In these strains, 25 isolates from Guangdong, 3 isolates from Fujian and 1 isolate from Jiangxi formed three clusters and belonged to G2-b, having high similarity with GD-A and CH-GXNN-2012. The other 34 isolates formed five clusters and belonged to G2-a. In the 34 strains, JXyc15 has a close relationship with the C4 cluster (North American strains). The other strains had a closer identity with CH-ZMDZ-11, CH-HNAY-2015 and CH-HNCDE-2016L. As shown in Table 4, all the isolated strains from 2015 belonged to G2-a (100%). In 2016 and 2017, there were 46.15% and 43.75% isolated strains belonging to G2-a, respectively. Compared with G2-a, the rate slightly increased, and there were 53.84% and 50% isolated strains belonging to G2-b, respectively. However, in 2018, there were 72.22% isolated strains that belonged to G2-b, which was much higher than that of G2-a in 2017 (22.22%).
Table 4
The PEDV positive prevalence of different groups of tested strains in our study.
Group | 2015 | 2016 | 2017 | 2018 |
G1-b | 0 | 0 | 0 | 5.56% |
G1-c | 0 | 0 | 6.25% | 0 |
G2-a | 100% | 46.15% | 43.75% | 22.22% |
G2-b | 0 | 53.84% | 50% | 72.22% |
Amino acid sequence analysis of the neutralizing epitope in the S protein
To analyze the genetic characteristics of the South China PEDV strains, the deduced amino acids of the S protein isolated in our study were aligned and compared with that of the representative PEDV strains, including strains from G1-a (CV777 and DR13 virulent), G1-b (CV777-attenuated), G1-c (OH851 and CH-ZWZBa-01-2015), G2-a (CH-HNQX-3-14, CH-HNAY-2015, CH-ZMDZY-11) and G2-b (CH-GXNN-2012, CD-A). As shown in Fig. 2, compared to the CV777 strain, the G1-b strain GDjm18-2 had three amino acid (AA) substitutions in the COE domain, and 1 AA substitution in the epitope SS6. The G2-a strains had AA substitutions at thirty-five positions in the COE domain, at two positions in the epitope SS2 and at five positions in the epitope SS6. In these positions, many new AA substitutions have been found in the COE regions of the G2-a strains, including 502 (S/P), 507 (P/M), 510 (N/S), 516 (N/D), 522 (S/A), 527 (S/G), 533 (A/V), 535 (D/E), 547 (D/E), 559 (V/I or A), 562 (S/D), 567 (S/A), 568 (K/T or N), 570 (Q/H), 571 (D/N or Y), 575 (P/L), 580 (S/A), 588 (S/G), 594 (T/R or C), 608 (Y/H), 613 (S/I or G), 614 (G/V), 626 (K/E or S), and 637 (L/ F or S). 2C10 was conserved in all G2-a strains. In the G2-a strains, GDhz16 had four continuous AA mutations in the epitope SS6, which was different from that of the other strains and the reference strains. Compared to the CV777 strain, except for having the AA substitution at one position in three epitopes (SS2, SS6 and 2C10), the G2-b strains had AA substitutions at seventeen positions in the COE domain. In addition to the common AA mutations that were similar to the reference strains in G2-b, there were novel AA substitutions at eight positions in the COE regions, including 504 (V/L), 510 (N/D), 535 (D/H), 542 (S/H), 567 (S/Y), 614 (G/V), 626 (K/T), and 637 (L/V).
Mutated amino acid number analysis of different domains in the S protein
To further analyze the AA mutations in different domains of the S protein in the isolates, the different domains of the S protein were aligned with CV777, and the average number of AA mutations each year were computed. The S protein could be divided into an S1 protein and an S2 protein; the S1 protein contained four domains: SP (1–18), S1-NTD (19–233), COE and RBD (501–629). The S2 protein included five domains: SS6 (764–771), HR1 (978–1117), HR2 (1274–1313), TM (1324–1346) and 2C10 (1368–1374). The previous data indicated that 2C10 was conserved, so we did not analyze the 2C10 domain. As shown in Fig. 3, compared with the S2 protein sequence in these strains, S1 had more AA mutations. From 2015 to 2018, the mutated AA numbers of S1 maintained a high level, but that of S2 decreased. Furthermore, the mutated AA numbers of SP (1–18) and S1-NTD (19–233) slightly increased. However, the mutated AA numbers of the COE and RBD domain decreased. SS6, HR1, HR2 and TM in the S2 protein did not obviously change from 2015 to 2018.