Population genetic structure analysis verified PDW pigs belong to the ecotype of ECN pigs
We conducted variants detection in 335 pigs, including 320 individuals from the public database (Table S1) and 15 PDW pigs, notably detected 30.98 million SNPs, were used for subsequent analysis. To examine phylogenetic relationships of the 335 pigs, an individual-based NJ-tree analysis was conducted using the SNPs set (Fig. 1A). The 335 pigs were subdivided into 3 main branches, representing Chinese pigs, EUR pigs and OUT respectively. Individuals from different populations formed different groups, among which, PDW pigs were close to the Chinese pigs, while KLR and DLR were in similar Landrace (LR) pig strain. Herein, Chinese indigenous pigs were divided into four ecotypes, i.e., (i) ECN including EHL, JH, MS, and WNB pigs; (ii) SCN including BMX, LUC and WZS pigs; (iii) SWCN including BS, NJ, RC, GST, YNT, SCT and TT pigs; and (iv) NCN including LWH, HT, BAM and MIN pigs, which was consistent with previous findings [5].
Table 1
Genetic diversity of 27 pig populations and out group.
Breed
|
Abbrev1
|
Ecotype2
|
Ho3
|
He4
|
Pn5
|
r20.3(kb)6
|
F7
|
π8
|
Chinese Wild boar
|
CWB
|
CWB
|
0.31
|
0.35
|
0.58
|
1.76
|
0.25
|
1.96e-03
|
Pudong White
|
PDW
|
ECN
|
0.38
|
0.36
|
0.46
|
11.10
|
0.41
|
1.74e-03
|
Erhualian
|
EHL
|
ECN
|
0.36
|
0.34
|
0.51
|
2.51
|
0.26
|
1.70e-03
|
Meishan
|
MS
|
ECN
|
0.35
|
0.35
|
0.41
|
2.56
|
0.33
|
1.57e-03
|
Jinhua
|
JH
|
ECN
|
0.37
|
0.34
|
0.49
|
12.22
|
0.38
|
1.74e-03
|
Wannan Black
|
WNB
|
ECN
|
0.27
|
0.32
|
0.37
|
1.51
|
0.42
|
1.14e-03
|
Bamei
|
BAM
|
NCN
|
0.36
|
0.34
|
0.65
|
6.04
|
0.21
|
2.31e-03
|
Hetao
|
HT
|
NCN
|
0.39
|
0.35
|
0.69
|
4.61
|
0.10
|
2.48e-03
|
Laiwu Black
|
LWH
|
NCN
|
0.36
|
0.35
|
0.60
|
4.83
|
0.27
|
2.21e-03
|
Min
|
MIN
|
NCN
|
0.38
|
0.36
|
0.62
|
4.98
|
0.23
|
2.26e-03
|
Bamaxiang
|
BMX
|
SCN
|
0.38
|
0.35
|
0.64
|
3.22
|
0.21
|
2.26e-03
|
Luchuan
|
LUC
|
SCN
|
0.39
|
0.35
|
0.52
|
5.11
|
0.33
|
1.93e-03
|
Wuzhishan
|
WZS
|
SCN
|
0.37
|
0.34
|
0.71
|
2.46
|
0.14
|
2.49e-03
|
Neijiang
|
NJ
|
SWCN
|
0.37
|
0.35
|
0.57
|
4.57
|
0.29
|
2.09e-03
|
Rongchang
|
RC
|
SWCN
|
0.30
|
0.35
|
0.53
|
1.76
|
0.39
|
1.91e-03
|
Baoshan
|
BS
|
SWCN
|
0.39
|
0.34
|
0.69
|
2.71
|
0.10
|
2.41e-03
|
Sichuan Tibetan
|
SCT
|
SWCN
|
0.34
|
0.36
|
0.64
|
3.05
|
0.24
|
2.29e-03
|
Tibetan
|
TT
|
SWCN
|
0.33
|
0.34
|
0.75
|
2.43
|
0.14
|
2.50e-03
|
Gansu Tibetan
|
GST
|
SWCN
|
0.34
|
0.33
|
0.70
|
2.23
|
0.21
|
2.41e-03
|
Yunnan Tibetan
|
YNT
|
SWCN
|
0.37
|
0.36
|
0.69
|
2.51
|
0.12
|
2.41e-03
|
French Large White
|
LW
|
EUR
|
0.33
|
0.33
|
0.38
|
4.72
|
0.45
|
1.46e-03
|
English Large White
|
LW
|
EUR
|
0.24
|
0.31
|
0.13
|
1.94
|
0.60
|
4.89e-04
|
Danish Landrace
|
LR
|
EUR
|
0.32
|
0.32
|
0.39
|
3.36
|
0.48
|
1.45e-03
|
Korean Landrace
|
LR
|
EUR
|
0.30
|
0.34
|
0.35
|
8.84
|
0.53
|
1.24e-03
|
Duroc
|
DU
|
EUR
|
0.28
|
0.30
|
0.37
|
2.32
|
0.56
|
1.26e-03
|
Pietrain
|
PI
|
EUR
|
0.28
|
0.30
|
0.18
|
2.31
|
0.51
|
6.55e-04
|
European Wild boar
|
EWB
|
EWB
|
0.25
|
0.35
|
0.16
|
2.96
|
0.73
|
6.82e-04
|
Out group
|
OUT
|
OUT
|
0.06
|
0.35
|
0.23
|
2.38
|
0.92
|
9.49e-04
|
1 Abbreviations of breeds. 2 ECN, East China; SCN, South China, SWCN, Southwest China, NCN, North China. EUR, European |
3 Observed heterozygosity. 4 Expected heterozygosity. |
5 The proportion of polymorphic markers. 6 Linkage disequilibrium values of r2 = 0.3. 7 Inbreeding coefficient. |
8 Average nucleotide polymorphism in 10-kb windows. |
The OUT was eliminated and PCA analysis was performed where the result revealed that the first principal component (PC1) divided the remaining 27 groups into Chinese and European pig populations, the PC2 divided Chinese pigs into multiple groups, and PDW pigs were still close to the Chinese indigenous pigs (Fig. 1B). Then, an NJ-tree between populations was constructed based on the Fst (Fig. 1C). PDW and ECN pigs were on the same branch which indicated that genetic differentiation between PDW and ECN pigs was smaller than other pigs.
PDW was affected by the lineage of European pigs
ADMIXTURE analysis was performed using the remaining 27 groups after removing OUT to further observe the lineage composition of PDW pigs. The result demonstrated that when the assumed ancestor number K was 2–4, PDW pigs primarily comprised the lineage of ECN pigs but contained a certain proportion of EUR pigs’ lineage with the average proportions of 29.52%, 16.93%, and 13.5% respectively, indicating that PDW pigs belong to the ECN ecotype in genetic relationship but influenced by EUR pigs at the same time (Fig. 1D). However, when the K was more than 5, an independent lineage was formed in PDW pigs. Additionally, the NCN pigs contained an apparent lineage of EUR pigs, this was consistent with the results of previous studies [5, 28]. We conducted Treemix analysis with OUT as root and the result revealed that Chinese indigenous pigs were divided into ECN, SCN, and SWCN pig groups. However, NCN and PDW pigs were located between Chinese and EUR pigs due to the effect by lineage of EUR pigs (Fig. S1). When the migration event was assumed to be 8, an apparent gene flow (migration weights = 0.23) between PDW and LW pigs was observed (Fig. S2). In line with other findings, the results above suggest that PDW pigs belong to the ecotype of ECN pigs but are affected by EUR pigs.
To further observe the genetic diversity of PDW pigs in the context of global pig breeds, we estimated the expected heterozygosity (He), observed heterozygosity (Ho), SNPs polymorphism ratio (Pn), LD (r2 = 0.3) fragment length, inbreeding coefficient, and Run of homozygosity (ROH) of each pig group (Table 1, Table S2). As a result, PDW pigs had similar heterozygosity (He and Ho), lower SNPs polymorphism, longest LD (r2 = 0.3) fragment length and higher inbreeding coefficient compared to other Chinese indigenous pigs (Table 1, Fig. S3). We counted the number of ROHs of different fragment lengths in 28 pig groups. When the length was less than 500 kb, the number of ROHs in PDW pigs was similar to Chinese indigenous pigs. Nevertheless, when the ROH length was more than 500 kb, its number in PDW pigs was significantly higher than other China indigenous pigs and close to European domestic pigs (Table S2). Considering that the ROHs in PDW pigs were between China indigenous pigs and European domestic pigs, it further indicates the possibility of introgression from EUR pigs into PDW pigs.
The gene flow, selected signals and introgression regions at the genome of PDW pigs from LW
We estimated the f3 test of each ecotype and found Chinese indigenous pigs have low genetic similarity with EUR pigs (Table S3-S6). Among these, NCN groups have higher genetic similarity with EUR pigs compared with other ecotypes (Table S6). Besides, each ecotype had higher genetic similarity with Chinese Wild boar (CWB) compared with European domesticated pigs. The f3(PDW, X; OUT) was estimated to further evaluate the shared genetic drift between PDW and the other pig breeds, and the results revealed that PDW pigs shared the most genetic similarity with ECN pigs (Table S7). Noteworthy, the genetic similarity between PDW and European domesticated pigs was slightly higher than SCN pigs or CWB. The calculation results of D(LW, X, PDW, OUT) showed that PDW pigs shared the most alleles with ECN pigs (Table S8). Also, PDW shared the most alleles with LW pigs compared with other European domestic pigs which were even higher than SCN pigs and CWB.
Although PDW pig breed is a type of ECN pigs in genetic relationships, its coat color is completely different from the latter. We considered that PDW pigs have been strongly selected through human-mediated, making it possible to fix the genes causing the white coat phenotype in PDW pigs. To reduce the signal noise caused by different genetic backgrounds, Fst was performed between ECNB and PDW pigs where 18 genes were identified on the top windows of autosome including ECT2L, CCDC28A, ELMOD3, MMP19, PYM1, FANCA, ZNF276, VPS9D1, PRSS16, KIT, CCDC126, TAF3, EFCAB3, C3orf14, IDE, CDH18, RALGAPB and KMT2C (Fig. 2A). Among these, the KIT is a famous gene causing white coat color phenotype in LW pigs [48]. Total of 251 genes were identified from 623 windows with Fst value higher than Fsttop1%. The Gene Ontology (GO) terms and KEGG pathways analysis were performed online website Metascape (http://metascape.org), which showed these genes mainly participate in regulating import growing development process, such as hormone metabolic process and labyrinthine layer morphogenesis (Fig. 2B).
Above results suggested a significant gene flow between PDW and LW pigs so that we performed whole-genome rIBD analysis between PDW and LW (ELW and FLW) pigs with ECNB as control group (Fig. 3A). Several significant introgression regions from LW pigs into PDW pigs were identified on chromosome 1, 2, 5, 6, 8, 9, 10, 14 and 15. Among these regions, two on chromosome 8 (Region2) and 14 (Region1) overlapped with the whole-genome ABBA-BABA test (Fig. S4). We examined Region1 carefully and found that a chr14:90.3–91.5 Mb region (Fig. 3B) including PARG, NCOA4, MSMB, ZFAND4, MARCHF8, ALOX5, RASSF4, ZNF22, DEPP1 and TMEM72 genes were identified. These genes mainly involved in growth-related biological processes, bone development and disease-resistance processes. The haplotypes heat-map showed that PDW pigs shared similar haplotypes with KLW, ELW, DU and sectional PI pigs (Fig. 3C). However, KLR and DLR pigs did not carry the shared haplotypes. Interestingly, sectional Chinese indigenous pigs (WZS, BS, BAM, HT and LWH pigs) also carried a small amount of the shared haplotypes, further implying that EUR pigs have introgressed into Chinese indigenous pigs.
The Fst and XP-EHH of PDW and the two groups (LW and ECNB pigs) in the area showed that the genetic differentiation and the degree of selection between PDW and LW pigs were lower compared to those of ECNB pigs (Fig. 3D and 3E).
The causality of splicing mutation, DUP1 and DUP2 ofKITin relation to the coat color phenotype of Pudong pigs
We found the import white coat color related KIT gene in European white pigs in Region2 (Fig. 3A). Previous studies showed that the KIT gene harbors various alleles. Among them, LW and LR pigs carry the I alleles, such as I1 and I2 [48], while Chinese indigenous pigs carry the recessive wild-type i allele [49–51], implying that the KIT identified in PDW pigs might be from introgression of European white pigs. Further, we made a close examination of rIBD in the KIT locus and its surrounding area (8 Mb on both sides), and found that the frequency of shared IBD haplotypes between PDW and LW pigs on the chr8:40.85–41.60 Mb region was significantly higher compared to those surrounding regions (Fig. 4A). The haplotype heat-map constructed using the region revealed that the haplotypes of PDW pigs were completely different from Chinese indigenous pigs but highly similar to European domestic pigs, specifically LW and LR pigs (Fig. 4B). Also, we calculated the rIBD between PDW and the other two groups (LR and ECNB pigs), where the shared IBD at the KIT locus was not found (Fig. S5). PDW and ECNB pigs were highly differentiated in a 400-kb region on both sides of the KIT locus, which was consistent with the degree of differentiation between ECNB and LW pigs. However, PDW and LW pigs had extremely low differentiation (Fig. 4C top). Besides, PDW and LW pigs had lower nucleotide polymorphisms in the KIT locus compared to ECNB pigs (Fig. 4C bottom). The genetic relationship of the KIT gene showed that Chinese indigenous pigs and EUR pigs formed two distinct branches, where PDW pigs were clustered with FLW, ELW and LR pigs (Fig. 4D).
Previous studies confirmed four duplications (DUP1-DUP4) on the KIT locus, and of them, DUP1, DUP2 and the splicing G > A mutation on intron 17 of KIT are necessary for the manifestation of a solid white coat color in European domestic white pigs [23, 48]. We counted the genotype frequency of the KIT splicing mutation (chr8:41486012 bp) of pigs used in this study and found that Chinese indigenous pigs and Wild boars did not carry the mutation. Nonetheless, 15 PDW and 13 LR pigs and 93% of LW pigs carried the mutation. Also, other 51 PDW pigs were genotyped using primers designed by Wu, et al. [48] where we found that all 51 PDW pigs carried the mutation. We predicted the CNV within the 560-kb DUP1 region using the Hidden Markov Model (HMM), which showed PDW, LW and LR pigs had 3–6 copies of DUP1 whereas Chinese indigenous pigs, CWB, DU and EWB carried normal copies (Fig. 5A). PCR was performed using the primers designed on the breakpoint of each DUP (Fig. 5B) [48] in PDW, LW, JXH and YSH pigs. Each breakpoint sequence was identified in the amplified of PDW and LW pigs, while JXH and YSH pigs did not (Fig. 5C). The above results confirmed that PDW pigs carried DUP1, DUP2 and the splice mutation on intron 17 of KIT, which caused the white coat color phenotype in PDW pigs.