The presence of pCoV subgenomic mRNA in the skin
To test whether Dahu skin was infected by pCoV (in addition to its lungs and brains12), we searched for viral sequences among the transcriptome data. A total of 193 reads mapped to the reference pCoV genome (Figure S1). The distribution of these mapped reads was consistent with the corresponding locations of the subgenomic mRNAs22. We also observed an individual read that spanned precisely the splicing sites: this read was 150 bp in length and its 5’ 47 bp mapped to the 5’ region of the pCoV genome, while its 3’ 103 bp mapped to the 3’ region of the same genome. This read indicates that the CoV RNA has been processed in the host cell23. Consistently, we detected the N gene (which is used in diagnostic human SARS-CoV-2 testing) in all the tested Dahu organ using qRT-PCR12, including the skin. To confirm the identity of the viral RNA in Dahu skin, we compared consensus sequences from our mapped reads with CoV from other species (Figure 1; Table S1). We observed that the pCoV genome and genes from Dahu skin were almost identical (sequence identity = 99.2-100%) to the counterparts from another pangolin, Guangdong pCoV isolate MP78924, confirming the presence of pCoV RNA in the Dahu skin.
Dahu skin transcriptional responses to pCoV
To explore the host skin transcriptional responses, we identified 3,201 DEGs (1,810 up-regulated and 1,391 down-regulated) in Dahu skin (Table S2). Consistent with the results observed in human COVID-19 patients13, we found terms associated with immunity processes were significantly enriched and up-regulated, such as terms related to non-interferon cytokine production, T-cell, neutrophil, myeloid cell, and mast cell differentiation and activation (Figure 2D). Apoptotic signalling pathways were negatively regulated, especially apoptotic processes of immune cells, such as myeloid cells and leukocytes. Also, rRNA metabolic process, RNA splicing, mRNA transcription, translational processes, and protein folding were also up-regulated. Specifically, we found genes associated with viral transcription and viral gene expression were up-regulated. Other terms, including complements and cellular response to biotic stimulus were up-regulated, while cell cycle processes were down-regulated.
Strikingly, we found that the coronavirus disease-COVID-19 pathway was the most significant up-regulated pathway (Figure 2E; Table S3; Figure S2). Ribosomal proteins were up-regulated (Figure S3A; Figure 2E; Table S3) as CoV needs ribosome frameshifting to translate and replicate25,26. It has been shown that the nonstructural protein 1 of the SARS-CoV-2 is a major virulence factor that interferes with host RNA translation by binding to 40S ribosomal subunit27,28. Li summarised the functional relationships between host ribosomal proteins and viral infection29, and suggested most interactions are beneficial for viral protein translation and replication. We found ribosomal proteins that are crucial for viral infection were up-regulated in Dahu skin (Figure S3), such as RPL330, RPL1831–34, and RPL2435. Moreover, we found RPL9 and RPL22 which help virus particle assembly and viral gene expression, were up-regulated in Dahu-skin, but not human patient lungs (Figure S3B), probably a new strategy to promote its replication in pangolin36,37.
Other up-regulated pathways include pathways in cancer, TNF signalling pathway, complement and coagulation cascades, chemokine signalling pathway, immunity pathway, and viral interaction/infection related pathways (Figure 2E; Table S3). Furthermore, the DEGs were down-regulated in pathways associated with linoleic acid and lipid metabolism (Figure 2E; Table S3). Again, these host responses are consistent with the core pathways identified in the human SARS-CoV-2 infection13.
Comparative analysis of transcription in Dahu skin and human lungs
We further investigated the similarities and differences of host responses between Dahu skin and the human lungs by comparing DEG lists between our study and an external human lung study17. Our comparative analysis revealed 366 DEGs shared between species (i.e., ‘common DEGs’ below), 2,835 Dahu skin-specific DEGs, and 1,527 human lung-specific DEGs. As anticipated, the common DEGs were enriched in coronavirus diseases-COVID-19 pathway, followed by MAPK signalling pathway, apoptosis, C-type lectin receptor signalling pathway and Kaposi sarcoma-associated herpesvirus infection. These findings are consistent with the pCoV infection in Dahu skin (Figure 3A).
Interestingly, interferon-specific responses were exclusively up-regulated in human lungs compared to Dahu skin. We found interferon responses (e.g., interferon-gamma production) significantly enriched and up-regulated in human lung-specific DEGs (Figure 3C-D; Table S4), but not in Dahu skin-specific DEGs (Table S5). We also downloaded genes related to interferon signalling pathways in Reactome pathway database38,39 and found 131 genes were expressed in Dahu skin and/or human lungs (Figure 3E; Table S6). Among them, 102 genes were expressed in human lungs, while 88 genes were expressed in Dahu skin (Figure 3E). We found 42 of them (37 up-regulated and 5 down-regulated) were differentially expressed in human lungs, but none of them was differentially expressed in Dahu skin. It has been reported that pangolins have unique immune characteristics14,15: both IFNE and IFIH1 (a cytoplasmic RNA sensor that helps initiates the innate immune response to viral infection) genes have been pseudogenised. IFNE plays an important antiviral role in epithelial cells40–46. Therefore, IFNE-deficient pangolin skin may be susceptible to CoV infection. Therefore, it indicates pangolin might adopt alternative strategies to fight against CoV infection over evolutionary time.
In our comparative analysis, we found three enriched pathways in Dahu skin-specific DEGs (Figure 3B), in which malaria and Staphylococcus aureus infection pathways were up-regulated in Dahu skin-specific DEGs, while arachidonic acid (AA) metabolism pathways were down-regulated. These pathways may indicate unique pangolin host skin responses. Malaria pathway is commonly up-regulated after SARS-CoV-2 infection13, and anti-malarial drugs have shown effects on inhibiting SARS-CoV-2 replication47. In addition, we discovered that malaria pathway was also up-regulated in human lungs. As it is one of the three Dahu skin-specific pathways, it indicates malaria pathway was further up-regulated in Dahu skin. Consistent with other up-regulated inflammatory responses, we observed Staphylococcus aureus infection pathway was specifically up-regulated in Dahu skin.
In addition, the AA metabolism pathways were down-regulated in Dahu skin-specific DEGs (Figure 3B), indicating that these pathways were suppressed by pCoV infection. It is known that AA pathways have inhibitory effects on coronavirus replication, suggesting that lipid metabolism could be a druggable target of coronavirus-infected patients48. Therefore, pCoV might also suppress AA metabolism pathway in pangolin skin to benefits its replication.
Responses of endogenous retrovirus (ERV) gene expression in the pCoV skin infection
Then, we investigated the relationships between host ERV gene expression and pCoV infection. We identified 6,076 ERV genes by screening 3,162 known viral proteins from Swiss-Prot across the pangolin genome (Table S7). We found 466 genes expressed in infected and/or non-infected pangolin skin, in which 348 of them (81 up-regulated versus 267 down-regulated) were differentially expressed (Figure 4A; Figure S4; Table S8), suggesting that the exogenous pCoV might suppress the expression of ERV genes after infection to benefit its replication. We found most of the ERV DEGs were env (43%), pol (31%), and gag (16%) (Figure 4B; Table S8), while the compositions in the genome were 28%, 38%, and 20%, respectively (Figure S5A; Table S8). The most abundant group of ERV DEGs were most closely related to mouse intracisternal A-particle (IAP) viruses (Figure 4D; Table S8). Mouse and Hamster IAP viruses make up 19% of the ERV DEGs, while only 11% of the IAP sequences are found in the pangolin genome (Figure S5C; Table S7), which may suggest the importance of IAPs in CoV infection and their possible interaction with the virus.