3.1 Prokaryotic expression and characterization of wbkC
A wbkC gene fragment was amplified from the B. melitensis M5-90 genome, and the size of 780 bp was as expected (Fig. 1A). The recombinant plasmid pET-28a-wbkC was constructed, and pET-28a-wbkC was digested using BamH Ⅰ and Hind Ⅲ restriction endonucleases, and the size of the digested product was about 5369bp and 780bp, which was consistent with the size of the pET-28a plasmid (5369bp) and the size of the wbkC gene fragment (780bp) (Fig. 1B). Optimizing the recombinant protein expression conditions, the highest wbkC protein expression of BL21-pET-28a-wbkC was observed at 37°C with an IPTG concentration of 1 mmoL/L for 4 h of induction (Fig. 1C and Fig. 1D). Western blot verified the presence of a specific band at a molecular weight size of approximately 29 kDa, which was consistent with the size of the wbkC protein, indicating successful expression of the recombinant fusion protein His-wbkC (Fig. 1E).
3.2 Preparation of polyclonal antibodies to wbkC protein
Recombinant fusion proteins were purified using affinity chromatography, and after purification, a single clear band appeared at a molecular weight size of about 29 kDa, which indicated a better purification effect, and the purity of the purified proteins was more than 80% (Fig. 2A). Figure 2 Alignment of gene distribution with reference genome. The purified proteins were used in animal immunization assays, and polyclonal antibody potencies to the wbkC protein ranged from 1:6400 to 1:12800 as determined by indirect ELISA (Table 2). Western blot verified the presence of a specific band at a molecular weight size of about 29 kDa, which was consistent with the size of the wbkC protein, indicating the successful preparation of a polyclonal antibody to the wbkC protein.
Table 2
Determination of Polyclonal Antibody Titer of wbkC
Dilution multiple of Antibody | P/N |
1∶800 | 11.47 |
1∶1600 | 10.47 |
1∶3200 | 5.51 |
1∶6400 | 3.50 |
1∶12800 | 2.07 |
1∶25600 | 1.83 |
3.3 Preparation of Ad_wbkC recombinant adenovirus
A wbkC gene fragment was amplified from the B. melitensis M5-90 genome, and the size of 780 bp was as expected (Fig. 3A). The shuttle plasmid ADV4-wbkC was constructed, and ADV4-wbkC was digested using EcoR Ⅰ with Hind Ⅲ restriction endonuclease, and the size of the digested product was about 7500bp and 780bp, which was consistent with the size of the ADV4 plasmid (7500bp) and the size of the wbkC gene fragment (780bp) (Fig. 3B). Recombinant adenoviruses were prepared and titers were determined using 293A cells, and cytopathic effect (CPE) was observed in samples at dilutions of 1×10 − 2, 1×10 − 3, and 1×10 − 4 72 h after transfection with recombinant adenoviruses Ad_wbkC and Ad_GFP. The titers of both recombinant adenovirus Ad_wbkC and control recombinant adenovirus Ad_GFP were 1×1010 PFU/mL (Fig. 3C and Fig. 3D).
3.4 Recombinant adenovirus Ad_wbkC mediates overexpression of wbkC in RAW264.7 cells
The multiplicity of infection (MOI) and transfection time were optimized, respectively, and when the MOI was 100 : 1 and the transfection time was 72 h, large green fluorescence was observed under the fluorescence microscope, which indicated a higher expression of the wbkC protein (Fig. 4A and Fig. 4B). Recombinant adenovirus Ad_wbkC with control recombinant adenovirus Ad_GFP was transfected into RAW264.7 cells under optimized conditions, and large green fluorescence was observed (Fig. 4C). The qRT-PCR verified that the wbkC gene expression in the Ad_wbkC test group was highly significantly higher than that in the Ad_GFP control group, with a fold difference of about 1510-fold (Fig. 4D). Western blot verified that the Ad_wbkC test group showed a specific band at a size of about 29 kDa, which was consistent with the size of the wbkC protein, and no specific band was found in the Ad_GFP control group (Fig. 4E). It indicated that Ad_wbkC successfully mediated the overexpression of wbkC in RAW264.7 cells.
3.5 Sequencing data preprocessing results
RNA concentration and purity were detected using a NanoDrop 2000 spectrophotometer, and RNA quality was examined using RNA electrophoresis.The RNA samples from the Ad_GFP control group and the Ad_wbkC test group all showed bands at the 28S, 18S and 5S positions, with no obvious spurious bands or dragging bands, indicating that the quality of the RNAs was high (Fig. 5). Preprocessing the sequencing data, the valid Reads of the Ad_GFP control group were 9,193,911,914, in which 97.81% of the bases with quality value ≥ 20 and 94.28% of the bases with quality value ≥ 30; and the valid Reads of the Ad_wbkC test group were 706,555,706, in which 97.57% of the bases with quality value ≥ 20 and 94.28% of the bases with quality value ≥ 30, meeting the requirements for the subsequent data analysis (Table 3). 93.78%, which meets the requirements for subsequent data analysis (Table 3).
Table 3
Statistics Before and After Quality Control
Sample name | Raw data | Valid data | Q20(%) | Q30(%) | GC(%) |
Ad_GFP | 93665522 | 91939914 | 97.81 | 94.28 | 48.55 |
Ad_ wbkC | 72620224 | 70655706 | 97.57 | 93.78 | 49.25 |
3.6 Analysis of overall lncRNA expression levels
Based on the FPKM value of lncRNA, the distribution of lncRNA expression in the Ad_GFP control group and the Ad_wbkC test group was demonstrated by expression distribution plots and box plots. In the expression distribution plot, the horizontal coordinate indicated the lncRNA expression level, the vertical coordinate indicated the distribution density, and the curve distribution was basically consistent across samples (Fig. 6A). In the box plots, the horizontal coordinate indicates the sample source and the vertical coordinate indicates the lncRNA expression level. The box plots for each sample contained five statistics, from top to bottom: maximum, upper quartile, median, lower quartile & minimum, and the overall levels were generally consistent across samples (Fig. 6B). It showed that the lncRNA expression trend and expression level of Ad_GFP control group and Ad_wbkC test group were overall consistent and reproducible.
3.7 Visual analysis of differential lncRNAs
The differentially expressed lncRNAs were counted, and compared to the Ad_GFP control group, there were a total of 62 significantly differentially expressed lncRNAs in the Ad_wbkC test group, of which 32 were up-regulated and 30 were down-regulated in lncRNA expression. Based on the FPKM values of the lncRNAs of each sample, the differential expression of lncRNAs was visualized by MA plots and scatter plots. Each point represents a specific lncRNA, and in the MA plot, the horizontal coordinate A represents log2 (FPKMAd_GFP* FPKMAd_wbkC) and the vertical coordinate represents log2 (FPKMAd_GFP/ FPKMAd_wbkC). In the scatter plot, the horizontal coordinate represents log2 (FPKMAd_GFP + 1) and the vertical coordinate represents log2 (FPKMAd_wbkC + 1). Red dots represent the expression of up-regulated lncRNAs, blue dots represent the expression of down-regulated lncRNAs, and gray dots represent the expression of insignificant lncRNAs (Fig. 7A vs. Figure 7B).
3.8 Cluster analysis of differential lncRNAs
Cluster analysis was used to determine the expression patterns of different lncRNAs in each sample. lncRNAs with similar expression patterns may have similar biological functions or participate in the same cell signaling pathways. In the differential lncRNA clustering heatmap, each row represents a specific lncRNA and each column represents the sample source. Based on the FPKM values of the lncRNAs of each sample, the color from green to red indicates low to high expression; dark red indicates high expression of lncRNAs and dark green indicates low expression of lncRNAs (Fig. 8A). In the trend line plot of differential lncRNA expression patterns, the horizontal coordinate indicates the sample source and the vertical coordinate indicates the lncRNA expression level. Differential lncRNAs with the same expression pattern were grouped into a subset, with each gray line representing a specific lncRNA and the blue line representing the average expression level of the subset. Subset 1 included 31 differential lncRNAs, subset 2 included 27 differential lncRNAs, subset 3 included 3 differential lncRNAs, and subset 4 included 1 differential lncRNA (Fig. 8B).
3.9 Analysis of GO and KEGG functional enrichment of lncRNA target genes
GO and KEGG functional analysis was based on enrichment significance (P-value), enriching for target genes that may have the same function or participate in the same signaling pathway.
The TOP 10 GO functionally enriched target genes cis-regulated by significantly different lncRNAs are: positive regulation of cyclic nucleotide phosphodiesterase activity (GO:0051343), regulation of cytolysis in other organisms (GO:0051710), positive regulation of cytolysis in other organisms (GO:0051714), N-acetylmuralyl-L-alanine amidase activity ( GO:0008745), peptidoglycan receptor activity (GO:0016019), negative regulation of natural killer cell differentiation (GO:0032824), negative regulation of natural killer cell differentiation in the immune response (GO:0032827), growth in symbiotic interactions (GO:0044110), growth of symbionts in the host (GO:0044117), regulation of natural killer cell differentiation involved in the immune response (GO:0032826) (Fig. 9A).
The TOP 10 GO functionally enriched target genes with significant differences in lncRNA trans-regulation are DNA binding (GO:0003677), immune system process (GO:0002376), nucleobase-containing compound biosynthesis process (GO:0034654), RNA biosynthesis process (GO:0032774), macromolecule biosynthesis process (GO. 0009059), Heterocyclic biosynthesis process (GO:0018130), Aromatic compound biosynthesis process (GO:0019438), Cellular macromolecule biosynthesis process (GO:0034645), Organocyclic compound biosynthesis process (GO:1901362), Transcription, DNA template (GO:0006351) (Fig. 9B) .
The TOP 10 target gene KEGG signaling pathways enriched for cis-regulation of significantly different lncRNAs are MAPK signaling pathway-plant (ko04016), plant-pathogen interaction (ko04626), phototransduction-fly (ko04745), phototransduction (ko04744), olfactory transduction (ko04740), long duration enhancement ( ko04720), amphetamine addiction (ko05031), renin secretion (ko04924), glioma (ko05214), Fanconi anemia pathway (ko03460) (Fig. 9C).
The TOP 10 target genes enriched for KEGG signaling pathway regulated by significantly different lncRNAs in trans are: rheumatoid arthritis (ko05323), systemic lupus erythematosus (ko05322), tuberculosis (ko05152), NOD-like receptor signaling pathway (ko04621), phagolysosomes (ko04145) non-homologous end joining (ko03450), graft-versus-host disease (ko05332), alcoholism (ko05034) cytoplasmic DNA-sensing pathway (ko04623), allograft rejection (ko05330) (Fig. 9D).
3.10 Mining of lncRNAs associated with cellular autophagy signaling pathway
Based on the results of GO and KEGG enrichment analysis of differentially expressed lncRNA target genes, the cellular autophagy-associated lncRNAs with TOP 15 expression were screened for qRT-PCR validation, and the differences between lncRNA 4933430A20Rik (ENSMUSG00000110116) and lncRNA B930036N10Rik ( ENSMUSG00000091993) were consistent with the sequencing results, with approximately 1.5-fold and 2.5-fold differential expression folds, and highly significant differences between groups (Fig. 10A vs. Figure 10B).