Profiling of exosomal tsRNAs in seminal plasma
To identify tsRNA as a promising biomarker for NOA, we first isolated exosomes from seminal plasma and prepared tsRNA library for high throughput sequencing. Since tsRNAs are decorated by RNA modifications that interfere with tsRNA-seq library construction[25], total RNA extracted from exosomes are pretreated to get rid of some RNA modifications which interfere with tsRNA-seq library construction. Then the selected tsRNAs as biomarkers were further verified in a subsequent set of blood serum and seminal plasma (Fig. 1A) by RT-qPCR and bioinformatic analysis. The prepared seminal plasma exosomes were confirmed by transmission electron microscopy measurement (Fig 1B) and nanoparticle tracking analysis (Fig 1C).
tRF-5 is the most abundant tRNA-derived small RNA in exosomes (56%), whereas tRF-2, tRF-1, and tiRNA-3 only account for 2.9% (Fig. 1D). We also analyzed the origin of these tsRNAs and found that the origin for tsRNAs in seminal plasma was distributed in 22 tRNAs (Fig 1E) which was different from tsRNAs distribution in plasma such that almost all tRNA-5s derive from four tRNAs (Gly-, Lys-, Glu- and Val-tRNA)[11]. The distributions of 9 tsRNAs were further analyzed, and it was found that the distribution of each type of tsRNA exhibited significant bias. Peculiarly, a large majority of tRF-5c were enriched in five tRNAs (Glu-, Gly-, His-, Lys- and Pro-tRNA); while tRF-1s were mainly enriched in Ser-TGA tRNA (Fig. 1F). The length distributions of tsRNAs were then analyzed and it was found that tRF-1 were 15-21 nt, tRF-2 were 14 nt, tRF-3a were 17-18 nt, tRF-3b were 19-22 nt, tRF-5a were 14-16 nt, tRF-5b were 22-24 nt, tRF-5c were 28-32 nt, tiRNA-3 were 40 nt and tiRNA-5 were 33-35 nt (Fig 1G). These results suggested that tsRNAs were not the product of random explanations from tRNA, but may regulate biological function through an unknown mechanism.
Identification of differentially expressed tsRNAs between NOA and OA patients
To comprehensively profile exosomal tsRNAs among NOA patients, OA patients, and healthy people, we first used Principal Component Analysis (PCA) method which was an unsupervised analysis to reduce the dimensionality of large data sets, and a distinguishable tsRNAs expression profiling among three groups was found (Fig 2A), indicating that the function of exosomal tsRNAs maybe associated with spermatogenesis and it is also effective to select markers from the tsRNAs profile that can distinguish between NOA patients and OA patients. We then screened the significantly differentially expressed tsRNAs between NOA patients and OA patients and found that 38 tsRNAs up-regulated and 46 tsRNAs down-regulated (difference of more than 3-fold change, P < 0.05) (Fig 2B). Then we found that 100 tsRNAs were differentially expressed between NOA patients and healthy people (16 up-regulated, 84 down-regulated) (Fig 2C).
tRF-Val-AAC-010 and tRF-Pro-AGG-003 are the biomarkers of NOA and associated with spermatogenesis status
To identify the potential biomarker for NOA patients, we selected eight tsRNAs which showed significant differences in expression when compared with OA patients (difference of more than three-fold change; P < 0.05, examined by both negative binomial distribution and student's t-test). Moreover, these eight miRNAs were further validated individually in subsequent samples (20 NOA samples, 15 OA samples) by RT-qPCR, although only four of them (tRF-Pro-AGG-005, tRF-Val-AAC-010, tRF-Pro-AGG-003 and tRF-Gly-CCC-002) significantly differentially expressed between NOA patients and OA patients (Fig 3A), the expression values of these four tsRNAs resulted in good predictive accuracy (tRF-Pro-AGG-005 (AUC 0.98,sensitivity = 95%, specificity = 80%,P< 0.0001); tRF-Val-AAC-010 (AUC: 0.96,sensitivity = 95%, specificity = 80%,P<0.0001); tRF-Pro-AGG-003 (AUC: 0.96,sensitivity = 95%, specificity = 87%P< 0.0001); tRF-Gly-CCC-002 (AUC: 0.73,sensitivity = 85%, specificity = 60%,P=0.02)) (Fig. 2B). As a comparison, the ROC curve analysis of testicular volume and blood FSH levels which was the classic predictor for the NOA patients was also determined, it was found that tRF-Pro-AGG-005, tRF-Val-AAC-010, tRF-Pro-AGG-003 as NOA predictors had better predictive accuracy than the testicular volume and blood FSH levels (Fig. 2B). In this study, eight miRNAs were also validated in 10 healthy samples and 5 IO samples, we found that tRF-Val-AAC-010 and tRF-Pro-AGG-003 expression levels were significantly increased when compared with that from OA samples and healthy samples (Fig 3C) and tRF-Val-AAC-010 and tRF-Pro-AGG-003 expression levels in idiopathic oligospermia (IO) samples were in the middle of NOA and OA samples. These results indicate that tRF-Val-AAC-010 and tRF-Pro-AGG-003 are associated with spermatogenesis status.
We further explored whether the target genes of these tsRNAs are associated with spermatogenesis. We found 24 target genes of tRF-Val-AAC-010 and tRF-Pro-AGG-003 enriched at least four-fold higher than mRNA level in the testis compared to any other tissue (Supplementary Table 1). The 24 genes were employed for DAVID gene ontology (GO) and GAD disease enrichment analysis. GO term biological process categories showed that the target genes are involved in spermatogenesis, GAD_DISEASE analysis indicates the target gene-related diseases were azoospermia, oligospermia, and male infertility (Fig. 4A). These results further confirm that tRF-Val-AAC-010 and tRF-Pro-AGG-003 are involved in the regulation of spermatogenesis and can be used as markers to predict NOA patients. To better understand the functions and the corresponding genetic regulatory networks of tRF-Val-AAC-010 and tRF-Pro-AGG-003, the gene regulatory network was visualized by the software CytoScape[26] (Fig 4B). Moreover, consistent with the above results from seminal plasma, tsRNAs examined in the testis from NOA patients and OA patients showed that the tRF-Val-AAC-010 and tRF-Pro-AGG-003 levels in NOA patients were also higher than those in OA patients, indicating that the tRF-Val-AAC-010 and tRF-Pro-AGG-003 expression levels can reflect the individual testicular tissue pathological status and therefore could serve as a novel biomarker for NOA diagnosis.
Multifactor models for diagnosis of non-obstructive azoospermia
Although tRF-Val-AAC-010 and tRF-Pro-AGG-003 have a better predictive accuracy than the classic predictor of blood FSH level and testicular volume, none of them have a 100% sensitivity and 100% specificity (Fig 5A). We found that the combination of tRF-Val-AAC-010 and tRF-Pro-AGG-003 and the combination of tRF-Val-AAC-010, tRF-Pro-AGG-003 and blood FSH level can both completely discriminate NOA patients from OA patients with 100% sensitivity and 100% specificity (Fig 5B-C). As a comparison, the discriminability of the combination of testicular volume and blood FSH levels which was the classic predictor for NOA patients was also determined and it was found that the combination of testicular volume and blood FSH levels can not completely discriminate NOA patients from OA patients and healthy people (Fig 5D). A binomial regression equation obtained from the generalized linear model (GLM) was then presented by using the tRF-Val-AAC-010 and tRF-Pro-AGG-003 expression levels to calculate multiple-factor risk score for predicting NOA patients. The multiple-factor risk score was equal to -exp(321.9 - 166.1× tRF-Pro-AGG-003-234.5 × tRF-Val-AAC-010) / (1 + exp(321.9 - 166.1 × tRF-Pro-AGG-003-234.5 × tRF-Val-AAC-010)). If the multiple-factor risk score is less than 0.5, the patient can be diagnosed as non-obstructive azoospermia; if the result is greater than 0.5, the patient can be diagnosed as obstructive azoospermia.