Searching and Screening
We searched the above 11 databases and obtained 677 literatures. After browsing the titles and abstracts, we downloaded 21 full texts. Finally, four studies [25–28] were included. Three were case-control studies and the remaining one was a cohort study, all of which were published in English. The flow chart is shown in Fig. 1.
Characteristics Of The Included Studies
A total of 12912 participants were included in four studies [25–28]. Two of the four case-control studies [25, 27] observed male with abnormal sperm, such as dyspermia, azoospermia or oligospermia, and the other two involved female patients with primary infertility or minimal/mild endometriosis. The only cohort study [28] observed nulliparous female without a history of infertility. The exposures of two studies [25, 27] were coffee, and the other two were caffeine containing beverage (CCB), including caffeinated soft drinks, coffee, tea, cocoa or cola et al. For two of the studies [26, 28], exposure intensity were analyzed according to the daily caffeine intake (mg/d), and for the other two studies [25, 27] caffeine intake was calculated according to the corresponding conversion relationship of caffeine consumption according to each kind of CCB.
In three case-control studies [25–27], odds ratio (OR) were used to indicate the risk of infertility caused by coffee or caffeine intake. The cohort study [28] used hazard ratio (HR) to indicate the risk of caffeine intake. HR is a risk ratio that takes into account the time factors, that is, when the number of events is included in the dynamic cohort, it also takes into account the loss of follow-up or other exposure factors to cause the event to occur. In this review, OR and HR cannot be directly converted. The basic characteristics of the four studies included are shown in Table 1.
Table 1
Characteristics and results of included studies concerned caffeine intake with infertility
Study ID | Study Type | Participants | Exposure (Caffeine/coffee) | Comparison | Outcomes |
Gender | Age | Sample size | Disease | Relative Risk(95%CI) |
Buiatti 1984 | Case-Control Study | Men | 20-54y | 239 | ≥ 100 mg/d | 0 mg/d | Azoospermia/oligospermia | OR = 0.91 (0.46–1.82) |
Grodstein 1993 | Case-Control Study | Women | Unclear | 4883 | E1 100–167 mg/d; E2 168–233 mg/d; E3༞233 mg/d | ༜100 mg/d | Primary infertility | OR1 = 1.00(0.79–1.25); OR2 = 1.16(0.94–1.42); OR3 = 1.36(1.16–1.60) a, b |
Prazzini 1993 | Case-Control Study | Men | Median 31-33y | 216 | E1 200–300 mg/d; E2 ≥ 400 mg/d | 0-100 mg/d | Infertile men with dyspermia | OR1 = 1.7(0.8–3.7); OR2 = 5.4 (2.4–12.6) c |
Liv2018 | Cohort Study | Women | 20-29y | 7574 | Coffee: E1 ≤ 200 ml/d E2 200–400 ml/d E3 ≥ 500 ml/d | 0 ml/d | Infertile women | HR1 = 0.86(0.70–1.06); HR2 = 0.88(0.73–1.06); HR3 = 0.89(0.72–1.10); |
| | | | | Tea: E1 ≤ 200 ml/d E2 200–400 ml/d E3 ≥ 500 ml/d | 0 ml/d | | HR1 = 1.10(0.84–1.44); HR2 = 1.10(0.84–1.46); HR3 = 1.15(0.87–1.53); |
| | | | | Caffeined: E1 ≤ 1-168 mg/d E2 169–333 mg/d E3 334–579 mg/d E4 ≥ 580 mg/d | 0 mg/d | | HR1 = 0.93(0.58–1.49); HR2 = 0.91(0.57–1.47); HR3 = 0.97(0.60–1.55); HR4 = 0.93(0.58–1.50)e |
Note: E: Exposure; OR: Odds ratio; HR: Hazard Ratio; CI: Confidence interval. |
a This study classified the participants into 5 groups according to their primary disease and calculated the estimate effect of them respectively. Details of the information were reported in the main text of this review. |
b OR were adjusted for center, age, lifetime number of sexual partners, current and former cigarette smoking, and alcohol intake. |
c Age-Adjust OR. |
d Total caffeine calculated from consumption of both coffee and tea. |
e Educational level (≤ 9, 10–11 or ≥ 12 years of schooling), Smoking (yes or no), Marital status (married/cohabiting or single), Weekly alcohol intake and Year of birth-Adjust HR. |
Methodological Quality Of Included Studies
The full score of NOS is 9 points. Only one case-control study [25] used blinded investigation, and the other three studies [26–28] used unblinded investigations or case records as a way to determine exposures or outcomes. Only one case-control study [25] used community controls, and the rest were hospital controls. One case-control study [25] did not describe non-response rate and did not adjust for confounding factors. Three case-control studies gained 6, 6 and 7 points respectively. The cohort study achieved9points.The methodological quality of the case-control studies included in this review is general, but the cohort study is good. The methodological quality of included studies is shown in Fig. 2.
Estimate relationship between caffeine intake and infertility
In Buiatti's study [25], males with azoospermia or oligospermia (i.e., sperms/ml < 20, 000, 000) were used as the case group, and normal routine examinations and sperm counts > 20, 000, 000/ml were used as the control group in the same period. The study showed that drinking coffee seems to have no correlation with sperm abnormalities (RR 0.91, 95%CI 0.46–1.82). Although it was mentioned that the coffee intake was divided into different doses for further analysis, relevant data was not provided and the corresponding relative risk were not calculated based on them.
Prazzini's study [27] had three groups, in which infertile men with dyspermia were used as the case group, and normospermic infertile men and fertile men with unknown semen quality as the control groups. Our study included only the case group and the fertile control group. With the number of cups of coffee per day, the risk of dyspermia increased. Compared to less than 100 mg/d caffeine intake those who had 200–300 mg/d caffeine intake may have an OR of 1.7 (95%CI 0.8–3.7), and those who had more than 400 mg/d caffeine intake may have even higher risk of infertility (OR 3.4, 95%CI 2.4–12.6).
In Grodstein's study [26], primary infertility was used as the case group and those with a history of delivery in the past two years as the control group. It reported the number of infertile patients with different doses of caffeine intake in different etiology. After adjusting for the relevant confounding factors, the risk of infertility caused by tubal disease (RR 1.5, 95%CI 1.1-2.0) increased significantly when caffeine consumption more than 7 g/m (233 mg/d), and the risk of cervical factors (OR 1.3, 95%CI 0.7–2.4, for 5.1-7 g/m and OR 1.4, 95%CI 0.9–2.3, for > 7 g/m) or endometriosis (OR 1.9, 95%CI 1.2–2.9, for 5.1-7 g/m and OR 1.6, 95%CI 1.1–2.4, for > 7 g/m) increased significantly when caffeine intake greater than 5 g/m (167 mg/d).We added the number of case together regardless the cause of infertility and re-analyzed the odds ratio. The results showed with 101–167 mg/d caffeine intake may not increase the risk of infertility compared to less 100 mg/d caffeine intake (OR 1.00, 95%CI 0.79–1.25), however, higher dose of caffeine intake may increase the risk (OR 1.16, 95%CI 0.94–1.42, for 168–233 mg/d; OR 1.36, 95%CI 1.16–1.60, for > 233 mg/d).
The only cohort study included [28] was a retrospective cohort study. It was divided into exposure and non-exposure group according to whether the participants drink coffee or tea. For different doses of total caffeine consumption (from coffee and tea), the risk of infertility for consumers was similar to that of never consumers. Only 100 mg/d of caffeine consumed did not affect the risk of primary infertility in the consumers (HR 1.00, 95% CI 0.98–1.02).
Details of the results from each single study was also shown in Table 1.
Considering the clinical heterogeneity among each study, we chose to present the single study’s results with bubble plot first. The X-axis represents the effect value (OR), the Y-axis represents the methodological quality score (according to NOS), and the bubble size reflects the size of the sample size. Figure 3(A) showed the risk of caffeine intake (more than 100 mg/d) for infertility from 3 studies, all of them found no difference of incidence rate of infertility between low dose and no caffeine intake (OR varied from 0.77 to 1). Figure 3(B) showed the risk of caffeine intake (more than 200 mg/d) for infertility also from 3 studies. Both case-control studies found that the incidence of infertility was higher in the medium dose than no caffeine intake, but the cohort study did not show a different incidence of infertility between the two groups (OR varied from 0.67 to 1.64). Figure 3(C) showed the risk of caffeine intake (more than 400 mg/d) for infertility from 2 studies. The case-control study (OR 4.99) found that the incidence of infertility was higher in the high dose than no caffeine intake, but the cohort study did not show a different incidence of infertility between the two groups (OR 0.74).
Meta-analysis was also conducted (Fig. 4). Subgroup was classified according to the type of the study (case-control and cohort study). The results also showed caffeine intake may not increase the risk of infertility (lose dose: OR 0.95, 95%CI 0.78–1.16, M-H Fixed, I2 = 0%, P = 0.64, 4627 participants, 3 studies; medium dose: OR 1.14, 95%CI 0.69–1.86, M-H Random, I2 = 72%, P = 0.0006, 5353 participants, 3 studies; high dose: OR 1.86, 95%CI 0.28–12.22, M-H Random,I2 = 94%, P < 0.0001, 2435 participants, 2 studies). However, due to the obvious statistical heterogeneity when pooling the data for high dose of caffeine intake assessment (I2 = 94%,), results of this meta-analysis was only used for GRADE evaluation.
Overall Quality Of Evidence By Grade
We used the GRADE method to evaluate the certainty of the evidence. For the three different doses of formed evidence bodies, no matter from which aspects, including large effect, dose-response gradient and plausible confounding, cannot be upgraded. The overall quality of the three evidence is all low (Table 2).
Table 2
GRADE Summary of Findings Table
Caffeine intake | Large effect | Plausible confounding | Dose-response gradient | Anticipated absolute effects* (95% CI) | Relative effect (95%CI) | No of Participants (studies) | Certainty of the evidence (GRADE) |
Risk with non caffeine | Risk with caffeine |
Low dose (≤ 100 mg/d) | No | No | No | 197 per 1,000 | 189 per 1,000 (161 to 221) | OR 0.95 (0.78–1.16) | 4627 (2case-control studies + 1 cohort study) | ⊕⊕○○ Low |
Medium dose (≥ 200 mg/d) | No | No | No | 194 per 1,000 | 215 per 1,000 (143 to 309) | OR 1.14 (0.69–1.86) | 5353 (2case-control studies + 1 cohort study) | ⊕⊕○○ Low |
High dose (≥ 400 mg/d) | No | No | No | 186 per 1,000 | 298 per 1,000 (60 to 736) | OR 1.86 (0.28–12.22) | 2435 (1case-control study + 1 cohort study) | ⊕⊕○○ Low |
*The risk in the intervention group (and its 95% confidence interval) is based on the assumed risk in the comparison group and the relative effect of the intervention (and its 95% CI). CI: Confidence interval; OR: Odds ratio |
GRADE Working Group grades of evidence High certainty: We are very confident that the true effect lies close to that of the estimate of the effect Moderate certainty: We are moderately confident in the effect estimate: The true effect is likely to be close to the estimate of the effect, but there is a possibility that it is substantially different Low certainty: Our confidence in the effect estimate is limited: The true effect may be substantially different from the estimate of the effect Very low certainty: We have very little confidence in the effect estimate: The true effect is likely to be substantially different from the estimate of effect |