2.1 National Health and Nutrition Examination Survey (NHANES)
Data analyzed was collected from the 2013–2014 NHANES survey cycle (available from: https://wwwn.cdc.gov/Nchs/Nhanes/2013-2014/TST_H.htm). NHANES is a nationwide survey conducted annually for the purpose of collecting health and diet information from a representative, non-institutionalized U.S. population. NHANES is unique in that it combines interviews, physical examinations, and laboratory evaluations to obtain a large amount of quantitative and qualitative data. Information on NHANES survey methods are described in detail elsewhere(27). Briefly, the survey examines about 5,000 persons each year from various counties across the U.S., which are divided into a total of 30 primary sampling units (PSUs), of which 15 are visited annually. All participants provided a written informed consent in agreement with the Public Health Service Act prior to any data collection. Household questionnaires, telephone interviews, and examinations conducted by healthcare professionals and trained personnel were utilized to collect data.
2.2 Study Participants and Exclusion Criteria
The 2013-2014 NHANES cycle collected data on 10,175 individuals. In our analysis, we excluded a total 7,217 women and all children under the age of 18, leaving 2,958 men. We excluded boys under 18 since low testosterone is a rare outcome in this age group and wouldn’t provide a sufficient sample size for robust analyses (n <10). Those individuals presenting with low testosterone so early is likely attributable to genetic conditions or unusually high exposure to medications or toxicants, levels exceeding everyday exposure amounts(28, 29). From these remaining individuals, analysis was restricted to men with valid serum testosterone concentrations, as well as complete information on demographic, anthropometric, questionnaire, and laboratory variables including BMI, alcohol use, diabetes status, creatinine and albumin concentration, ethnicity, smoking status, and sex hormone binding globulin concentrations, resulting in a final analysis sample size of 372.
2.3 Assessment of Serum Testosterone
Following an overnight fast, serum samples were first taken between 8:30 a.m. and 11:30 a.m. and then testosterone concentrations were determined using a competitive electrochemiluminescence immunoassay on the 2010 Elecsys autoanalyzer (Roche Diagnostics, Indianapolis, IN, USA) with the lowest detection limit of the assay being 0.02 ng/mL. All sex steroid hormones from the present NHANES cycle were assayed at Boston Children’s Hospital (Boston, MA, USA) by laboratory technicians blinded to participant characteristics. The details for the NHANES laboratory methodology for testosterone determination are available from: https://wwwn.cdc.gov/Nchs/Nhanes/2013-2014/TST_H.htm.
2.4 Quantification of Caffeine Metabolites
Caffeine and 14 of its metabolites were quantified in urine by use of high-performance liquid chromatography-electrospray ionization-tandem quadrupole mass spectrometry (HPLC-ESI-MS/MS), and with stable isotope labeled internal standards. With the exception of paraxanthine, which is readily obtained from plasma and minimally excreted in the urine, these metabolites are readily obtained from urine and serve as good proxies for caffeine exposure(30). 50-µL aliquots of urine were diluted with 450 µL of water. Then, 100 µL of the diluted urine was combined with 120 µL of a 0.2 N NaOH solution containing stable isotope labeled internal standards. The mixture was incubated for 30 min at room temperature. Samples were then acidified with 30 µL of 2.0 N HCl and 250 µL of a 1:9 methanol/water solution containing 0.1% formic acid. Quantitation by HPLC-ESI-MS/MS was based on peak area ratios interpolated against an 11-point calibration curve derived from calibrators in synthetic urine. A further detailed description on laboratory procedures can be found elsewhere(31).
2.5 Defining Demographic Variables
Methods for questionnaire data collection are described in the NHANES procedures guide(32). Covariates related to low testosterone, as well as potential confounders were included and based on results from literature searches. Participants were classified according to highest level of education attainment, insurance coverage status, smoking status, alcohol use, diabetes status, and cholesterol status. Levels of education were based on responses by participants during the home interview. Insurance coverage status and smoking status were recorded as a yes or no response from the home interview. Alcohol use was divided into three categories of “non-drinker”, “moderate drinker”, and “heavy drinker.” Non-drinkers were defined by individuals stating they drank less than 1 alcoholic beverage a week. Moderate drinkers reported drinking between 2-8 drinks a week. Heavy drinkers were defined as drinking over 10 alcoholic drinks a week. Diabetes status was defined as a fasting serum glucose greater than 126 mg/dL, having answered yes to taking diabetic medications, or having been diagnosed by a physician with diabetes. Cholesterol status was defined by whether or not a person was told he/she has high cholesterol by a physician, if the serum total cholesterol was greater than 240 mg/dL, and/or if that person is currently taking hypercholesterolemia medications.
2.6 Statistical Analyses
Continuous variables were compared using one-way ANOVA, while categorical variables were compared using the Chi-squared test. Multivariable, ordinary least squares regression models were used to measure the association between caffeine and its urinary metabolites, and serum testosterone concentrations. Additionally, theobromine and theophylline are biologically active metabolites with known involvement in pathways that may be related to testosterone production and maintenance. Therefore, we included theobromine and theophylline in our logistic regression models, to model the odds of low testosterone based on quartile of metabolite concentration. The lowest quartile was used as the reference in each case. The complex survey design assigns a weight to each individual as a function of their probability of being randomly selected and this was considered when building our regression models. We controlled for potential confounders including ethnicity, age, BMI, education, insurance coverage status, smoking, drinking, cholesterol status, diabetes, creatinine levels, and sex hormone binding globulin levels.
All statistical analyses were performed using SAS 9.4 and SUDAAN software packages accounting for the complex survey design of NHANES(33). A p-value < 0.05 was used as the criterion for significance.