Data source
The present study used publicly available, cross-sectional data from the Health Information National Trends Survey (HINTS) [49]. HINTS is a nationally representative survey collected by the National Cancer Institute. This study used HINTS 5 Cycle 1, 2, 3, and 4 in 2017-2020. HINTS 5 is a random digit-dialed telephone survey or a mailed questionnaire survey in non-institutionalized civilians aged 18 and older in the U.S. Geographic addresses were stratified by two areas with high concentration of minority population or low concentration of minority population in HINTS 5 Cycle 2, 3, and 4. HINTS 5 Cycle 1 included one more stratification in geographic address, counties of Central Appalachia. Our study followed the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) guidelines [50]. The total number of survey respondents in this study were 16,092. 3,285 from Cycle 1, 3,504 from Cycle 2, 5,438 from Cycle 3, and 3,865 from Cycle 4. Each response rate was 32.4 %, 32.4 %, 30.3 %, 36.7 %, respectively [49]. Data received full sample weights for the sample to be nationally representative. 50 replicate weights were applied to calculate standard errors as suggested by HINTS analysis recommendations. The full-sample weight accounted for household-level base weight, non-response, person-level initial weight and other biases [51].
Cancer survivor status
Consistent with the National Cancer Institute definition of cancer survivor [52], cancer survivor status in this study was identified by the question; “Have you ever been diagnosed as having cancer?” Those who affirmatively responded ‘yes’ were defined as cancer survivor. Using the question, “At what age, were you first told you had a cancer?” HINTS calculated time since cancer diagnosis and provided it in 4 levels: less than 1 year, 2-5 years, 6-10 years, more than 11 years. Participants reported their cancer types and were classified as having breast, cervical, prostate, colon, lung, skin cancer, melanoma, multiple cancers, and other cancers. Other cancers included bladder, bone, endometrial, head and neck, leukemia/blood, liver, lymphoma (Hodgkin’s and non-Hodgkin’s), oral, ovarian, pancreatic, pharyngeal, rectal, renal, and stomach cancer.
Outcome variables
Alcohol consumption
To investigate the number of average drinks per week, we used data derived from two survey questions about average number of drinks per day and the number of days of having at least one drink per week during the past 30 days. We categorized alcohol consumption per week into light (0-3), moderate (4-6), heavy (≥7) drinks, as done previously [42, 53]. While there has been no consensus on the alcohol consumption guidelines for cancer survivors, we primarily referred to the National Health Interview Survey (NHIS)- categories [53]. Then, we combined moderate and heavy drinking as risky alcohol use because American Cancer Society (ACS) and International Agency for Research on Cancer (IARC) both consider these high risk groups [42, 54, 55].
Tobacco use
To investigate the current smoking status, the following questionnaire was used. “How often do you now smoke?” Respondents who answered, “Every day”, “Some days” were considered as current smokers. To investigate the current e-cigarette smoking status, the similar questionnaire was used. “Do you now use e-cigarette every day, some days, or not at all?” If respondents answered, “Every day”, or “Some days”, they were considered as current smokers or e-cigarette smokers, as done previously [42].
Physical activity
To investigate the weekly minutes of moderate exercise, data from two survey questionnaires about the number of days of moderate exercise (such as brisk walking, bicycling at a regular pace, and swimming at a regular pace) per week and minutes of moderate exercise per day were used. We categorized the level of physical activity into two groups; physically active (more than 150 minutes of weekly moderate exercise) and physically inactive (0-150 minutes of weekly moderate exercise) based on the U.S. Physical Activity Guidelines [56, 57].
Covariates
The conceptual framework of social determinants of health from the Healthy People 2030 [58] was applied for the choices of sociodemographic predictors in this study: Age (18 to 34, 35 to 49, 50 to 64, 65 to 74, 75 or older), birth gender (male, female), race/ethnicity (non-Hispanic White, Non-Hispanic Black/African American, Hispanic, non-Hispanic Asian, Other), household income (< $20,000, $20,000 to <$35,000, $35,000 to < $50,000, $50,000 to < $75,000, $75,000£), educational attainment (less than high school, high school graduate, some college, college graduate or more), marital status (married or living with a romantic partner as a married vs. not married including divorced, widowed, separated, single/never been married), employment status (employed vs. unemployed including homemaker, student, retired, disabled), health insurance type (insured by employment, private insurance, Medicaid, Medicare, Tricare, Veterans Affairs, Indian Health Services), rurality of residence (metropolitan, micropolitan, small town, rural). Rurality was determined by Urban Rural Commuting Area (RUCA) that categorizes census tracts based on population density, urbanization, commuting patterns developed by United Status Department of Agriculture [59]. Clinical predictors included medical conditions (diabetes, high blood pressure, heart disease, lung disease, arthritis, depression) and psychological distress (little interest, hopelessness, nervousness, worrying).
Statistical analysis
In descriptive analysis, we conducted a Chi-Square (categorical data) and a t-test (continuous data) to demonstrate demographic and clinical characteristics of cancer survivors and the prevalence of unhealthy behaviors (alcohol consumption, tobacco use, e-cigarette use, physical inactivity). Categorical data was presented with frequency (n) and weighted percentage (%) and continuous data was presented with mean and standard deviation (Table 1 and 2). Survey weighted multivariate logistic regression was performed to estimate the odds of alcohol consumption (light, moderate/heavy drinking), tobacco use (current, former, never), e-cigarette use (current, former, never), and physical activity (0-150, 150< weekly minutes) for selected sociodemographic factors (e.g., age, birth gender, race/ethnicity, educational attainment, household income, marital status, health insurance, rurality of residence), clinical covariates (e.g., medical condition and psychological distress) (Table 3). We also performed weighted survey logistic regression for each health behavior to observe relationships between multiple unhealthy behaviors after adjusting associated factors found in Table 3 (Table 4). The statistical significance was determined at p-value < 0.05 and alpha= 0.05. SAS 9.4 (SAS studio 3.8, Cary, NC, USA) was used for the analysis.