Study population
The data used in this study comprised of 323, 373 participants of the UK Biobank. Briefly, the UKB is a large-scale prospective study with over 500, 000 participants aged 37 to 73 years, enrolled from 2006 to 2010 and followed up to 2016. These participants attended one of 22 assessment centers in England, Wales, and Scotland, where they completed baseline questionnaires, underwent various anthropometric measurements, and reported medical conditions. The North West Multicenter Research Ethical Committee approved the study, and all participants signed written informed consent. A detailed description of the study design of the UKB was detailed elsewhere [29, 30]. We applied for the related data according to the rules of the UKB data sharing policy under the approved 64689. All sleep behaviors were self-reported, and details of assessment were available online.
We excluded 72,457 participants whose health lifespan had terminated before, according to in-patient hospital admissions data (UKB data category 2000). Then, we excluded 28,816 participants whose health lifespan had terminated prior, according to self-reported diagnoses obtained via verbal interview (UKB data category 100074) as a compliment. Additionally, 77861 participants who had missing data on sleep-related variables were also excluded. Finally, the study population comprises 323, 373 participants of the UK Biobank (Figure S1).
Definition of health lifespan
Based on the incidence of chronic diseases, a study based on the UKB database reported a cluster of the top-eight morbidities strongly associated with ageing, which we used to define health lifespan [31]. These top-eight morbidities included congestive heart failure (CHF), myocardial infarction (MI), chronic obstructive pulmonary disease (COPD), stroke, dementia, diabetes, cancer, and death. A participant diagnosed with any of these conditions first during the study period was considered to have terminated health lifespan. For each selected condition, except for cancer and death, we compiled a list of hospital data codes (ICD-10) and self-reported data codes (UKB data coding 6) that define these conditions in our study. We used National cancer registries linkage to UKB (UKB data category 100092) to define cancer and National death registries linkage to UKB (UKB data category 100093) to define death event. However, the National cancer registry linkage to UKB was updated only to 14th December 2016, earlier than the other two databases (in-patient hospital admissions data: 31st March 2017; National death registries linkage to UKB: 14th February 2018). To ensure consistency between the three database update dates used to build a healthy lifespan, we had to set 14th December 2016 as the end of follow-up.
Statistical analysis
Personal follow-up time was calculated from the baseline assessment date until the date health lifespan was terminated or end of followed up. We applied descriptive statistics (mean and percentages) and multivariate adjusted Cox proportional hazards regression models to examine the association between sleep behaviors and risk of health lifespan termination. Model 1 was adjusted for age, sex, ethnicity, Townsend index, and education; Model 2 was further adjusted for Townsend index, BMI, smoking status, alcohol consumption, physical exercise, diet, family history of diseases, and taking sleep-related drugs. The proportional hazards assumptions were tested using Schoenfeld residual method [32].
Furthermore, we collapsed each of the sleep behavior factors into two categories (High vs. Low risk). For instance, 'usually napping' was considered as high risk, whiles 'never/rarely' and 'sometimes napping' as low risk('Rarely'); 'Usually insomnia/sleeplessness' vs. 'Rarely insomnia'('sometimes' and 'Never/rarely' insomnia); 'Usually/Excessive daytime sleepiness' ('often' and 'always' daytime sleepiness) vs. 'Rarely daytime sleepiness' ('never/rarely' and 'sometimes' daytime sleepiness ); and, 'Difficult getting-up' ('not at all easy' and 'not very easy' getting up/waking out of bed) vs. 'Easy getting-up' ('fairly easy' and 'very easy' as low risk). These binary factors were analyzed in a multivariate-adjusted model to examine the hazard risk associated with health lifespan termination (Table 3). Using these binary categories, we performed stratification analysis according to age, gender, BMI, smoking status, physical activity, and healthy diet (Figure 1). In addition, we calculated the PAR % for the high risk sleep behaviors using the epi2by2 function of the "epiR" package in R language (Table 3). All statistical analyses were performed using R, version 4.1.0, and statistical significance was defined as P-values <0.01.