Data collection and setting
Data were collected on a routine basis from patients attending rehabilitation in the municipality of Aalborg from March 2018 to April 2019. Patients were referred by their general practitioner or the hospital to the rehabilitation programme after an acute event necessitating a hospital stay related to their CVD, COPD or diabetes. Sociodemographic characteristics included age, gender (female or male), cohabitation (binary), education (defined in three levels according to the International Standard Classification of Education (ISCED): low < 11 years of schooling, medium 11–16 years of schooling, high > 16 years of schooling) and socioeconomic status (employed, unemployed or other benefits, or retired). All attending patients were asked to complete a questionnaire developed by the healthcare centre (the Aalborg questionnaire, available on request) and the ICECAP-A questionnaire at baseline and 12 weeks follow-up after the completion of the rehabilitation programme. It was the patient’s choice as to whether they wished to complete the questionnaire on each occasion.
Municipal rehabilitation
In Denmark, the 98 municipalities offer rehabilitation programmes to chronically ill patients with, for example, CVD, COPD, and/or diabetes. The programmes are situated at the healthcare centre in Aalborg and at times in ‘satellite’ centres in varied locations across the municipality. The programmes provide exercise and education to groups of varying size. The exercise sessions take place one to two times a week and are of low to moderate intensity. The education component covers knowledge of the disease; dietary advice; the importance of physical activity, smoking cessation and medicine consumption; and goals and motivation. The programmes usually commence within a few weeks after discharge from the hospital and continue for 8–12 weeks [12].
Measuring rehabilitation outcomes
The municipality of Aalborg, Denmark, decided in 2018 to develop a self-completion questionnaire to evaluate their rehabilitation programme. The full questionnaire consists of 33 questions, including background information (gender, employment status, education level and cohabitation). Additional questions concerning training level and satisfaction with the program were asked at follow-up. The healthcare centre uses six of the questions to evaluate the rehabilitation programmes: (1) ‘general health’, (2) ‘improvement of quality of life’, (3) ‘feeling fit to do the things I want to’, (4) ‘better at handling everyday life after programme’, (5) ‘know how to sustain health in the future’ and (6) ‘able to be more physically active after programme’. Questions 1 and 3 were the only questions asked at both baseline and follow-up; the rest were only asked at follow-up. Questions 1–5 have four or five possible response categories (where higher scores indicate greater levels of general health, for example). Question 6 has a binary response option (yes or no).
Construct validity
Construct validity is the degree to which an instrument (such as a questionnaire) measures what it is hypothesised to be measuring. It can be assessed by considering the degree to which expected relationships between a measure and other factors are confirmed [13, 14]. Best-practice guidance on psychometric analyses highlights the importance of a priori statement of hypotheses on the anticipated relationship between the constructs explored [15]. Drawing on Sen’s theoretical framework for the establishment of capabilities, capability can be limited by reduced socioeconomic status and improved by good circumstances [3]. For the assessment of construct validity, a priori hypotheses were developed based on existing evidence about the ICECAP measures in other contexts [16, 17]. Table 1 indicates the expected direction between the five attributes of ICECAP-A, and indicators of socioeconomic status, general health and freedom in terms of ‘feeling fit to do the things I want to’ included in the Aalborg questionnaire.
Table 1
Hypothesised positive relationships between ICECAP-A attributes and the Aalborg questionnaire
ICECAP-A
|
Stability
|
Attachment
|
Autonomy
|
Achievement
|
Enjoyment
|
Total score
|
General health
|
+
|
|
+
|
+
|
+
|
+
|
+
|
+
|
+
|
+
|
+
|
+
|
Employment
|
+
|
+
|
+
|
+
|
+
|
+
|
Education level
|
+
|
|
+
|
+
|
|
+
|
Cohabitation
|
+
|
+
|
|
+
|
+
|
+
|
The interpretation of Table 1 is as follows. The stability attribute is initially expressed as being able to feel settled and secure, and relates to the absence of significant changes in life and stress. It is therefore hypothesised that significant negative life changes were likely to be associated with reduced capability (such as changes in general health). The validity study by Al-Janabi et al. found that, among other factors, employment, education and relationship status were associated with stability in a positive direction [17]. Therefore, this study expected an association between stability and employment, education and cohabitation in a positive direction, despite the different definitions of relationship status and education level. The attachment attribute is stated in terms of being able to have love, friendship and support, and relates to the ability to interact with others and have good relationships. Al-Janabi et al. found an positive association between attachment, employment and relationship status [17]. This study therefore anticipated finding an association between attachment, employment and cohabitation in a positive direction. The autonomy attribute is defined as being able to be independent and relates to looking after oneself and making one’s own decisions. Previously, positive associations between autonomy and employment and education have been found [17]. It was therefore anticipated that higher capability level for autonomy would be associated with higher level of employment and education in this study. The achievement attribute is defined as being able to achieve and progress, and reflects individuals’ abilities to move forward and achieve their goals. Previously, positive associations between achievement and employment, education and relationship status have been found [17]. It was therefore anticipated that capability for achievement would be associated with employment, education and cohabitation in a positive direction in this study. The enjoyment attribute is defined as being able to have enjoyment and pleasure in life. It reflects opportunities for the small pleasures in life, as well as things that are perceived to be enjoyable or exciting. As such, an association with employment and cohabitation was anticipated in a positive direction [17].
The ICECAP-A measure was developed to measure the effectiveness of health and social care interventions. The degree of variation in health and healthcare usage is reflected in individuals’ capabilities, and therefore is essential and of interest, because poor health and disabilities affect one's capabilities [4, 17]. Previous studies concerning ICECAP-A have found that impairments to physical health reduce the capability for stability, autonomy, achievement and enjoyment [17, 18]. Therefore, this study anticipated an association between general health and stability, autonomy, achievement and enjoyment. Here, it was anticipated that the question focusing on general health would be interpreted by participants as a question about physical health only, given the reasons that they were accessing the service, and thus would not be associated with attachment. ‘Feeling fit to do the things I want to’ was hypothesised to be associated with all five attributes of the ICECAP-A, and high levels of capability were anticipated to relate to a high level of this question of freedom. This hypothesis is based on the findings by Al-Janabi et al. where a similar question was asked, ‘I can do the things in life I want to do’, and an association was found with all attributes [17].
Statistical analysis
Based on these hypotheses (Table 1), associations between selected variables and the ICECAP-A attributes at baseline were analysed using chi-squared tests for categorical variables and Spearman rank correlation for ordinal variables. A correlation was considered strong if the coefficient was higher than 0.5, moderate if the coefficient was between 0.3 and 0.5, and weak if the coefficient was below 0.3 [19].
Responsiveness
The ability of outcome measures to detect meaningful change, is central to their usefulness in health and social care interventions. Two core ideas in the assessment of evaluative instruments are sensitivity to change and responsiveness. Sensitivity to change refers to the ability of instruments to measure change statistically. Responsiveness addresses the detection of the clinically relevant change [13, 20].
To assess responsiveness, some criterion is needed to ascertain where patients have changed over time. The two main methods for assessing responsiveness are the distribution- and anchor-based approaches. The distribution-based method uses the effect size of the difference between groups to measure variability, standard response means, standard error of measurement and responsive statistics. The anchor-based method is sample-independent and examines the relationship with an anchor, such as a QoL measure, to explain the meaning of a particular degree of change [21]. The anchors can either be cross-sectional or longitudinal. An anchor-based analysis aims to assess whether scores on the target measure change in an anticipated way, as indicated by changes in the scores on the anchor [22]. Distribution methods alone do not provide information about the clinical relevance of the observed change. Therefore, this study assessed responsiveness, using anchor-based methods to investigate the association between change over time in the ICECAP-A scores and change over time in the anchors. An exploratory analysis of the correlation between the change scores of longitudinal outcome measures was used to support the choice of anchors for this study.
Using Cohen's rule, correlations were considered strong when the coefficients were > 0.50, moderate when ≥ 0.30, and weak when < 0.30. Therefore, 0.30 was used as a correlation threshold to define an at least moderate association between an anchor and outcome measure change score [23]. General health and ‘feeling fit to do the things I want to’ were the only two questions for which there were longitudinal data, but they were only used if they reached a threshold of baseline correlation of 0.3 (at least moderate correlation). For appropriate anchors, patients were divided into three groups depending on the changes in scores in general health and ‘feeling fit to do the things I want to’: (1) those who had worsened between baseline and follow-up scores, (2) those who had improved between baseline and follow-up scores, and (3) those with no change in scores between baseline and follow-up.
When assessing the responsiveness of a weighted measures such as ICECAP-A [8], consideration needs to be given independently to both the descriptive system [4] and the value weighting of the descriptive system. It is essential that the descriptive system can detect a change in a construct for the weighted measure to reflect meaningful change. If the analysis only uses the weighted tariffs scores, a misleading conclusion could be made, that is, a conclusion whereby the measure is thought not to be responsive, when, in fact, the descriptive system of the measure shows change, but the value weightings suggest that these changes are not highly valued [24]. The weighted tariffs scores are also reflective of the UK population and not those of the Danish public. Therefore, for each anchor, two analyses are presented: (1) an analysis of the ‘un-weighted’ descriptive system of the ICECAP-A and (2) an analysis of the ‘weighted tariff scores’. For the un-weighted and weighted analysis, change was calculated in groups that improved and worsened. Un-weighted scores were calculated by summing ICECAP-A item response levels, with four indicating full capability on an item and one indicating no capability on an item. The weighted tariff scores were calculated using the UK general population tariff from Flynn et al. [25]. Findings were explored across different age groups (< 65 versus ≥ 65 years of age).
Responsiveness of the ICECAP-A scores was assessed using the Cohen’s effect size (ES) and standardised response mean (SRM). Additionally, a paired t-test was applied to test the null hypothesis, that no change in the response means between baseline and follow-up had occurred. These indices were calculated separately for patients who reported improved, worsened or no change in the anchors [13, 23]. The effect size was calculated by dividing the mean difference between baseline and follow-up scores by the standard deviation (SD) of baseline scores; SRM was calculated by dividing the mean score change (follow-up minus baseline) by the standard deviation of the change [22]. For all indices, a value of < 0.2 was considered small, 0.2–0.5 moderate and > 0.5 large responsiveness [23]. The range of the un-weighted score was 16 (5–20), and for the tariff score was 1 (0–1) with higher scores on both representing higher capability. Age differences in responsiveness were investigated by subgroup analysis using a group < 65 years of age and a group ≥ 65 years of age.
To assess the responsiveness of the individual ICECAP-A items, a response profile (frequency of participants answering each level for each item, at baseline and follow-up) was completed for the two anchors. Change in response profiles between baseline and follow-up was analysed for each item to indicate which items were the ‘drivers’ of change in the overall measure.
Statistical analysis
The investigation of construct validity was based on all baseline data. The responsiveness analysis was based on complete cases in terms of questionnaire data because of high rates of missing data (78%); hence, imputation was not considered. The type of missing was anticipated to be missing completely at random because in all cases the entire questionnaire was missing. The reason for the amount of missing is that there was voluntary completion of the questionnaire, both at baseline and follow-up. Therefore, complete case analysis was performed for the responsiveness analysis. All analyses were carried out in Stata version 15 with a significance level set at 1% and 5%.
The study was carried out in accordance with the General Data Protection Regulation (2015-509-00007). In accordance with the Danish National Committee on Health Research Ethics, this research satisfies the criteria of being ‘questionnaire and register-based research excluding human biological material’, and thus was not required to undergo a formal ethics procedure [26].