Sample and design
Table 2
Sample and design of identified studies
No.
|
Author and year
|
Sample size
|
Gender
|
Grade
|
Department
|
Methodology
|
Type of study
|
Recruitment strategy
|
1
|
Arora et al. 2011
|
18
|
Unspecified
|
FY1/FY2 18
|
Surgery 18
|
Quantitative (randomised-controlled design)
|
Pilot
|
Random
|
2
|
Berridge et al. 2007
|
50
|
Males 33, females 17
|
FY1 50
|
Medicine 30, surgery 18, unspecified 2
|
Mixed-methods
|
Longitudinal
|
Convenient
|
3
|
Bu et al. 2019
|
20
|
Males 10, females 10
|
FY1 10, FY2 10
|
Unspecified 20
|
Mixed-methods
|
Pilot
|
Convenient
|
4
|
Eisen et al. 2014
|
44
|
Unspecified
|
ST1 44
|
Paediatrics 44
|
Quantitative
|
Cross-sectional
|
Random
|
5
|
Pal et al. 2017
|
6
|
Unspecified
|
Unspecified 6
|
Paediatrics 6
|
Quantitative
|
Pilot
|
Unspecified
|
6
|
Webb et al. 2015
|
42
|
Unspecified
|
CMT1 21, CMT2 21
|
Medicine 42
|
Quantitative
|
Pilot
|
Convenient
|
7
|
Wells et al. 2019
|
150
|
Unspecified
|
FY1 150
|
Unspecified 150
|
Quantitative
|
Longitudinal
|
Convenient
|
Table 2 presents details about the sample and design of the included studies. Sample size was reported by all studies (N = 7) and ranged widely from six to 150. Most studies (N = 5) did not explicitly justify their sample size. Two studies (N = 2) provided a flowchart justifying the number of included participants,(36) with one stating that participant numbers had to be capped due to limited capacity to provide the intervention.(39) Only two studies (N = 2) reported the gender of participants,(39) with more male than female participants reported in one.(34)
Grades of participants ranged from FY1 to CT2; three studies (N = 3) did not explicitly state the grade of their participants in their paper,(35, 37, 40) but authors from two studies confirmed that they were newly-qualified doctors within the first five years of graduation.(35, 40) Most of these studies (N = 4) were performed exclusively on Foundation doctors; two of these studies were conducted on participants during FY1 induction, meaning these participants had not started working as FY doctors yet.(34, 38) Most studies (N = 5) reported the specialty where the doctors were working: these include paediatrics (N = 2), medicine (N =1),(35–37) mixture of medicine and surgery (N = 1).(34)
Although all included studies presented quantitative data, two (N = 2) used mixed methods, hence reporting both quantitative and qualitative data.(34, 39) Two quantitative studies (N = 2) also gathered qualitative data using open-ended questions within the questionnaire, but did not report the analysis of this qualitative data.(35, 36)
Four studies (N = 4) were pilot studies, while two (N = 2) were longitudinal studies and one (N = 1) was a cross-sectional study. Three of the four pilot studies were small-scale studies, with the potential effectiveness, feasibility and acceptability reported. Additionally, we did not find any evidence of the four pilot studies being implemented as full studies at the time this systematic review was conducted.
All studies reported their recruitment strategy. However, we found the recruitment strategy of one study (N = 1) to be unclear because the recruitment strategy was not stated.(37) Four studies (N = 4) used convenient sampling. The remaining two studies (N = 2) stated they used random sampling, but did not randomly select potential participants.(35, 40) Arora et al. (2011) was the only study to use a prospective randomised-controlled design where participants were randomised to intervention or control.(40)
As studies were volunteer-based, it is likely that the recruited participants were highly self-aware, willing to engage with these wellbeing interventions, and already had good motivation. Hence, the authors are unable to account for selection and non-response bias. The widespread convenience sampling is also unlikely to be representative of the larger junior doctor cohort.
Interventions performed
Table 3
Wellbeing interventions performed by included studies
No.
|
Author and year
|
Duration of data collection
|
Stages for data collection
|
Screening tool
|
Wellbeing intervention
|
Non-wellbeing intervention
|
Duration of intervention
|
Wellbeing outcomes measured
|
Non-wellbeing outcomes measured
|
1
|
Arora et al. 2011
|
5 days
|
2 stages: pre, post
|
Imperial Stress Assessment Tool (HR, salivary cortisol, State Trait Anxiety Inventory (6 item Likert scale))
|
Mental practice
|
None specified
|
2.5 hours (0.5h x 5 sessions)
|
Stress, anxiety
|
Mental imagery
|
2
|
Berridge et al. 2007
|
1 month
|
3 stages: pre, intermediate, post
|
Unspecified (17 item questionnaire used)
|
None specified
|
Preparation for Practice Course
|
2 weeks
|
Anxiety
|
Confidence, preparedness
|
3
|
Bu et al. 2019
|
3 months
|
2 stages: pre, post
|
Unspecified (1 item questionnaire used)
|
Mindfulness course delivered by Breathworks
|
None specified
|
12 hours (2h x 6 weeks)
|
Stress, overall wellbeing
|
None specified
|
4
|
Eisen et al. 2014
|
1 year
|
2 stages: pre, post
|
Unspecified (questionnaire used, details not given)
|
Mentorship programme with senior trainees
|
None specified
|
1 year (unstructured mentoring)
|
Stress management, work-life balance
|
Demand for course, perceived value of programme, self-confidence, transferable skill acquisition
|
5
|
Pal et al. 2017
|
Unspecified
|
2 stages: pre, post
|
GHQ12
|
Mindfulness course; email support between sessions
|
None specified
|
5 hours (2.5h x 2 sessions)
|
None specified
|
None specified
|
6
|
Webb et al. 2015
|
1 year
|
2 stages: pre, post
|
Unspecified (questionnaire used, unknown number of items)
|
Mentorship programme with second-year trainees
|
None specified
|
1-2.5h (0.5h x 2-5 sessions)
|
Confidence, work-life balance
|
Transferable skill acquisition, time management
|
7
|
Wells et al. 2019
|
8 months
|
3 stages: pre, 2 months post, 6 months post
|
Unspecified (7 item questionnaire used, details given)
|
None specified
|
Undergoing assistantship prior to starting FY2
|
1 month
|
Anxiety
|
Confidence, preparedness, perceived value of programme
|
Table 3 outlines the wellbeing interventions performed by the included studies. The interventions described varied significantly from one another and were characterised as either mindfulness courses (N = 3), clinical preparation courses (N = 2) or mentorship programmes (N = 2). The clinical preparation courses were performed on newly-qualified doctors as an induction programme before clinical work commenced; these two papers also collected data in three stages.(34, 38) Both studies using mentorship programmes collected data over an entire year, representing the longest follow-up times amongst the included studies in this systematic review.(35, 36) Given that the follow-up time in these interventions was relatively short, the duration for which the outcome lasts may be overestimated due to impact bias. Finally, the two mindfulness courses involved multiple sessions of mindfulness training.(37, 39) One study specified this intervention to be conducted with a partner organisation Breathworks,(39) while the other did not specify any details about their mindfulness intervention.(37)
The total duration of interventions in this systematic review lasted between one hour and twelve hours. Three studies (N = 3) did not clearly state the total number of hours over which the interventions were run.(34, 35, 38) Instead, only the total duration for data collection was indicated.
Duration of data collection typically corresponded to duration of the intervention. Data collection duration lasted between five days and one year; one study (N = 1) did not specify duration of data collection.(37) Most studies (N = 5) collected data at two stages: before and after the intervention, while two studies (N = 2) collected data at three intervals: before the intervention, after the intervention, and in the interim (34) or six months after the intervention commenced.(38) Just one study (N = 1) reported details of non-respondents and participant flow through the study.(36)
Only two studies (N = 2) used validated tools to screen for or measure wellbeing outcomes: the General Health Questionnaire 12 (GHQ12) (37) and the Imperial Stress Assessment Tool (which includes objective measures of stress) were used respectively.(40) The remaining studies (N = 5) produced their own questionnaires; it is unclear whether or not the questionnaires were validated. All questionnaires involved rating on a Likert-scale, although questionnaire details were not provided in one study.(35)
Results from interventions
Table 4
Results and statistical analysis of included studies
No.
|
Author and year
|
Results
|
Significance of results
|
1
|
Arora et al. 2011
|
Decreased subjective anxiety (mean STAI 8.40 in intervention vs 11.31 in control). Decreased objective stress (mean HR 77 vs 88 bpm, max HR 102 vs 119 bpm and cortisol 2.25 vs 3.85 nmol/L).
|
Mann-Whitney U test performed. Decreased anxiety and stress statistically significant (at least p < 0.05).
|
2
|
Berridge et al. 2007
|
Decreased anxiety, including physical symptoms of anxiety (mean from 3.04 to 4.00 out of 5.00 on an inverted scale). Improved confidence and preparedness.
|
Mann-Whitney U test performed. Improved confidence was statistically significant. Decreased anxiety was not statistically significant.
|
3
|
Bu et al. 2019
|
Decreased stress (mean from 6.5 to 5.0 out of 10.0). Doctors reported being more mindful and having improved overall wellbeing.
|
Mann-Whitney U test performed. Decreased stress statistically significant (p = 0.04).
|
4
|
Eisen et al. 2014
|
Improved stress management and work-life balance (78% of participants).
|
Not reported
|
5
|
Pal et al. 2017
|
No difference in GHQ12 over the course. All participants enjoyed the course.
|
Unspecified statistical test. Results not statistically significant (p = 043).
|
6
|
Webb et al. 2015
|
Improvement in stress management and work-life balance (not reported).
|
Not reported
|
7
|
Wells et al. 2019
|
Improved anxiety relief during first placement (mean from 3.9 to 4.1 out of 5.0). However, this improvement had disappeared by the last stage of data collection (mean 3.3).
|
Repeated measures analysis of variance (ANOVA) performed. Anxiety relief statistically significant at first placement (at least p < 0.03), but not second placement.
|
Table 4 summaries the results from the included studies, and states the statistical analytical tools used by each. Majority of interventions measured anxiety (N = 3) or stress (N = 3) as their intended outcome, while one study (N = 1) did not specify their intended outcome.(37) Only one study (N = 1) explicitly measured both anxiety and stress as primary intended outcomes.(40) The remaining studies (N = 6) reported stress or anxiety as secondary outcomes either alongside the primary aims of the study, or incidentally while collecting data.
Four studies (N = 4) performed statistical analysis. Three studies performed Mann-Whitney U test,(34, 39, 40) while one performed repeated measures analysis of variance,(38) and three were unspecified or unreported.(35–37) Pal et al. (2017) did not find statistically significant results for GHQ12, which screens for non-psychotic psychiatric conditions including anxiety; Berridge et al. (2007) found that anxiety relief was not statistically significant; and Wells et al. (2019) found that anxiety relief was no longer statistically significant by the third stage of data collection.
All studies but one (N = 6) reported that wellbeing interventions improved stress management and anxiety. Mindfulness courses (N = 2), clinical preparation courses (N = 2) and mentorship programmes (N = 2) identified an improvement in stress or anxiety. Pal et al. was the only study (N = 1) which failed to identify an improvement in their measured outcome.(37) All studies except one explicitly reported the quantitative change produced by their intervention.(36) The studies which performed qualitative analysis (N = 2) reported findings which were convergent with quantitative results.(34, 39)
Clinical preparation courses such as assistantship programmes following graduation, and mindfulness courses such as mental practice were reported to improve anxiety in participants.(34, 38, 40) All studies noted improvements to subjective self-reported anxiety scores. However, Wells et al. noted that the anxiety relief provided by their programme was only valid at the first rotation and was subsequently not valid when data was collected in the next rotation (38); the other two studies did not measure outcomes beyond one month. Additionally, Pal et al. failed to state the wellbeing outcome being measured, but noted no statistically significant difference in GHQ12 score (used to screen for non-psychotic morbidity) following a mindfulness intervention.
Three studies indicated that mindfulness courses, mental practice and mentorship quantitatively improve stress levels.(35, 39, 40) One mentorship programme failed to provide any quantitative data but wrote that their intervention improved stress management and work-life balance.(36) The majority of included papers (N = 4) were pilot studies, of which three (N = 3) demonstrated a positive result.