Feasibility and acceptability
Across the three UK medical schools, 264 students participated (18.2% of all eligible, Fig. 2). Recruitment was greater for students in cohort two (n = 183/833, 22%) than those in cohort one (n = 81/621, 13%). Uptake was slightly greater at school B (n = 136/610, 22%), followed by A (n = 112/696, 16%) and C (n = 16/148, 11%). Uptake was similar at schools A (n = 44//336, 13%) and B (n = 37/285, 13%) in cohort one. However, uptake was greater at school B (n = 99/325, 31%) than A (n = 68/360, 19%) and C (n = 16/148, 11%) in cohort two. Participant characteristics are shown in Table 1 and no significant differences between the intervention and control group were observed.
Table 1 Participant characteristics at baseline
|
Intervention group
|
Control group
|
P value
|
Age (Years): n (%)
|
|
20-22
|
4 (2.9)
|
1 (0.8)
|
|
23-24
|
73 (53.3)
|
79 (62.2)
|
|
25-26
|
39 (28.5)
|
29 (22.8)
|
|
27-28
|
11 (8.0)
|
10 (7.9)
|
|
>29
|
10 (7.3)
|
8 (6.3)
|
0.49
|
Total n
|
137
|
127
|
|
Gender: n (%)
|
|
Female
|
64 (46.7)
|
58 (45.7)
|
0.87
|
Male
|
73 (53.3)
|
69 (54.3)
|
Total n
|
137
|
127
|
|
Self-reported clinical reasoning skills at baseline: mean (SD)
|
83.1 (9.6)
|
83.5 (8.8)
|
0.75
|
Total n
|
122
|
118
|
|
Knowledge at baseline: mean (SD)
|
9.2 (1.8)
|
9.3 (1.6)
|
0.22
|
Total n
|
125
|
126
|
|
Notes: Baseline n varied for demographics, self-reported clinical reasoning skills and knowledge as some students didn’t complete all information. Results are presented as number and percentage of students or mean and standard deviation for each group. Comparisons were made using t-tests for means and Chi-squared tests for percentages. P Value less than 0.05 was considered significant.
There was no significant difference detected in retention between the intervention and control groups one week after baseline, 72% and 68% respectively (χ2 (1) = 0.65, p = 0.42), or after one month, 57% and 55% respectively (χ2 (1) = 0.34, p = 0.56, Fig. 2). There was no significant difference found in the proportion of students at each school who stayed in the study one week after baseline. However, there was significantly poorer retention at school A after one month (n = 47/112, 42%) than at school B (n = 83/136, 61%) and C (n = 10/16, 63%), χ2 (2) = 9.58, p = 0.008. Those in cohort one were significantly less likely to stay in the study one-week post baseline (n = 45/81, 56%) than those in cohort two (n = 140/183, 77%), χ2 (1) = 11.75, p = 0.001. This was also observed one-month post baseline (n = 29/81, 36% and n = 111/183, 61% respectively), χ2 (1) = 13.92, p = 0.000.
Most students (> 80%) agreed that eCREST helped them learn clinical reasoning skills and that they would use it again without incentives (Table 2). There were no significant differences detected between the schools. However, those in cohort two were significantly more likely than cohort one to agree that: eCREST helped to improve their clinical reasoning skills (87.7% vs 64.0%), χ2 (2) = 7.5, n = 98, p= .024); eCREST enhanced their overall learning (93.2% vs 64.0%), χ2 (2) = 13.7, n = 98, p = .001) and that they would use eCREST again without an incentive (97.3% vs 52.0%), χ2 (2) = 31.8, n = 98, p=.000).
Table 2 Intervention group medical student responses to the acceptability survey
|
Strongly agree/ Agree
|
Neither agree or disagree
|
Strongly disagree/ Disagree
|
Statement
|
n (%)
|
n (%)
|
n (%)
|
It was easy to navigate through eCREST
|
96/98 (98)
|
1/98 (1)
|
1/98 (1)
|
The level of difficulty of the material was appropriate
|
95/98 (97)
|
3/98 (3)
|
0/98 (0)
|
eCREST should be used to supplement traditional teaching
|
88/98 (90)
|
9/98 (9)
|
1/98 (1)
|
eCREST helped me to learn clinical reasoning skills to apply to clinical work
|
80/98 (82)
|
15/98 (15)
|
3/98 (3)
|
Overall, using eCREST enhanced my learning
|
84/98 (86)
|
13/98 (13)
|
1/98 (1)
|
I would use eCREST in the future without an incentive
|
84/98 (86)
|
10/98 (10)
|
4/98 (4)
|
Notes: results are taken from across all 3 schools. 98 students in the intervention group completed the acceptability survey at Time 1.
Clinical reasoning outcomes
Validity
The internal consistency of the self-reported clinical reasoning measure was adequate (Cronbach’s α = 0.66). Correlations between self-reported and observed clinical reasoning outcome measures, and knowledge and clinical outcomes are shown in Table 3. There was a mostly positive but non-significant correlation between the self-reported clinical reasoning measure and the observed clinical reasoning measure. The self-reported clinical reasoning measure had a weak but significant positive correlation with knowledge for aggregated data (rs=0.13, p = 0.037, n = 240). The observed clinical reasoning measure was positively but not significantly correlated with knowledge. The self-reported clinical reasoning measure at baseline and the observed clinical reasoning measure were positively but not significantly correlated with identification of the most serious diagnosis.
Table 3 Mean scores and correlations between measures of clinical reasoning, knowledge and diagnostic choice
Reasoning measure
|
Group[a]
|
N
|
Mean Scores (SD)
|
Correlation coefficients
|
Self-reported clinical reasoning at baseline[b]
|
Knowledge at baseline[c]
|
Diagnostic choice[d]
|
Self-reported clinical reasoning skills
|
|
|
|
rs[e]
|
p
|
rs
|
p
|
rs
|
p
|
Baseline
|
I
|
122
|
83.1 (9.6)
|
|
|
0.17
|
0.057
|
-0.01
|
0.927
|
|
C
|
118
|
83.5 (8.8)
|
|
|
0.10
|
0.270
|
0.09
|
0.463
|
|
All
|
240
|
83.3 (9.2)
|
|
|
0.14
|
0.037*
|
0.04
|
0.663
|
Time 1[f]
|
I
|
99
|
84.1 (10.3)
|
|
|
|
|
|
|
|
C
|
86
|
82.4 (9.0)
|
|
|
|
|
|
|
|
All
|
185
|
83.3 (9.8)
|
|
|
|
|
|
|
Time 2[g]
|
I
|
75
|
84.4 (9.8)
|
|
|
|
|
|
|
|
C
|
65
|
82.0 (9.4)
|
|
|
|
|
|
|
|
All
|
140
|
83.3 (9.7)
|
|
|
|
|
|
|
Observed clinical reasoning skills[h]
|
|
|
|
|
|
|
|
Essential information identified[i]
|
I
|
78
|
61.6% (17.6)
|
-0.01
|
0.957
|
0.09
|
0.423
|
0.09
|
0.435
|
|
C
|
70
|
53.3% (15.8)
|
0.03
|
0.798
|
0.22
|
0.074
|
0.13
|
0.303
|
|
All
|
148
|
57.7% (17.2)
|
0.01
|
0.865
|
0.15
|
0.78
|
0.12
|
0.144
|
Relevance of history taking[j]
|
I
|
78
|
81.4% (10.5)
|
0.06
|
0.578
|
0.06
|
0.623
|
0.12
|
0.304
|
|
C
|
70
|
84.6% (10.6)
|
0.02
|
0.893
|
0.11
|
0.356
|
0.08
|
0.518
|
|
All
|
148
|
82.9% (10.6)
|
0.03
|
0.704
|
0.07
|
0.415
|
0.10
|
0.251
|
Flexibility in diagnoses[k]
|
I
|
78
|
3.2 (1.0)
|
0.01
|
0.919
|
0.17
|
0.137
|
0.01
|
0.933
|
|
C
|
70
|
3.0 (1.0)
|
0.09
|
0.439
|
0.04
|
0.772
|
0.23
|
0.054
|
|
All
|
148
|
3.1 (1.0)
|
0.05
|
0.542
|
0.11
|
0.186
|
0.12
|
0.137
|
[a] Intervention (I) and Control group (C).
[b] Measured using the Flexibility in Thinking scale (Bordage et al. 1990).
[c] Measured by 12 multiple choice respiratory medicine questions.
[d] Measured by whether the most important diagnosis was selected for a patient case.
[e] rs denotes Spearman’s rank correlation coefficient.
[f] Time 1= one week after registration.
[g] Time 2= one month registration.
[h] Measured by performance on a patient case delivered by eCREST to all students.
[i] Percentage of essential information from gathered from patient case out of possible essential information available.
[j] Percentage of relevant information from gathered from patient case out of all information student gathered.
[k] Number of times changed diagnosis.
* indicates p ≤ 0.05.
Effect sizes
The intervention group had non-significantly higher self-reported clinical reasoning skills than the control group at Time 1 (84.1 vs 82.4, p = 0.26) and Time 2 (84.4 vs 82.0, p = 0.15). There was no significant effect of group allocation, (F (1) = 0.00, p = 0.97, n = 136) time, (F (2) = 0.01, p = 0.99, n = 136) or interaction between group allocation and time, F (2) = 0.48, p = 0.62, n = 136.
Table 4 shows logistic regression analyses comparing observed clinical reasoning skills between the intervention and control groups. The intervention group identified significantly more essential information than the control group (62% vs 53%). The control group sought more relevant information than the intervention group (85% vs 81%) but this difference was not significant. Students in both groups changed their diagnoses at least twice. The intervention group changed their diagnoses more often than controls, but the difference was not statistically significant.
Table 4 Logistic regression analyses comparing clinical reasoning skills between intervention and control groups
Observed clinical reasoning skills[a]
|
Trial group
|
n
|
Mean (SD)
|
Odds Ratio (95% CI)
|
P value
|
Essential information identified[b]
|
Intervention
|
78
|
61.6% (17.6)
|
|
|
|
Control
|
70
|
53.3% (15.8)
|
|
|
|
|
|
|
1.40 (1.12, 1.75)
|
0.003*
|
Relevance of history taking[c]
|
Intervention
|
78
|
81.4% (10.5)
|
|
|
|
Control
|
70
|
84.6% (10.6)
|
|
|
|
|
|
|
0.79 (0.62, 1.01)
|
0.064[d]
|
Flexibility in diagnoses[e]
|
Intervention
|
78
|
3.2 (1.0)
|
|
|
|
Control
|
70
|
3.0 (1.0)
|
|
|
|
|
|
|
|
|
|
|
|
2 (base)
|
|
|
|
|
|
3
|
1.48 (0.68, 3.24)
|
0.323
|
|
|
|
4
|
1.63 (0.68, 3.92)
|
0.270
|
|
|
|
5
|
2.46 (0.55, 11.00)
|
0.239
|
|
|
|
6
|
1.77 (0.07, 20.76)
|
0.887[f]
|
[a] Measured by students’ performance on a patient case delivered by eCREST.
[b] Percentage of essential information from gathered from patient case out of possible essential information available.
[c] Percentage of relevant information from gathered from patient case out of all information student gathered.
[e] Number of times changed diagnosis.
[f] x2 (4) =2.24, p =0.692.
* indicates p ≤ 0.05.