Benfari et al.,2019 study of mortality in patients with four levels of functional tricuspid regurgitation6
Figure 2 shows the poor fit of regular parametric models to the follow-up from Benfari et al. for trivial FTR, suggesting the need for an alternative approach.
In Fig. 3a, bathtub and Chen models have been fitted separately to all four FTR groups from both Benfari et al. For the Benfari et al. trivial group (best prognosis), it is clear that the Chen and bathtub models provide a superior visual fit. This is supported by the goodness of fit statistics which show a significant improvement over the regular parametric models (Table 2).
This is maintained across all four levels of FTR where the bathtub and Chen models provided almost identical visual fit (Fig. 3) and at least one of these models provided the best AIC/BIC score. Either bathtub or Chen models provided the best AIC and BIC scores (Table 2) and neither model strayed from the 95% CIs of the KM plots. In contrast standard parametric models provided poorer AIC and BIC scores, poorer visual fit and almost all models strayed outside the KM 95% CIs (Supplementary section 2). Chen models were very similar to bathtub over 10 years observation time but following further extrapolation Chen models predicted more favourable longer-term survival (Supplement: section 3).
Essayagh et al., 2021 study of mortality in patients with four levels of FTR under medical management 9
KM plot 95% CIs were wider than in the larger Benfari et al. study. For all four levels of FTR bathtub and Chen models provided similar and the best models according to visual fit (Fig. 3b (right)). Either bathtub or Chen models provided the best AIC and BIC scores (Table 2) and both models did not stray appreciably outside the KM 95% CIs. Standard models presented poorer visual fit. (Supplement: section 4)
Table 2
Information criteria (AIC BIC) scores for parametric models; Tricuspid Regurgitation, microfracture and atrial occlusion studies
Model
|
AIC
|
BIC
|
|
Model
|
AIC
|
BIC
|
|
Model
|
AIC
|
BIC
|
|
Model
|
AIC
|
BIC
|
Benfari et al., Trivial functional regurgitation
|
|
Essayagh et al., Trivial functional regurgitation
|
|
Tribouilloy et al., functional regurgitation
|
|
Bae et al., microfracture of the knee
|
bathtub
|
11391.6
|
11410.72
|
|
bathtub
|
2273.44
|
2290.66
|
|
ggamma
|
1182.412
|
1196.128
|
|
ggamma
|
227.83
|
236.53
|
Chen
|
11457.57
|
11470.31
|
|
Chen
|
2281.45
|
2292.93
|
|
bathtub
|
1184.632
|
1198.349
|
|
Gompertz
|
227.83
|
233.63
|
ggamma
|
11489.82
|
11508.94
|
|
Weibull
|
2281.51
|
2292.99
|
|
lognormal
|
1185.011
|
1194.156
|
|
Weibull
|
228.84
|
234.64
|
Weibull
|
11550.12
|
11562.86
|
|
ggamma
|
2281.76
|
2298.98
|
|
loglogistic
|
1194.666
|
1203.81
|
|
Chen
|
228.30
|
234.09
|
loglogistic
|
11643.51
|
11656.26
|
|
exponential
|
2282.99
|
2288.73
|
|
Weibull
|
1196.236
|
1205.381
|
|
Rayleigh 2
|
229.11
|
234.90
|
Gompertz
|
11733.51
|
11746.25
|
|
loglogistic
|
2283.69
|
2295.17
|
|
Gompertz
|
1198.722
|
1207.866
|
|
Rayleigh 1
|
229.52
|
232.41
|
lognormal
|
11745.43
|
11758.18
|
|
gompertz
|
2284.83
|
2296.31
|
|
exponential
|
1208.135
|
1212.707
|
|
loglogistic
|
232.44
|
238.24
|
exponential
|
11769.46
|
11775.83
|
|
Rayleigh 2
|
2284.85
|
2296.33
|
|
Chen
|
1208.419
|
1217.564
|
|
lognormal
|
237.58
|
243.38
|
Rayleigh 2
|
11741.83
|
11754.58
|
|
lognormal
|
2294.47
|
2305.96
|
|
Rayleigh 1
|
1501.44
|
1506.012
|
|
exponential
|
272.94
|
275.84
|
Rayleigh 1
|
14945.58
|
14951.95
|
|
Rayleigh 1
|
2580.30
|
2586.04
|
|
Rayleigh 2
|
NA
|
NA
|
|
bathtub
|
NA
|
NA
|
Benfari.et al Mild functional regurgitation
|
|
Essayagh. et al Mild functional regurgitation
|
|
TRISCEND II control arm
|
|
|
|
|
bathtub
|
11903.75
|
11922.76
|
|
Chen
|
2790.99
|
2802.04
|
|
lognormal
|
294.6
|
301.4
|
|
|
|
|
Chen
|
11981.74
|
11994.42
|
|
Ggamma
|
2859.81
|
2876.39
|
|
ggamma
|
294.7
|
304.9
|
|
|
|
|
lognormal
|
12203.39
|
12216.07
|
|
Weibull
|
2874.22
|
2885.27
|
|
bathtub
|
294.9
|
305.1
|
|
|
|
|
ggamma
|
12015.36
|
12034.37
|
|
loglogistic
|
2892.36
|
2903.41
|
|
loglogistic
|
296.5
|
303.3
|
|
|
|
|
Weibull
|
12062.17
|
12074.84
|
|
bathtub
|
2893.42
|
2910.00
|
|
Chen
|
296.7
|
303.5
|
|
|
|
|
loglogistic
|
12158.08
|
12170.76
|
|
exponential
|
2906.42
|
2911.95
|
|
Weibull
|
296.9
|
303.7
|
|
|
|
|
Gompertz
|
12382.84
|
12395.51
|
|
gompertz
|
2908.39
|
2919.44
|
|
Gompertz
|
297.0
|
303.8
|
|
|
|
|
exponential
|
12505.97
|
12512.31
|
|
Rayleigh 2
|
2908.39
|
2919.45
|
|
exponential
|
309.4
|
312.8
|
|
|
|
|
Rayleigh 2
|
12416.85
|
12429.52
|
|
lognormal
|
3016.19
|
3027.24
|
|
Rayleigh 1
|
385.8
|
389.2
|
|
|
|
|
Rayleigh 1
|
16861.08
|
16867.41
|
|
Rayleigh 1
|
3528.87
|
3534.40
|
|
Rayleigh 2
|
|
|
|
|
|
|
Benfari.et al. Moderate functional regurgitation
|
|
Essayagh et al., Moderate functional regurgitation
|
|
Whitlock et al., left atrial appendage No occlusion
|
|
|
|
|
Chen
|
7019.15
|
7030.59
|
|
bathtub
|
1587.68
|
1601.61
|
|
Chen
|
1847
|
1859
|
|
|
|
|
bathtub
|
7019.35
|
7036.51
|
|
ggamma
|
1592.95
|
1606.88
|
|
bathtub
|
1851
|
1869
|
|
|
|
|
ggamma
|
7046.38
|
7063.54
|
|
Weibull
|
1593.39
|
1602.68
|
|
lognormal
|
1901
|
1912
|
|
|
|
|
Weibull
|
7072.59
|
7084.03
|
|
Chen
|
1594.52
|
1603.80
|
|
Weibull
|
1905
|
1917
|
|
|
|
|
loglogistic
|
7165.33
|
7176.77
|
|
loglogistic
|
1600.24
|
1609.53
|
|
loglogistic
|
1905
|
1917
|
|
|
|
|
lognormal
|
7214.13
|
7225.57
|
|
Gompertz
|
1602.95
|
1612.24
|
|
ggamma
|
1907
|
1924
|
|
|
|
|
Gompertz
|
7255.78
|
7267.22
|
|
Rayleigh 2
|
1603.59
|
1612.87
|
|
Gompertz
|
2117
|
2129
|
|
|
|
|
Rayleigh 2
|
7272.49
|
7283.93
|
|
exponential
|
1604.87
|
1609.52
|
|
Rayleigh 2
|
2159
|
2171
|
|
|
|
|
exponential
|
7322.90
|
7328.62
|
|
lognormal
|
1610.09
|
1619.37
|
|
exponential
|
2199
|
2205
|
|
|
|
|
Rayleigh 1
|
10139.16
|
10144.88
|
|
Rayleigh 1
|
1990.27
|
1994.92
|
|
Rayleigh 1
|
3007
|
3013
|
|
|
|
|
Benfari.et al) Severe functional regurgitation
|
|
Essayagh.et al. Severe functional regurgitation
|
|
Whitlock et al., left atrial appendage occlusion
|
|
|
|
|
bathtub
|
2442.81
|
2456.65
|
|
bathtub
|
457.1133
|
466.2821
|
|
Chen
|
1353.035
|
1364.584
|
|
|
|
|
ggamma
|
2466.09
|
2479.93
|
|
loglogistic
|
457.9641
|
464.0766
|
|
ggamma
|
1378.3
|
1341.1
|
|
|
|
|
lognormal
|
2469.57
|
2478.80
|
|
lognormal
|
454.7669
|
460.8794
|
|
bathtub
|
1381.676
|
1398.999
|
|
|
|
|
Weibull
|
2470.93
|
2480.16
|
|
Ggamma
|
456.7364
|
465.9051
|
|
lognormal
|
1400.537
|
1412.086
|
|
|
|
|
loglogistic
|
2485.78
|
2495.01
|
|
Weibull
|
459.2854
|
465.3979
|
|
loglogistic
|
1406.278
|
1417.827
|
|
|
|
|
Chen
|
2487.90
|
2497.12
|
|
gompertz
|
459.9671
|
466.0796
|
|
Weibull
|
1406.585
|
1418.134
|
|
|
|
|
Gompertz
|
2515.70
|
2524.92
|
|
Rayleigh 2
|
461.6414
|
467.7539
|
|
Gompertz
|
1542.111
|
1553.659
|
|
|
|
|
exponential
|
2565.68
|
2570.30
|
|
exponential
|
466.8290
|
469.8853
|
|
exponential
|
1651.162
|
1656.937
|
|
|
|
|
Rayleigh 1
|
3710.03
|
3714.64
|
|
Chen
|
467.3864
|
473.4988
|
|
Rayleigh 1
|
2266.306
|
2272.08
|
|
|
|
|
Rayleigh 2
|
NA
|
NA
|
|
Rayleigh 1
|
648.6065
|
651.6627
|
|
Rayleigh 2
|
NA
|
NA
|
|
|
|
|
NA = Not applicable model did not converge
|
Tribouilloy et al 2024 of mortality in a French cohort with isolated functional tricuspid regurgitation10
Chen and bathtub models generated very similar good visual fit (Fig. 4b (middle)). Generalised gamma and bathtub provided the best AIC BIC scores, while Chen provided poor AIC BIC scores (Table 2). With extrapolation to 25 years the generalised gamma model predicted clinically implausible 50% survival and other standard parametric models similarly predicted substantial proportion of patients surviving beyond 25 years (Supplementary: section 3) generating overoptimistic survival curves for an elderly population with serious comorbidities. Only bathtub and Chen models generated plausible long-term extrapolations and both generated U shaped hazard plots (Supplement: section 5).
Bae et al., 2013; Korean Cohort with degenerative osteoarthritic knee treated by microfracture11
After ten years of follow-up, around 50% of microfracture-treated knees required TKA in this cohort. Over the observation period all parametric models except lognormal loglogistic and exponential generated reasonably good visual fit with similar AIC BIC scores; Chen, Gompertz and Rayleigh models generated well-fitting models (Fig. 4a (left), Supplement: section 7). On extrapolation of models to 25 years there were considerable differences between models with only Gompertz and Chen models predicting 100% failure within 20 years. The hazard for failure of other models barely changed over the observation period and beyond (Supplement: section 7) despite ageing of the population and inevitable wear and tear of the microfracture-treated knee. The more clinically plausible scenario would seem to be an increasing hazard that gets steeper with increasing age of patient, progressive wear and tear, and the exacerbation of osteoarthritis through time. This scenario is best satisfied by the Rayleigh, Gompertz and Chen models.
Occlusion versus no-occlusion of left atrial appendage (RCT; Whitlock et al.,2021)
A bathtub model for the no-occlusion arm (Fig. 4c (right)) generated superior AIC/BIC scores relative to standard parametric models and better visual fit; all standard models for the no-occlusion arm generated poor visual fit (Supplement: section 8). The Chen model AIC BIC scores were superior to those for 3-parameter bathtub despite its poor visual fit, and the hazard plot differed from the bathtub (3-parameter) hazard. The Chen model appears unsuited to substantial and rapid initial accumulation of events as found in both arms of this RCT. For the intervention arm (occlusion) the 3-parameter bathtub model again generates a good visual fit and Chen 2-parameter model a poor fit (Fig. 4c) while several standard models (e.g. Weibull) generated good visual fit for the intervention arm (Supplement: section 8).
Total Hip Replacement (THR) (Registry studies)
Revision after THR failure varies according to many characteristics of both THR device and patient age, gender, and other demographics of patient populations. The National Joint Registry Annual report 20 (2023) 13 itemises many KM plots for cumulative failure of THR. We analysed failure for THR recipients with osteoarthritis, for recipients of cemented metal on polythene and of ceramic on polythene devices, and for re-revision after first THR revision. In each of these four examples bathtub and Chen models provided the best visual fit and almost identical models (Fig. 5) accompanied by the lowest AIC/BIC scores (Table 3). Standard models provided poorer visual fit (Supplement: section 9) and poorer AIC/BIC scores.
Table 3
Goodness of fit statistics for models fitted to THR data
Model
|
AIC
|
BIC
|
|
Model
|
AIC
|
BIC
|
THR Osteoarthritis
|
|
THR cemented ceramic on polythene
|
Chen
|
300.2661
|
308.4213
|
|
Chen
|
296.4028
|
305.1967
|
bathtub
|
301.9992
|
314.232
|
|
bathtub
|
296.7181
|
309.9089
|
exponential
|
302.1448
|
306.2224
|
|
Gompertz
|
298.4029
|
307.1967
|
Gompertz
|
303.7814
|
311.9367
|
|
exponential
|
298.5097
|
302.9066
|
Rayleigh 2
|
303.8298
|
311.9851
|
|
Rayleigh 2
|
298.8838
|
307.6777
|
Weibull
|
303.8698
|
312.0251
|
|
Weibull
|
300.4538
|
309.2477
|
loglogistic
|
304.1212
|
312.2765
|
|
loglogistic
|
300.6615
|
309.4553
|
ggamma
|
305.6258
|
317.8587
|
|
ggamma
|
302.2484
|
315.4392
|
lognormal
|
307.6922
|
315.8475
|
|
lognormal
|
303.9896
|
312.7835
|
Rayleigh 1
|
328.2803
|
332.358
|
|
Rayleigh 1
|
314.5399
|
318.9368
|
THR re-revision
|
|
THR cemented metal on polythene
|
Chen
|
346.0576
|
352.6442
|
|
Chen
|
316.8134
|
325.2426
|
bathtub
|
348.9028
|
358.7827
|
|
bathtub
|
318.0951
|
330.7389
|
Weibull
|
349.3799
|
355.9665
|
|
exponential
|
319.2385
|
323.4531
|
loglogistic
|
349.5192
|
356.1058
|
|
Gompertz
|
319.798
|
328.2272
|
lognormal
|
349.6722
|
356.2588
|
|
Rayleigh 2
|
319.9957
|
328.4249
|
ggamma
|
351.3068
|
361.1867
|
|
Weibull
|
321.2365
|
329.6658
|
Gompertz
|
360.9285
|
367.5151
|
|
loglogistic
|
321.5145
|
329.9437
|
exponential
|
368.2898
|
371.5831
|
|
ggamma
|
322.9653
|
335.6091
|
Rayleigh 1
|
471.5511
|
474.8444
|
|
lognormal
|
325.9746
|
334.4038
|
Rayleigh 2
|
NA
|
NA
|
|
Rayleigh 1
|
339.8937
|
344.1083
|
NA = Not applicable model did not converge
|
Triscend II RCT: tricuspid valve replacement with EVOQUE system for patients with tricuspid regurgitation.8
The US FDA presented KM plots for mortality in the TRISCEND II trial (see Supplementary: section 6). For the medical treatment arm bathtub and Chen models generated almost identical good visual fit for the cumulative incidence of death. Weibull and generalised gamma models delivered marginally lower IC values than bathtub and Chen models but on extrapolation produced less plausible mortality curves for such an aged population with extensive comorbidities. The bathtub g parameter is almost zero so that bathtub and Rayleigh 2 models predict virtually identical curves for survival and for hazard. All models fitted to the intervention arm generated clearly implausible survival curves. Longer follow up with more patients is required for modelling.
Are bathtub and Chen model predictions supported by longer term follow up Registry findings?
We compared extrapolated bathtub and Chen model predictions for Registry cumulative failure rates with that reported for later registry analyses. For this we used the National Joint registry 9th annual report (NJR9) for England and Wales 12 with the 20th annual report13 (NJR20). The NJR9 and NJR20 KM plots for frequently used devices show increasing revision rates over time (Fig. 6). Performance of cemented MOP and cemented COP devices was reasonably consistent across these time spans, but in the case of uncemented MOP revision performance appears to have improved with time.
Using NJR9, 12) Chen and bathtub models of 125,285 cemented MOP recipients suggests predict between 6% and 7% cumulative revision rate at 18 years, consistent with 6.6% reported in NJR20, when about 2.94 times as many recipients (N = 368,641) were available for analysis (Fig. 6a top).
Chen and bathtub extrapolations for 59,983 uncemented MOP recipients predict 6% and 12% cumulative revision respectively. In this case, the NJR20 (N = 205,001) revision rate is 8.5%, suggesting an average of these two models outperforms a single extrapolation (Fig. 6b middle).
Chen and bathtub models of 13,871 cemented COP recipients from NJR9 predict about 5% and 6% cumulative revision respectively corresponding very closely to the 18-year follow up reported in NJR20 (Fig. 6c bottom).
Similar results were found in analyses conducted using the Swedish Arthroplasty Registry (Supplement: section 10).