This study is dedicated to enhancing the accuracy of early diagnosis for D2T RA, for which two machine learning models have been developed. Model 1 is based on clinical and serological data, while Model 2 incorporates clinical, serological, and US data, with the US examination covering the knee, wrist, and hand joints, as well as tendons. Utilizing logistic regression, random forest, and SVM algorithms, we constructed and evaluated these models. The results indicate that the Random Forest model, which includes US data, outperforms the model using only clinical and serological data in terms of AUC, accuracy, specificity, and F1 score, facilitating early diagnosis and timely adjustment of treatment plans.
D2T RA manifests in clinical practice as a complex and heterogeneous condition, often emerging after the exhaustion of all available treatment options [7], highlighting the scarcity of biomarkers for predicting treatment responses and the trial-and-error approach inherent in therapeutic strategies [18]. Our study indicates that pulmonary and cardiovascular diseases are the two primary comorbidities affecting RA treatment. MTX, a pivotal drug in RA therapy, is contraindicated in these comorbidities, potentially leading to initial treatment failure and diminished efficacy of b/tsDMARDs. The presence of multiple comorbidities may be correlated with the progression to D2T RA. Additionally, D2T RA patients had an average of 1.4 ± 0.5 prior b/tsDMARDs, a number higher than that of D2T RA patients. Notably, the use of medications with the number of used modes of action had the greatest weight in the model, significantly increasing the risk of D2T RA. The existence of comorbidities profoundly alters RA's pharmacological treatment strategies, potentially compelling patients to investigate a broader array of b/tsDMARDs and drugs with distinct mechanisms of action in pursuit of optimal therapeutic outcomes [7, 19].
Our research not only elucidates medication use and comorbidities but also highlights the longer disease duration and delayed diagnosis in D2T RA. These findings align with Tan et al.'s study [18]. Additionally, our study identifies osteoporosis and bone erosion as independent risk factors for D2T-RA, associated with disease complexity, steroid use, and persistent inflammation. Aize [20] and Gauri's [21] studies corroborate osteoporosis as an independent predictor of D2T-RA through logistic regression analysis. Bone erosion, a key RA manifestation, is closely linked to disease severity and functional decline, potentially contributing to the development of D2T RA.
Females, high activity serological scores, and some clinical indexes are pivotal in RA. Our systematic analysis of disease-related variables underscores early risk factor identification to preempt treatment post-RA flares. Rigours disease control and targeted treatment are essential for achieving remission and maintaining a state of low disease activity in RA [18]. Nagy G et al. have highlighted [22] that in RA management, the detection of inflammation is critical for guiding both pharmacological and non-pharmacological interventions, thereby minimizing the unnecessary cycles of DMARD therapy. Despite the fact that DAS28, RF, and anti-CCP are helpful for diagnosis and prognosis, they present issues such as complex calculations or insufficient specificity [23].
Musculoskeletal US is a diagnostic tool with high reliability and sensitivity for RA diagnosis and prognostic assessment. [24–26]. Limited target joints are crucial in evaluating the disease activity of RA, and we have designed an US scoring protocol that covers large, medium, and small joints, which are commonly affected in RA, including the hand, wrist, and knee joints. Additionally, some studies suggest that tenosynovitis provides independent predictive data [10, 27], which recommends including tendons as candidate variables. Therefore, we use a multi-joint plus hand tendon semi-quantitative US evaluation scheme.
The current multi-joint US studies have only looked at persistent synovitis and the remission period of synovitis [28–31], but most people with D2T-RA have moderate-to-high disease activity. This is why combined scoring is needed in addition to single scoring to better separate moderate-to-high activity synovitis. Studies indicate that when PD>1, the involvement of the MCP1,3,4, and PIP3 joints is more significant in identifying moderate-to-high activity RA. Compared to GS, the addition of meaningful variables is greater, indicating that GS has lower sensitivity in predicting synovitis activity, while PD assessment is more reliable, consistent with studies. [11, 32]. This underscores the importance of combining GS and PD scores for a more comprehensive inflammatory assessment [33].
In this study, we observed that D2T-RA and non-D2T-RA share similar US characteristics, making it difficult to distinguish between the conventionally involved sites such as the MCP 2, 3, and PIP 2, 3. But MCP 1 may be able to tell you if someone is going to get refractory RA because early damage to it is strongly linked to a loss of hand grip strength [34]. Additionally, UC joint synovitis is the only feature in the wrist joint that helps to differentiate D2T-RA, possibly because the UC joint is less susceptible to involvement in the early stages and does not bear as much functional load as the RC and MC joints [35]. Also, the involvement of the MED and LAT in the knee joint is a strong predictor for D2T-RA. This means that patients may have more disease activity and disability, which shows how important it is to find the disease early [36].
Additionally, involvement of compartment II and the DF4 region may be independent indicators of D2T-RA. Previous studies have mainly focused on early RA and found that dorsal tenosynovitis of the wrist is more common in compartments IV and VI, while the palmar side mainly affects the flexor tendons of the second and third fingers [37]. However, D2T-RA presents a special pattern of involvement in hand joints and tendon sheaths, which may be related to the heavier inflammatory burden associated with high disease activity in patients. Although there is no evidence, this study suggests that in patients with high disease activity and poor treatment response during the active phase, monitoring of the MCP 1, UC joint, medial and lateral regions of the knee joint, as well as the compartments II and the DF4 region, may be considered as early indicators of D2T RA.
After modeling the single-factor variables, we observed the following: Model 1, using the Random Forest algorithm, achieved an AUC of 0.81 and an F1 score of 0.67, indicating the model has a certain diagnostic capability. Model 2 raised its AUC to 0.83 and its F1 score to 0.69, demonstrating that in terms of variable selection, Model 2 is more accurate than Model 1. Although the role of US features in variable selection is limited, they show potential value in identifying patients with a poor response to conventional treatments. Currently, there is a lack of direct model studies on the role of US in D2T RA patients; however, existing evidence suggests that US may play a significant role in distinguishing truly refractory RA[3]. Some studies argue [38] that it does not add extra value in defining the diagnosis of RA-D2T, possibly due to singular US grading, the subjectivity of US without consistency, and only examining synovitis of both hands, all of which have been addressed and corrected in this article. The overall performance of logistic regression and SVM models is relatively lower, which may be constrained by their ability to handle nonlinear data, and the outcomes did not meet expectations. The diagnosis of refractory RA is complicated, suggesting a need for improved models or new biomarkers.
This study compared the consistency between binary scoring and the SQ grading system in joint US assessments. The results indicated that the SH score for the dorsal extensor tendon sheath of the wrist was moderate, while the scores for other regions ranged from satisfactory to excellent. The PD images demonstrated the highest consistency in both binary and SQ grading, which is consistent with the findings of the Maria study [6]. Additionally, the MCP joints showed a high level of consistency between binary and SQ scoring in SH, PD, and SH + PD images (κ values ranged from 0.79 to 0.97). In contrast, the scoring consistency for tenosynovitis was lower. This might be attributed to the fact that the MCP joint, as a standardized joint, is more commonly trained among physicians [11, 17], while tenosynovitis is less common in RA, leading to relatively less experience in exploration among physicians.
The study has limitations: 1) Future research needs stratified analysis for diverse RA patients, considering lifestyle, economy, and demographics. 2) US assessments should cover more joints to improve model accuracy. 3) The model's accuracy might be limited by a small, single-center sample.