An adapted neural-fuzzy inference system model using preprocessed balance data to improve the predictive accuracy of warfarin maintenance dosing in patients after heart valve replacement

doi:10.21203/rs.3.rs-224969/v1

Download PDF

Research Article

An adapted neural-fuzzy inference system model using preprocessed balance data to improve the predictive accuracy of warfarin maintenance dosing in patients after heart valve replacement

https://doi.org/10.21203/rs.3.rs-224969/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 20 Apr, 2021

Read the published version in Cardiovascular Drugs and Therapy →

You are reading this latest preprint version

Background

Tailoring warfarin use poses a challenge for physicians and pharmacists due to its narrow therapeutic window and huge inter-individual variability. This study aimed to create an adapted neural-fuzzy inference system (ANFIS) model using preprocessed balance data to improve the predictive accuracy of warfarin maintenance dosing in Chinese patients undergoing heart valve replacement (HVR).

Methods

This retrospective study enrolled patients who underwent HVR between June 1, 2012 and June 1, 2016 from 35 centers in China. The primary outcomes were the mean difference between predicted warfarin dose by ANFIS models and actual dose, and the models’ predictive accuracy, including the ideal predicted percentage, the mean absolute error (MAE), and the mean squared error (MSE). The eligible cases were divided into training, internal validation, and external validation groups. We explored input variables by univariate analysis of a general liner model and created two ANFIS models using imbalanced and balanced training sets. We finally compared the primary outcomes between the imbalanced and balanced ANFIS models in both internal and external validation sets. Stratified analyses were conducted across warfarin doses (low, medium, and high doses).

Results

A total of 15,108 patients were included and grouped as follows: 12,086 in the imbalanced training set; 2,820 in the balanced training set; 1,511 in the internal validation set; and 1,511 in the external validation set. Eight variables were explored as predictors related to warfarin maintenance doses, and imbalanced and balanced ANFIS models with multi-fuzzy rules were developed. The results showed a low mean difference between predicted and actual doses (< 0.3 mg/d for each model) and an accurate prediction property in both the imbalanced model (ideal prediction percentage: 74.39–78.16%, MAE: 0.37 mg/daily, MSE: 0.39 mg/daily) and the balanced model (ideal prediction percentage: 73.46–75.31%, MAE: 0.42 mg/daily; MSE, 0.43 mg/daily). Compared to the imbalanced model, the balanced model had a significantly higher prediction accuracy in the low-dose (14.46% vs. 3.01%; P < 0.001) and the high-dose warfarin groups (34.71% vs. 23.14%; P = 0.047). The results from the external validation cohort confirmed this finding.

Conclusions

The ANFIS model can accurately predict the warfarin maintenance dose in patients after HVR. Through data preprocessing, the balanced model contributed to improved prediction ability in the low- and high-dose warfarin groups.

Cardiac & Cardiovascular Systems

heart valve disease

warfarin

dose prediction

model

machine learning

neural network

Warfarin, with a clear effectiveness and price advantage, is recommended for the prevention of thrombosis after heat valve replacement (HVR) [1]. However, the need for frequent monitoring, the narrow therapeutic range, dietary restrictions, and multiple drug interactions associated with warfarin have contributed to the insufficiency or excessiveness of anticoagulation, which can lead to thromboembolism and bleeding. Notably, significant individual diversity leads to considerable differences (of up to 20-fold) in the response of patients to warfarin doses [2]. Therefore, a tailored warfarin dose is important to reduce complications and to improve the survival rate of patients with HVR [3].

At present, the warfarin individualized drug prediction model can be divided into multiple linear regression (MLR) and a machine learning algorithm [4], and the latter has the best overall prediction effect [5]. However, there is currently no warfarin maintenance dose prediction model that is able to achieve an overall prediction accuracy higher than 70% [6-8]. Pharmacogenomic variables such as cytochrome P450, family 2, subfamily C, polypeptide 9 (CYP2C9) and vitamin K epoxide reductase complex, subunit 1 (VKORC1) have been introduced to improve the prediction ability of the models, which could explain the 55% warfarin dose variation after combining clinical factors [8]. In China, where medical insurance is limited and there is a lack of primary medical units, extensive gene sequencing for predicting warfarin dose variation will bring unnecessary economic burden, which is contrary to the original cost-efficiency advantage of warfarin. Thus, developing the optimal prediction model for warfarin dosing based on explicable clinical variables poses a challenging task. In previous studies, we established the Artificial Neural Network (ANN) [9], the Back-Propagation neural network with Genetic Algorithm (BP-GA) [10], the Back Propagation Neural Network (BPNN) [11], and the Adapted Neural-Fuzzy Inference System (ANFIS) models [12], based on machine learning algorithms, to predict the maintenance dose of warfarin. As a result, we achieved the actual warfarin dose with 59–78% accuracy for patients who underwent AVR. However, we also found poor accuracy for patients in the low- and high-dose groups. Similar results were also observed in other warfarin dose models [13-16]. One explanation for this might be the class imbalance learning (CIL) problem, which is associated with the size of the data used to train the warfarin dose prediction model. In brief, . It is worth noting that patients receiving low or high warfarin doses are more vulnerable to cardiovascular adverse events. Therefore, to improve the model prediction effect in low- and high-dose patient groups, we attempted to correct the CIL problem by .

Study population

This was a retrospective multicenter study based on the Chinese Low Intensity Anticoagulant Therapy after Heart Valve Replacement (CLIATHVR) database. The CLIATHVR database prospectively included all patients who underwent HVR and received warfarin treatment from June 1, 2012 to June 1, 2016 in 35 centers (15 provinces) in China. The study included adult patients who met the following criteria: (1) underwent HVR; (2) received warfarin to prevent valve-associated thrombosis after surgery; (3) conducted regular international normalized ratio (INR) monitoring; and (4) achieved a stable dose of warfarin (the INR value fluctuated by < 0.2 for three consecutive times under the fixed dose). The exclusion criteria were as follows: (1) patients in whom severe liver or kidney dysfunction occurred before or after surgery; (2) those who received a combination of other antiplatelet or anticoagulant agents or non-steroidal anti-inflammatory drugs; and (3) patients with embolism, bleeding events, or death during warfarin therapy (the absence of complications indicated the ideal statement under warfarin maintenance dose). All included patients provided informed consent for the procedure and data collection. The study protocol was approved by the Ethics Committee of West China Hospital of Sichuan University (ChiECRCT-201792).

Input and output variables

The input variables were selected in two ways: (1) Clinical characteristics associated with the warfarin dose were chosen according to expert advice and published literature; and (2) preliminarily comprehensive screening was conducted to select potential variables related to warfarin maintenance dose. According to the correlation coefficient matrix, variables with collinearity were excluded. The general liner model (GLM)-univariate method was further used to screen out variables based on both the level of statistical significance (P < 0.05) and η2 (> 0.002; where η2 is defined as the contribution of a certain input variable to the output variable). The output variable was the predicted warfarin maintenance dose, which was defined as the target value of the patients’ INR between 1.5 and 2.5 on at least three consecutive occasions.

Group setting

The entire dataset construction process is shown in Figure 1. The eligible cases were divided into three groups as follows: the training set (80% of patients), the internal validation set (10% of patients), and the external validation set (the final 10% of patients by the enrollment time). We applied two training datasets with different structures to train the ANFIS model in order to compare the prediction accuracy in the same model before and after preprocessing. The ANFIS model trained by the above training group (imbalanced training set) was called the imbalanced model, and the new dataset constructed by the method of equal random-stratified sampling from the imbalanced training set was called the balanced training set. Finally, the ANFIS model trained by this training set was called the balanced model. The purpose of the equal random-stratified sampling was to randomly sample the same number of cases in other groups according to the group containing the minimum number of cases; thus, the proportion of patients receiving high, medium, and low doses of warfarin in the balanced training set was 1:1:1.

Adaptive Neural-Fuzzy Inference System (ANFIS) model

The ANFIS, as a classic machine learning algorithm, is a Neuro Fuzzy System (NFS) that was proposed by Jang et al. in 1993 [18] and combines the advantages of a Fuzzy Inference System (FIS) and ANN. The ANFIS is driven by data and can automatically construct a set of if-then fuzzy rules, create appropriate membership function, determine its parameters, and quickly form the mapping relationship between the input and output. Since the ANFIS has only limited parameter settings, it greatly simplifies the problems of unclear and complicated characteristics in the process of system modeling. The models used in this study are based on the Takagi Sugeno type of ANFIS. Figure S1 presents the overall structure of the ANFIS model. It includes a multi-layer feed-forward network with a total of five layers. First, the fuzzy layer converts the input into membership functions. Second, the appropriate fuzzy rules are fitted by the rules layer and the reliability of each rule is calculated. Third, the standardized layer calculates the final weight of each rule. Fourth, the defuzzy layer combines the previously obtained parameters to calculate the result value of each rule. Finally, the weighted sum of the results of each rule is calculated in the output layer. This system is characterized by the back-propagation (BP) algorithm and least square algorithm, which determine the fuzzy rules and relevant parameters and automatically establish the mapping relationship between the input and output. The input of the model is the patient data, and the output is the predicted value of warfarin dosing.

Prediction ability of the model

The primary outcomes were the mean difference between the predicted warfarin maintenance dose by the models and the actual dose in clinical practice, and the predictive accuracy of the models that was evaluated by the internal and external validation sets with three indexes: the ideal predicted percentage, the mean absolute error (MAE), and the mean squared error (MSE). The ideal predicted percentage was defined as the percentage of patients whose predicted warfarin dose was within 20% of the actual dose. The MAE was the mean absolute difference between the predicted dosage and actual dosage of warfarin. The MSE was the square of the difference between the two dosages. The warfarin dose was classed as low (< 1.875 mg/d), medium (1.875–3.125 mg/d), or high (> 3.125 mg/d).

Statistical analyses

Microsoft Excel 2019 was used for data inclusion and preliminary screening of eligible cases. Categorical variables are expressed as number and percentage and were compared using the chi-square test. Continuous variables are expressed as mean with standard deviation (SD) and were compared using the paired Student’s t-test. MATLAB R2010b was used to establish the ANFIS models and to predict the individual warfarin dose. The differences between the predicted dose and the actual dose of each ANFIS model were analyzed using the mean difference (< 0.3 mg/d considered to be an acceptable difference). The parameters of the models’ predictive accuracy (MAE, MSE, ideal predicted percentage) were calculated using the ANFIS model results. Statistical analyses were performed using SPSS software, version 22.0 (SPSS Inc., Chicago, Illinois, U.S.A), with a P-value < 0.05 indicating significant difference.

Data acquisition and variables inclusion

As shown in Figure 2, 19,595 patients who were enrolled in the CLIATHVR database between June 1, 2012 and June 1, 2016 were selected in this study. After eliminating 3,424 unfinished cases and 726 cases who did not meet the criteria, 15,445 cases were eligible for further analysis. A further 337 cases containing missing data or abnormal values were excluded, and an original dataset, including 15,108 patients and consisting of 52 potential independent variables (patient characteristics, medical history, echocardiography indexes, preoperative laboratory results, surgical information, and postoperative warfarin medication information), was finally constructed. Variables were selected based on the GLM-univariate method (Table S1), and eight variables (including age, disease, weight, tricuspid valve disease, albumin level, creatinine level, usage of the first dose, and dosage of the first dose) were included for model construction (Table S2).

Datasets and population characteristics

Of 15,108 patients, 1,511 were selected as the external validation set according to the admission time. From the remaining 13,597 patients, 1,511 were selected as the internal validation set by random sampling and 12,086 patients were selected as the imbalanced training set. The number of patients in the low-, medium-, and high-dose groups in the imbalanced training set was 1,259 (10.42%), 9,887 (81.81%), and 940 (7.78%), respectively, indicating a great imbalance (imbalanced ratio: 10.5). Therefore, the equal random-stratified sampling method was used to construct a balanced training set that included 2,820 patients (imbalanced ratio: 1.0). The baseline characteristics of the included variables in the different datasets are shown in Table 1. The mean age of the overall population was 50.84 ± 11.09 years, and the mean weight was 60.16 ± 10.77 kg. Warfarin was primarily used for the treatment of rheumatic heart disease (83.95%), with a mean maintenance dose of 2.65 ± 0.66 mg/d. The baseline characteristics were similar between the patients in the imbalanced training set and those in the balanced training set (P > 0.05 for each variable). The characteristics of the patients in the external validation set were significantly different from those of the patients in the other datasets (internal validation set, imbalanced training set, and imbalanced training set; P < 0.05 for weight, albumin level, creatinine level, usage of the first dose, dosage of the first dose, and warfarin maintenance dose), which satisfied the requirement for the external validation set.

ANFIS model construction

On the basis of the eight variables as the input layer and the warfarin maintenance dose as the output layer, the imbalanced and the balanced training sets were used to train the original ANFIS model, respectively. After self-adjustment of the model with default settings, an imbalanced model with two fuzzy rules and a balanced model with four fuzzy rules were constructed (Figure S2). Meanwhile, the membership functions of the individual variable and the warfarin maintenance dose are shown in Figure S3.

Overall prediction ability of models

As shown in Table 2, the predictive warfarin doses were close to the actual doses, with a low mean difference (< 0.3 mg/d for each model). The overall prediction abilities of the imbalanced model and the balanced model are summarized in Table 3. For internal validation, the overall prediction accuracy of the balanced ANFIS model was 75.31% and that of the imbalanced ANFIS model was 78.16%. The difference in the overall prediction accuracy between the two models was not statistically significant (P = 0.064). The MAE of the balanced model and the imbalanced model was 0.421 and 0.368, respectively. The MSE of the balanced model was 0.433 and that of the imbalanced model was 0.388. For external validation, the overall prediction accuracy of the balanced ANFIS model and the imbalanced model was 73.46% and 74.39%, respectively, with no significant difference (P = 0.562). The MAE of the balanced model was 0.422 and that of imbalanced model was 0.370. The MSE of the balanced model was 0.413 and that of the imbalanced model was 0.386.

Prediction ability of the two models in different warfarin dose groups

The prediction difference of models across warfarin doses (low-dose: < 1.875 mg/d; medium-dose: 1.875–3.125 mg/d; high-dose: > 3.125 mg/d) is outlined in Figure 3 and Table S3. As for internal validation, although the balanced model lowered the prediction accuracy by 6.21% compared with the imbalanced model in the medium-dose group (P < 0.001), it inversely significantly increased the prediction accuracy in the low- and high-dose groups (P < 0.001 and P = 0.047, respectively). Similar results were found in the external validation set, which strengthened the conclusion that the balanced ANFIS model could improve the prediction effect in the low- and high-dose groups.

Major findings

In this study, we simultaneously constructed two ANFIS models, namely the imbalanced model and the balanced model, to predict the warfarin maintenance dose, based on a retrospective multicenter database involving 35 centers and 15,108 patients after HVR. The major findings were as follows: (I) The imbalanced ANFIS model, based on a training set of 12,086 cases, could accurately predict the warfarin maintenance dose for Chinese patients undergoing HVR, with an ideal prediction percentage of 74.39%–78.16%, MAE of 0.37 mg/daily, and MSE of 0.39 mg/daily; (II) the balanced ANFIS model that used equal random-stratified sampling and was based on a training set of 2,820 cases also achieved an accurate prediction property of warfarin maintenance dose (ideal prediction percentage: 73.46%–75.31%; MAE: 0.42 mg/daily; MSE: 0.43 mg/daily); (III) compared to the imbalanced model, the balanced model had a significantly higher prediction accuracy in the low-dose warfarin group (internal validation: 14.46% vs. 3.01%; P < 0.001) and the high-dose warfarin group (34.71% vs. 23.14%; P = 0.047); (IV) the results of external validation were in line with the results of internal validation, thus strengthening the conclusion that the ANFIS model could improve the model prediction effect.

Summary of models

Table S4 summarizes the current warfarin prediction models. In 2004, Gage et al. first created a warfarin dosage prediction model based on 369 patients [19]. This study explored eight variables (age, sex, body surface area [BSA], race, amiodarone, simvastatin use, INR, CYP2C19) using an MLR model and achieved a 39% predictive ability to explain the variance of the warfarin maintenance dose. Of note, the CYP2C9*2 and CYP2C9*3 alleles contributed to a dominating weight in the said model. Since then, six further studies have been conducted in order to gain a higher predictive accuracy of the model in a Caucasian population [3, 20-22, 2, 23]. Although these studies achieved considerable predictive abilities (R²: 47%–73%) through involving certain pharmacogenomic information (e.g., CYP2C19, VKORC1, GGCX), they had two main limitations: the small sample sizes (< 350 patients), which limited the representation of the population; and a lack of external validation, which limited the extrapolation of models to large patient populations in real-world practice. In 2008, Gage et al. developed another pharmacogenetic algorithm based on 1,015 patients and nine predictors (age, BSA, smoking, race, amiodarone use, current thrombosis, CYP2C9, VKORC1, target INR) [24]; this model could explain 53%–54% of the variability in the warfarin dose in the derivation and validation cohorts. Furthermore, a nonprofit website was developed to facilitate the use of this pharmacogenetic and clinical equation (www. WarfarinDosing. org). The following year, the International Warfarin Pharmacogenetics Consortium (IWPC) created a novel pharmacogenetic algorithm that included 4,043 patients from 21 various research groups in nine countries and eight factors (age, weight, height, race, amiodarone status, enzyme inducers, CYP2C9 and VKORC1) [25]. This model could explain 43%–47% of the variability in the derivation and validation populations and provided accurate dose estimates, as evidenced by a low MAE (8.3 mg/week). In addition, the differences in the performance of the model in the low-dose (≤ 21 mg/week), medium-dose (21–49 mg/per week), and high-dose (≥ 49 mg/week) groups were evaluated. Although the Gage and IWPC models have addressed the above limitations, it may not be appropriate to directly extrapolate these results for a Chinese population due to the variation in warfarin sensitivity across ethnic groups (weight, dietary habit, drug interaction, genotype, adherence, etc.). All of these inherent issues have fueled the development of warfarin prediction models for the Chinese population. However, the current Chinese medical insurance coverage only covers genetic testing for warfarin dosage prediction for patients with a high risk of bleeding or labile INR values, which is a barrier to its utilization. Considering the latter limitation, the current models conducted for a Chinese population have included small sample sizes combining both clinical and pharmacogenomic variables, which limited the generalizability of models [26-32]. Therefore, developing the optimal prediction model for warfarin dose based on explicable clinical variables is a challenging task.

The MLR method presents certain irreconcilable issues such as poorly behavior of the non-linear relationship between variables; thus, the MLR is unlikely to be an optimal method for predicting the warfarin dose [33]. Recently, several artificial intelligence modeling technologies, including support vector machines and a general regression neural network, have been used for warfarin dosage predication [34, 35]; however, these models showed a relatively low predictive ability of < 50% in the ideal predicted percentage. Our study team has made numerous attempts in the field of warfarin model development and achieved a 63% predictive accuracy based on BPGA and ANFIS models [10, 36, 12]. In this study, we further included 15,108 patients who underwent HVR from 35 centers and used balanced training set preprocessing with the equal random stratified sampling method. Compared with the results of the IWPC model, both the imbalanced and the balanced ANFIS models had better performance in terms of ideal prediction percentage (73.46%–74.39% for ANFIS vs. 45.5% for IWPC) and MAE (2.59–2.95 mg/week for AFNIS vs. 8.5 mg/week for IWPC) in external validation cohorts. Hence, the AFNIS method based on big data is a feasible and optimal modeling technology to improve the prediction ability for estimating the warfarin maintenance dose.

Reasons for improved prediction property in low- and high-dose groups

Patients receiving low or high warfarin doses are more vulnerable to thromboembolic and bleeding events due to difficulty with INR control. To date, no study has been specifically designed to address this concern. Our previous studies found an extremely low prediction accuracy in the low-dose group (0.0% by BPNN [11] and 9.1% by ANFIS [12]) and high-dose group (0.0% by BPGA [10]). Considering the distribution of patients across different doses in the training set, the proportion in the medium-dose group was higher than that in the low- and high-dose groups (low-dose: 10.41%, medium-dose: 81.81%, high-dose: 7.78%). This explains why our previous models showed better performance in the medium-dose group but poor performance in the low-dose group. This is known as the CIL problem. insufficient data learning of smaller-scale categories, resulting in an unsatisfactory prediction effect of the model in a [37]. To address this problem, we used the equal random stratified sampling method, which can balance the number of patients in each group through random sampling [38]. The model results using the balanced training set indicated an increased prediction accuracy compared to the imbalanced model (low-dose: 14.46%–24.34% vs. 3.01%–3.62%; high-dose: 29.58%–34.71% vs. 21.12%–23.14%).

Clinical relevance

When lacking genetic information in a clinical setting, this AFNIS method could provide high accurate warfarin dose estimates on the basis of clinical variables (including age, disease, weight, tricuspid valve disease, albumin level, creatinine level, usage of the first dose, and dosage of the first dose). This could aid physicians and pharmacists in the selection of patients who will likely be suited to low or high doses of warfarin, thus allowing earlier and more aggressive intervention to control INR.

Strengths and limitations

The main strengths of this study were as follows: first, this study used a large sample of 15,108 Chinese patients from 35 centers who received warfarin after HVR to develop and validate the models; second, we applied the equal random stratified sampling method to address the CIL problem that resulted in the low predicted ability in the low- and high-dose groups; and third, we validated the models using both internal and external validation cohorts. However, this study also had some limitations. First, this was a retrospective study that may have a certain selection bias. Second, some of the possible determinants of warfarin dose such as diet information and patient genotypes (CYP2C9 and VKORC1) are not available in our study, which may limit the performance of the models. Third, clinical adverse events related to warfarin use were not examined in this study. Given the above limitations, using machine learning techniques, further prospective studies with more potential predictors need to be carried out to further improve the model performance.

This study constructed two ANFIS models to predict the warfarin maintenance dose, based on 15,108 patients who underwent HVR from 35 centers. The results showed that both imbalanced and balanced models could provide an accurate prediction performance of warfarin maintenance dose (ideal prediction percentage > 70%). In addition, the balanced model contributed to improved prediction ability in the low- and high-dose warfarin groups.

Author contributions

Chen is the guarantors of the entire manuscript. Gu and Huang contributed to the study conception and design, critical revision of the manuscript for important intellectual content, and final approval of the version to be published. Li, Zhou, Wang, and Fu contributed to the data acquisition, analysis, and interpretation.

Sources of Funding

This study was supported by National Natural Science Foundation of China (71974137 & 81641021), Research Funds of Shanghai Health and Family Planning commission (20184Y0022), Cultivation fund of clinical research of Renji Hospital (PY2018-III-06), Clinical Pharmacy Innovation Research Institute of Shanghai Jiao Tong University School of Medicine (CXYJY2019ZD001), and Shanghai “Rising Stars of Medical Talent” Youth Development Program – Youth Medical Talents – Clinical Pharmacist Program (SHWJRS (2019)_072).

Conflicts of Interest: None conflicts of interest to declare.

Ethical Statement: This study was registered in Chinese Clinical Trial Register platform (Trial number: ChiCTR-OCH-10001185). The study protocol was approved by the Ethics Committee of West China Hospital of Sichuan University (ChiECRCT-201792). All participants signed written informed consent.

Data Availability Statement: All data can be obtained by contacting the corresponding author.

Kirley K, Qato DM, Kornfield R, Stafford RS, Alexander GC. National Trends in Oral Anticoagulant Use in the United States, 2007 to 2011. Circulation-Cardiovascular Quality and Outcomes. 2012;5(5):615-21. doi:10.1161/circoutcomes.112.967299.
Anderson JL, Horne BD, Stevens SM, Grove AS, Barton S, Nicholas ZP et al. Randomized trial of genotype-guided versus standard warfarin dosing in patients initiating oral anticoagulation. Circulation. 2007;116(22):2563-70. doi:10.1161/circulationaha.107.737312.
Sconce EA, Khan TI, Wynne HA, Avery P, Monkhouse L, King BP et al. The impact of CYP2C9 and VKORC1 genetic polymorphism and patient characteristics upon warfarin dose requirements: proposal for a new dosing regimen. Blood. 2005;106(7):2329-33. doi:10.1182/blood-2005-03-1108.
Jorgensen AL, Pirmohamed M. Risk modeling strategies for pharmacogenetic studies. Pharmacogenomics. 2011;12(3):397-410. doi:10.2217/pgs.10.198.
Kurnik D, Loebstein R, Halkin H, Gak E, Almog S. 10 years of oral anticoagulant pharmacogenomics: what difference will it make? A critical appraisal. Pharmacogenomics. 2009;10(12):1955-65. doi:10.2217/pgs.09.149.
Anderson JL, Horne BD, Stevens SM, Woller SC, Samuelson KM, Mansfield JW et al. A Randomized and Clinical Effectiveness Trial Comparing Two Pharmacogenetic Algorithms and Standard Care for Individualizing Warfarin Dosing (CoumaGen-II). Circulation. 2012;125(16):1997-+. doi:10.1161/circulationaha.111.070920.
Li X, Liu R, Luo Z-Y, Yan H, Huang W-H, Yin J-Y et al. Comparison of the predictive abilities of pharmacogenetics-based warfarin dosing algorithms using seven mathematical models in Chinese patients. Pharmacogenomics. 2015;16(6):583-90. doi:10.2217/pgs.15.26.
Yan H, Yin J-Y, Zhang W, Li X. Possible Strategies to Make Warfarin Dosing Algorithm Prediction More Accurately in Patients With Extreme Dose. Clinical Pharmacology & Therapeutics. 2018;103(2):184-. doi:10.1002/cpt.800.
Zhou Q, Kwong J, Chen J, Qin W, Chen J, Dong L et al. Use of artificial neural network to predict warfarin individualized dosage regime in Chinese patients receiving low-intensity anticoagulation after heart valve replacement. International Journal of Cardiology. 2014;176(3):1462-4. doi:10.1016/j.ijcard.2014.08.062.
Li Q, Tao H, Wang J, Zhou Q, Chen J, Qin WZ et al. Warfarin maintenance dose Prediction for Patients undergoing heart valve replacement-a hybrid model with genetic algorithm and Back-Propagation neural network. Scientific Reports. 2018;8. doi:10.1038/s41598-018-27772-9.
Li Q, Wang J, Tao H, Zhou Q, Chen J, Fu B et al. The Prediction Model of Warfarin Individual Maintenance Dose for Patients Undergoing Heart Valve Replacement, Based on the Back Propagation Neural Network. Clinical Drug Investigation. 2020;40(1):41-53. doi:10.1007/s40261-019-00850-0.
Tao H, Li Q, Zhou Q, Chen J, Fu B, Wang J et al. A prediction study of warfarin individual stable dose after mechanical heart valve replacement: adaptive neural-fuzzy inference system prediction. Bmc Surgery. 2018;18. doi:10.1186/s12893-018-0343-1.
Groszek B, Piszczek P. Vitamin K antagonists overdose. Przeglad lekarski. 2015;72(9):468-71.
Heffler E, Campisi R, Ferri S, Crimi N. A Bloody Mess: An Unusual Case of Diffuse Alveolar Hemorrhage Because of Warfarin Overdose. American Journal of Therapeutics. 2016;23(5):E1280-E3. doi:10.1097/mjt.0000000000000397.
Levine M, Pizon AF, Padilla-Jones A, Ruha A-M. Warfarin overdose: a 25-year experience. Journal of medical toxicology : official journal of the American College of Medical Toxicology. 2014;10(2):156-64. doi:10.1007/s13181-013-0378-8.
Wang SV, Franklin JM, Glynn RJ, Schneeweiss S, Eddings W, Gagne JJ. Prediction of rates of thromboembolic and major bleeding outcomes with dabigatran or warfarin among patients with atrial fibrillation: new initiator cohort study. Bmj-British Medical Journal. 2016;353. doi:10.1136/bmj.i2607.
Zhang Y, Liu B, Cai J, Zhang S. Ensemble weighted extreme learning machine for imbalanced data classification based on differential evolution. Neural Computing & Applications. 2017;28:S259-S67. doi:10.1007/s00521-016-2342-4.
Jang JR, Sun CT. Functional equivalence between radial basis function networks and fuzzy inference systems. IEEE transactions on neural networks. 1993;4(1):156-9. doi:10.1109/72.182710.
Gage BF, Eby C, Milligan PE, Banet GA, Duncan JR, McLeod HL. Use of pharmacogenetics and clinical factors to predict the maintenance dose of warfarin. Thrombosis and Haemostasis. 2004;91(1):87-94. doi:10.1160/th03-06-0379.
Aquilante CL, Langaee TY, Lopez LM, Yarandi HN, Tromberg JS, Mohuczy D et al. Influence of coagulation factor, vitamin K epoxide reductase complex subunit 1, and cytochrome P4502C9 gene polymorphisms on warfarin dose requirements. Clinical Pharmacology & Therapeutics. 2006;79(4):291-302. doi:10.1016/j.clpt.2005.11.011.
Herman D, Peternel P, Stegnar M, Breskvar K, Dolzan V. The influence of sequence variations in factor VII, gamma-glutamyl carboxylase and vitamin K epoxide reductase complex genes on warfarin dose requirement. Thrombosis and Haemostasis. 2006;95(5):782-7. doi:10.1160/th05-10-0678.
Wadelius M, Chen LY, Eriksson N, Bumpstead S, Ghori J, Wadelius C et al. Association of warfarin dose with genes involved in its action and metabolism. Human Genetics. 2007;121(1):23-34. doi:10.1007/s00439-006-0260-8.
Zhu Y, Shennan M, Reynolds KK, Johnson NA, Herrnberger MR, Valdes R, Jr. et al. Estimation of warfarin maintenance dose based on VKORCI (-1639 G > A) and CYP2C9 genotypes. Clinical Chemistry. 2007;53(7):1199-205. doi:10.1373/clinchem.2006.078139.
Gage BF, Eby C, Johnson JA, Deych E, Rieder MJ, Ridker PM et al. Use of pharmacogenetic and clinical factors to predict the therapeutic dose of warfarin. Clinical Pharmacology & Therapeutics. 2008;84(3):326-31. doi:10.1038/clpt.2008.10.
Klein TE, Altman RB, Eriksson N, Gage BF, Kimmel SE, Lee MTM et al. Estimation of the Warfarin Dose with Clinical and Pharmacogenetic Data (vol 360, pg 753, 2009). New England Journal of Medicine. 2009;361(16):1613-.
Miao L, Yang J, Huang C, Shen Z. Contribution of age, body weight, and CYP2C9 and VKORC1 genotype to the anticoagulant response to warfarin: proposal for a new dosing regimen in Chinese patients. European Journal of Clinical Pharmacology. 2007;63(12):1135-41. doi:10.1007/s00228-007-0381-6.
Wen MS, Lee MTM, Chen JJ, Chuang HP, Lu LS, Chen CH et al. Prospective study of warfarin dosage requirements based on CYP2C9 and VKORC1 genotypes. Clinical Pharmacology & Therapeutics. 2008;84(1):83-9. doi:10.1038/sj.clpt.6100453.
Huang S-W, Chen H-S, Wang X-Q, Huang L, Xu D-L, Hu X-J et al. Validation of VKORC1 and CYP2C9 genotypes on interindividual warfarin maintenance dose: a prospective study in Chinese patients. Pharmacogenetics and Genomics. 2009;19(3):226-34. doi:10.1097/FPC.0b013e328326e0c7.
Cen H-J, Zeng W-T, Leng X-Y, Huang M, Chen X, Li J-L et al. CYP4F2 rs2108622: a minor significant genetic factor of warfarin dose in Han Chinese patients with mechanical heart valve replacement. British Journal of Clinical Pharmacology. 2010;70(2):234-40. doi:10.1111/j.1365-2125.2010.03698.x.
You JHS, Wong RSM, Waye MMY, Mu Y, Lim CK, Choi K-c et al. Warfarin dosing algorithm using clinical, demographic and pharmacogenetic data from Chinese patients. Journal of Thrombosis and Thrombolysis. 2011;31(1):113-8. doi:10.1007/s11239-010-0497-x.
Wei M, Ye F, Xie D, Zhu Y, Zhu J, Tao Y et al. A new algorithm to predict warfarin dose from polymorphisms of CYP4F2, CYP2C9 and VKORC1 and clinical variables: Derivation in Han Chinese patients with non valvular atrial fibrillation. Thrombosis and Haemostasis. 2012;107(6):1083-91. doi:10.1160/th11-12-0848.
Lou Y, Liu H, Han L, Xie S, Huang YL, Li YS. The study of warfarin maintenance dose algorithm in Chinese Han population. International Journal of Cardiology. 2013;163:S9-S.
Liu R, Li X, Zhang W, Zhou HH. Comparison of Nine Statistical Model Based Warfarin Pharmacogenetic Dosing Algorithms Using the Racially Diverse International Warfarin Pharmacogenetic Consortium Cohort Database. PLoS One. 2015;10(8):e0135784. doi:10.1371/journal.pone.0135784.
Tao Y, Chen YJ, Fu X, Jiang B, Zhang Y. Evolutionary Ensemble Learning Algorithm to Modeling of Warfarin Dose Prediction for Chinese. IEEE journal of biomedical and health informatics. 2019;23(1):395-406. doi:10.1109/JBHI.2018.2812165.
Tao Y, Chen YJ, Xue L, Xie C, Jiang B, Zhang Y. An Ensemble Model With Clustering Assumption for Warfarin Dose Prediction in Chinese Patients. IEEE journal of biomedical and health informatics. 2019;23(6):2642-54. doi:10.1109/JBHI.2019.2891164.
Li S, Garcia DA, Khorana AA, Carrier M, Lyman GH, Kalmanti L et al. Treatment of Vascular Thrombosis in Antiphospholipid Syndrome: An Update. Cancer. 2020;40(1):31-7. doi:10.1002/cncr.32724 10.1055/s-0040-1701473.

He H, Garcia EA. Learning from Imbalanced Data. Ieee Transactions on Knowledge and Data Engineering. 2009;21(9):1263-84. doi:10.1109/tkde.2008.239.
Hirzel A, Guisan A. Which is the optimal sampling strategy for habitat suitability modelling. Ecological Modelling. 2002;157(2-3):331-41. doi:10.1016/s0304-3800(02)00203-x.

Table 1. Baseline characteristics and differences among groups

Characteristics N (%)/ Mean ± SD	Total patients (n=15108)	Imbalanced Training set (n=12086)	Balanced Training set (n=2820)	Internal Validation Set (n=1511)	External Validation Set (n=1511)
Age	50.84 ± 11.09	50.80 ± 11.12	50.64 ± 11.32	50.80 ± 11.01	51.13 ± 10.96
Weight	60.16 ± 10.77	60.11 ± 10.83	60.72 ± 11.39	59.86 ± 10.52	60.78 ± 10.51 ^{a, b, c}
Diseases
Rheumatic valvular heart disease	12683 (83.95%)	10127 (83.79%)	2278 (80.78%)	1255 (83.06%)	1301 (86.10%)
Degenerative aortic valve disease	895 (5.92%)	770 (6.37%)	231 (8.19%)	89 (5.89%)	36 (2.38%)
Congenital heart disease	561 (3.71%)	432 (3.57%)	107 (3.79%)	64 (4.24%)	65 (4.38%)
Mitral valve degeneration	500 (3.31%)	413 (3.41%)	100 (3.55%)	57 (3.77%)	30 (1.99%)
Infective endocarditis	210 (1.39%)	164 (1.36%)	49 (1.74%)	18 (1.19%)	28 (1.85%)
Autoimmune diseases involve valvular diseases	116 (0.77%)	82 (0.68%)	22 (0.78%)	15 (0.99%)	19 (1.26%)
Secondary valvular heart disease	102 (0.68%)	65 (0.54%)	19 (0.67%)	11 (0.73%)	26 (1.72%)
Ischemic heart disease	12 (0.08%)	9 (0.08%)	5 (0.18%)	1 (0.07%)	2 (0.13%)
Hypertrophic cardiomyopathy	10 (0.07%)	6 (0.05%)	1 (0.04%)	1 (0.07%)	3 (0.20%)
Traumatic valvular heart disease	8 (0.05%)	8 (0.07%)	3 (0.11%)	0 (0.00%)	0 (0.00%)
Degeneration of cardiac conduction system	7 (0.05%)	7 (0.06%)	3 (0.11%)	0 (0.00%)	0 (0.00%)
Dilated cardiomyopathy	4 (0.03%)	3 (0.02%)	2 (0.07%)	0 (0.00%)	1 (0.07%)
Iatrogenic valvular heart disease	0 (0.00%)	0 (0.00%)	0 (0.00%)	0 (0.00%)	0 (0.00%)
Tricuspid valve disease
No	7003 (46.35%)	5674 (46.95%)	1201 (42.59%)	725 (47.98%)	604 (39.97%)
Stenosis	168 (1.11%)	131 (1.08%)	42 (1.49%)	19 (1.26%)	18 (1.19%)
Incomplete closure	7804 (51.65%)	6172 (51.07%)	1550 (54.96%)	755 (49.97%)	877 (58.04%)
Stenosis with incomplete closure	133 (0.88%)	109 (0.90%)	27 (0.96%)	12 (0.79%)	12 (0.79%)
Albumin	41.42 ± 4.75	41.43 ± 4.78	41.32 ± 4 .81	41.49 ± 4.72	41.28 ± 4.49 ^{a, b, c}
Creatinine	78.21 ± 21.05	77.66 ± 20.67	78.94 ± 21.83	78.27 ± 21.00	82.53 ± 23.53^{a, b, c}
Usage of the first dose	13392 (88.64%)	10593 (89.88%)	2355 (83.51%)	1338 (88.55%)	1461 (96.69%)^{a, b, c}
Dosage of the first dose	2.84 ± 0.65	2.87 ± 0.66	2.93 ± 0.82	2.85 ± 0.65	2.57 ± 0.52^{a, b, c}
Warfarin Maintenance dose	2.65 ± 0.66	2.68 ± 0.65	2.75 ± 1.19	2.67 ± 0.67	2.44 ± 0.66^{a, b, c}
Low-dose (≤ 1.875mg/d)	1.48 ± 0.37	1.49 ± 0.37	1.48 ± 0.36	1.43 ± 0.44	1.48 ± 0.36
Medium-dose (> 1.875mg/d; < 3.125mg/d)	2.69 ± 0.27	2.70 ± 0.27	2.70 ± 0.27	2.70 ± 0.27	2.59 ± 0.23
High-dose (≥ 3.125 mg/d)	4.06 ± 0.86	4.06 ± 0.86	4.06 ± 0.86	4.06 ± 0.79	4.12 ± 0.93

a: significant difference between external validation set and overall set (P<0.05); b: significant difference between external validation set and imbalanced training set (P<0.05); c: significant difference between external validation set and internal validation set (P<0.05); d: significant difference between imbalanced training set and balanced training set (P<0.05).

Table 2. Comparison of predicted dose and actual dose of warfarin

Models	Actual dose (mg/ daily)	Predictive Dose (mg/ daily)	MD (mg/d)
Imbalanced model (IV)	2.61 ± 0.93	2.59 ± 0.23	0.14
Imbalanced model (EV)	2.39 ± 0.88	2.56 ± 0.22	0.27
Balanced model (IV)	2.68 ± 0.67	2.74 ± 0.41	-0.11
Balanced model (EV)	2.44 ± 0.66	2.58 ± 0.39	-0.14

MD: mean difference; IV: internal validation; EV: external validation

Table 3. Comparison of overall prediction accuracy of models

Outcomes	Imbalanced model	Balanced model	P value
Internal validation
Ideal prediction percentage	1181 (78.16%)	1138 (75.31%)	0.064
Underestimation prediction percentage	110 (7.28%)	121 (8.01%)	0.451
Overestimation prediction percentage	220 (14.56%)	252 (16.68%)	0.109
MAE (mg/daily)	0.368302	0.421399	-
MSE (mg/daily)	0.388446	0.433394	-
External validation
Ideal prediction percentage	1124 (74.39%)	1110 (73.46%)	0.562
Underestimation prediction percentage	71 (4.70%)	105 (6.95%)	0.008
Overestimation prediction percentage	316 (20.91%)	296 (19.59%)	0.365
MAE (mg/daily)	0.370215	0.421528	-
MSE (mg/daily)	0.385928	0.412730	-

MAE: Mean Absolute Error; MSE: Mean square error.

Supplementaryfile.doc

Download PDF

Journal Publication

published 20 Apr, 2021

Read the published version in Cardiovascular Drugs and Therapy →

Editor assigned by journal
09 Feb, 2021
Editor invited by journal
09 Feb, 2021
Reviewers invited by journal
09 Feb, 2021
First submitted to journal
08 Feb, 2021

You are reading this latest preprint version

An adapted neural-fuzzy inference system model using preprocessed balance data to improve the predictive accuracy of warfarin maintenance dosing in patients after heart valve replacement

Status:

Journal Publication

Version 1

Abstract

Background

Methods

Results

Conclusions

Figures

Introduction

Methods

Results

Discussion

Conclusions

Declarations

References

Tables

Supplementary Files

Status:

Journal Publication

Version 1