Predicting the risk of chronic kidney disease using Machine Learning Algorithms

doi:10.21203/rs.3.rs-3862496/v1

Download PDF

Article

Predicting the risk of chronic kidney disease using Machine Learning Algorithms

https://doi.org/10.21203/rs.3.rs-3862496/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Background and Objective: Chronic kidney disease (CKD) is among the most severe diseases in the modern world adversely affecting human life. Various risk factors, such as age, sex, diabetes, and hypertension, predispose to the occurrence of CKD. The aim of this study was to determine the predictors of CKD using machine learning algorithms.

Materials and Methods: The present study was conducted on the data from the Ravansar Non-Communicable Disease (RaNCD) cohort. At the end of 5 years of follow-up, the number of participants was 10065 cases, 81 (0.8%) of whom were excluded after sub-processing, and 9984 (98.92%) subjects were finally included in the study. Different machine learning algorithms were used to analyze the data, and the performance of each model was evaluated by calculating accuracy, sensitivity, specificity, and area under the curve (AUC). The final model was used to identify the most important predictors of CKD.

Results: The Generalized Linear Model (GLM) was selected as the final model with the highest sensitivity and accuracy (AUC =97%). According to this model, the most important predictors of CKD were identified to be SC=1.0, Wc=0.591, Rt=0.687, age=0.401, SGPT=0.334, TG=0.334, MCH=0.327, MCV=0.327, BFM=0.306, and HDLC=0.276. Also, the variables of SC, AIP, gender, and SGPT were most important in predicting CKD. Based on the final model, sodium, SGOT, and DBP were the most important predictors that contradicted with None-CKD patients.

Conclusion: Based on our results, the GLM model delivered the most proficient performance in predicting CKD by correctly identifying all patients. In this model, serum creatinine level obtained the highest weight and, therefore, was the most important predictor of CKD.

Health sciences/Nephrology/Kidney diseases

Health sciences/Risk factors

Health sciences/Medical research

Health sciences/Medical research/Epidemiology

Chronic kidney disease (CKD)

Machine Learning

Generalized Linear Model (GLM)

Artificial intelligence

Kidneys are vital organs whose dysfunction will transform the blood into a toxic fluid with a volume of 9–10 liters carrying urea and creatinine within just 2–3 days. This condition is known as chronic kidney disease (CKD)¹, referring to the irreversible and progressive renal failure caused by prolonged (months or years) kidney injury. Reports show that around 324 million people worldwide suffer from CKD², rendering this condition a global public health priority ³. The factors associated with CKD are diverse, including advancing age, hypertension, diabetes, obesity, and primary renal abnormalities, accelerating kidneys’ loss of function ⁴.

It is particularly important to diagnose CKD early to reduce the mortality rate in patients. Late diagnosis of CKD often culminates in renal failure, requiring hemodialysis or kidney transplantation (1). The glomerular filtration rate (GFR), which is widely used to diagnose CKD, provides a screening parameter for this condition and correlates with gender, age, and serum creatinine level ^5,6. Although CKD affects all people worldwide, its prevalence is higher in developing countries. In South Asia and countries such as Pakistan, India, Bhutan, Bangladesh, and Nepal, the prevalence of CKD reaches as high as 1 in 10 people ⁷.

Machine learning (ML) technologies, due to their unique features, play a prominent role in the diagnosis of diseases in various fields of medicine^8,9. With the rapid growth of medical technologies, ML algorithms are increasingly used in health sciences and informatics to identify the master predictors and risk factors of various diseases¹⁰. This technology can help achieve hiaghly accurate and cost-effective diagnostic strategies. So far, ML algorithms have been developed to diagnose cardiac diseases^11,12, diabetes and related retinopathy ^13,14, acute kidney injury ¹⁵, and cancer ¹⁶. Regarding CKDs; however, ML tools have been mostly used for risk stratification ^17–19. The aim of the present study was to utilize the data obtained from a 5-year prospective cohort to identify the risk factors of CKDs using ML techniques. We evaluated the performance of various models to detect the most efficient one for predicting the risk of CKD.

The current cross-sectional correlational study was conducted on the data obtained from the Ravansar Non-Communicable Disease (RaNCD) cohort²⁰. The objective of the study was to determine the predictors of CKD using ML models. The above-mentioned cohort study started in 2014 with the enrollment of 10,182 participants, as a part of a large prospective epidemiologic study in Iran (Persian) with the participation of about 10,000 Iranian urban and rural populations. The mentioned study is expected to continue for 15 years, in which the target parameters of the populations under follow-up are measured at 5-year intervals.

A total of 10065 participants were evaluated at the end of 5 years of follow-up. After subprocessing, 81 (0.8%) individuals were excluded, and the remaining 9984 (98.92%) people were included in the study. A total of 42 variables, including 16 categorical and 24 numerical parameters, were measured in this study. The characteristics of the attributes included in the study have been shown in Table 1.

Table1; Attribute descriptions

Abb.	Description	Abb.	Description
Sc	Serum creatinine (mg/dL)	Edu	Education
Rt	Residence Type	S	Smoking (yes,no)
Wc	Waist circumference (cm)	Alcohol	Alcohol
MCV	Mean Corpuscular Volume (fL)	LDL	low-density lipoprotein (mmol/L)
TG	Triglyceride (mmol/L)	SGOT	The aspartate aminotransferase (ALT)
Age	Age (Year)	RBC	Red blood cell count (count)
MCH	Mean corpuscular hemoglobin (g/dL)	BUN	Blood urea nitrogen (mmol/L)
HDLC	High-Density Lipoprotein Cholesterol (mg/dL)	HEI	Healthy Eating Index score
BFM	Body Fat Mass (Kg)	DBP	diastolic blood pressure (mmHg)
Ms	Marital Status	SBP	Systolic blood pressure (mmHg)
SGPT	The alanine aminotransferase (ALT)	PR	Pulse Rate (per minute)
T2DM	Type II Diabetes mellitus (yes, no)	HRF	Has Renal Failure (yes, no)
Sd	Sleep Duration	RBC	Red blood cell (count)
Sod	Sodium (mEq/L)	HCT	Hematocrit (%)
MET	physical activity	MCV	Mean corpuscular volume (g/dL)
Eth.	Ethnicity	MCH	Mean corpuscular hemoglobin (g/dL)
CHOL	Total cholesterol (mmol/L)	FBS	Fasting blood sugar (mmol/L)
BMI	Body mass Index (kg/m²)	HDL	High-density lipoprotein (mmol/L)
AIP	Alkaline phosphatase (IU/L)	SESq	Socioeconomic status (quintile)
WBC	White Blood Count (count)	HTN	Hypertension (mmHg)

Data Preprocessing

In this step, variables with more than 50% missing data and stability above 90% were excluded. After paired analysis, variables with more than 90% correlation were omitted. Also, the data were checked for the presence of outliers and duplicated values. The final version of data were subjected to analysis. After the initial processing, the data were divided into two groups: test set (30%) and trainig set (70%). The data in both groups were also balanced using the SMOTE up-sampling technique to increase comparability and improve learning algorithms.

Model Evaluation

In this study, logistic regression (LR) models, generalized linear models (GLM), deep learning (DL) models, decision tree (DT), random forest (RF), artificial neural network (ANN), simple Bayes (NB), gradient boosting trees (GBT), and support vector machine (SVM) were used. After fitting the models on train data, the models’ accuracy, sensitivity, specificity, and area under the ROC curve (AUC) were determined and compared with each other. Accordingly, the GLM model was chosen as the final model. This algorithm presents one of the supervised machine learning classification methods that offers many applications in medicine, including for determining the probability of the occurrence or existence of a disease. This algorithm is also used to separate grouped dependent variables (for example, whether a patient suffers from CKD (1) or not (0)). In this study, the ensemble Bayesian Boosting (BB) algorithm was used to reinforce the performance of the proposed models (Fig 1).

Ethics approval and consent to participate

The study was approved by the ethics committee of Kermanshah University of Medical Sciences (KUMS.REC.1394.318). All methods were carried out in accordance with relevant guidelines and regulations. All the participants were provided oral and written informed consent. All methods were carried out by relevant guidelines and regulations. This study was conducted by the Declaration of Helsinki.

Data Analysis

The data were initially subjected to summarization using descriptive statistics (mean, standard deviation, median, maximum, and minimum). Categorical variables were described using frequency and percentage. Descriptive and exploratory analyses were conducted in STATA 11 software (StataCorp LLC College Station, Texas, USA), and RapidMiner software ⁹^,¹⁰ was used for data mining²¹. RapidMiner Studio is a powerful and comprehensive machine learning software based on JAVA programming that offers various features and functionalities for data analysis, prediction, classification, and more. The software contains statistical and machine learning toolboxes consisting of several AI predictive algorithms, which are used for supervised and unsupervised learning purposes. In this research, For Windows 10 version of this software was employed with the Intel processor (Intel I Core I i7-5500U CPU @ 2.40 GHz, 2 core(s) and 4 logical processors.

The total number of people participating in this study was 9984 (98.92%), 81 (0.8%) of whom were excluded after sub-processing. Men and women comprised 5252 (52.6%) and 4732 (47.4%) of the participants, respectively. Overall, 1096 (11%) of all participants were diagnosed with CKD. The mean (SD) age of the participants was 47.3 ± 8.2 years, and the mean values of systolic and diastolic blood pressure were 108.2 ± 17.03 and 98.8 ± 9.9 mm Hg, respectively. Other serological characteristics of the participants have been summarized in Table 2.

Table 2, Descriptive statistics of serologic profile of subjects

Variables	Mean	Standard deviation	Minimum value	Maximum value
Sd	7.09	1.23	0.00	13.00
DBP	69.85	9.93	0.00	125.00
SBP	108.24	17.03	45.00	235.00
WBC	6.44	1.60	1.40	19.80
RBC	4.92	0.57	2.92	8.19
HCT	39.49	4.15	18.00	62.70
MCV	80.65	7.02	47.50	110.00
MCH	28.96	3.05	10.70	41.10
FBS	97.06	30.17	53.30	571.00
BUN	13.58	4.23	3.30	100.90
SC	0.99	0.23	0.50	9.60
TG	137.77	84.00	18.20	1720.90
CHOL	185.56	37.93	73.10	468.30
SGOT	21.40	9.09	5.00	289.10
SGPT	24.85	14.80	3.10	256.00
ALP	197.74	63.03	28.10	1837.10
GGT	24.67	19.92	1.00	438.00
LDL	111.66	31.39	22.08	370.56
Sodium	4875.86	2039.63	560.60	17106.74
BFM	25.14	9.58	2.60	78.50
BMI	27.52	4.64	12.50	52.80
PBF	33.91	9.50	5.50	56.60
HEI	51.63	7.34	22.00	90.00
AIP	0.98	0.65	-1.74	3.84

The comparison of the ML models in two scenarios (i.e., primary and BB-boosted algorithms) showed that the BB ensemble method increased the performance of the DP and SVM models in terms of sensitivity, as well as the performance of the DT and NB models in terms of specificity. This enhancement also improved the accuracy of the DTT, GBT, NB, and SVM models and reduced the rate of error in the DP, GBT, NB, and SVM models. In the final assessment, GLM achieved the highest sensitivity and specificity compared to other models and was elected as the final model (Table 3).

Table 3, Performance Metrics for Eighteen machine learning models

Measures	Sensitivity		Specificity		Accuracy		Classification Error		AUC
Measures	Primary	BB	Primary	BB	Primary	BB	Primary	BB	Primary	BB
DL	93.0	96.0	98.3	98.0	97.7	97.6	2.3	2.4	0.99	0.98
DT	97.6	95.7	98.4	99.4	98.3	99.0	1.7	1.0	0.99	1.00
GBT	99.4	98.5	99.4	99.6	99.4	99.5	0.6	0.5	1.0	1.00
GLM	100	100	96.6	96.6	97.0	97.0	3	3	0.99	0.99
LR	99.7	99.7	98.2	98.2	98.4	98.4	1.6	1.6	0.99	0.99
NB	82.4	82.7	68.0	86.2	69.6	85.8	30.4	14.6	0.82	0.93
RF	89.0	89.0	91.6	91.6	91.3	91.3	8.7	8.7	0.98	0.91
SVM	30.0	83.0	100	95.0	89.0	93.6	11	6.4	0.72	0.93
ANN	90.9	90.9	94.5	94.5	94.1	94.0	6	6.0	0.97	0.97
BB; Bayesian Boosting, DL; Deep Learning, DT; Decision tree, GBT; Gradient boosting trees, GLM; Generalized linear model, LR; Logistic regression, NB; Naive Bayes, RF; Random Forest, SVM; support vector machine, ANN; Artificial Neural network AUC; area under the ROC (receiver operating characteristic) curve

Regarding the performance of the evaluated models, the GLM delivered the highest sensitivity (100%) and specificity (96.6%). Also, the accuracy of this model was 97% in both BB and primary processes. The results of the model evaluation revealed that out of 3000 input cases in the test group, this model correctly categorized 329 patients and 2575 healthy individuals, rendering a sensitivity of 100% and a specificity of 96.59% (table4).

Table 4, Confusion matrix of GLM Algorithm

	True CKD-	True CKD+
Pred CKD-	2575	0
Pred CKD+	91	329

Furthermore, ROC curve analysis was performed to ascertain the sensitivity (i.e., the ability to identify CKD patients) of three models. In this analysis, models with greater AUC values are supposed to perform better in identifying patients with CKD. Regarding the AUC values obtained, the greatest value belonged to the GLM model (AUC=100), indicating the higher ability and efficiency of this model in recognizing CKD patients (Fig 2).

According to the GLM model, the weights of the influential variables in predicting CKD were as follows. (Rt=0.687), (Wc=0.591), (SC=1.0) (Age=0.401), (SGPT =0.334), (TG=0.334), (MCV=0.327), (MCH=0.327), (BFM=0.306), (HDLC=0.276)

The most important variables with the highest weights were serum creatinine levels, place of residence, waist circumference, and age (Fig 3).

The final analysis based on the GLM model to identify the most important predictors of CKD disclosed that the variables of SC, AIP, gender, and SGPT variables were the most important positive predictors, while sodium, SGOT, and DBP were the most prominent negative predictors of CKD (Figure 4).

The present study relied on the data retrieved from a prospective cohort study aiming to explore the most important predictors of CKD patients using ML models. In this study, the following models were analyzed: LR, GLM, DL, DT, RF, ANN, NB, GBT, and SVM. In recent years, various studies have used different ML models to predict the risk of CKD ²²^,²³. Yadav et al. (2021) explored a dataset containing 26 CKD-related parameters and combined the ANN classifier with four feature-based algorithms (Extra Tree, Pearson correlation, Lasso model, and chi-square) to identify CKD predictors²⁴ . Emon et al. (2021) also used 8 machine learning classifiers employing Weka software to analyze their performance in predicting CKD Logistic Regression (LG), Naive Bayes (NB), Multilayer Perceptron (MLP), Stochastic Gradient Descent (SGD), Adaptive Boosting (Adaboost), Bagging, Decision Tree (DT), Random Forest (RF) classifier ²⁵.

In this study, the Ensemble BB algorithm was used to enhance the performance of the proposed models. The comparison of the results of the primary and BB-enhanced models showed that BB enhancement increased the performance of the models in terms of sensitivity, specificity, accuracy, and error rate. Srivastava et al. (2022) proposed an algorithm to predict CKD using diagnostic medical data available in the UCI repository combining an array of physiological parameters and ML techniques. In the recent study, the researchers employed the Ranking Weighted Ensemble algorithm to boost the performance of the proposed models and reported that this algorithm could be used to develop an electronic diagnostic system for determining the severity of CKD with the accuracy, sensitivity, specificity, and F1 Score of 98.75%, 100%, 96.55%, and 99.03%, respectively²⁶ . Moreover, Wang et al. (2020) initially tried to estimate serum creatinine levels using a regression model with eight predictors. They then combined the predicted creatinine level with 23 main characteristics in order to predict the risk of CKD in patients. They further boosted their findings using an ensemble technique, including three models (RF, XGBoost (a boosting tree), and ResNet (a neural network-based model)), among which the XGBoost model offered a better performance compared to other models, with an AUC value of only 0.76 ²⁷.

According to our findings, the final model developed in this study could reliably discern CKD patients from healthy individuals. This model was able to correctly discriminated true patients with healthy individuals with 100% sensitivity and 96.6% specificity, respectively. In another study, Qin et al. (2019) employed an ML approach to diagnosing CKD and after removing missing data, used six ML algorithms (logistic regression, random forest, support vector machine, k-nearest neighbor, naive Bayes classifier, and feed-forward neural network). In the recent study, the random forest model obtained the best performance with a diagnostic accuracy of 99.75% ²⁸. Dritsas et al. (2022) utilized the SVM, LR, SGD, ANN, and k-NN models to predict the risk of CKD, among which the Rotation Forest (RotF) model with an AUC of 100%, as well as accuracy and F measure of 99.2%, was designated as the best model ²². Priyanka et al. (2019) also used the Naïve Bayes, KNN, SVM, Decision tree, and ANN algorithms to predict the risk of CKD, among which the best performance belonged to the Naïve Bayes model with an accuracy of 94.6% ²⁹.

In the present study, the final model with the best performance was GLM, according to which serum creatinine level, place of residence, waist circumference, and age attained the greatest weight in the diagnosis of CKD. Among these variables, the greatest weight was related to serum creatinine level. Likewise, Chiu et al. (2021) identified BUN and UA as the first and second most important predictors in the risk stratification of CKD ³⁰. Shih et al. (2020) observed that the C4.5 model performed better than other models in predicting CKD, suggesting the creatinine ratio (UPCR), proteinuria, age, RBC, GLU, triglyceride level, total cholesterol, and gender as the most important predictors of CKD, while variables such as HDL, LDL, and ALB seemed to be less important according to this model ³¹.

The results of the model developed in the present study suggested that SC, AIP, and gender were the strongest predictors of CKD in our participants. In another study by Chiu et al. (2021), SBP, SGPT, SGOT, and LDL-C were identified as the most important risk factors associated with the incidence of CKD (29). Also, Jarad et al. ³² declared that reduced albumin levels were strongly associated with impaired renal function, which was in line with the report of Lang et al., noting that urinary levels of albumin and creatinine were strongly associated with impaired renal function³³ .

According to the final model proposed in the present study, the most important predictors that contradicted with None-CKD patients were serum sodium level, SGOT, and DBP. In their study, Samsuria et al. (2019) investigated the relationship between renal dysfunction and the serum levels of sodium and potassium, leading to the observation of a significant relationship between potassium and urea levels³⁴ .

Some noteworthy strengths of the present study include: there were a little amount of missing data (0.8%) in some attributes, having no outliers in the data (indicating the high quality of the dataset), and the use of the state-of-the-art Bayesian Boosting technique to improve the performance of learning algorithms. The limitation of this study was the lack of information on some other variables, such as urine specific gravity, albumin, bacteria, urine protein, and lower extremity edema, in the cohort data.

Strengths and limitations of study

The results of the big data study are from PERSIAN cohort study in which there is a minimum amount of sensoring and high accuracy in recording the variables. From the total cohort, 10 065 eligible remaining 9984 (98.92%). Procedures for data access, information on collaborations, publications and other details can be found at http://persiancohort.com. Similar to all cohort studies, this study is limited because of selection bias. Individuals who are willing to participate in long-term research may be more concerned about their health than others and may adopt lifestyles that they believe address these concerns.

In this study, we used the Ensemble BB algorithm to fortify the performance of the ML models, which successfully increased the performance of these models. The final model (i.e., GLM) delivered 100% sensitivity and 97.6% specificity for identifying CKD patients, indicating the high efficiency of the selected model. The highest weights in the final model belonged to serum creatinine level, place of residence, waist circumference, and age. The most important predictors of CKD in patients were identified to be SC, AIP, and gender.

Author Contribution

Y.V and H.S. designed and prepared the manuscript. M.Kh. participated in cleaning and data entry. M.M participated in the study design. H.S. conducted the statistical analysis. All authors contributed to writing the first draft of the article.

Swain, D., et al. A Robust Chronic Kidney Disease Classifier Using Machine Learning. Electronics 12, 212 (2023).
Bhaskar, N., Suchetha, M. & Philip, N.Y. Time series classification-based correlational neural network with bidirectional LSTM for automated detection of kidney disease. IEEE Sensors Journal 21, 4811-4818 (2020).
Yan, M.-T., Chao, C.-T. & Lin, S.-H. Chronic kidney disease: Strategies to retard progression. International journal of molecular sciences 22, 10084 (2021).
Gansevoort, R.T., et al. Chronic kidney disease and cardiovascular risk: epidemiology, mechanisms, and prevention. The Lancet 382, 339-352 (2013).
Sobrinho, A., et al. Computer-aided diagnosis of chronic kidney disease in developing countries: A comparative analysis of machine learning techniques. IEEE Access 8, 25407-25419 (2020).
Ma, Y.-C., et al. Comparison of 99mTc-DTPA renal dynamic imaging with modified MDRD equation for glomerular filtration rate estimation in Chinese patients in different stages of chronic kidney disease. Nephrology Dialysis Transplantation 22, 417-423 (2007).
Ali, S.I., et al. Ensemble feature ranking for cost-based non-overlapping groups: A case study of chronic kidney disease diagnosis in developing countries. IEEE Access 8, 215623-215648 (2020).
Ebiaredoh-Mienye, S.A., Esenogho, E. & Swart, T.G. Integrating enhanced sparse autoencoder-based artificial neural network technique and softmax regression for medical diagnosis. Electronics 9, 1963 (2020).
Jasińska, V.B. Prediction of Chronic Kidney Disease-A Machine Learning perspective.
Iftikhar, H., et al. A Comparative Analysis of Machine Learning Models: A Case Study in Predicting Chronic Kidney Disease. Sustainability 15, 2754 (2023).
Alickovic, E. & Subasi, A. Medical decision support system for diagnosis of heart arrhythmia using DWT and random forests classifier. Journal of medical systems 40, 108 (2016).
Masetic, Z. & Subasi, A. Congestive heart failure detection using random forest classifier. Computer methods and programs in biomedicine 130, 54-64 (2016).
Zou, Q., et al. Predicting diabetes mellitus with machine learning techniques. Frontiers in genetics 9, 515 (2018).
Gao, Z., et al. Diagnosis of diabetic retinopathy using deep neural networks. IEEE Access 7, 3360-3370 (2018).
Park, N., et al. Predicting acute kidney injury in cancer patients using heterogeneous and irregular data. PloS one 13, e0199839 (2018).
Patrício, M., et al. Using Resistin, glucose, age and BMI to predict the presence of breast cancer. BMC cancer 18, 1-8 (2018).
Jeong, B., et al. Comparison between statistical models and machine learning methods on classification for highly imbalanced multiclass kidney data. Diagnostics 10, 415 (2020).
Xiao, J., et al. Comparison and development of machine learning tools in the prediction of chronic kidney disease progression. Journal of translational medicine 17, 1-13 (2019).
Segal, Z., et al. Machine learning algorithm for early detection of end-stage renal disease. BMC nephrology 21, 1-10 (2020).
Pasdar, Y., et al. Cohort profile: Ravansar Non-Communicable Disease cohort study: the first cohort study in a Kurdish population. International journal of epidemiology 48, 682-683f (2019).
Mierswa, I. & Klinkenberg, R. RapidMiner Studio (9.2)[Data science, machine learning, predictive analytics]. Retrieved from rapidminer. com (2018).
Dritsas, E. & Trigka, M. Machine learning techniques for chronic kidney disease risk prediction. Big Data and Cognitive Computing 6, 98 (2022).
Debal, D.A. & Sitote, T.M. Chronic kidney disease prediction using machine learning techniques. Journal of Big Data 9, 1-19 (2022).
Yadav, D.C. & Pal, S. Performance based Evaluation of Algorithmson Chronic Kidney Disease using Hybrid Ensemble Model in Machine Learning. Biomedical and Pharmacology Journal 14, 1633-1645 (2021).
Emon, M.U., Islam, R., Keya, M.S. & Zannat, R. Performance analysis of chronic kidney disease through machine learning approaches. in 2021 6th International Conference on Inventive Computation Technologies (ICICT) 713-719 (IEEE, 2021).
Srivastava, S., Yadav, R.K., Narayan, V. & Mall, P.K. An Ensemble Learning Approach For Chronic Kidney Disease Classification. Journal of Pharmaceutical Negative Results, 2401-2409 (2022).
Wang, W., Chakraborty, G. & Chakraborty, B. Predicting the risk of chronic kidney disease (ckd) using machine learning algorithm. Applied Sciences 11, 202 (2020).
Qin, J., et al. A machine learning methodology for diagnosing chronic kidney disease. IEEE Access 8, 20991-21002 (2019).
Priyanka K, S.B. Chronic kidney disease prediction based on naive Bayes technique. 1653–1659. (2019).
Chiu, Y.-L., Jhou, M.-J., Lee, T.-S., Lu, C.-J. & Chen, M.-S. Health data-driven machine learning algorithms applied to risk indicators assessment for chronic kidney disease. Risk Management and Healthcare Policy, 4401-4412 (2021).
Shih, C.-C., Lu, C.-J., Chen, G.-D. & Chang, C.-C. Risk prediction for early chronic kidney disease: results from an adult health examination program of 19,270 individuals. International Journal of Environmental Research and Public Health 17, 4973 (2020).
Jarad, G., Knutsen, R.H., Mecham, R.P. & Miner, J.H. Albumin contributes to kidney disease progression in Alport syndrome. American Journal of Physiology-Renal Physiology 311, F120-F130 (2016).
Lang, J., et al. Association of serum albumin levels with kidney function decline and incident chronic kidney disease in elders. Nephrology Dialysis Transplantation 33, 986-992 (2018).
Samsuria, I.K. The Relastionship between sodium, potassium, and hypothroidism in Chronic Kidney Disease (CKD) patients. (2019).

No competing interests reported.

Download PDF

Version 1

posted

You are reading this latest preprint version

Predicting the risk of chronic kidney disease using Machine Learning Algorithms

Status:

Version 1

Abstract

Figures

Introduction

Materials and Methods

Results

Discussion

Conclusion

Declarations

Author Contribution

References

Additional Declarations

Status:

Version 1