The definition of AKI depends on SCr and UO, which are imperfect markers of AKI19-21, The evaluation of urinary output (UO) is challenging without the use of a urinary catheter and requires monitoring of the patient's fluid volume and blood circulation, as well as considering the administration of diuretics. Additionally, it is a time-consuming process to assess UO on an hourly basis19-23. Therefore, extensive research has been dedicated to the exploration and advancement of novel biomarkers.
In this study, we demonstrated that advanced machine learning techniques such as GBM (Gradient Boosting Machine) and Random Forest modeling can enhance the amount of information extracted from analyzing a database. Moreover, these techniques allow us to develop and validate predictive models that outperform traditional logistic regression methods. In our study, we demonstrated that certain clinical factors are more likely to be associated with VR-AKI compared to VU-AKI. By utilizing advanced machine learning techniques, we were able to identify several important clinical factors that are linked to VR-AKI.
Our findings revealed several relevant parameters. Firstly, the evaluation of blood urea nitrogen (BUN) and creatinine levels provides a rigorous and comprehensive approach to assess the oliguric type of acute kidney injury (AKI). Elevated BUN levels indicate impaired kidney function, while increased creatinine levels reflect diminished glomerular filtration rate24,25. Monitoring BUN and creatinine allows for early detection and evaluation of oliguric AKI, facilitating prompt intervention. This combined assessment of BUN and creatinine yields clinically significant information, enabling accurate diagnosis and therapeutic decision-making in oliguric AKI patients, with potential implications for improving patient outcomes. Moreover, our study recognized the significance of blood glucose levels and platelet counts in assessing VR in AKI patients. Elevated blood glucose (glucose_max) that commonly occurs in diabetes and metabolic disorder patients demonstrated a potential influence on VR via various mechanisms, such as the development of hyperosmolar tubular necrosis6. Additionally, the minimum platelet count (platelets_min) was identified as a significant parameter, as it reflects the coagulation function and bleeding risk potential. Considering that bleeding can result in reduced fluid volume and subsequent impairment of renal perfusion, platelet count becomes an important factor in VR evaluation.
Secondly, our analysis of the random forest and GBM modeling indicated that prolonged hospital stay and extended ICU stay are crucial variables for evaluating AKI patients with an oliguric phenotype. Longer hospital stays may indicate a more complex condition, delayed recovery, and potential complications. Similarly, lengthier ICU stays suggest increased severity, organ dysfunction, and the need for intensive management. These factors reflect the disease burden, patient instability, and the challenge of achieving optimal control, making them valuable predictors in AKI evaluation26.
Thirdly, the average respiratory rate (resp_rate_mean) was identified as a potential indicator of VR due to its link with respiratory function and lung health, which can affect renal perfusion and filtration capabilities. For instance, if an individual exhibits a high respiratory rate and low SpO2, along with an elevated heart rate, it may suggest inadequate volume status or poor oxygenation. On the other hand, a combination of normal respiratory rate, SpO2, and heart rate may indicate more favorable volume responsiveness. It should be noted that these findings provide valuable insights into the potential associations between selected parameters and VR in AKI patients. However, a comprehensive clinical evaluation and diagnostic investigations are required to determine the individualized causes and management strategies for AKI patients27-29.
We compared the feature importance rankings obtained by random forest and gradient boosting algorithms. Both methods are based on tree-based models, which can capture the interaction effects among features. However, they differ in how they build and combine the trees. Gradient boosting is an additive model, which builds each tree on the residuals of the previous tree and assigns different weights to each tree30-32. This means that gradient boosting can adjust the contribution of each feature or feature combination to the target variable more flexibly, and thus emphasize the importance of individual factors or factor combinations. Random forest is an averaging model, which builds each tree independently and averages the predictions of all trees33-35. This means that random forest tends to balance the contribution of each feature or feature combination to the target variable, and thus emphasize the importance of different factors combined. These differences may have implications for interpreting the results and selecting the most relevant features for prediction.
limitation
One limitation of our study is the lack of data regarding the specific reasons for administering large-volume resuscitation. Since our study was not a pre-planned clinical trial, the indications for such fluid administration could not be predetermined. However, despite the absence of a prospective design, our study illuminates potential directions for future experimental designs through the analysis of large-scale data. Moreover, the parameters examined in this study, such as creatinine, blood urea nitrogen (BUN), blood glucose, bicarbonate, respiratory rate, SpO2, Sbp, and Heart rate, are commonly measured clinical indicators with strong operational feasibility. By highlighting their association with oliguric AKI, our findings contribute valuable insights to guide clinical practice and stimulate further research in this field.
The consistent findings of both models, highlighting the significance of creatinine, blood urea nitrogen (BUN), blood glucose, bicarbonate, and age, suggest that a combination of these indicators can provide a clearer understanding of VU occurrence. By considering these features together, it becomes possible to identify potential warning signs more effectively. The variations in how the models build and combine trees have implications for interpreting the results and selecting the most relevant features for prediction.