- Letter to the Editor
- Open access
- Published:
A prediction model for moderate to severe acute kidney injury in people with heart failure
Military Medical Research volume 11, Article number: 57 (2024)
Dear Editor,
Heart failure (HF) is a common multi-faceted and life-threatening syndrome, of which up to 23% occur acute kidney injury (AKI) [1]. HF-related AKI is largely overlooked or delayed in identification [2]. Approximately 85% of AKI cases that occurred during cardiac hospitalization in China were either ignored or identified too late [3]. Currently, there are no specific guidelines for the management of HF-related AKI. Hence, it is essential to identify patients at the risk of developing AKI and intervene promptly, to reduce social and economic burden.
Machine learning (ML)-based prediction models can predict patients at risk or likely to benefit from treatment, due to their superior capability in integrating data with multidimensional interactions. A variety of prediction models have been successfully used in HF patients, however, there remains a lack of widely accepted models for predicting HF-related AKI. Previous models have several critical limitations, including relatively low discrimination, miscellaneous patient selection, small sample sizes, and insufficient predictors [4, 5]. Hence, this study aimed to develop a prediction model of AKI in the HF population using ML algorithms.
This study developed and validated a prediction model of moderate to severe AKI and AKI requiring dialysis, based on readily available variables from HF patients (Additional file 1: Table S1), using 4 ML algorithms with optimal hyperparameters (Fig. 1a; Additional file 1: Table S2). A total of 120,479 participants from the China Renal Data System (CDRS) database were included as the development cohort and 4327 patients from the local hospital as the external validation cohort. Additional file 1: Fig. S1 outlines the patient selection process for the development and validation cohorts. Additional file 1: Table S3 summarizes the baseline characteristics of the participants. In the development cohort [age 68.0 (58.0, 78.0) years; 71,895 men], 5975 (5.0%) patients developed moderate to severe AKI, and 2956 (2.5%) required dialysis. In the external validation cohort [age 72.0 (63.0, 79.0); 2722 men], patients were older, had worse baseline renal function, more severe HF conditions, and were less likely to take HF-associated medications, had a higher incidence of moderate to severe AKI (7.1%) and a lower risk of AKI requiring dialysis (1.1%), compared to the development cohort. Additional file 1: Table S4 demonstrate the comparison of baseline characteristics among patients in the three cohorts. Potential continuous variables with high multicollinearity and categorized variables with near-zero variance were removed (Additional file 1: Fig. S2).
Development and validation of heart failure (HF)-related acute kidney injury (AKI) prediction model. a Study design. The area under the receiver operating characteristic curves (AUCs) for moderate to severe AKI (b, d) and AKI requiring dialysis (c, e) in the internal validation and external validation cohorts. f SHapley Additive exPlanation (SHAP) summary plot of the XGBoost model. The plot depicts the dot estimation on the model output of the XGBoost model. Each dot represents an individual patient from the dataset. Red represents the higher SHAP value of specific features; blue represents the lower SHAP value of specific features. The higher the SHAP values, the greater the risk of developing AKI development. LR logistic regression, RF random forest, SVM supported vector machine, XGBoost eXtreme gradient boosting, proBNP pro-brain natriuretic peptide, eGFR estimated glomerular filtration rate, LDH lactate dehydrogenase
The discrimination performance metrics of prediction models in the three cohorts are presented in Additional file 1: Table S5, Fig. S3, and Fig. 1b–e. Compared with the other three prediction models using logistic regression, supported vector machine and random forest, the eXtreme gradient boosting (XGBoost) model had the best predictive performance in discrimination with the highest area under the receiver operating characteristic curve (AUC) of 0.868 and 0.973; accuracy of 87.7% and 93.6%; sensitivity of 61.3% and 92.1%; specificity of 89.0% and 93.7%; positive predictive value (PPV) of 22.5% and 26.8%; negative predictive value (NPV) of 97.8% and 99.8%; for moderate to severe AKI (Fig. 1b) and AKI requiring dialysis (Fig. 1c) in the internal validation cohort; respectively. In the external validation cohort, the XGBoost model also attained the highest AUC of 0.956 and 0.958; accuracy of 95.7% and 99.1%; sensitivity of 43.8% and 30.4%; specificity of 99.6% and 99.8%; PPV of 90.0% and 60.9%; NPV of 95.9% and 99.3%; for moderate to severe AKI (Fig. 1d) and AKI requiring dialysis (Fig. 1e); respectively. These results demonstrated the trade-off of high NPV against low PPV, in order to minimize underreporting and heighten clinicians’ vigilance among high-risk patient groups, considering the life-threatening complications of HF-related AKI.
In addition, the XGBoost model was well-calibrated, as suggested by large calibration slopes (0.898, 1.044), small calibration intercept (0.026, −0.075), and small Brier’s score (0.039, 0.016) for moderate to severe AKI and AKI requiring dialysis in the internal validation cohort, respectively. Similar good calibrations with the calibration slopes of 1.740 and 1.039, the intercept of −0.096 and −0.138, and Brier’s scores of 0.038 and 0.012 were observed in the external validation cohort (Additional file 1: Table S6). It indicated that the model had preserved calibration.
To allow for the interpretation of our model’s predictions, we used SHapley Additive exPlanation values to assess feature importance and identified a feature’s relative contribution to uncover key features. Figure 1f shows the top 20 predictors in the XGBoost model, not only including renal health-related variables, but also several specific medication variables for the treatment of the primary causes of HF, HF symptoms, and HF-related complications, which might not be comparable to the general population.
In conclusion, this study developed, internally and externally validated a novel ML risk model for predicting HF-related AKI, based on a large dataset of Chinese admitted HF patients and uses of easily available variables. It is expected that the model could be integrated into the routine clinic workflow for risk stratification of the HF population and selecting individuals at high risk of AKI, facilitating early kidney-specific care, timely diagnosis, and treatment of HF-related AKI.
Availability of data and materials
The data that support the findings of this study are available from the authors upon reasonable request.
Abbreviations
- AKI:
-
Acute kidney injury
- AUC:
-
Area under the receiver operating characteristic curve
- CRDS:
-
China Renal Data System
- HF:
-
Heart failure
- ML:
-
Machine learning
- NPV:
-
Negative predictive value
- PPV:
-
Positive predictive value
- XGBoost:
-
EXtreme gradient boosting
References
Damman K, Valente MAE, Voors AA, O’Connor CM, van Veldhuisen DJ, Hillege HL. Renal impairment, worsening renal function, and outcome in patients with heart failure: an updated meta-analysis. Eur Heart J. 2014;35(7):455–69.
Ronco C, Bellomo R, Kellum JA. Acute kidney injury. Lancet. 2019;394(10212):1949–64.
Yang L, Xing G, Wang L, Wu Y, Li S, Xu G, et al. Acute kidney injury in China: a cross-sectional survey. Lancet. 2015;386(10002):1465–71.
Hong C, Sun Z, Hao Y, Dong Z, Gu Z, Huang Z. Identifying patients with heart failure who are susceptible to de novo acute kidney injury: machine learning approach. JMIR Med Inform. 2022;10(10):e37484.
Liu WT, Liu XQ, Jiang TT, Wang MY, Huang Y, Huang YL, et al. Using a machine learning model to predict the development of acute kidney injury in patients with heart failure. Front Cardiovasc Med. 2022;9:911987.
Acknowledgements
None.
Funding
This work was supported by the National Key Research and Development Program of China (2021YFC2500200, 2021YFC2500204), the Key Technologies Research and Development Program of Guangdong Province (2023B1111030004), and the Guizhou Science and Technology Department (QKHPTRC2018-5636-2, QKHCG2023-ZD010).
Author information
Authors and Affiliations
Contributions
YZ and FFH initiated the project and the collaboration. YQY, JJD, and SN extracted the study cohort, cleaned up the data, and performed all experiments. FFH and SN proposed and created the CRDS. Other authors provided clinical data for CRDS. YQY and JJD wrote the paper, with revision advice provided by YZ and FFH. All authors have read and approved the manuscript.
Corresponding authors
Ethics declarations
Ethics approval and consent to participate
This study was approved by the Medical Ethics Committees of Nanfang Hospital, Southern Medical University (NFEC-2019-213), and Guizhou Provincial People’s Hospital ([2019]29).
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Supplementary Information
Additional file 1:
Materials and methods. Table S1 List of 91 potential predictor variables used in the training models. Table S2 Final hyperparameters adopted in the four ML models. Table S3 Characteristics of the cohort participants in the prediction model of AKI outcomes. Table S4 Characteristics of derivation, internal validation, and external validation cohorts according to AKI status. Table S5 Discrimination performance of moderate to severe AKI, and AKI requiring dialysis risk prediction models for patients with HF in the derivation and validation cohorts. Table S6 Calibration performance of various prediction models in the internal and external validation cohorts. Fig. S1 Overview of study design. Fig. S2 Multicollinearity results for continuous variables in the derivation cohorts. Fig. S3 The AUCs for moderate to severe AKI and AKI requiring dialysis in the derivation cohorts.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Yang, YQ., Da, JJ., Nie, S. et al. A prediction model for moderate to severe acute kidney injury in people with heart failure. Military Med Res 11, 57 (2024). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s40779-024-00558-z
Received:
Accepted:
Published:
DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s40779-024-00558-z