In this retrospective cohort study, we compared LR and ML techniques to select potential risk factors for AKI. Firstly, we concluded that different enrolled numbers of features impact the final selected feature number. The more inputs, the more outputs with all FS methods. Secondly, as for the performance, all selected features by various FS methods demonstrated excellent AUCs. Meanwhile, the embedded method demonstrated the highest accuracy compared with the LR method, while the filter method showed the lowest accuracy. Thirdly, regarding the importance ranking of features, our results confirmed some features that previous studies have reported and found some novel clinical parameters.
Tremendous research has shown that ML outperformed LR for the prediction of AKI [2, 4, 23, 24]. However, a recent meta-analysis reported that ML algorithms were comparable to regression models in developed models [25]. Particularly ML techniques have become prevalent in medical fields in recent years, where it can reduce dimensionality and help us understand the causes of disease.[26]The performance of different FS methods in predicting CSA-AKI remains unknown. Researchers used traditional logistic regression or Cox regression to explore the potential risk factors for AKI[1, 6, 16, 27–29]. Previous studies mainly used these features to build models and predict AKI[1, 16, 27]. Recently, some studies found other features might influence the progression of AKI [17–19, 28, 30, 31], yet there is a lack of sufficient multicenter evidence to support their findings. Hence, we included as many features as possible in Group I, and 25 more were not commonly recognized before compared with Group II. We used LR and ML approaches to select the relevant features in the two groups. We identified that the more you input, the more likely its output with all FS methods. In addition, the number of features selected by ML methods was less than LR methods, which might promote establishing an easier and more effective predictive model. Meanwhile, the number of selected features by the embedded method is smaller than the filter and wrapper method, which may be attributed to the filter method mainly used to screen data and reduce the size. In contrast, the wrapper and the embedded method are based on the filter method and are better at handling and processing data[30].
Our study showed the AUCs between FS and LR methods were significant statistically. Still, all performances demonstrated excellent, which implies that FS and LR methods can be performed comparably well in selecting predictors of CSA-AKI. Furthermore, no matter dealing with a lower or higher dimensional dataset, the embedded method performed better than other methods, which might be attributed to the feature subset search process of the embedded method incorporated into the classifier training process. Moreover, the LR method performed better in a lower dimension than in a higher dimension dataset. [30] This might be due to the LR method being more proficient in dealing with liner relationship problems. Notably, the AUCs of all FS methods in our study were beyond 0.9. Perhaps this is because we included serum creatinine (SCr) in our analysis, which is an important indicator of the definition of KDIGO-AKI. However, Koyner Carey et al. found that the algorithm for predicting severe AKI did not change significantly after excluding the SCr variable. Future studies should investigate the complex connection between baseline SCr and postoperative AKI. Additionally, we included many intraoperative variables that might improve the performance of our models[2].
In terms of the selection accuracy of the ML methods compared with the LR method, our analysis indicated that the embedded method achieved the highest accuracy despite the number of the features inputted. In contrast, the filter method achieved less accuracy and was influenced by the number of features inputted. It may be due to the wrapper, and embedded methods have a built-in algorithm, in which the feature selection process and algorithm training are performed simultaneously. Therefore, their results are more accurate and reliable[30]. Nevertheless, the number of selected features in the embedded method was related to the threshold value, yet the problem defining the threshold used to discriminate a truly informative feature from the other noninformative ones[31].
According to the importance matrix plot, the most influential factor was NT-proBNP in all FS methods. It is still unclear about the relationship between NT-proBNP and AKI. One possible mechanism is that NT-proBNP acts as a mediator between cardiovascular and renal dysfunction in that its production is stimulated by elevated venous pressure, which contributes to renal impairment[32]. Another potential hypothesis is that elevated NT-proBNP will cause systematic vasodilation and antagonize the renin-angiotensin-aldosterone system, which might directly prompt renal hypotension[33]. Eventually, it’s a vicious circle between cause and effect. In addition, the laboratory values such as PT, BUN, total and direct bilirubin, and features to evaluate left ventricular function were found to have a higher potential impact on the progression of CSA-AKI. Consistent with previous studies, our study also demonstrated decreased LVEF as a risk factor of AKI[34, 35]. one classical theory explains it might relate to renal hypotension, associated ischemia, and insufficient oxygen supply[36]. As for prolonged PT, an increasing number of evidence has demonstrated that AKI might correlate with an interaction between inflammation cascade activation and deranged coagulation pathways, leading to endothelial cell dysfunction, microvascular damage, and extensive microthrombi[37]. Urea nitrogen is a protein terminal metabolite and is excreted through the kidneys, though its level can be influenced by various factors, it can still be used as a biomarker to some extent. Further prospective studies are warranted to explore the relationship and underlying mechanism. In addition, preoperative hemodynamic variables such as heart rate and systolic blood pressure were ranked in the top 10 in our cohort, suggesting better managing these features before surgery might benefit the patient. Abundant evidence denotes that intraoperative variable, including surgery and the CPB procedure, are closely associated with postoperative AKI [3, 4]. In one single-center cohort, patients at low risk of AKI were reclassified as high risk after including intraoperative variables[38]. We also confirmed that CSA-AKI was associated with the aortic clamp time, CPB, and surgery time. In addition, we found that the preoperative use of β-blockers might help mitigate AKI. However, it is still controversial whether pharmacological interventions are beneficial for high-risk patients [39, 40]. Additionally, the top 10 features included were almost the same between two groups in different methods. However, the importance ranking of these features were different which may attribute to the interaction between features and thus, when more features are enrolled, their importance percentages will change. Furthermore, some features such as ALT, and the volume of intraoperative blood salvage, which were statistically important, were only detected by ML methods. Additionally, Lee, Hofer et al. found that a hybrid of FS and LR could perform comparably with deep neural network. This implies that FS and LR could be combined in future research, as FS could reduce the number of parameters, decrease the learning time, and avoid the problems of dimensionality; meanwhile LR could output explainable variables with low computational cost[41],which need more investigation when faced with booming data and high dimensional statistics..
Our study also has several limitations. First, this is a retrospective analysis with single-center data and a relatively small number of cases. The performance of machine learning algorithms might be different for a larger dataset with a different distribution of patient characteristics in different institutions. Second, the most important variables are not clinically modifiable, and whether our results could benefit high-risk patients is unknown. Nevertheless, further prospective trials are imperative to evaluate whether the adjustment of modifiable predictors could yield beneficial results[2]. But we have confirmed the importance of the intraoperative variables. Third, we did not determine the time of the latest measurement before surgery, which might impact our results. Fourth, we only took the drug usage into our analysis, the effects of drug changes before and after surgery are still unknown. Fifth, another potential limitation is we did not use urine volume as a criterion to diagnose AKI, and different diagnostic criteria might lead to different results. Sixth, we did not include some biomarkers, such as cystatin C, tissue inhibitor of metalloproteinases 2 (TIMP-2), and insulin-like growth factor-binding protein 7 (IGFBP7)[42], which have been reported to have high specificity and sensitivity, irrespective of potentially interfering conditions in our analysis. Further investigation could explore whether these biomarkers could help better detect postoperative AKI.