Predicting 30-days mortality for MIMIC-III patients with sepsis-3: a machine learning approach using XGboost
- PDF / 2,250,422 Bytes
- 14 Pages / 595.276 x 790.866 pts Page_size
- 74 Downloads / 174 Views
Journal of Translational Medicine Open Access
RESEARCH
Predicting 30‑days mortality for MIMIC‑III patients with sepsis‑3: a machine learning approach using XGboost Nianzong Hou1†, Mingzhe Li2, Lu He3, Bing Xie1, Lin Wang4, Rumin Zhang4, Yong Yu4, Xiaodong Sun5, Zhengsheng Pan6 and Kai Wang4*†
Abstract Background: Sepsis is a significant cause of mortality in-hospital, especially in ICU patients. Early prediction of sepsis is essential, as prompt and appropriate treatment can improve survival outcomes. Machine learning methods are flexible prediction algorithms with potential advantages over conventional regression and scoring system. The aims of this study were to develop a machine learning approach using XGboost to predict the 30-days mortality for MIMIC-III Patients with sepsis-3 and to determine whether such model performs better than traditional prediction models. Methods: Using the MIMIC-III v1.4, we identified patients with sepsis-3. The data was split into two groups based on death or survival within 30 days and variables, selected based on clinical significance and availability by stepwise analysis, were displayed and compared between groups. Three predictive models including conventional logistic regression model, SAPS-II score prediction model and XGBoost algorithm model were constructed by R software. Then, the performances of the three models were tested and compared by AUCs of the receiver operating characteristic curves and decision curve analysis. At last, nomogram and clinical impact curve were used to validate the model. Results: A total of 4559 sepsis-3 patients are included in the study, in which, 889 patients were death and 3670 survival within 30 days, respectively. According to the results of AUCs (0.819 [95% CI 0.800–0.838], 0.797 [95% CI 0.781– 0.813] and 0.857 [95% CI 0.839–0.876]) and decision curve analysis for the three models, the XGboost model performs best. The risk nomogram and clinical impact curve verify that the XGboost model possesses significant predictive value. Conclusions: Using machine learning technique by XGboost, more significant prediction model can be built. This XGboost model may prove clinically useful and assist clinicians in tailoring precise management and therapy for the patients with sepsis-3. Keywords: MIMIC-III, Sepsis-3, Machine learning, Xgboost, Logistic regression, SAPS-II score
*Correspondence: [email protected] † Nianzong Hou and Kai Wang contributed equally to this work 4 Department of Critical Care Medicine, Zibo Central Hospital, Shandong First Medical University , Zibo 255036, Shandong, China Full list of author information is available at the end of the article
Background Sepsis is a common and economically significant disease which has become an important public health issue globally and led to over 5.3 million people dies annually with an approximately overall mortality of 30%, particularly in the intensive care unit (ICU) [1–3]. Sepsis is defined as a syndrome of physiologic, pathologic, and biochemical abnormalities induced by infectio
Data Loading...