Machine learning techniques for mortality prediction in critical traumatic patients: anatomic and physiologic variables
- PDF / 886,148 Bytes
- 12 Pages / 595.276 x 790.866 pts Page_size
- 45 Downloads / 166 Views
(2020) 20:262
RESEARCH ARTICLE
Open Access
Machine learning techniques for mortality prediction in critical traumatic patients: anatomic and physiologic variables from the RETRAUCI study Luis Serviá1, Neus Montserrat1, Mariona Badia1, Juan Antonio Llompart-Pou2, Jesús Abelardo Barea-Mendoza3, Mario Chico-Fernández3, Marcelino Sánchez-Casado4, José Manuel Jiménez5, Dolores María Mayor6 and Javier Trujillano1*
Abstract Background: Interest in models for calculating the risk of death in traumatic patients admitted to ICUs remains high. These models use variables derived from the deviation of physiological parameters and/or the severity of anatomical lesions with respect to the affected body areas. Our objective is to create different predictive models of the mortality of critically traumatic patients using machine learning techniques. Methods: We used 9625 records from the RETRAUCI database (National Trauma Registry of 52 Spanish ICUs in the period of 2015–2019). Hospital mortality was 12.6%. Data on demographic variables, affected anatomical areas and physiological repercussions were used. The Weka Platform was used, along with a ten-fold cross-validation for the construction of nine supervised algorithms: logistic regression binary (LR), neural network (NN), sequential minimal optimization (SMO), classification rules (JRip), classification trees (CT), Bayesian networks (BN), adaptive boosting (ADABOOST), bootstrap aggregating (BAGGING) and random forest (RFOREST). The performance of the models was evaluated by accuracy, specificity, precision, recall, F-measure, and AUC. Results: In all algorithms, the most important factors are those associated with traumatic brain injury (TBI) and organic failures. The LR finds thorax and limb injuries as independent protective factors of mortality. The CT generates 24 decision rules and uses those related to TBI as the first variables (range 2.0–81.6%). The JRip detects the eight rules with the highest risk of mortality (65.0–94.1%). The NN model uses a hidden layer of ten nodes, which requires 200 weights for its interpretation. The BN find the relationships between the different factors that identify different patient profiles. Models with the ensemble methodology (ADABOOST, BAGGING and RandomForest) do not have greater performance. All models obtain high values in accuracy, specificity, and AUC, but obtain lower values in recall. The greatest precision is achieved by the SMO model, and the BN obtains the best recall, F-measure, and AUC. (Continued on next page)
* Correspondence: [email protected] 1 Servei de Medicina Intensiva, Hospital Universitari Arnau de Vilanova, Universitat de Lleida, IRBLleida, Avda Rovira Roure 80, 25198 Lleida, Spain Full list of author information is available at the end of the article © The Author(s). 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the o
Data Loading...