Survival prediction models since liver transplantation - comparisons between Cox models and machine learning techniques

doi:10.21203/rs.3.rs-22670/v3

Download PDF

Research article

Survival prediction models since liver transplantation - comparisons between Cox models and machine learning techniques

https://doi.org/10.21203/rs.3.rs-22670/v3

This work is licensed under a CC BY 4.0 License

Journal Publication

published 16 Nov, 2020

Read the published version in BMC Medical Research Methodology →

You are reading this latest preprint version

Background: Predicting survival of recipients after liver transplantation is regarded as one of the most important challenges in contemporary medicine. Hence, improving on current prediction models is of great interest.

Nowadays, there is a strong discussion in the medical field about machine learning (ML) and whether it has greater potential than traditional regression models when dealing with complex data. Criticism to ML is related to unsuitable performance measures and lack of interpretability which is important for clinicians.

Methods: In this paper, ML techniques such as random forests and neural networks are applied to large data of 62294 patients from the United States with 97 predictors selected on clinical/statistical grounds, over more than 600, to predict survival from transplantation. Of particular interest is also the identification of potential risk factors. A comparison is performed between 3 different Cox models (with all variables, backward selection and LASSO) and 3 machine learning techniques: a random survival forest and 2 partial logistic artificial neural networks (PLANNs). For PLANNs, novel extensions to their original specification are tested. Emphasis is given on the advantages and pitfalls of each method and on the interpretability of the ML techniques.

Results: Well-established predictive measures are employed from the survival field (C-index, Brier score and Integrated Brier Score) and the strongest prognostic factors are identified for each model. Clinical endpoint is overall graft-survival defined as the time between transplantation and the date of graft-failure or death. The random survival forest shows slightly better predictive performance than Cox models based on the C-index. Neural networks show better performance than both Cox models and random survival forest based on the Integrated Brier Score at 10 years.

Conclusion: In this work, it is shown that machine learning techniques can be a useful tool for both prediction and interpretation in the survival context. From the ML techniques examined here, PLANN with 1 hidden layer predicts survival probabilities the most accurately, being as calibrated as the Cox model with all variables.

Health Economics & Outcomes Research

Random Survival Forests

Neural Networks

Predictive Performance

Risk Factors

Post-transplantation

Survival analysis

Due to technical limitations, full-text HTML conversion of this manuscript could not be completed. However, the latest manuscript can be downloaded and

accessed as a PDF.

Additionalfile1.pdf
Additional file 1 — Supplementary material Additional file 1 includes the Garson’s algorithm for 2 hidden layers, a table with the relative importance of the time intervals for the neural networks with 1 and 2 hidden layes, detailed criteria for variable pre-selection, a plot of survival and censoring distributions and 4 tables with individual patient characteristics.
Additionalfile2.pdf
Additional file 2 — Model training Additional file 2 provides information about the package to implement RSFs and NNs as well as technical parts regarding the choice of tuning parameters and the cross-validation procedure for each method. A figure illustrates the cross-validation procedure for RSF on a 3D space. References are provided for further reading.
Additionalfile3.docx
Additional file 3 — Calibration plots at 5 and 10 years since LT Additional file 3 contains calibration plots at 5 and 10 years for a) a Cox model with all prognostic factors, b) a Random Survival Forest with all prognostic factors, c) a Partial Logistic Artificial Neural Network with 1 hidden layer with all prognostic factors and d) a Partial Logistic Artificial Neural Network with 2 hidden layers with all prognostic factors.
Additionalfile4.docx
Additional file 4— R code for analyses implementation Additional file 4 provides the R code developed for the analyses of this project.

Download PDF

Journal Publication

published 16 Nov, 2020

Read the published version in BMC Medical Research Methodology →

Submission checks completed at journal
25 Oct, 2020
Editorial decision: Accept
23 Oct, 2020

You are reading this latest preprint version

Survival prediction models since liver transplantation - comparisons between Cox models and machine learning techniques

Status:

Journal Publication

Version 3

Abstract

Figures

Full Text

Supplementary Files

Status:

Journal Publication

Version 3