Assessing the Calibration in Toxicological in Vitro Models with Conformal Prediction

doi:10.21203/rs.3.rs-220364/v1

Download PDF

Research article

Assessing the Calibration in Toxicological in Vitro Models with Conformal Prediction

https://doi.org/10.21203/rs.3.rs-220364/v1

This work is licensed under a CC BY 4.0 License

You are reading this latest preprint version

Machine learning methods are widely used in drug discovery and toxicity prediction. While showing overall good performance in cross-validation studies, their predictive power (often) drops in cases where the query samples have drifted from the training data’s descriptor space. Thus, the assumption for applying machine learning algorithms, that training and test data stem from the same distribution, might not always be fulfilled. In this work, conformal prediction is used to assess the calibration of the models. Deviations from the expected error may indicate that training and test data originate from different distributions. Exemplified on the Tox21 datasets, composed of chronologically released Tox21Train, Tox21Test and Tox21Score subsets, we observed that while internally valid models could be trained using cross-validation on Tox21Train, predictions on the external Tox21Score data resulted in higher error rates than expected. To improve the prediction on the external sets, a strategy exchanging the calibration set with more recent data, such as Tox21Test, has successfully been introduced. We conclude that conformal prediction can be used to diagnose data drifts and other issues relating to model calibration. The proposed improvement strategy — exchanging the calibration data only — is convenient as it does not require retraining of the underlying model.

General Biochemistry

Chemical Engineering

toxicity prediction

conformal prediction

data drifts

applicability domain

calibration plots

Tox21 datasets

Due to technical limitations, full-text HTML conversion of this manuscript could not be completed. However, the latest manuscript can be downloaded and

accessed as a PDF.

Download PDF

Editorial decision: Major revision
06 Mar, 2021
Review #2 received at journal
03 Mar, 2021
Review #1 received at journal
01 Mar, 2021
Reviewer #2 agreed at journal
21 Feb, 2021
Reviewers invited by journal
13 Feb, 2021
Reviews received at journal
13 Feb, 2021
Reviewer #1 agreed at journal
13 Feb, 2021
Submission checks completed at journal
09 Feb, 2021
Editor invited by journal
09 Feb, 2021
Editor assigned by journal
08 Feb, 2021
First submitted to journal
07 Feb, 2021

You are reading this latest preprint version

Assessing the Calibration in Toxicological in Vitro Models with Conformal Prediction

Status:

Version 1

Abstract

Figures

Full Text

Supplementary Files

Status:

Version 1