SHAP Explanations for Multimodal Text-Tabular Models

doi:10.21203/rs.3.rs-3405528/v1

Download PDF

Article

SHAP Explanations for Multimodal Text-Tabular Models

https://doi.org/10.21203/rs.3.rs-3405528/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Research on model transparency has remained relatively limited within the growing field of multimodal machine learning, particularly with text-tabular datasets. To address this research gap, we present a novel multimodal masking framework that extends SHapley Additive exPlanations (SHAP) to text-tabular datasets. This framework, which we make publicly available, enables the generation of SHAP explanations for any text-tabular dataset using any combination method. By masking features according to their modality, our framework ensures that features are treated consistently across unimodal and multimodal settings. Furthermore, by deferring the model input formation until after the masking call, we make the framework agnostic to how the input is formatted, avoiding the issues that arise when pre-forming the data into text and applying the existing text masker. In an extensive study, we examine the impact that combination strategies and language models have on SHAP explanations. Notably, the choice of combination method considerably influences the features identified as most important by the model. Furthermore, our findings reveal that methods converting all input to text tend to assign greater relative importance to text features over tabular features.

Physical sciences/Mathematics and computing/Computational science

Physical sciences/Mathematics and computing/Computer science

No competing interests reported.

TextNTabularNatureSciRep1.pdf

Download PDF

Version 1

posted

You are reading this latest preprint version

SHAP Explanations for Multimodal Text-Tabular Models

Status:

Version 1

Abstract

Full Text

Additional Declarations

Supplementary Files

Status:

Version 1