Development of Novel Deep Multimodal Representation Learning-based Model for the Differentiation of Liver Tumors on B-Mode Ultrasound Images

doi:10.21203/rs.3.rs-143117/v1

Download PDF

Research Article

Development of Novel Deep Multimodal Representation Learning-based Model for the Differentiation of Liver Tumors on B-Mode Ultrasound Images

https://doi.org/10.21203/rs.3.rs-143117/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 14 Dec, 2021

Read the published version in Journal of Gastroenterology and Hepatology →

Version 1

posted

You are reading this latest preprint version

Recently, multimodal representation learning for images and other information such as numbers or language has gained much attention due to the possibility of combining latent features using a single distribution. The aim of the current study was to analyze the diagnostic performance of deep multimodal representation model-based integration of tumor image, patient background, and blood biomarkers for the differentiation of liver tumors observed using B-mode ultrasonography (US). First, we applied supervised learning with a convolutional neural network (CNN) to 972 liver nodules in the training and development sets (479 benign and 493 malignant nodules), to develop a predictive model using segmented B-mode tumor images. Additionally, we also applied a deep multimodal representation model to integrate information about patient background or blood biomarkers to B-mode images. We then investigated the performance of the models in an independent test set of 108 liver nodules, including 53 benign and 55 malignant tumors. Using only the segmented B-mode images, the diagnostic accuracy and area under the curve (AUC) values were 68.52% and 0.721, respectively. As the information about patient background such as age or sex and blood biomarkers was integrated, the diagnostic performance increased in a stepwise manner. The diagnostic accuracy and AUC value of the multimodal DL model (which integrated B-mode tumor image, patient age, sex, AST, ALT, platelet count, and albumin data) reached 96.30% and 0.994, respectively. Integration of patient background and blood biomarkers in addition to US image using multimodal representation learning outperformed the CNN model using US images. We expect that the deep multimodal representation model could be a feasible and acceptable tool that can effectively support the definitive diagnosis of liver tumors using B-mode US in daily clinical practice.

Cancer Biology

Gastroenterology & Hepatology

B-mode

convolutional neural network

deep multimodal representation learning

MAGELLAN BLOCKS

liver tumor

Ultrasonography (US) is widely used for hepatocellular carcinoma (HCC) surveillance to screen high-risk populations, because of its cost-effectiveness and non-invasiveness. However, a definitive diagnosis of liver tumors observed using B-mode sonography can be difficult because of the low specificity of this modality.¹ Currently, B-mode sonography is usually used in combination with other contrast imaging modalities such as computed tomography (CT) or magnetic resonance imaging (MRI), to obtain a definitive diagnosis. However, since B-mode US provides structural information that may reflect the histological characteristics of the tumor,² a precise and objective recognition of B-mode images has the potential to become a powerful tool for the qualitative diagnosis of liver tumors.

Machine learning (ML) is a multidisciplinary field combining computer science and mathematics, that focuses on implementing computer algorithms capable of maximizing the predictive accuracy from static or dynamic data sources using analytic or probabilistic models.³ Deep learning (DL) architectures have become a hot topic in the ML field, and have been successfully used for image classification.⁴ The ImageNet Large Scale Visual Recognition Challenge competition is an annual competition for computer vision; in the competition held in 2017, DL technology with deep convolutional neural network (CNN) achieved a misclassification rate of less than 5%, indicating that CNN can classify images more precisely than humans.⁵ Recently, multimodal representation learning for images and other information such as numbers or language has gained much attention due to the possibility of combining latent features using a single distribution.⁶ In addition to information on B-mode images of liver tumors, patient background or data on biomarkers of liver inflammation (aspartate aminotransferase [AST] and alanine aminotransferase [ALT]) or fibrosis (platelet count)　⁷ are commonly collected in daily clinical practice. In addition, serum albumin levels, which were shown to be decreased in cancer patients, ⁸ are widely available. These biomarkers alter the pretest probability for the diagnosis of liver tumors using B-mode US, and thus are useful for the definitive diagnosis of liver tumors detected by US.

Although the application of multimodal representation learning-based integration of B-mode images, patient background, or blood biomarkers is likely to become a promising means of making a definitive diagnosis, the clinical utility of multimodal representation learning by DL model for the classification of liver tumors has not yet been elucidated. Our current study was designed to assess the diagnostic significance of adding patient background or blood data to B-mode US images, and to analyze the diagnostic performance of a deep multimodal representation model for the differentiation of liver tumors observed using B-mode US.

Patient and tumor characteristics

The details of the liver tumors included in the current study are shown in Table 1. The majority of benign and malignant tumors were hemangiomas and HCCs.

Table 1

Tumor characteristics (n = 1080)
Type of nodule	Number of patients	Diagnostic basis
Type of nodule	Number of patients	*ce CT	†ce MRI	‡ce US	§ne CT	¶ne MRI	‖ne US and Clinical course	Pathology
Benign tumor
Hemangioma	405	187	108	0	0	33	77	0
Angiomyolipoma	7	7	0	0	0	0	0	0
Complicated cyst	4	2	2	0	0	0	0	0
Ciliated hepatic foregut cyst	1	0	1	0	0	0	0	0
Focal nodular hyperplasia	19	2	2	0	0	15	0	0
Focal spared lesion	1	0	0	0	0	1	0	0
Focal fat deposition	2	0	1	0	0	0	1	0
Organized abscess	1	0	1	0	0	0	0	0
Others	92	40	24	0	0	0	28	0
Malignant tumor
Hepatocellular carcinoma	440	281	153	3	0	2	0	1
Intrahepatic cholangiocarcinoma	4	4	0	0	0	0	0	0
Metastatic tumor	104	74	18	0	0	0	0	12
*contrast-enhanced computed tomography (CT) †contrast-enhanced magnetic resonance imaging (MRI) ‡contrast-enhanced ultrasonography (US) §non-enhanced CT ¶non-enhanced MRI ‖non-enhanced US

Patient characteristics are shown in Table 2. The proportion of male patients was significantly higher among patients with malignant nodules than among those with benign nodules. Compared to those in patients with benign nodules, the serum levels of AST, ALT, gamma-glutamyl transpeptidase, or alkaline phosphatase, and patient age were also significantly higher, whereas the white blood cell count, hemoglobin level, platelet count, and serum albumin level were lower in patients with malignant liver tumors.

Table 2

Patient characteristics (n = 1080)
Variables	Benign	Malignancy	P values
Sex, n (%)			< 0.001
Female	286 (53.8)	191 (34.9)
Male	246 (46.2)	357 (65.1)
Age (years) *	58.0 (47.0 – 68.0)	73.0 (66.0 – 80.0)	< 0.001
White blood cell count (× 10³/μL)*	5.4 (4.5-6.4)	5.0 (4.0-6.4)	0.01
Hemoglobin level (g/dL)*	13.7 (13.1-14.8)	12.1 (10.575-13.7)	< 0.001
Platelet count (× 10⁴/μL) *	23.25 (19.8 – 27.3)	12.5 (9.275 – 18.6)	< 0.001
AST level (U/L) *	20.0 (17.0 – 25.0)	35.0 (25.0 - 51.0)	< 0.001
ALT level (U/L) *	16.0 (12.0 - 23.0)	25.0 (16.0 - 42.0)	< 0.001
Albumin level (g/dL) *	4.2 (4.1 – 4.4)	3.6 (3.2 – 4.0)	< 0.001
Gamma-glutamyl transpeptidase level *^†	24.0 (17.0-39.0)	50.0 (32.0-140.5)	< 0.001
Alkaline phosphatase level *^‡	203.0 (171.0 – 256.0)	330.0 (247.5 – 452.0)	< 0.001
*Data are expressed as the median and interquartile range ^†Missing in three cases ^‡Missing in 28 cases

Predictive accuracy of CNN models for discriminating malignant and benign nodules

Table 3 shows the diagnostic accuracy, sensitivity, and specificity of each DL model in the test set. Using only the segmented B-mode images (DL model 1), the diagnostic accuracy, sensitivity, and specificity were 68.52%, 67.27%, and 69.81%, respectively.

The diagnostic performance increased in a stepwise manner with the integration of patient background information such as age or sex and blood biomarkers. The diagnostic accuracy of model 5 (the model integrating the data on B-mode tumor image, patient age, sex, AST, ALT, platelet count, and albumin) reached 96.30%. The sensitivity and specificity of ML model 5 were 100.0% and 92.45%, respectively.

Table 3.

Table 3

Diagnostic accuracy of each deep learning (DL) model in the test set
DL model	Accuracy, % (n/N)	Sensitivity, % (n/N)	Specificity, % (n/N)
DL model 1 (Model using B-mode image only)	68.52 (74/108)	67.27 (37/55)	69.81 (37/53)
DL model 2 (Model 1 + patient age, sex)	71.30 (77/108)	78.18 (43/55)	64.15 (34/53)
DL model 3 (Model 2 + AST, ALT)	87.04 (94/108)	89.10 (49/55)	84.91 (45/53)
DL model 4 (Model 3 + platelet count)	91.67 (99/108)	94.55 (52/55)	88.68 (47/53)
DL model 5 (Model 4 + albumin)	96.30 (104/108)	100.00 (55/55)	92.45 (49/53)

ROC curve analysis of DL models

The ROC curves for the prediction of malignant tumors were plotted for each DL model (Figure 2). The AUCs for the prediction of malignant tumors for DL models 1, 2, 3, 4, and 5 were 0.721, 0.803, 0.955, 0.982, and 0.994, respectively. The predictive AUC values of DL models 3 to 5 were significantly higher than those of DL model 1 (Supplementary Table 2).

Variability in lesion segmentation

To assess the intra-observer and inter-observer reliability of the manual segmentation, we selected 20 cases and re-performed the segmentation. We also assessed inter-observer reliability between the two observers. We analyzed the correlation the correlation of the magnitude of the volume based on the number of pixels in the segmented images. Supplementary Figure 1a shows the intra-observer correlation of the size of images (number of pixels) between the original segmentation (A) and the re-performed segmentation (B) performed by M.S, and Supplementary Figure 1b shows the inter-observer correlation between two observers (M.S and Y.S). Adequate agreement on volume size was found in both intra-observer (ICC 0.990; 95% confidence interval [CI], 0.976 to 0.996) and inter-observer (ICC 0.959; 95% CI, 0.895 to 0.984) assessments.

DL has gained increasing attention as an artificial intelligence strategy.⁴ Image recognition technology has also improved dramatically, and its use in the medical field is increasing rapidly.^9-13 As B-mode US itself provides structural information, an objective recognition of B-mode images using the ML approach has the potential to become a powerful tool for the qualitative diagnosis of liver tumors. In some fields, computer technology performs better than humans because of its ability to manage large amounts of information and to repeat the same routines exactly time after time.¹⁴ A previous study by Brehar et al. investigated the performance in differentiating HCC from cirrhotic parenchyma using B-mode US, and reported a higher performance of the DL approach as compared to that of classical ML classifiers such as gradient boosting, support vector machines, or random forest-based classifications.¹⁵ This result is potentially applicable to the definitive diagnosis of liver tumors using B-mode US. In the present study, the CNN image processing network (DL model 1) showed fair performance (AUC value of 0.721) for the differentiation of liver tumors.

Recently, multimodal representation learning for images and other information has gained much attention because of the possibility of combining latent features using a single distribution.⁶ Additionally, a number of previous studies have reported that multimodal representation had superior performance compared to unimodal representation-based approaches in various applications, and achieved remarkable results.^6,16,17 In multimedia applications, multimodal learning is becoming increasingly necessary and important because different modalities typically carry different information.¹⁷

In the present study, we also applied deep multimodal representation learning for the definitive diagnosis of B-mode liver tumor images. Stepwise integration of information on patient background and blood biomarkers improved the predictive performance of the original image processing model (DL model 1). The AUC value of the proposed multimodal network using information on patient age, sex, AST, ALT, platelet count, and albumin in addition to B-mode image (DL model 5) reached 0.994, and significantly outperformed the original model. To the best of our knowledge, this study is the first to investigate the clinical utility of deep multimodal representation model-based integration for the differentiation of liver tumors observed using B-mode US.

Lately, DL with multimodal representation has been applied to various clinical fields.^18-20 In the HCC field, a study from China built a multi-modal and multi-task ML model to predict the prognosis of patients with HCC after TACE.²¹ Using evidence-based clinical scores such as the “American Joint Committee on Cancer stage” and “Response Evaluation Criteria in Solid Tumors (RECIST)” in addition to HCC images from dual-phase contrast enhanced CT, the AUCs for predicting the 3-year, 5-year, and 10-year survival rates were reported to be 0.85, 0.910, and 0.89, respectively.

In the current study, because information about tumor markers such as AFP or DCP was lacking in a considerable number of cases with benign tumors, it was not possible to investigate the performance of a multimodal representation model using tumor markers. Huge volume of data will be stored in the cloud storage platform in the future. We expect that the performance of the DL model will be further improved using a larger volume of training data, including tumor markers. In addition, DL-based frameworks could be used to develop more complicated models or systems to aid clinical decision-making in the future.

Our study had several limitations. First, we applied B-mode images obtained using limited types of US devices (Aplio 300 or Aplio 500 instrument) at a single institution for the construction of the CNN model. Further studies with multicenter clinical trials are needed to fully understand the clinical utility of the CNN model for B-mode image recognition. Second, histological proof of a liver tumor was lacking in a majority of cases. However, currently, the histological diagnosis of HCC is rarely required, as non-invasive methods are preferred. HCC can be diagnosed with the use of triphasic CT, contrast (Gadolinium, Premovist) MRI, or contrast (Sonazoid) US.²² These non-invasive modalities are widely available and have largely replaced biopsy for HCC diagnosis.²³

In conclusion, with the integration of patient background information and blood biomarkers in addition to US images, multimodal representation learning outperformed the CNN model that used US images alone. We expect that the deep multimodal representation model could be a feasible and acceptable tool that can effectively support the definitive diagnosis of liver tumors using B-mode US in daily clinical practice.

Patients

We enrolled patients who visited our institution between April 2016 and November 2018 and underwent US examination that resulted in the detection of liver tumors. Patients for whom information on age, sex, AST, ALT, platelet count, and albumin were available, were selected. Simple cysts were not included in the study. In addition, we excluded the tumor images that included measurement lines only, and those without information on the benign or malignant nature of the tumors. Finally, we extracted the data for a total of 1080 patients with US-detected liver tumors in whom a clinical diagnosis was made (548 malignant nodules and 532 benign nodules).

Contrast-enhanced CT, MRI, and US were used to obtain a definitive diagnosis of liver nodules using validated imaging criteria.^24-29 We also used tumor markers (e.g., alpha-fetoprotein [AFP] or des-gamma-carboxy prothrombin [DCP] for HCC, and carcinoembryonic antigen, or carbohydrate antigen 19-9 for metastatic liver tumors) as diagnostic aids. When a definite diagnosis could not be made using these modalities, a US-guided tumor biopsy was performed. Patients clinically diagnosed with benign tumors with no evidence of clinical progression and without any treatment were also included.^30,31 To evaluate the predictive ability of ML models, we randomly split a total of 1080 lesions into three groups, as follows: (ⅰ) the training set (80%), which was used to build the model (864 lesions), (ⅱ) the development set, which was used for tuning the model parameters (108 lesions), and (ⅲ) the test set, which was used to evaluate the performance of each classifier (108 lesions); we then assessed the predictive accuracy of the developed model.

The current study was performed in accordance with the ethical guidelines of the Declaration of Helsinki. This research project was approved by the ethics committee of our university hospital (approval number, 11941). Informed consent was obtained in the opt-out format, on the institution’s website. Patients who opted out of participating in our study were excluded. The study design was also included in a comprehensive protocol for retrospective studies, and was approved by the ethics committee of our institution (approval number, 2058).

Study Outline

The current study consisted of two stages: a training stage and a validation stage. In the training stage, we developed a DL model with 972 samples (864 samples for the training set and 108 samples for the development set). MobileNet version 2 (MobileNet v2, Salt Lake City, UT, USA) was used for model development.³² In the validation stage, we assessed the diagnostic accuracy of the developed model using the test set that consisted of 108 samples that were completely independent from the training samples.

Image processing

All ultrasound examinations were performed using a Toshiba Aplio 300 or Aplio 500 instrument (Canon Medical Systems Co., Tokyo, Japan) fitted with 3.5-5 MHz transducers. We used still images of tumors captured and stored in routine clinical practice. These images were manually annotated for this study by an expert hepatologist (M.S) and an expert sonographer (Y.S); the original B-mode US images were annotated with rectangular bounding boxes to cover whole tumor nodules and make the areas other than the tumor lesions as small as possible (Figure 1).

Development of the algorithm (training stage)

First, we applied supervised learning with a CNN to a total of 972 B-mode liver nodules in the training and development sets (479 benign and 493 malignant nodules) to develop the image-only model (model 1) using the MobileNet v2 software. In addition to the CNN image processing network, we applied a multimodal representation DL to integrate the information on patient background or blood biomarkers to B-mode images. In a stepwise manner, we integrated the information on patient background such as age and sex (model 2), liver inflammation (AST and ALT) (model 3), liver fibrosis (platelet count) (model 4), and albumin (model 5) (Supplementary Table 1).

Validation of the algorithm (validation stage)

In the validation stage, we examined the accuracy of the trained model using 108 (55 malignant nodules and 53 benign nodules) segmented images from the original B-mode US images, patient background, or blood biomarkers in the test set. Each nodule was evaluated based on the ML models developed in the training stage, and the trained model outputted the probability of malignancy.

Statistical analysis

Continuous variables were expressed as medians and interquartile ranges, while categorical variables were expressed as frequencies (%). Categorical data and continuous data were analyzed using the chi-square test and the Mann-Whitney U test.

In the validation stage, we investigated the performance of each model in an independent test set by calculating its accuracy, sensitivity, and specificity using a confusion matrix generated by the CNN and the multimodal representation models. We also used receiver operating characteristic (ROC) curve analysis to assess the predictive accuracy. The area under the curve (AUC) was evaluated as the ability to discriminate malignant nodules; comparison of the AUC values was carried out using the Delong test.³³ We used the following grading scales for the interpretation of the AUC results: AUC 0.5-0.6, fail; AUC 0.6-0.7, poor performance; AUC 0.7-0.8, fair performance; AUC 0.8-0.9, good performance; AUC 0.9-1, excellent performance.^34,35 The intraclass correlation coefficient (ICC) was used to calculate intra- or inter-observer variance of continuous variables. Statistical analyses were performed using the R 3.4.3 software (https://cran.r-project.org/).

AFP, alpha-fetoprotein; ALT, alanine aminotransferase; AST, aspartate aminotransferase; CNN, convolutional neural network; CI, confidence interval; CT, computed tomography; DCP, des-gamma-carboxy prothrombin; DL, deep learning; HCC, hepatocellular carcinoma; ICC, intraclass correlation coefficient; MRI, magnetic resonance imaging; ML, machine learning; US, ultrasonography; AUC, area under the curve; ROC, receiver operating characteristic

Acknowledgement

We thank Groovenauts, Inc. for the provision of the machine learning API, “MAGELLAN BLOCKS” cloud platform.

This work was supported by the Health, Labour, and Welfare Policy Research Grants from the Ministry of Health, Labour, and Welfare of Japan (Policy Research for Hepatitis Measures [H30-Kansei-Shitei-003]).

This document has been edited by a professional English language editor, who is a native speaker of English.

Conflict of interest disclosure

The authors have no conflicts of interest to disclose.

Clark, T., Maximin, S., Meier, J., Pokharel, S. & Bhargava, P. Hepatocellular Carcinoma: Review of Epidemiology, Screening, Imaging Diagnosis, Response Assessment, and Treatment. Curr Probl Diagn Radiol44, 479-486, doi:10.1067/j.cpradiol.2015.04.004 (2015).
Kim, K. A. et al. Small hepatocellular carcinoma: ultrasonographic findings and histopathologic correlation. Clinical imaging27, 340-345 (2003).
Wang, S. & Summers, R. M. Machine learning and radiology. Medical image analysis16, 933-951, doi:10.1016/j.media.2012.02.005 (2012).
Krizhevsky, A., Sutskever, I. & Hinton, G. E. in Advances in neural information processing systems. 1097-1105.
Park, E. et al. ILSVRC-2017. URL http://www. image-net. org/challenges/LSVRC/2017 (2017).
Pandey, G. & Dukkipati, A. in 2017 International Joint Conference on Neural Networks (IJCNN). 308-315 (IEEE).
Poynard, T. & Bedossa, P. Age and platelet count: a simple index for predicting the presence of histological lesions in patients with antibodies to hepatitis C virus. METAVIR and CLINIVIR Cooperative Study Groups. Journal of viral hepatitis4, 199-208, doi:10.1046/j.1365-2893.1997.00141.x (1997).
Costa, G. Cachexia, the metabolic component of neoplastic diseases. Cancer Research37, 2327-2335 (1977).
Aoki, T. et al. Automatic detection of erosions and ulcerations in wireless capsule endoscopy images based on a deep convolutional neural network. Gastrointestinal Endoscopy (2018).
Gulshan, V. et al. Development and Validation of a Deep Learning Algorithm for Detection of Diabetic Retinopathy in Retinal Fundus Photographs. Jama316, 2402-2410, doi:10.1001/jama.2016.17216 (2016).
Mori, Y. et al. Real-Time Use of Artificial Intelligence in Identification of Diminutive Polyps During Colonoscopy: A Prospective Study. Annals of internal medicine169, 357-366, doi:10.7326/m18-0249 (2018).
Ransohoff, J. D. et al. Detecting Chemotherapeutic Skin Adverse Reactions in Social Health Networks Using Deep Learning. JAMA oncology4, 581-583, doi:10.1001/jamaoncol.2017.5688 (2018).
Yasaka, K., Akai, H., Abe, O. & Kiryu, S. Deep learning with convolutional neural network for differentiation of liver masses at dynamic contrast-enhanced CT: a preliminary study. Radiology286, 887-896 (2017).
Jaakkola, H., Henno, J., Mäkelä, J. & Thalheim, B. in Information and Communication Technology, Electronics and Microelectronics (MIPRO), 2017 40th International Convention on. 635-643 (IEEE).
Brehar, R. et al. Comparison of Deep-Learning and Conventional Machine-Learning Methods for the Automatic Recognition of the Hepatocellular Carcinoma Areas from Ultrasound Images. Sensors (Basel, Switzerland)20, doi:10.3390/s20113085 (2020).
Srivastava, N. & Salakhutdinov, R. R. Multimodal Learning with Deep Boltzmann Machines. Advances in Neural Information Processing Systems25, 2222-2230 (2012).
Wang, C., Yang, H. & Meinel, C. A deep semantic framework for multimodal representation learning. Multimedia Tools and Applications75, 9255-9276 (2016).
Cheerla, A. & Gevaert, O. Deep learning with multimodal representation for pancancer prognosis prediction. Bioinformatics (Oxford, England)35, i446-i454 (2019).
Silva, L. A. V. & Rohr, K. in 2020 IEEE 17th International Symposium on Biomedical Imaging (ISBI). 568-571 (IEEE).
Bagheri, A. et al. Multimodal Learning for Cardiovascular Risk Prediction using EHR Data. arXiv preprint arXiv:2008.11979 (2020).
Liu, Q. P., Xu, X., Zhu, F. P., Zhang, Y. D. & Liu, X. S. Prediction of prognostic risk factors in hepatocellular carcinoma with transarterial chemoembolization using multi-modal multi-task deep learning. EClinicalMedicine23, 100379, doi:10.1016/j.eclinm.2020.100379 (2020).
Kudo, M. et al. Role of gadolinium-ethoxybenzyl-diethylenetriamine pentaacetic acid-enhanced magnetic resonance imaging in the management of hepatocellular carcinoma: consensus at the Symposium of the 48th Annual Meeting of the Liver Cancer Study Group of Japan. Oncology84, 21-27 (2013).
Hennedige, T. & Venkatesh, S. K. Imaging of hepatocellular carcinoma: diagnosis, staging and treatment monitoring. Cancer imaging : the official publication of the International Cancer Imaging Society12, 530 (2012).
Parikh, T. et al. Focal liver lesion detection and characterization with diffusion-weighted MR imaging: comparison with standard breath-hold T2-weighted imaging. Radiology246, 812-822 (2008).
Silva, A. C. et al. MR imaging of hypervascular liver masses: a review of current techniques. Radiographics29, 385-402 (2009).
Bruix, J. & Sherman, M. Management of hepatocellular carcinoma. Hepatology (Baltimore, Md.)42, 1208-1236 (2005).
Goshima, S. et al. Hepatic hemangioma and metastasis: Differentiation with gadoxetate disodium–Enhanced 3-T MRI. American Journal of Roentgenology195, 941-946 (2010).
Tamada, T. et al. Peripheral low intensity sign in hepatic hemangioma: Diagnostic pitfall in hepatobiliary phase of Gd‐EOB‐DTPA‐enhanced MRI of the liver. Journal of Magnetic Resonance Imaging35, 852-858 (2012).
Omata, M. et al. Asia–Pacific clinical practice guidelines on the management of hepatocellular carcinoma: a 2017 update. Hepatology international11, 317-370 (2017).
Aldrighetti, L., Cetta, F. & Ferla, G. Benign Tumors of the Liver. (Springer, 2015).
Bioulac-Sage, P., Laumonier, H., Laurent, C., Blanc, J. F. & Balabaud, C. in Seminars in liver disease. 302-314 (© Thieme Medical Publishers).
Sandler, M., Howard, A., Zhu, M., Zhmoginov, A. & Chen, L.-C. in Proceedings of the IEEE conference on computer vision and pattern recognition. 4510-4520.
DeLong, E. R., DeLong, D. M. & Clarke-Pearson, D. L. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics44, 837-845 (1988).
Landis, J. R. & Koch, G. G. The measurement of observer agreement for categorical data. Biometrics, 159-174 (1977).
Jaeschke, R. et al. Users' guides to the medical literature: III. How to use an article about a diagnostic test B. What are the results and will they help me in caring for my patients? Jama271, 703-707 (1994).

MultimodalSupplementaryTable20210108.docx
SupplementaryFigure1a.tif
Supplementary Figure 1a. Plot of intraclass correlation coefficient (ICC) of intra-observer correlation of the size of images (number of pixels) between original segmentation (A) and re-performed segmentation (B) (ICC 0.990; 95% confidence interval [CI], 0.976 to 0.996)
SupplementaryFigure1b.tif
Supplementary Figure1b. Plot of ICC of inter-observer correlation between two observers (ICC 0.959; 95% CI, 0.895 to 0.984).

Download PDF

Journal Publication

published 14 Dec, 2021

Read the published version in Journal of Gastroenterology and Hepatology →

Version 1

posted

You are reading this latest preprint version

Development of Novel Deep Multimodal Representation Learning-based Model for the Differentiation of Liver Tumors on B-Mode Ultrasound Images

Status:

Journal Publication

Version 1

Abstract

Figures

Introduction

Results

Patient and tumor characteristics

Predictive accuracy of CNN models for discriminating malignant and benign nodules

ROC curve analysis of DL models

Variability in lesion segmentation

Discussion

Materials And Methods

Patients

Study Outline

Image processing

Development of the algorithm (training stage)

Validation of the algorithm (validation stage)

Statistical analysis

Abbreviations

Declarations

Acknowledgement

Conflict of interest disclosure

References

Supplementary Files

Status:

Journal Publication

Version 1