CEM and MR Radiomics-based Biomarkers to Predict Immunohistochemistry Breast Cancer Subtypes: A comparative study

doi:10.21203/rs.3.rs-2232518/v1

Download PDF

Research Article

CEM and MR Radiomics-based Biomarkers to Predict Immunohistochemistry Breast Cancer Subtypes: A comparative study

https://doi.org/10.21203/rs.3.rs-2232518/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Purpose Accurately predicting the clinical breast cancer subtypes could be extremely helpful for radiologists, pathologists, surgeons, and clinicians and inform future treatment prediction algorithms. Therefore, we evaluate and compare the accuracy of radiomic features extracted from contrast enhanced mammography (CEM) and magnetic resonance imaging (MRI) scans to make predictions to subtypes of breast cancer.

Methods This HIPAA-compliant prospective single institution study was approved by the local institutional review board with written informed consent. Women with breast tumors 2 cm or larger underwent dynamic contrast-enhanced MRI and/or CEM for surgical staging. Semi-manual regions of interest were drawn by radiologist using Cancer Imaging Phenomics Toolkit (CaPTk). Radiomic features were obtained using PyRadiomics and MR- and CEM- based classification models were built on a low-dimensional representation of the features obtained via kernel principal component analysis. We subscribed to an ensemble tree-based learning approach called extremely randomized trees (ERT) to predict cancer subtypes captured via immunohistochemistry markers.

Results For MRI analysis, 124 women with newly diagnosed breast cancer were included in the analysis comprising 49 HR+HER2-, 37 HR+HER2+, 11 HR-HER2+, and 27 HR-HER2- cases. For CEM analysis, models were built using data from 170 female patients including 74 HR+HER2-, 41 HR+HER2+, 14 HR-HER2+, and 43 HR-HER2-. CEM based model resulted in accuracies of 55%, 72%, 88%, and 71% respectively for HR+HER2-, HR+HER2+, HR-HER2+, and HR-HER2- whereas MRI based model alone led to accuracies of 54%, 62%, 89%, and 76% respectively for HR+HER2-, HR+HER2+, HR-HER2+, and HR-HER2-.

Conclusions Radiomic features extracted from CEM and MR were strong predictors of breast cancer subtypes with CEM-based radiomic features performing slightly better, though not statistically significantly better (p = 0.82), than its MRI counterpart.

Breast cancer

Machine learning

Radiomics

Breast cancer subtype

Contrast-enhanced mammography

Breast MR imaging

Breast cancer is the most common female cancer globally, accounting for more than 12% of all new annual cancer cases [1]. It is a heterogeneous disease harboring varying biology, treatment responses, and immunohistochemistry (IHC) markers. Among these, the hormone receptor subtype of breast cancer has particularly received significant attention in clinical practice since the early 2000s and is routinely adopted into clinical guidelines for breast cancer management. The National Comprehensive Cancer Network guideline classifies breast cancer subtypes into the following four categories according to the status of hormone receptor or HR and human epidermal growth factor receptor 2 (HER2): HR + HER2-, HR + HER2+, HR-HER2+, and HR-HER2- (referred to as triple negative breast, TNBC) [1]. Treatment and surveillance plans are entirely dependent on these immunohistochemistry markers.

Breast MRI primarily detects breast cancer by highlighting areas of tumor neoangiogenesis, which demonstrates the underlying tumor vascular physiology. Breast MRI has been the gold standard imaging tool to characterize and locally stage newly diagnosed breast cancer [2–15]. An emerging imaging modality, contrast enhanced mammography (CEM), has also recently attracted broad clinical research interest with the potential to be a low-cost alternative to MR imaging. As a low-cost tool, CEM could become accessible to patients in developing countries where access to MRI tends to be limited. However, unlike MRI, CEM only provides a two-dimensional view of the breast. Therefore, establishing the efficacy of CEM biomarkers compared to MRI and its predictive capability will be crucial for its widespread adoption.

Breast cancer subtype is a significant independent prognostic factor [16]. Accurately predicting the tumor subtype of breast cancer by using MRI and CEM images could be extremely helpful for radiologists, pathologists, surgeons, and clinicians. Prediction of tumor subtype would allow personalized treatment such that aggressive surgical procedures could be avoided. In addition, it would allow treatment costs and side effects reduction, and avoidance of patient inconvenience [17].

This study aims to examine new prognostic, radiomics-based biomarkers (used in combination with current clinical markers) to accurately predict the tumor subtype of breast cancer using MRI and CEM images and compare their predictive performance. Given the importance of breast cancer subtypes as prognostic factors in women with operable breast cancer, we aim to analyze imaging biomarkers embedded within the standard of care imaging studies performed for staging to predict immunohistochemistry-based subtypes. The successful development of such a predictive model will potentially help radiologists, pathologists, surgeons, and clinicians understand features driving breast cancer phenotypes and, more importantly, prognoses using CEM and MRI.

Study population

Data used in this study was acquired from patients enrolled in a large clinical trial at our institution assessing circulating breast tumor DNA. This is a retrospective analysis of prospectively collected patients with T2 or larger tumors who intended to undergo NST. Our patient cohort included women with invasive breast cancer of 2 centimeters or greater at clinical examination or imaging and who were undergoing neoadjuvant systemic therapy (NST) and underwent MR or CEM imaging were between 01/2015 to 01/2021, where CEM and MRI are standard clinical care at our site. Each patient was offered both MRI and CEM, however, some patients opted for one study and declined the other based on claustrophobia, allergy, or reimbursements. The images are collected prior to NST. Pregnant patients and those with ferromagnetic prosthesis were excluded from the study. Patients signed a single consent form for blood and tissue samples and imaging. The Health Insurance Portability and Accountability Act-compliant protocol and the informed consent process were approved by the local site Institutional Review Board.

Patient data were obtained from an IRB approved institutional database containing breast cancer patients. Analysis was conducted on de-identified data. At the time of our study, 124 breast cancer patients had pre-treatment MR imaging, and 170 patients had CEM imaging available for review. Pre-treatment imaging was performed before the start of NST. The breast MRI cohort comprised 49 HR + HER2-, 37 HR + HER2+, 11 HR-HER2+, and 27 TNBC cases. The CEM cohort included 74 HR + HER2-, 41 HR + HER2+, 14 HR-HER2+, and 43 HR-HER2-.

Clinical tumor size in centimeters, as well as tumor location with clock position, was recorded. The institution's breast specialized pathologist performed the histopathologic analysis of surgical specimens. HR positivity (estrogen receptor (ER) positivity or progesterone (PR) positivity or both) and HER2 expression were determined from pre-treatment core biopsy by immunohistochemistry according to ASCO/CAP guidelines and Allred score.

Breast MR Imaging Technique

MRI was performed using a 3.0-T imaging system (GE, Discovery 750W) in the prone position with a dedicated 16-channel breast coil (Sentinelle, In vivo Corp.). Each study included a non–fat-saturated T1-weighted sequence (FSE) in the axial orientation (TR/TE = 700/8.3 ms; matrix = 256 x 192) and a fat-suppressed T2- weighted FSE ASPIR sequence in the sagittal orientation (TR/TE = 4800/79.5 ms; matrix = 256 x 224).

A dynamic contrast-enhanced image set was also acquired, with the first series being an unenhanced fat-saturated gradient-recalled echo T1-weighted sequence (VIBRANT) followed by three dynamic contrast-enhanced fat-saturated T1-weighted gradient-recalled echo series (VIBRANT) performed after IV administration of Gadobutrol (Gadavist, Bayer) at 30 sec, 3 min and 6 min with the use of a weight-based dosing protocol. The dynamic contrast images were acquired in the Sagittal orientation (TR/TE = 5.2/2.3 ms; matrix = 256 x 256) in 2.6 mm slices. Automatic post-processing included the generation of subtraction images between pre, and post-contrast images were produced after each phase. Late gadolinium fat-suppressed T1-weighted fast-spoiled-gradient-echo (FSPGR) sequences were also acquired for both right and left sides separately in the axial orientation (TR/TE = 115/3.15 ms; matrix = 256 x192).

For each examination, T1 weighted dynamic contrast material enhanced MRI images were analyzed for the study. Early phase dynamics subtraction images obtained at 2 mins were used for analysis. Images from the MRI were reviewed, and the lesion was located independently by a radiologist with consensus review by a second radiologist with at least five years of experience (NP, BKP). The tumor location was confirmed on the associated radiology report. MRI imaging assessment included measurement of tumor in the longest diameter. The clinical size was also recorded prior to treatment. The location of the primary tumor on the MRI examination was annotated using CaPTk [18]. Representative MRI images along with regions of interest (ROI) for each of the subtypes are shown in Fig. 1. Here, ROIs for MRI images are in 3D and we only show a representative image here for clarity.

Breast Dual Energy (DE) CEM Technique

All of our DE-CEM examinations were performed on a single commercial DE-CEM model (Hologic). Contraindications to performing CEM at our institution include renal insufficiency (glomerular filtration rate < 30 mL/min/1.73 m²), prior iodinated contrast allergy, and pregnancy. Details regarding the technique have been previously published [19; 20]. For each CEM examination, the recombined subtracted images were analyzed for the study. Images from the CEM were reviewed, and the lesion was located independently by a radiologist with consensus review by a second radiologist (NP, BKP). The location of the primary tumor on the CEM examination was annotated using Horos. Representative images of the CEM and annotations for each of the IHC subtype are shown in Fig. 2.

Radiomic Analysis and feature extraction of CEM and MRI

Before extracting radiomic features from MRI and CEM images, the intensity profile of each of the images was normalized to a scale of 100 and resampled to a pixel spacing of 3, 3, 3 in each X, Y, and Z dimension to standardize all the images. We also discretize the MRI images with a bin width of 5 pixels so that the effect of noisy pixels may be reduced. Radiomic features were extracted using the PyRadiomics v3.0.1 package in Python [21] for MRI and CEM images based on the ROIs delineated by two radiologists with 3 and 14 years of experience through a consensus review. The extracted features were a combination of first-order statistics (19 features), shape-based features that included 16 3D (only extracted for MRI) and ten 2D features, and 75 higher-order statistical features. These higher-order statistical features included 24 gray level co-occurrence matrix (glcm), 16 gray level run length matrix (glrlm), 16 gray level size zone matrix (glszm), five neighboring gray tone difference matrix (ngtdm), and 14 gray level dependence matrix (gldm). A set of 120 radiomic features for each of the MRI and CEM images were extracted from the corresponding raw images. A graphical illustration of these features is presented in Fig. 3. In addition to extracting features from the raw images, we also extract features after processing the images through certain filters. Specifically, we employ two filters, namely, Laplacian of Gaussian (LoG) and wavelet with either a high-pass filter or a low-pass filter in X, Y, and Z axis (when applicable). In total, we extracted 960 features for MRI images and 688 features for CEM images.

Dimensionality reduction and data imbalance

We employed nonlinear principal component analysis (PCA), known as kernel PCA to reduce the dimensionality of radiomic features using the sigmoid kernel [22]. After performing kernel PCA, we retained only the top 50 principal components that explained more than 95% of the variability in the original radiomics feature space.

Classification model

We compare four different machine learning algorithms, including support vector machine (SVM) [23], random forests [24], adaptive boosting (AdaBoost) [25], and extremely randomized trees (ERT) [26]. Table 1 shows the performance comparison of these four algorithms on the original dataset for MRI-based phenotypes for prediction to one of the subtypes. Based on their performances over all metrics, we finally use the ensemble tree-based supervised classification methodology ERT. To avoid overfitting and obtain a consistent estimate of the performance of ERT for classifying IHC subtypes in the absence of an unseen test set, we performed 10-fold cross-validation. Thus, the ratio of training set and testing set is 90:10. It is also important to note that the IHC subtype prediction considered in this work is a four-class problem. However, due to limited sample size, we performed four binary classifications, namely HR + HER2 + vs. others, HR-HER2- vs. others, HR-HER2 + vs. others, and HR + HER2- vs. others. Similar strategies have been adopted in the literature, for instance, [27; 28].

Table 1

Performance comparison of SVM, random forests, AdaBoost, and ERT for HR + HER2- prediction.
	Accuracy	Positive Predictive Value	Recall	Specificity	AUC
SVM	0.54 [0.48, 0.6]*	0.15 [0.0, 0.5]	0.042 [0.0, 0.111]	0.88 [0.8, 0.933]	0.45 [0.36, 0.493]
Random Forests	0.58 [0.542, 0.64]	0.38 [0.0, 0.667]	0.12 [0.0, 0.2]	0.89 [0.8, 0.933]	0.52 [0.473, 0.617]
AdaBoost	0.53 [0.4, 0.64]	0.39 [0.222, 0.571]	0.31 [0.2, 0.444]	0.68 [0.533, 0.8]	0.52 [0.38, 0.6]
ERT	0.54 [0.36, 0.6]	0.42 [0.2, 0.5]	0.37 [0.2, 0.556]	0.63 [0.4, 0.667]	0.55 [0.413, 0.689]
*Range for each of the metric is provided within the square brackets.

Patient Characteristics

Table 2 shows the distribution and characteristics of all the patients analyzed, after exclusion of patients with incomplete annotation. Based on two sample t-test and a chi-square test for categorical variables, no significant difference is observed between the patient population in CEM and MRI datasets for each IHC subtype.

Predictive Ability of MRI-based Phenotypes for IHC subtype prediction

Using the top 50 principal components obtained from kernel PCA, we performed the classification of the IHC subtypes of tumors. The performance metrics reported here were accuracy, positive predictive value, recall, and specificity defined as follows:

\({\text{Accuracy}}=\frac{{{\text{True Positive}}+{\text{True Negative}}}}{{{\text{True Positive}}+{\text{True Negative}}+{\text{False Positive}}+{\text{False Negative}}}}\)

\({\text{Positive Predictive Value}}=\frac{{{\text{True Positive}}}}{{{\text{True Positive}}+{\text{False Positive}}}}\)

\({\text{Recall}}=\frac{{{\text{True Positive}}}}{{{\text{True Positive}}+{\text{False Negative}}}}\)

\({\text{Specificity}}=\frac{{{\text{True Negative}}}}{{{\text{True Negative}}+{\text{False Positive}}}}\)

While accuracy remains the primary indicator for the performance of the classification, we included recall and specificity to establish the performance of the proposed approach considering data imbalance. Table 3 shows the results obtained via the MRI-based phenotypes obtained after performing 10-fold cross-validation. The ROC curves corresponding to the HR-HER2- and HR + HER2 + are shown in Fig. 4.

Predictive Ability of CEM-based Phenotypes for IHC subtype prediction

Data for CEM is available for both the low energy (LE) and DES. High energy images are not used for clinical analysis as low energy images are equivalent to standard full-field mammogram using both low energy and high energy [29]. However, no specific difference was noted in the predictive performance between the LE and DES images. Therefore, to assess the performance of CEM images, we considered the LE CEM images with mediolateral oblique (MLO) view. The same set of features were extracted as the MR images with the exception of 3D shape features as CEM images are two-dimensional. Predictive results are presented in Tables 4, respectively. The ROC curve for HR-HER2- and HR + HER2 + cases are shown in Fig. 5.

Table 2

Characteristic and distribution of patients considered in this study.
	HR + HER2-		HR + HER2+		HR-HER2+		TNBC
	CEM	MRI	CEM	MRI	CEM	MRI	CEM	MRI
Number of patients	74 (43.0%)	49 (39.5%)	41 (23.8%)	37 (29.8%)	14	11 (8.9%)	43 (25.0%)	27
Number of patients	74 (43.0%)	49 (39.5%)	41 (23.8%)	37 (29.8%)	(8.10%)	11 (8.9%)	43 (25.0%)	-21.80%
ER+	46	31	25 (60.9%)	23	11	9 (81.2%)	30	21 (77.8%)
ER+	(62.20%)	(63.30%)	25 (60.9%)	(62.2%)	(78.60%)	9 (81.2%)	(69.80%)	21 (77.8%)
PR+	41	28	24 (58.5%)	23	9	9	29	18
PR+	(55.40%)	(57.10%)	24 (58.5%)	(62.20%)	(64.30%)	(81.20%)	(67.40%)	(66.70%)
Age
Median	54	51	52	53	54	53	58	64
p-value^a	0.734		0.975		0.995		0.233
Range	35–77	34–83	30–80	32–76	38–66	40–66	36–76	37–76
BMI
Median	27.7	27.35	25.4	25.4	24.79	26.7	27.9	27.7
p-value^a	0.746		0.857		0.965		0.721
Range	18.4–44.0	18.4–41.1	18.9–39.2	17.7–38.5	18.9–45.6	19.2–39.0	19.7–43.9	19.7–43.9
Size
Median	4.1	3.7	3.1	2.55	4.95	4.95	2.8	3.7
p-value^a	0.403		0.338		0.978		0.391
Range	0.7–14.0	1.7–12.2	0.6–12.5	0.9-7.0	1.7–8.5	4-10.5	1.1–10.5	1.1–9.6
Postmenopausal status	63.93%	59.20%	56.40%	52.78%	50%	54.50%	71.10%	80%
p-value^b	0.69		0.31		0.49		0.19
Dense Breast	57.63%	55%	65.50%	68.57%	66.67%	63.64%	65.80%	64%
p-value^b	0.29		0.46		1		1
Race
White	49	43	31	27 (72.9%)	11	8	29 (67.4%)	22 (81.5%)
White	(66.20%)	(87.80%)	(75.60%)	27 (72.9%)	(78.60%)	(72.70%)	29 (67.4%)	22 (81.5%)
Hispanic	8	5	4	2	0	1	4	3 (11.10%)
Hispanic	(10.80%)	(10.20%)	(9.80%)	(5.40%)	0	(9.10%)	(9.30%)	3 (11.10%)
Asian/ Pacific Islander	3	0	1	3	1	1	3	0
Asian/ Pacific Islander	(4.50%)	0	(2.40%)	(8.10%)	(7.10%)	(9.10%)	(6.98%)	0
Native American	0	0	1	0	0	0	1	1
Native American	0	0	(2.40%)	0	0	0	(2.33%)	(3.70%)
African American	1	1	0	2	0	0	0	0
African American	(1.4%)	(2.00%)	0	(5.40%)	0	0	0	0
Other/NA	13	0	4 (9.80%)	3	2	1	6	1
Other/NA	(17.60%)	0	4 (9.80%)	(8.10%)	(14.30%)	(9.10%)	(13.96%)	(3.70%)
^a two-sample t-test
^b Chi-square test for categorical variables based on Postmenopausal status and Dense Breast

Table 3

Performance of ERT on the original dataset for MRI-based phenotypes for IHC subtype prediction.
	Accuracy	Positive Predictive Value	Recall	Specificity	AUC
HR-HER2+	0.89 [0.84, 0.92]*	0.08 [0.0, 0.333]	0.13 [0.0, 0.5]	0.84 [0.818, 0.955]	0.42 [0.341, 0.609]
HR + HER2+	0.62 [0.5, 0.72]	0.35 [0.222, 0.5]	0.31 [0.143, 0.375]	0.74 [0.301, 0.675]	0.56 [0.353, 0.643]
HR + HER2-	0.54 [0.36, 0.6]	0.42 [0.2, 0.5]	0.37 [0.2, 0.556]	0.63 [0.4, 0.667]	0.55 [0.413, 0.689]
HR-HER2-	0.76 [0.667, 0.846]	0.25 [0.0, 1.0]	0.2 [0.0, 0.667]	0.89 [0.7, 1.0]	0.59 [0.421, 0.65]
*Range for each of the metric is provided within the square brackets.

Table 4

Performance of ERT on the original dataset for CEM imaging-based phenotypes for IHC subtype prediction.
	Accuracy	Positive Predictive Value	Recall	Specificity	AUC
HR-HER2+	0.88 [0.871, 0.933]*	0.08 [0.0, 0.25]	0.13 [0.0, 0.333]	0.98 [0.821, 1.0]	0.55 [0.339, 0.714]
HR + HER2+	0.72 [0.677, 0.742]	0.43 [0.167, 0.5]	0.15 [0.125, 0.25]	0.91 [0.818, 1.0]	0.59 [0.449, 0.614]
HR + HER2-	0.55 [0.533, 0.6]	0.41 [0.375, 0.5]	0.34 [0.25, 0.5]	0.74 [0.556, 0.778]	0.53 [0.421, 0.57]
HR-HER2-	0.71 [0.677, 0.767]	0.35 [0.0, 0.5]	0.22 [0.0, 0.286]	0.91 [0.826, 1.0]	0.61 [0.523, 0.671]
*Range for each of the metric is provided within the square brackets.

Both MRI- and CEM-based models are able to distinguish the breast cancers with different IHC-based subtypes using a radiomics-based machine learning approach. CEM-based model performs numerically better than the MRI-based model (71.5% vs. 70.2%), although this is not statistically significant (p-value = 0.82). The similar performance between CEM- and MR-based models is somewhat surprising given that CEM only provides a two-dimensional view of the tumor, whereas MRI provides a complete three-dimensional view. We also notice that the overall predictive performance of the HR-HER2- in terms of accuracy, PPV, recall, sensitivity, and AUC is superior to other classes both across CEM and MRI test cases. These results provide evidence that CEM imaging could be as informative as MRI from a machine learning perspective. The unexpected performance of CEM-based model could be potentially attributed to its higher resolution (nearly ten times) in comparison to MR images. High-resolution images preserve the details of the tumor, especially the geometric (or shape-based) features that consistently remain the prominent features in all the classifications, and resolves the presence of microcalcifications.. Alternatively, this could be explained by the larger size of the CEM cohort, which could lead to the slightly better model training and prediction.

While the problem of IHC subtype classification has been recently studied in the literature [27; 28], radiomics-based predictive models are still emerging. Because radiomic features can be automatically extracted from segmented images, they allow fast, quantitative, and reproducible features. This is varied from the current state of BI-RADS classification, which requires trained experts, and has been shown to demonstrate both inter-and intra- reader expert variability [30; 31]. Similar studies have emerged in recent years that focus on the classification of tumor subtypes using radiomics, clinical features, BI-RADS, or a combination. For instance, Wu et al. [32] employed BI-RADS features to classify four different IHC subtypes: Luminal A, Luminal B, HER2, and basal-like breast cancer achieving an accuracy of 74.1% on a cohort of 363 patients. Leithner et al. [28] employed radiomic signatures extracted from CEM images to develop a predictive model using 91 patients from one institution and validated on another institution consisting of 52 patients with an accuracy of 79.4% for Luminal A vs. Luminal B and 77.1% for Luminal B vs. TNBC. However, the authors did not report the recall and specificity of their performance. More recently, Son et al. [27] performed the prediction of IHC subtypes using radiomics signatures of synthetic mammography constructed from the digital breast tomosynthesis (DBT) for a cohort of 365 patients with an accuracy of 81.7%, 76.1%, and 56.3% for TNBC, HER2, and luminal A and B, respectively in an one class vs. others framework using the craniocaudal (CC) view. There was no improvement in the performance when the features from the CC and MLO views were combined. Similarly, our work also demonstrated no significant difference between performance of the model when using CC versus MLO views.

We also studied the importance of radiomic features (without performing kernel PCA) using a game-theoretic approach known as Shapley values [33]. In agreement with existing studies, we note that several shape-based features emerged as prominent features in both the MRI- and CEM-based models. Some of the features that were consistently prominent across all the predictive models include: shape sphericity, axis length, shape flatness, and shape surface area. Specifically, we noted that for HR-HER2- patients, the tumors were consistently round and spherical in shape, whereas for HR + HER2- patients, the tumors were irregular. These observations are aligned with the findings reported in the literature and observed in clinical practice. For instance, Son et al. [27] reported that triple-negative tumors tend to be round or oval. In addition to the shape-based features, we also noted several intensities and correlation-based features to be significant in model prediction, particularly, first-order features such as correlation and entropy extracted from gray level co-occurrence matrix and gray level dependence matrix.

From the present study, as well as other recent reports in the literature, it is evident that radiomic features are effective in distinguishing IHC subtypes. Limitations include classic large \(p\) small \(n\) problem in machine learning, [34] caused by limited number of patients (\(n =\) 170 for CEM and \(n=\)124 for MRI) studied in relationship to the high-dimensionality of features in the dataset (\(p\)= 960 for MRI features). This also limits the development of multi-class predictive models [35]. Class imbalance leads to poor precision and recall performance of the predictive models, and while synthetic resampling strategies could help augment the existing datasets, they seldom improve the predictive performance. The issue of class imbalance could partially be addressed by analyzing larger cohorts. To avoid a problem of data harmonization from inter-scanner and/or inter-radiologist variability, we use data collected from a single scanner annotated by the same set of radiologists, which leads to limited generalizability. The high model complexity and black-box nature of machine learning models employed limits their interpretability. The authors' ongoing work is focused on making these machine learning models more interpretable so that the inference generated from these models may not only lead to more understanding of the biology but also to informing practitioners in the decision-making process. Interpretability and model fairness also allow for monitoring against potential biases associated with the underrepresentation of racial minorities in most datasets. In general, these limitations are being addressed via multi-institutional collaboration for the generation of much larger and diverse datasets for generation, and comparison of different models.

For the breast MRI image analyses in this study, we utilized only dynamic contrast-enhanced images and did not incorporate the associated T2 weighted imaging in our analysis [36; 37]. The expectation would be that these sequences would provide additional surrogates for biological data of the tumors and will be included in future studies. The patients included in the study had biopsy-proven invasive breast cancer prior to undergoing MRI and CEM contrast-enhanced imaging, where by post biopsy change may confound our results. The heterogeneous enhancement that can occur in a post biopsy bed can alter the appearance of the native tumor. However, this is standard clinical care and beneficial to build models as imaging in true clinical practice is available.

MRI- and CEM-based machine learning models demonstrate a comparable performance based on radiomic features to classify breast cancer according to known IHC subtypes. Via an ensemble machine learning algorithm known as extremely randomized trees, we show that MRI and CEM based radiomic features can predict IHC subtypes with 70.2% and 71.5% accuracy, respectively. Using feature importance, we note that shape-based features such as shape sphericity, elongation, and shape surface area are consistently the most prominent features across all the predictive models developed in this study. Our current and future works focus on validating our machine learning model on a large cohort of patients, generating interpretability for black-box machine learning methods. Doing so, will help improve clinician uptake of these models in clinical practice. Quantitative prognostic and predictive models to predict complete pathological response to neoadjuvant chemotherapy will help strengthen precision medicine and lead to improved patient survival, without the otherwise additional unnecessary treatment side effects from therapies that would have been futile.

DE-CEM: Dual Energy Contrast Enhanced Mammography

MRI: Magnetic Resonance Imaging

HIPAA: Health Insurance Portability and Accountability Act

CaPTk: Cancer Imaging Phenomics Toolkit

ERT: Extremely Randomized Trees

IHC: Immunohistochemistry

HR: Hormone Receptor

HER2: Human Epidermal Growth Factor Receptor 2

TNBC: Triple Negative Breast Cancer

NST: Neoadjuvant Systemic Therapy

FSE: Fast Spin Echo

TR/TE: Repetition Time and Echo Time

FSPGR: Fast Spoiled Gradient Echo

ASPIR: Adiabatic Spectral Inversion Recovery

DNA: Deoxyribonucleic Acid

VIBRANT: T1-weighted Volume Imaging for Breast Assessment

ASCO/CAP: American Society of Clinical Oncology/College of American Pathologists

ROI: Region of Interest

LoG: Laplacian of Gaussian

PCA: Principal Component Analysis

ROC: Receiver Operating Characteristic

LE: Low- Energy

DES: Dual-Energy Subtraction

MLO: Mediolateral Oblique

AUC: Area Under Curve

PPV: Positive Predictive Value

BI-RADS: Breast Imaging-Reporting and Data System

DBT: Digital Breast Tomosynthesis

RCB: Residual Cancer Burden

pCR: Pathological Complete Response

Acknowledgments

The authors are grateful for the kind support provided by the Gerstner Family and the Brandt Young Scholarship from the Centers of Individualized Medicine and ASU-Mayo Clinic Summer residency program. The data employed in this study was collected at the Mayo Clinic Arizona.

Funding. This study has received funding from Gerstner Family and the Brandt Young Scholarship from the Centers of Individualized Medicine and ASU-Mayo Clinic Summer residency program.

Competing interests. The authors of this manuscript declare no relationships with any companies, whose products or services may be related to the subject matter of the article.

Authors’ contributions. Ashif Iquebal, Siqiong Zhou, Nicholaus Pfeiffer, Sara Ranjbar, Imon Banerjee, and Bhavika K. Patel analyzed and interpreted patient data. Ashif Iquebal, Siqiong Zhou, Nicholaus Pfeiffer, and Bhavika K. Patel developed the statistical models. Ashif Iquebal, Sara Ranjbar, Imon Banerjee, Kristin Swanson, Felipe Batalini, Karen S. Anderson, Muhammad Murtaza, and Barbara A. Pockaj prepared and revised the manuscript.

Data availability. The data in the study was obtained by Bhavika K. Patel at Mayo Clinic of Arizona, under the IRB approved institutional database containing breast cancer patients. The codes developed in this study will be available from the authors upon request.

Ethics approval. Institutional Review Board approval was obtained.

Consent to participate. Only if the study is on human subjects: Written informed consent was obtained from all subjects (patients) in this study.

Consent to publish. The authors affirm that human research participants provided informed consent for publication of the images in Fig. 1, Fig. 2, and Fig.3.

Howlader, N., Noone, A., Krapcho, M., et al. (2021). SEER cancer statistics review, 1975-2018. National Cancer Institute, 2008.
Hylton, N. M., Blume, J. D., Bernreuter, W. K., et al. (2012). Locally advanced breast cancer: MR imaging for prediction of response to neoadjuvant chemotherapy—results from ACRIN 6657/I-SPY TRIAL. Radiology, 263(3), 663-672.
Partridge, S. C., Gibbs, J. E., Lu, Y., et al. (2005). MRI measurements of breast tumor volume predict response to neoadjuvant chemotherapy and recurrence-free survival. American Journal of Roentgenology, 184(6), 1774-1781.
Croshaw, R., Shapiro-Wright, H., Svensson, E., Erb, K., & Julian, T. (2011). Accuracy of clinical examination, digital mammogram, ultrasound, and MRI in determining postneoadjuvant pathologic tumor response in operable breast cancer patients. Annals of surgical oncology, 18(11), 3160-3163.
Londero, V., Bazzocchi, M., Del Frate, C., et al. (2004). Locally advanced breast cancer: comparison of mammography, sonography and MR imaging in evaluation of residual disease in women receiving neoadjuvant chemotherapy. European radiology, 14(8), 1371-1379.
Ko, E. S., Han, B.-K., Kim, R. B., et al. (2013). Analysis of factors that influence the accuracy of magnetic resonance imaging for predicting response after neoadjuvant chemotherapy in locally advanced breast cancer. Annals of surgical oncology, 20(8), 2562-2568.
Shin, H., Kim, H., Ahn, J., et al. (2011). Comparison of mammography, sonography, MRI and clinical examination in patients with locally advanced or inflammatory breast cancer who underwent neoadjuvant chemotherapy. The British journal of radiology, 84(1003), 612-620.
Akazawa, K., Tamaki, Y., Taguchi, T., et al. (2006). Preoperative evaluation of residual tumor extent by three‐dimensional magnetic resonance imaging in breast cancer patients treated with neoadjuvant chemotherapy. The breast journal, 12(2), 130-137.
Yeh, E., Slanetz, P., Kopans, D. B., et al. (2005). Prospective comparison of mammography, sonography, and MRI in patients undergoing neoadjuvant chemotherapy for palpable breast cancer. American Journal of Roentgenology, 184(3), 868-877.
Montemurro, F., Martincich, L., De Rosa, G., et al. (2005). Dynamic contrast-enhanced MRI and sonography in patients receiving primary chemotherapy for breast cancer. European radiology, 15(6), 1224-1233.
Weatherall, P., Evans, G. F., Metzger, G. J., Saborrian, M. H., & Leitch, A. M. (2001). MRI vs. histologic measurement of breast cancer following chemotherapy: comparison with x‐ray mammography and palpation. Journal of Magnetic Resonance Imaging, 13(6), 868-875.
Rosen, E. L., Blackwell, K. L., Baker, J. A., et al. (2003). Accuracy of MRI in the detection of residual breast cancer after neoadjuvant chemotherapy. American Journal of Roentgenology, 181(5), 1275-1282.
Lobbes, M., Prevos, R., Smidt, M., et al. (2013). The role of magnetic resonance imaging in assessing residual disease and pathologic complete response in breast cancer patients receiving neoadjuvant chemotherapy: a systematic review. Insights into imaging, 4(2), 163-175.
Lobbes, M. (2012). Treatment response evaluation by MRI in breast cancer patients receiving neoadjuvant chemotherapy: there is more than just pathologic complete response prediction. Breast cancer research and treatment, 136(1), 313-314.
Marinovich, M. L., Houssami, N., Macaskill, P., et al. (2013). Meta-analysis of magnetic resonance imaging in detecting residual breast cancer after neoadjuvant therapy. Journal of the National Cancer Institute, 105(5), 321-333.
Hwang, K. T., Kim, J., Jung, J., Chang, J. H., Chai, Y. J., Oh, S. W., ... & Hwang, K. R. (2019). Impact of Breast Cancer Subtypes on Prognosis of Women with Operable Invasive Breast Cancer: A Population-based Study Using SEER DatabaseBreast Cancer Subtype and Prognosis. Clinical Cancer Research, 25(6), 1970-1979.
Heil, J., Kuerer, H. M., Pfob, A., Rauch, G., Sinn, H. P., Golatta, M., ... & Peeters, M. V. (2020). Eliminating the breast cancer surgery paradigm after neoadjuvant systemic therapy: current evidence and future challenges. Annals of Oncology, 31(1), 61-71.
Yushkevich, P. A., Gao, Y., & Gerig, G. (2016). ITK-SNAP: An interactive tool for semi-automatic segmentation of multi-modality biomedical images. Paper presented at the 2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).
Sensakovic, W. F., Carnahan, M. B., Czaplicki, C. D., et al. (2021). Contrast-enhanced Mammography: How Does It Work? RadioGraphics, 41(3), 829-839.
Patel, B. K., Lobbes, M., & Lewin, J. (2018). Contrast enhanced spectral mammography: a review. Paper presented at the Seminars in Ultrasound, CT and MRI.
Van Griethuysen, J. J., Fedorov, A., Parmar, C., et al. (2017). Computational radiomics system to decode the radiographic phenotype. Cancer research, 77(21), e104-e107.
Schölkopf, B., Smola, A., & Müller, K.-R. (1997). Kernel principal component analysis. Paper presented at the International conference on artificial neural networks.
Boser, B. E., Guyon, I. M., & Vapnik, V. N. (1992, July). A training algorithm for optimal margin classifiers. In Proceedings of the fifth annual workshop on Computational learning theory (pp. 144-152).
Breiman, L. (2001). Random forests. Machine learning, 45(1), 5-32.
Freund, Y., & Schapire, R. E. (1997). A decision-theoretic generalization of on-line learning and an application to boosting. Journal of computer and system sciences, 55(1), 119-139.
Geurts, P., Ernst, D., & Wehenkel, L. (2006). Extremely randomized trees. Machine learning, 63(1), 3-42.
Son, J., Lee, S. E., Kim, E.-K., & Kim, S. (2020). Prediction of breast cancer molecular subtypes using radiomics signatures of synthetic mammography from digital breast tomosynthesis. Scientific reports, 10(1), 1-11.
Leithner, D., Horvat, J. V., Marino, M. A., et al. (2019). Radiomic signatures with contrast-enhanced magnetic resonance imaging for the assessment of breast cancer receptor status and molecular subtypes: initial results. Breast Cancer Research, 21(1), 1-11.
Francescone, M. A., Jochelson, M. S., Dershaw, D. D., Sung, J. S., Hughes, M. C., Zheng, J., ... & Morris, E. A. (2014). Low energy mammogram obtained in contrast-enhanced digital mammography (CEDM) is comparable to routine full-field digital mammography (FFDM). European journal of radiology, 83(8), 1350-1355.
Soille, P. (2013). Morphological image analysis: principles and applications: Springer Science & Business Media.
Baker, J. A., Kornguth, P. J., & Floyd Jr, C. E. (1996). Breast imaging reporting and data system standardized mammography lexicon: Observer variability in lesion description. AJR. American journal of roentgenology, 166(4), 773-778.
Wu, M., Zhong, X., Peng, Q., et al. (2019). Prediction of molecular subtypes of breast cancer using BI-RADS features based on a “white box” machine learning approach in a multi-modal imaging setting. European journal of radiology, 114, 175-184.
Lundberg, S. M., & Lee, S.-I. (2017). A unified approach to interpreting model predictions. Paper presented at the Proceedings of the 31st International Conference on Neural Information Processing Systems.
Huynh, P.-H., Nguyen, V. H., & Do, T.-N. (2020). Improvements in the large p, small n classification issue. SN Computer Science, 1(4), 1-19.
Marino, M. A., Leithner, D., Sung, J., et al. (2020). Radiomics for tumor characterization in breast cancer patients: a feasibility study comparing contrast-enhanced mammography and magnetic resonance imaging. Diagnostics, 10(7), 492.
Parikh, J., Selmi, M., Charles-Edwards, G., et al. (2014). Changes in primary breast cancer heterogeneity may augment midtreatment MR imaging assessment of response to neoadjuvant chemotherapy. Radiology, 272(1), 100-112.
Chamming’s, F., Ueno, Y., Ferré, R., et al. (2018). Features from computerized texture analysis of breast cancers at pretreatment MR imaging are associated with response to neoadjuvant chemotherapy. Radiology, 286(2), 412-420.

Download PDF

Version 1

posted

You are reading this latest preprint version

CEM and MR Radiomics-based Biomarkers to Predict Immunohistochemistry Breast Cancer Subtypes: A comparative study

Status:

Version 1

Abstract

Background

Methods

Study population

Breast MR Imaging Technique

Breast Dual Energy (DE) CEM Technique

Radiomic Analysis and feature extraction of CEM and MRI

Dimensionality reduction and data imbalance

Classification model

Results

Patient Characteristics

Predictive Ability of MRI-based Phenotypes for IHC subtype prediction

Predictive Ability of CEM-based Phenotypes for IHC subtype prediction

Discussion

Conclusion

Abbreviations

Declarations

References

Status:

Version 1