AI-based Strategies in Breast Mass ≤ 2cm classification with Mammography and Tomosynthesis: A Retrospective Evaluation

doi:10.21203/rs.3.rs-3427561/v1

Download PDF

Research Article

AI-based Strategies in Breast Mass ≤ 2cm classification with Mammography and Tomosynthesis: A Retrospective Evaluation

https://doi.org/10.21203/rs.3.rs-3427561/v1

This work is licensed under a CC BY 4.0 License

You are reading this latest preprint version

Purpose To evaluate the efficiency of digital mammography (DM) and combined digital breast tomosynthesis (DBT) on AI-based strategies for breast mass ≤ 2cm classification.

Methods DM and DBT images in 483 patients including 512 breast masses were acquired from November 2018 to November 2019. The radiomics and deep learning methods were employed to extract the breast mass features in images and finally for benign and malignant classification. The DM and combined DBT (DM+DBT) images were fed into radiomics and deep learning models to construct corresponding models, respectively. The area under the receiver operating characteristic curve (AUC) was estimated models performance. A comprehensive comparison of the subgroups AUCs of the best optimal model was calculated on age, tumor size, and breast density category.

Results In the testing dataset, the AUC of DM combined DBT by radiomics and deep learning models were 0.869 and 0.908, respectively. Compared with the DM model, the combined DBT models based on radiomics and deep learning both showed statistically significant higher AUCs (0.869 vs. 0.810, P<0.001, by radiomics; 0.908 vs. 0.867, P<0.001, by deep learning). The deep learning models present superior than the radiomics models in the experiments with only DM (P<0.001) and DM+DBT (P<0.003). The advantage of the deep learning model is especially prominent in patients with small masses less than 1cm, 20 to 40 years old, and dense breast.

Conclusions Deep learning model based on DM+DBT has a best diagnostic efficiency. AI-based stragies will play a major role in detecting early breast cancer in screening.

breast mass

mammography

digital breast tomosynthesis

radiomics

deep learning

1st Key Point: The survival was higher for women with breast tumors less than 2cm.

2nd Key Point: The deep learning model based on DM+DBT images significantly has a best diagnostic efficiency .

3rd Key Point:AI-based DBT will play a major role in detecting early breast cancer in screening.

In digital mammography (DM), dense fibroglandular overlap reduces the sensitivity of cancer detection, and the conspicuity of lesion features. The 20%-30% of breast cancers were not detected in screening^[1]. Digital breast tomosynthesis (DBT) is a special-angle tomographic technique obtined multiple mutil-angle images to reconstruct three-dimensional data^{[2, 3]}. For breast cancer detection, DBT has a higher sensitivity than mammograms. DBT significantly improved assessment for mass, distortions, and asymmetrically dense than DM^[4.5]. The mass lesions, especially small masses, can be obscured by fibroglandular tissue and lead to missed diagnosis in dense breast. Though DBT can effectively find some small masses, it is difficult to classify benign and malignant by visual assessment due to morphological features of small masses are not easy to observe. Additionly, DBT has significantly longer reading times than DM^{[6, 7]}, approximately doubled^[8] and the cognitive and perceptual errors also present^[9]. Although DBT may still exhibit occasional inaccuracies, it nevertheless maintaions a supior performance compared to DM,

Many researchers also explored to promote the diagnostic accuracy and efficiency of radiologists by the technique of computer-aided detection benefiting from machine learning^[10–12]. In DBT, higher dimensional and volumetric information were captured more sufficient and high-level features. Samala et al. attempted to classify masses in DBT with compressed deep convolutional neural network (DCNN) model^[13]. Osteras et al^[14].reported that DBT enabled the detection of more cancers in all density and age groups compared with DM, especially cancers classified as spiculated masses and architectural distortions. Fan et al^[15] found that the deep learning-based mass detection performed better for larger masses than smaller masses. Currently, the above literatures reports focus on mass type lesions. However, whether the 3D deep learning methods are superior to the methods for detecting mass with less than 2cm remains unknown. The survival of women with tumors less than 2cm was higher than women with tumors greater than 2cm^[26].

This study aims that was to develop superior model for masses presenting as less than 2cm for distinguishing malignant and benign mass using DM combined DBT as possible are diagnosed at or less than 2cm (early T1 stage). And the combination strategies of DM and DBT were developed, and to compared the efficiency in patients with different characteristics.

Patients

The imaging and clinical data were retrospectively collected from the department of breast imaging of Tianjin Medical University Cancer Institute and Hospital from November 2018 to November 2019 with Institutional Review Board (IRB) approval. Individual consent for this retrospective analysis was waived. The inclusion criterias included (a) preoperative DM and DBT images were collected and (b) available pathological reports by surgery. The exclusion criterias included (a) patients received preoperative neoadjuvant chemotherapy treatment and (b) patients received local resection of lesions before DBT examination. A total of 512 DM and DBT images in 483 consecutive patients were included, with unilateral breast masses in 454 patients, and bilateral breast masses in 29 patients. The data were randomly divided into the training set, the validation set and the test set in a 3:1:1 ratio, resulting in 308, 102 and 102 lesions respectively in these three sets without patient overlap. A total of 49 masses in DM and DBT images of 49 consecutive patients were included as external validation dataset from the department of breast imaging of Qingdao University Affiliated Hospital from March 2022 to December 2022. The breast density was classified according to the Breast Imaging Reporting and Data System (BI-RADS) ACR categories, ranging from A to D. The breasts with up to 25% and 26–50% gland parenchyma were defined as ACR A and B. The breasts with 51–75% and more than 75% gland parenchyma were classified as ACR C and D. Malignant and benign tumors were determined by biopsies using histological analysis and follow-up with 24 months .

Digital mammography (DM) and Digital mammography tomosynthesis(DBT)technique

On the internal and external dataset, craniocaudal (CC) and mediolateral oblique (MLO) view on DM and DBT images were acquired by a Selenia Dimensions TM unit (Hologic, American) using a total tomographic angular range of 15◦ and 15 projection views (PVs). The DBT images were reconstructed at 1-mm slice spacing.

ROI extraction

The tumor region of interest (ROI) in DM and DBT was manually segmented by two radiologists with 5 and 10 years experience in breast mammography diagnosis, and the segmentation tool used for annotation was Deepwise Multi-modal Research Platform V1.0, which provided required tools and functions for DM and DBT image reading. Disagreements of ROI were settled by discussion to reach consensus.

Development of Radiomics and Deep Learning Models

The overall pipeline was showed in Fig. 1 and specifically explained as follows.

-Radiomics Model

In the radiomics model, six calculation methods provided by the open source image toolbox Pyradiomics were used to extract radiomic features of lesion regions in DM and DBT images. The types of radiomics featues include first-order statistics, shape (2D), gray-level co-occurrence matrix (GLCM), gray level run length matrix (GLRLM), gray level size zone matrix (GLSZM) and gray level dependence matrix (GLDM). For each lesion, we extracted a total of 825 radiomic features. To avoid model overfitting, for the extracted features, Joint Hypothesis Test was used for feature selection. When the linear correlation coefficient between any two features is greater than a certain threshold, the one with less impact on the benign/malignant classification will be removed. Based on the selected radiomics features, a logistic regression algorithm was employed to construct a classification model. In the radiomics classification experiment, L1 regularization was introduced to further mitigate overfitting.

-Deep Learning Model

The DL model and training procedure were implemented using the PyTorch DL framework, and the base model architecture is a common used model of ResNet-34. In the model construction, all DM and DBT images were grouped by individual lesions and classification is in lesion level. Each lesion contains multiple images from the mediolateral oblique (MLO) and craniocaudal (CC) views. In the DM model, all images of each lesion was fed into the network to extract deep features and then the last layer features of different images were merged by concatenation. Then, the classifier was then employed to output the estimated probabilities of benign or malignant. Similarly, in the combined model using DM and DBT, the added DBT images were also fed into the same network and then all the features were merged and finally by classifier to generate the probabilities.

The binary cross-entropy loss was employed for model training. All the layers of the ResNet model was optimized using a initial learning rate of 0.001 with a weight decay rate of 0.00001. The optimizer adopted was stochastic gradient descent (SGD) with the momentum of 0.9. The batch size was set as 64 and the training epoch as 100. We implemented the networks using the open-source PyTorch (https://pytorch.org/) framework.

Evaluation and Statistical Analysis

To evaluate the diagnosis performance of different models, we applied 5-fold cross-validation. In this setting, the whole dataset was randomly divided into five sets with an equal number of lesions And in each fold, four sets were used for training and the rest one set for testing. The criterion for model performance estimation consists of the area under the receiver operating characteristic curve (AUC), accuracy, sensitivity and specificity on the test dataset. The significance of the difference between AUCs was estimated by using DeLong test and a statistically significant difference was indentified if a P value less than 0.05. A comprehensive comparison of the subgroups AUCs of the four models was calculated on patients based on age, tumor size, and breast density category. To verify the generalization performance of the models, we tested all the radiomics and deep learning models on the external validation dataset from another hospital .

Study Population

On internal dataset, a total of 512 masses in DM and DBT images of 483 consecutive patients were included. The DBT data set included 1024 views from 512 masses images. The mean age was 50.26 years old and the median age was 49 years old (range, 19–81). Among samples, 273 (53.3%) were benign by biopsy and follow-up with 24 months and 239 (46.7%) were confirmed to be malignant tumors by biopsy. For mass sizes, 108 (21.1%) were in size less than 1cm and 404 (78.9%) were in size between 1cm and 2cm. The ACR A, B, C, and D of breast density represented 15 (2.9%), 57(11.1%), 387(65.6%), and 53 (10.4%), respectively. Table 1 shows the characteristics of the samples included in this study.

On external dataset, a total of 49 masses in DM and DBT images of 49 consecutive patients were included. The DBT data set included 98 views from 49 masses images. The mean age was 48.61 years old and the median age was 49 years (range, 31–70). Among samples, 17 (34.7%) were benign and 32 (65.3%) were confirmed to be malignant tumors by biopsy. For mass sizes, 4 (8.2%) were in size less than 1cm and 45 (91.8%) were in size between 1cm and 2cm. The ACR A, B, C, and D of breast density represented 0 (0.0%), 11 (22.4%), 31 (63.3%), and 7 (14.3%), respectively. Table 1shows the characteristics of the samples included in this study.

Table 1

Distribution of clinicopathological characteristics
Characteristics	Internal Dataset (n = 512)	External Dataset (n = 49)
Age
- ≤30	22(4.3%)	0(0.0%)
−31ཞ40	87(17.0%)	11(22.4%)
−41ཞ50	162(31.6%)	16(32.7%)
−51ཞ60	129(25.2%)	17(34.7%)
−61ཞ70	97(18.9%)	5(10.2%)
- >70	15(2.9%)	0(0.0%)
Breast density
- ACR A	15(2.9%)	0(0.0%)
- ACR B	57(11.1%)	11(22.4%)
- ACR C	387(65.6%)	31(63.3%)
- ACR D	53(10.4%)	7(14.3%)
Tumor size
- ≤1cm	108(21.1%)	4(8.2%)
−1ཞ2cm	404(78.9%)	45(91.8%)
Histological types
- benign	273(53.3%)	17(34.7%)
- DCIS	16(3.1%)	1(2.0%)
- IDC	223(43.6%)	31(63.3%)

Radiomics Model on Internal Dataset

The AUC of DM model based on radiomic features is 0.810. When combiningradiomic features of DBT and DM, the AUC is promoted to 0.869 with an considerable improvement (P = 0.000). The results of accuracy, sensitivity and specificity are all with similar trends between DM and DM combined DBT (Table 2 and Table 3). The combined radiomics model achieves an accuracy of 78.2%,a sensitivity of 79.1% and a specificity of 77.4%. Among the radiomic features in DM model, there are 20 features after feature selection and the two most discriminative ones are wavelet-LLH_glrlm_RunEntropy, original_glszm_ZoneEntropy, and in combined model, 25 features are selected and the two most discriminative ones are log-sigma-4-0-mm-3D_firstorder_Skewness,log-sigma-1-0-mm-3D_glszm_SizeZoneNonUniformity. The features of RunEntropy and ZoneEntropy both exhibit texture features of tumor. The Skewness and SizeZoneNonUniformity imply asymmetry and heterogeneity of the tumor region.

Deep Learning Model on Internal Dataset

The AUC of deep learning model using DM is 0.867 and shows a appreciable improvement to 0.908 (P = 0.001) when combining DM and DBT images. The combined DL model obtains an accuracy of 83.3%, a sensitivity of 86.2% and a specificity of 80.7%.

In a word, the performances of DM + DBT are all better than DM in radiomics (0.869 vs. 0.810, P = 0.000) and deep learning (0.908 vs. 0.867, P = 0.001) models. The results suggest DBT has the added value for breast lesion diagnosis. From the perspective of model types, deep learning shows more powerful capability than radiomics in breast lesion characterization with higher AUC in both DM (0.867 vs. 0.810, P = 0.001) and DM + DBT (0.908 vs. 0.869, P = 0.003) models (Table 2 and Table 3, Fig. 2A).

The detail results of 5-fold are showed in Table 4. These results demonstrate that the models are general in each fold with independent training and test data.

Results on External Dataset

The test AUCs and comparison results of different models are list in Table 2 and Table 3. These results show similar trends as in the internal cross validation results, demonstrating the superior performance of DM + DBT compared with only DM both in radiomics and deep learning models. Regrettably, due to the limited amount of data, the comparison results between models of DM and DM + DBT on the external validation set do not demonstrate statistical significance.

Table 2. Classification Performance with DM and DM+DBT using Radiomics and Deep Learning Models
	Models	AUC	accuracy	sensitivity	specificity
Internal dataset	radiomics
	-DM（M1）	0.810	75.3%	74.9%	75.6%
	-DM+DBT（M2）	0.869	78.2%	79.1%	77.4%
	deep learning
	-DM（M3）	0.867	77.6%	81.6%	74.1%
	-DM+DBT（M4）	0.908	83.3%	86.2%	80.7%
External dataset	radiomics
	-DM（M1）	0.740	71.4%	72.7%	68.8%
	-DM+DBT（M2）	0.807	73.5%	75.8%	68.8%
	deep learning
	-DM（M3）	0.815	73.5%	75.8%	68.8%
	-DM+DBT（M4）	0.854	79.6%	81.8%	75.0%
DM, digital mammography; DBT, digital tomography mammography; AUC, area under the ROC curve; M1,radiomics model based DM; M2, radiomics model based DM+DBT; M3, deep learning model based DM; M4, deep learning model base DM+DBT

Table 3

P Value of Comparison between Different Models in AUC
	P (Internal dataset)	P (External dataset)
M1 vs. M2	0.000	0.215
M1 vs. M3	0.001	0.184
M1 vs. M4	0.000	0.056
M2 vs. M3	0.909	0.924
M2 vs. M4	0.003	0.223
M3 vs. M4	0.001	0.257
M1,radiomics model based DM; M2, radiomics model based DM + DBT; M3, deep learning model based DM; M4, deep learning model base DM + DBT

Table 4

AUC results of each fold on respective test set
Models	Fold 1	Fold 2	Fold 3	Fold 4	Fold 5
radiomics
-DM	0.801	0.794	0.823	0.817	0.785
-DM + DBT	0.855	0.848	0.882	0.875	0.850
deep learning
-DM	0.851	0.850	0.877	0.862	0.879
-DM + DBT	0.893	0.885	0.915	0.911	0.892

Subgroups analysis of Radiomics and Deep Learning Models based on age, breast density, and tumor size on Internal Dataset

Table 5 illustrates the AUC of radiomics and deep learning models using DM and DM + DBT based on different subgroups of the size, age, and breast density. The performances of both models improve with increasing size, age and decreasing breast density. The performances of the deep learning models based on DM + DBT on patients with different characteristics are shown in B, C, D of Fig. 2, respectively.

The AUCs of radiomics model using DM + DBT and deep learning model using DM are 1.000 and clearly achieved the most superior performance on the fat breast (breast density A). The AUC of radiomics model using DM is 0.595 and shows the worst performance on the dense breast (breast density D), but obviously improves to 0.886 in deep learning model using DM + DBT.

The deep learning model based on DM + DBT achieved better AUCs for almost all the age ranges compared to other models. For the women with older than 60 years, AUCs are 0.922, showing the same performance in the radiomics model using DM + DBT, deep learning model using DM and DM + DBT. For younger women form 20 years to 40 years, it was expected to have relatively lower AUCs, with values of 0.766, 0.777, 0.736, and 0.843 for Models 1 ~ 4, respectively, compared to the overall AUCs of 0.810, 0.869, 0.867, and 0.908. In the age subgroup from 40 years to 60 years, there are similar AUCs of with 0.793, 0.869, 0.872, and 0.901 as the overall AUCs in models 1 ~ 4.

The deep learning model based on DM + DBT shows better diagnostic efficiency than other models for all the sizes of masses (Figs. 3 and 4). It also performed better for larger masses compared to smaller masses (Fig. 2D). For the mass smaller than 1cm in size, the deep learning model using DM + DBT has a better performance with the AUC od 0.866, while the radiomics model using DM has a lower AUC of 0.742.

Table 5

The AUC of DM and DM + DBT using Radiomics and Deep Learning Models according to the size, age, and breast density
		radiomics		deep learning
	Group (total/pos/neg)^a	DM	DM + DBT	DM	DM + DBT
Size
	< 1cm (108/23/85)	0.742 [0.607,0.877]^b	0.791 [0.674,0.908]	0.819 [0.718,0.921]	0.866 [0.769,0.963]
	1 ~ 2cm (404/216/188)	0.803 [0.760,0.846]	0.865 [0.830,0.899]	0.864 [0.829,0.898]	0.906 [0.877,0.935]
Age
	20 < age ≤ 40 (109/18/91)	0.766 [0.642,0.890]	0.777 [0.641,0.914]	0.736 [0.609,0.864]	0.843 [0.716,0.971]
	40 < age ≤ 60 (291/141/150)	0.793 [0.739,0.847]	0.869 [0.828,0.910]	0.872 [0.832,0.912]	0.910 [0.877,0.943]
	age > 60 (112/80/32)	0.861 [0.791,0.931]	0.922 [0.872,0.972]	0.922 [0.873,0.971]	0.922 [0.874,0.970]
breast density
	ACR A (15/11/4)	0.954 [0.847,1.]	1.000 [1. 1.]	1.000 [1. 1.]	0.977 [0.914,1.]
	ACR B (57/36/21)	0.912 [0.843,0.981]	0.928 [0.866,0.990]	0.941 [0.888,0.995]	0.972 [0.934,1.]
	ACR C (387/180/207)	0.800 [0.754,0.846]	0.861 [0.824,0.898]	0.861 [0.824,0.897]	0.890 [0.857,0.923]
	ACR D (53/12/41)	0.595 [0.378,0.812]	0.733 [0.544,0.922]	0.682 [0.507,0.858]	0.886 [0.767,1.]
^a total,pos,neg denote number of total, positive and negative masses in each group ^bAUC values are showed as mean and 95% confidence interval (CI)

In this study, the deep learning model based DM + DBT images showed the best discriminative performance among the four models evaluated. It achieved an AUC of 0.908, accuracy of 83.3%, sensitivity of 86.2%, and specificity of 80.7% on internal dataset. When the external data was used for verification, the deep learning model based on DM and DBT still outperformed the other models, with an AUC of 0.854, accuracy of 79.6%, sensitivity of 81.8%, and specificity of 75.0%. Moreover, on internal dataset, the performance of deep learning model was assessed in different subgroups of patients with varying tumor sizes, age ranges, and breast densities.The advantage of the deep learning model based DM + DBT images is especially prominent in patients with small masses less than 1cm, 20 to 40 years old, and dense breast.

In recent years, it has been a emerging trendency that radiomics become a promising method in disease diagnosis because it extracts quantitative imaging features for pathophysiology characteristics description^[17–20]. In our study, the radiomics model based DM + DBT exhibited more power than the radiomics model based DM, with an AUC of 0.869 for the radiomics model based DM + DBT and 0.810 for the radiomics model based DM in the AUC of the ROC curves. Another study also demonstrated that showed that the AUC of the test ROC curves for classification of mass for the 2D alone, 3D alone, and combined 3D and 2D were 0.85, 0.86, and 0.91, respectively^[21]. And the most frequently features included two morphological features, six global and five local SGLD texture features for the 2D detection approach and two morphological features, four gray features, two RLS texture and one SGLD texture feature for the 3D detection approach. In our study, among the 825 radiomic features, the gray-level characteristics were higher value, which indicates the more complicated texture and heterogeneity of the tumoral region. This fingding was consistent with previous reports by Kontos’s group^{[22, 23]}, who also found that texture features were closely correlated to breast cancer in the DBT images. Niu et al.^[24] evaluated the tumoral and peritumoral regions in the breast DBT image in differentiating malignant from benign lesions using handcrafted and deep features and developed a radiomics nomogram by integrating the radiomics signature and important clinical factors for facilitating early diagnosis of breast cancer. Fusion DM and DBT images is a promising approach to small mass detection. A larger data set should be collected to improve the training efficiency of the fusion model.

Recently, medical software devices based on deep learning artificial techniques have been developed to automatically detect and classify benign and malignant lesions on DM and DBT mammograms. In a previous study, the researchers investigated to predict malignancy of masses in DBT images by transfer learning of deep learning^[25]. Rabili et al. attempted to detect malignant masses and calcifications through the common Faster RCNN model^[16]. However, these studies were performed based on the 2D analysis of a deep neural network. In this study, we showed that the 3D deep learning method is superior to the 2D methods in size ≤ 2cm mass for benign and malignant classification. Compared with the model based on radiomics, deep learning–based model based on DM + DBT have a better performance with an AUC of 0.908. Our results were comparable to the performance of three different DCNN networks by Ricciardi et al .^[26] Reported, one developed ad-hoc (DBT-DCNN) and the other two, Alex-Net and VGG 19, with the AUC values ranging from 0.70 to 0.93, and accuracies ranging from 69–93%. Fan et al.^[15] showed that the Mask RCNN has better lesion-based detection performance while the Faster RCNN achieved better breast-based mass detection in DBT images. The common deep learning model of ResNet-34 was used in our datasets. Deep learning model might be less influenced by lesion-specific features than feature-based radiomics methods, resulting in a better chance of recognizing a in size ≤ 2cm mass and a significantly better breast DM + DBT-based detection performance.

The systematic analyses of the deep learning model based on DM + DBT showed that the mass AUCs performance varied on different subgroups of characteristics such as age, mass size, and breast density. Interestingly, the radiomics model based on DM + DBT achieved better performance than the deep learning model based on DM + DBT on fatty breast. One possible reason for this is that the lesion-specific features of mass are more prominent on the fatty breast due to the absence of tissue overlap. Fan et al.^[20] also found that 3D-Mask RCCN had significantly better mass detection performance than the 2D methods for patients with 40–59 years old, benign tumors, irregular tumors and dense breast, which were comparable to our results, with obviously improving AUCs from 0.736 to 0.843 on the subgroup of 20 ~ 40 years old, from 0.682 to 0.886 on the dense breast subgroup. This improvement can be attributed to the fact that DBT reduces the tissue overlap and increases the lesion conspicuity, particularly in dense breasts and younger women.

Our study had limitations. First, the radiomics and deep learning models were evaluated on a relatively small patient cohort, which may limit the generalization of the current model to a larger patient group. Moreover, due to the limited amount of external data, no subgroup analysis was performed. Second, the ROIs on each slice were manually segmented, which increased the workload. Also, we used image patches for detection to save computer memory, and thus future studies that focus on the entire image should be conducted. In addition, the performance of models may depend strongly on factors that affect image quality such as the DBT reconstruction methods and parameters, and image acquisition methods such as the x-ray techniques, number of projection views, and tomographic angle. Further investigations will be needed to evaluate the effects of these factors on the performance of different models using different approaches.

In summary, we proposed the deep learning model based DM + DBT images has a best performance. A comparison of the DM and DM + DBT models under different subgroups based on age ranges, lesion sizes, and breast densities was conducted.The advantage of the deep learning model is especially prominent in patients with small masses less than 1cm, 20 to 40 years old, and dense breast. It is expected that AI will play a major role in the evaluation of DBT in ≤ 2cm mass, particularly detecting early breast cancer in the screening setting.

MRI:Magnetic resonance imaging

BPE:Background parenchymal enhancement

NGS:Next generation sequencing

TR:Time of repeatation

TE:Time of echo

DWI:Diffusion weighted imaging

VIBRANT:Volume imaging for breast assessment

BI-RADS:Breast imaging reporting and data system

IHC:Immunohistochemical

ER:Estrogen receptor

PR:Progesterone receptor

HER2:Human epidermal growth factor receptor-2

ASCO-CAP:American Society of Clinical Oncology-College of American Pathologists

PCR:Polymerase chain reaction

GPM:Global proteome machine

IGV:Integrative genomics viewer

ROC:Receiver operating characteristic

AUC:Area under the curve

MAI:Mitotic activity index

Author contributions Conceptualization:HL,ZS;Data curation:CH,YS,ZY;Formal analysis: ZS,YH; Investigation: FZ; Methodology: HL, FZ; Project administration: HL, ZS; Resources: HL, WM; Supervision: FZ; Validation: QZ,JC; Roles/Writing-original draft: ZS, FG; Writing-review and editing: ZS, HL.

Funding This work was suppoted by grants form Chinese National Key Research and Development Project [Grant No. 2018YFC1315600, Grant No. 2021YFC2500400 and Grant No.2021YFC2500402]; Construction Project of Cancer Precision Diagnosis and Drug Treatment Technology [Grant No. ZLJZZDYYWZL10]; Tianjin Key Medical Discipline(Specialty) Construction Project [Grant No.TJYXZDXK-009A];National Natural Science Foundation of China [Grant No.81801781,and Grant No.82072004].

Data avaliability Enquiries and data availability should be directed to the authors.

Conflict of interest No potential conflict of interest relevant to this article was reported.

Mellado Rodríguez M, Osa Labrador AM (2013) Breast cancer screening: current status [in Spanish]. Radiología 55:305–314
Vedantham S, Karellas A, Vijayaraghavan GR, Kopans DB (2015) Digital Breast Tomosynthesis: State of the Art. Radiology 277:663–684
Sechopoulos I(2013). A review of breast tomosynthesis Part I. The image acquisition process. Med Phys 40:014301
Roth RG, Maidment AD, Weinstein SP, Roth SO, Conant EF (2014) Digital breast tomosynthesis: lessons learned from early clinical implementation.RadioGraphics 34:E89–E102
Park JM, Franken EA, Garg M, Fajardo LL, Niklason LT (2007) Breast tomosynthesis: present considerations and future applications.RadioGraphics 27(suppl 1):S231–S240
Dang PA, Freer PE, Humphrey KL, Halpern EF, Rafferty EA (2014) Addition of tomosynthesis to conventional digital mammography: effect on image interpretation time of screening examinations.Radiology 270:49-56
Skaane P, Bandos AI, Gullien R et al (2013) Comparison of digital mammography alone and digital mammography plus tomosynthesis in a population-based screening program.Radiology 267:47–56
Tagliafico AS, Calabrese M, Bignotti B et al (2017) Accuracy and reading time for six strategies using digital breast tomosynthesis in women with mammographically negative dense breasts. Eur Radiol 27:5179–5184
Korhonen KE, Weinstein SP, McDonald ES, Conant EF (2016) Strategies to Increase Cancer Detection: Review of True-Positive and False-Negative Results at Digital Breast Tomosynthesis Screening. RadioGraphics 36:1954–1965
Mohamed AA, Berg WA, Peng H, Luo Y, Jankowitz RC, Wu S (2018) A deep learning method for classifying mammographic breast density categories. Med Phys 45:314–321
Chougrad H, Zouaki H, Alheyane O (2018) Deep convolutional neural networks for breast cancer screening. Comput Methods Programs Biomed 157:19–30
Becker AS, Marcon M, Ghafoor S, Wurnig MC, Frauenfelder T, Boss A (2017) Deep learning in mammography:diagnostic accuracy of a multipurpose image analysis software in the detection of breast cancer.Invest Radiol 52:434–440
Samala RK, Chan HP, Hadjiiski LM, Helvie MA, Richter C, Cha K(2018)Evolutionary pruning of transfer learned deep convolutional neural network for breast cancer diagnosis in digital breast tomosynthesis. Phys. Med.Biol 63:095005
Osteras BH, Martinsen AC T, Gullien R, Skaane P (2019) Digital mammography versus breast tomosynthesis: impact of breast density on diagnostic performance in population-based screening. Radiology 293: 60–68
Fan M, Zheng H, Zheng S et al (2020) Mass Detection and Segmentation in Digital Breast Tomosynthesis Using 3D-Mask Region-Based Convolutional Neural Network: A Comparative. Front Mol Biosci 7:599333
Ribli D, Horvath A, Unger Z, Pollner P, Csabai I (2018) Detecting and classifying lesions in mammograms with deep learning. Sci. Rep 8:4165
Aerts HJ, Velazquez ER, Leijenaar RT, et al (2014) Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach. Nat Commun 5:4006
Zhou M, Scott J, Chaudhury B et al (2018) Radiomics in brain tumor: image assessment, quantitative feature descriptors, and machine-learning approaches. AJNR Am J Neuroradiol 39:208-216
Parmar C, Grossmann P, Bussink J, Lambin P, Aerts HJ (2015) Machine learning methods for quantitative radiomic biomarkers. Sci Rep 5:13087
Lambin P, Leijenaar RT, Deist TM et al (2017) Radiomics: the bridge between medical imaging and personalized medicine. Nat Rev Clin Oncol 14:749-762
Chan HP, Wei J, Zhang Y et al (2008) Computer-aided detection of masses in digital tomosynthesis mammography: comparison of three approaches. Med Phys. 35:4087-4095
Kontos D, Ikejimba LC, Bakic PR, Troxel AB, Conant EF, Maidment AD (2011) Analysis of parenchymal texture with digital breast tomosynthesis: comparison with digital mammography and implications for cancer risk assessment. Radiology 261:80-91
Kontos D, Bakic PR, Carton AK, Troxel AB, Conant EF, Maidment AD (2009) Parenchymal texture analysis in digital breast tomosynthesis for breast cancer risk estimation: a preliminary study. Acad Radiol 16:283-298
Niu S, Tao Y, Yan C, Dong Y, Luo Y, Jiang X (2022) Digital breast tomosynthesis-based peritumoral radiomics approaches in the differentiation of benign and malignant breast lesions.Diagn Interv Radiol 28:217-225
Samala RK, Heang-Ping C, Hadjiiski L, Helvie MA, Richter CD, Cha KH (2019). Breast cancer diagnosis in digital breast tomosynthesis: effffects of training sample size on multi-stage transfer learning using deep neural nets. IEEE Trans. Med. Imaging 38, 686–696
Ricciardi R, Mettivier G, Staffa M et al (2021) A deep learning classifier for digital breast tomosynthesis. Phys Med 83:184-193

Download PDF

Reviewers agreed at journal
17 Dec, 2023
Editor invited by journal
04 Nov, 2023
Reviewers invited by journal
23 Oct, 2023
Editor assigned by journal
14 Oct, 2023
First submitted to journal
13 Oct, 2023

You are reading this latest preprint version

AI-based Strategies in Breast Mass ≤ 2cm classification with Mammography and Tomosynthesis: A Retrospective Evaluation

Status:

Version 1

Abstract

Figures

Key Points

Introduction

Materials and Methods

Patients

Digital mammography (DM) and Digital mammography tomosynthesis(DBT)technique

ROI extraction

Development of Radiomics and Deep Learning Models

-Radiomics Model

-Deep Learning Model

Evaluation and Statistical Analysis

Results

Study Population

Radiomics Model on Internal Dataset

Deep Learning Model on Internal Dataset

Results on External Dataset

Discussion

Abbreviations

Declarations

References

Status:

Version 1