The potential of adding mammography to HHUS and ABUS to reduce unnecessary biopsies in  BI-RADS ultrasound category 4a: a multicenter hospital-based study in China

doi:10.21203/rs.3.rs-2090494/v1

Download PDF

Research Article

The potential of adding mammography to HHUS and ABUS to reduce unnecessary biopsies in BI-RADS ultrasound category 4a: a multicenter hospital-based study in China

https://doi.org/10.21203/rs.3.rs-2090494/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Purpose

This study compares image features influencing false-positive lesions in category 4a between handheld ultrasound (HHUS) and automated breast ultrasound (ABUS) and explores the role of second-look mammography (MAM) adjunct to US of 4a masses.

Methods

Women aged 30 to 69 underwent HHUS and ABUS from 2016 through 2017 at five high-level hospitals in China with those aged 40 or older also accepting MAM. Logistic regression analysis assessed image variables correlated with false-positive lesions in US category 4a. Unnecessary biopsies, invasive cancer (IC) yields, and diagnostic performance among different biopsy thresholds were compared.

Results

1946 women (44.9±9.8 years) were eligible for analysis. 188 (9.66%) were categorized as category 4a in HHUS and 117 (6.01%) of ABUS. Orientation, architectural distortion, and duct change were independent factors associated with the false-positive lesions in 4a of HHUS, whereas premenopausal, size, calcification, and architectural distortion were significant features of ABUS (all P＜0.05). For HHUS, both unnecessary biopsy rate and IC yields were significantly reduced when changing biopsy thresholds by adding MAM for US 4a patients (scenario #1:BI-RADS 3, 4, and 5; scenario #2: BI-RADS 4 and 5) compared with the current scenario (all P＜0.05). However, scenario #1 reduced false-positive biopsies without affecting IC yields when compared to the current scenario for ABUS (P＜0.001; P=0.125).

Conclusions

The higher unnecessary biopsy rate of category 4a by ABUS was similar to HHUS. However, the second-look MAM adjunct to ABUS has the potential to safely reduce false-positive biopsies.

Breast Neoplasms

Ultrasonography

Automated Breast Ultrasound

Mammography

Diagnosis

Mammography (MAM) is widely used as the standard modality for detecting and screening early breast cancer. However, the diagnostic accuracy of MAM is limited in women with dense breasts [1–3]. Another barrier for MAM to apply and expand sustainability is the lack of equipment, especially in low resources areas [4].

Conventional handheld ultrasound (HHUS) offers a low-cost and portable way of breast cancer detection without the limitations of breast density [5], thereby increasingly being used in clinical breast examination. However, operator dependence has long been a concern for HHUS and causes interobserver variability. Automated breast ultrasound (ABUS) is a newly designed tool with the potential to overcome the criticism of HHUS by separating image acquisition from interpretation to increase reproducibility [6]. Multiplanar reconstructions also provide an advantage for evaluating breast lesions which might help improve the diagnosis accuracy [6].

To provide standardized ultrasound (US) findings reporting systems and aid quality assurance and risk assessment, the Breast Imaging Reporting and Data System (BI-RADS) is generalized worldwide [7]. The latest nationwide survey in mainland China reported the average utilization rate of BI-RADS was up to 87.02% among 5,460 departments providing ultrasound diagnoses [8]. However, the application of category 4 subdivisions in the new fifth BI-RADS lexicon offers a challenge to managing of BI-RADS 4a. The malignant rate of BI-RADS 4a is meager (2–10%) whereas immediate biopsy referral is recommended, while category 3 refers to probably benign masses (༜2%) with short-term follow-up imaging recommended [7]. In case to avoid missed diagnoses, observers tend to upgrade breast masses into 4a when difficult to determine category 3 or 4a, but this may result in unnecessary biopsies.

The benign biopsy rate on breast US of 4a patients is a considerable percentage (more than 50%) [9–10]. Unnecessary biopsies can result in negative consequences for normal women, including the risk of complications, psychological anxiety, and additional financial costs [11–13]. Thereby, how to avoid excessive biopsies of HHUS category 4a by supplementing other techniques remains for further exploratory.

The diagnostic performance between ABUS and HHUS has been proven comparable based on the fifth BI-RADS edition [14]. However, to our knowledge, there is not yet established evidence to identify whether the accuracy of category 4a on ABUS is higher than HHUS. Furthermore, given the advantages in diagnosing calcification lesions [15], MAM provides a potential complementary option to improve diagnostic performance when combined with US (ABUS or HHUS). Few studies have evaluated whether adding MAM to lesions assessed as US category 4a improves diagnostic accuracy and reduces unnecessary biopsies rate.

The purpose of this study was twofold: (1) identify and compare the clinical and image features influencing the false-positive lesions in category 4a between ABUS and HHUS. (2) assess the diagnostic performance of second-look MAM adjunct to HHUS and ABUS to reduce false-positive diagnoses.

Study population

The research design has been published in detail elsewhere [16]. Briefly, this multi-center cross-sectional study was conducted in five high-level hospitals located in China (including Beijing, Tianjin, Shanghai, Hangzhou, and Guangzhou) from February 2016, through March 2017. Women aged 30 to 69 years were invited to attend both HHUS and ABUS while those aged 40 years and above also underwent MAM. Patients with the most severe category on three modalities, including BI-RADS 4 and 5, were considered positive findings and required a biopsy, whereas those with BI-RADS 1, 2, or 3 were categorized as negative findings. The study was registered in the Chinese Clinical Trial Registry (ChiCTR1800017908) and granted institutional ethics approval by all study centers.

Image Acquisition And Interpretation

The participants underwent ABUS using Invenia ABUS (GE Healthcare, WI, USA) performed by well-trained technicians and interpreted by radiologists with 3–6 months of experience. Three planes (including lateral, anteroposterior, and medial) are collected on each breast. The image in three views could be transmitted to the workstation and reconstructed in the breast and displayed in 3D volumes. The HHUS images were acquired by one of the following devices, including GE LOGIQ9 (GE Healthcare, WI, USA), iU22 Ultrasound System (Philips Medical System, WA, USA), S2000 (Siemens Medical Solutions, CA, USA), and the Aixplorer system (Supersonic Imagine, Aix en Provence, France), which performed by qualified radiologists in five hospitals. All MAM examinations were performed by one of three techniques including GE Sengraphe DS (GE Healthcare, WI, USA), Hologic Selenia (Hologic, MA, USA), and Fujifilm FDR MS-2500 (Fujifilm Crop, Tokyo, Japan). During the study, different experienced radiologists reviewed and interpreted images from three modalities and were blinded to each other. However, they were provided with information on participants’ clinical examinations.

Statistical analysis

We analyzed and compared the detection rate of normal/benign and malignant lesions classified as US categories 3, 4a, ab, and 4c using the Chi-squared test for trend. In clinical practice, observers always have difficulty in better characterizing category 3 and 4a lesions even for highly qualified experts. Therefore, to evaluate the clinical and image features influencing the false-positive lesions in category 4a, we selected those who were categorized as 3 and 4a and underwent biopsy and evaluated them as benign breast lesions as an analysis set. With category 3 as the reference group, multivariable logistic regression analysis was used to estimate odds ratios (ORs) and confidence intervals (CIs). Unnecessary biopsy rate, invasive cancer (IC) detection rate, malignant rate of biopsy, sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and area under curve (AUC) were calculated to evaluate the diagnostic performance among different biopsy thresholds, which were compared using the McNemar tests or the Chi-squared test. The statistical analysis was performed with SAS, version 9.4 (SAS Institute, Cary, North Carolina). A P-value < 0.05 was considered statistical significance.

Distribution of benign and malignant lesions in category 3 and 4a

Among 1973 eligible women who received HHUS and ABUS between 2016 and 2017, 27 women were excluded for missing breast density in those who underwent MAM (Fig. 1). Of 1946 participants (mean age 44.9 ± 9.8 years) for analysis, 188 (9.66%) were categorized as category 4a in HHUS while 117 (6.01%) of ABUS. For HHUS, the proportion of normal or benign lesions showed a decreasing trend among 4a (67.55%), 4b (26.39%), and 4c (18.99%) (P for trend༜0.001). ABUS showed the same trend as HHUS among 4a, 4b, and 4c (65.81% vs 23.94% vs 8.57%; P for trend༜0.001). Meanwhile, there were 72.84% of unnecessary biopsies occurred in 81 participants who have assessed the BI-RADS 4a category with both ABUS and HHUS (Table 1).

Table 1

Distribution of benign and malignant lesions according to BI-RADS-US category between ABUS and HHUS
BI-RADS US category	Total (N,%)*	Normal/benign (n,%)	DCIS (n,%)	IC (n,%)
HHUS
3	536(27.54)	518(96.64)	9(1.68)	9(1.68)
4a	188(9.66)	127(67.55)	10(5.32)	51(27.13)
4b	72(3.70)	19(26.39)	11(15.28)	42(58.33)
4c	79(4.05)	15(18.99)	6(7.59)	58(73.42)
P for trend	-	< 0.001	-	< 0.001
ABUS
3	546(28.06)	520(95.24)	5(0.91)	21(3.85)
4a	117(6.01)	77(65.81)	12(10.26)	28(23.93)
4b	71(3.65)	17(23.94)	10(14.09)	44(61.97)
4c	105(5.40)	9(8.57)	9(8.57)	87(82.86)
P for trend	-	< 0.001	-	< 0.001
HHUS & ABUS
3	436(22.40)	424(97.25)	4(0.92)	8(1.83)
4a	81(4.16)	59(72.84)	6(7.41)	16(19.75)
4b	21(1.08)	5(23.81)	3(14.29)	13(61.90)
4c	32(1.64)	2(6.25)	3(9.37)	27(84.38)
P for trend	-	< 0.001	-	< 0.001
* The denominator of the percentage of BI-RADS categories 3 and 4 is 1946
Abbreviations: ABUS: Automated Breast Ultrasound; BI-RADS: Breast Imaging Reporting and Data System; DCIS: Ductal Carcinoma in Situ; HHUS: Handheld Ultrasound; IC: Invasive Cancer

Clinical And Imaging Factors Associated With False-positive Lesions In Category 4a

Among 371 benign lesions assessed as categories 3 and 4a on HHUS, 127 were assessed as false-positive cases in 4a. Meanwhile, the false-positive cases were 77 in ABUS 4a among 357 benign lesions in 3 and 4a. Tables 2 and 3 display the ORs of clinical and imaging factors for false-positive cases assigned by HHUS and ABUS when using category 3 as the reference group. In the logistic regression analysis, nonparallel masses (OR, 5.30; 95%CI, 1.98 to 14.16; P = 0.001), architectural distortion (2.86; 1.33 to 6.15; P = 0.007), and duct change (8.92; 3.49 to 22.77; P༜0.001) were independent factors linked with the false-positive lesions in the BI-RADS-US 4A in HHUS, while postmenopausal (0.39; 0.20 to 0.77; P = 0.006), larger lesions (2.09; 1.07 to 4.10; P = 0.032), calcification (2.29; 1.15 to 4.57; P = 0.018), and architectural distortion (4.11; 1.15 to 11.21; P = 0.006) were significant features of ABUS.

Table 2

Differential regression analysis of clinical and imaging features of false-positive lesions in BI-RADS 4a among HHUS
Variables		BI-RADS 4a (Benign, n = 127 )	BI-RADS 3 (Benign, n = 244 )	OR (95%CI)	aOR (95%CI)**
Age(y)
	30–39	48	102	1.00
	40–69	79	142	1.18(0.76,1.84)	-
Menopausal status
	Premenopausal	25	44	1.00
	Postmenopausal	102	200	0.90(0.52,1.55)	-
Breast density*
	Less dense	11	21	1.00
	More dense	68	121	1.17(0.76,1.80)	-
Palpability of the mass
	Palpable	62	77	1.00	1.00
	Non palpable	65	147	0.69(0.45,1.07)	0.84(0.48,1.49)
Size(cm)*
	≤ 2	79	184	1.00	1.00
	> 2	43	60	1.57(0.98,2.51)	1.61(0.87,2.97)
Shape*
	Oval and Round	56	174	1.00	1.00
	Irregular	66	70	2.69(1.72,4.20)	1.69(0.95,3.03)
Orientation*
	Parallel	102	236	1.00	1.00
	Nonparallel	20	8	5.51(2.35,12.92)	5.30(1.98,14.16)
Margin*
	Regular	74	204	1.00	1.00
	Irregular	48	40	3.10(1.89,5.07)	1.68(0.88,3.20)
Posterior feature*
	None	87	182	1.00
	Enhancement and/or Shadowing	35	62	1.12(0.69,1.81)	-
Calcification*
	None	97	213	1.00	1.00
	Present	25	31	1.68(0.95,3.00)	1.82(0.91,3.61)
Distorted structure
	None	102	227	1.00	1.00
	Architectural distortion	25	17	3.27(1.69,6.33)	2.86(1.33,6.15)
Duct change
	None	102	236	1.00	1.00
	Dilation or with filling	25	8	7.23(3.16,16.57)	8.92(3.49,22.77)
Vascularity
	Absent	70	171	1.00	1.00
	Internal and/or Vessels vascularity	57	73	1.91(1.22,2.97)	1.24(0.71,2.16)
* Missing values in data
** OR was adjusted by the following variables: palpability of the mass, size, shape, orientation, margin, calcification, distorted structure, duct change, and vascularity
Abbreviations: BI-RADS: Breast Imaging Reporting and Data System; HHUS: Handheld Ultrasound

Table 3

Differential regression analysis of clinical and imaging features of false-positive lesions in BI-RADS 4a among ABUS
Variables		BI-RADS 4a (Benign, n = 77 )	BI-RADS 3 (Benign, n = 280 )	OR (95%CI)	aOR (95%CI)**
Age(y)
	30–39	26	119	1.00
	40–69	51	161	1.45(0.86,2.46)	-
Menopausal status
	Premenopausal	24	48	1.00	1.00
	Postmenopausal	53	232	0.46(0.26,0.81)	0.39(0.20,0.77)
Breast density*
	Less dense	10	25	1.00	1.00
	More dense	45	136	1.21(0.73,2.00)	1.19(0.66,2.15)
Palpability of the mass
	Palpable	38	122	1.00
	Non palpable	39	158	0.79(0.48,1.31)	-
Size(cm)*
	≤ 2	48	215	1.00	1.00
	> 2	23	56	1.70(0.96,3.01)	2.09(1.07,4.10)
Shape*
	Oval and Round	35	201	1.00	1.00
	Irregular	36	70	2.63(1.56,4.44)	2.15(0.99,4.68)
Orientation*
	Parallel	57	242	1.00	1.00
	Nonparallel	14	29	1.92(0.96,3.85)	1.39(0.57,3.39)
Margin*
	Regular	30	177	1.00	1.00
	Irregular	41	94	2.25(1.35,3.76)	0.97(0.45,2.09)
Posterior feature
	None	44	183	1.00
	Enhancement and/or Shadowing	33	97	1.42(0.85,2.37)	-
Calcification
	None	52	243	1.00	1.00
	Present	25	37	3.16(1.75,5.69)	2.29(1.15,4.57)
Distorted structure
	None	62	270	1.00	1.00
	Architectural distortion	15	10	6.53(2.80,15.22)	4.11(1.51,11.21)
Duct change
	None	62	257	1.00	1.00
	Dilation or with filling	15	23	2.70(1.33,5.48)	2.24(0.94,5.34)
Retraction phenomenon
	None	71	280	-	-
	Present	6	0	-	-
* Missing values in data
** OR was adjusted by the following variables: menopausal status, breast density, size, shape, orientation, margin, calcification, distorted structure, and duct change
Abbreviations: ABUS: Automated Breast Ultrasound; BI-RADS: Breast Imaging Reporting and Data System

Diagnostic Performance Of Adding Mam To Hhus And Abus

We evaluated the effect of changing biopsy thresholds for women with US category 4a lesions who underwent MAM (HHUS, 138 women; ABUS, 94 women). Three scenarios about different biopsy thresholds are shown in Table 4, including all women with BI-RADS-US (HHUS or ABUS) category 4a undergoing biopsy (current scenario), women with BI-RADS-US category 4a and BI-RADS-MAM category 3,4, and 5 undergoing biopsy (scenario #1), and women with BI-RADS-US category 4a and BI-RADS-MAM category 4 and 5 undergoing biopsy (scenario #2).

Table 4

Diagnostic performance of different scenarios when add MAM to HHUS and ABUS
Biopsy thresholds¶	HHUS + MAM					ABUS + MAM
Biopsy thresholds¶	Sensitivity (%, 95%CI)	Specificity (%, 95%CI)	PPV (%, 95%CI)	NPV (%, 95%CI)	AUC	Sensitivity (%, 95%CI)	Specificity (%, 95%CI)	PPV (%, 95%CI)	NPV (%, 95%CI)	AUC
Current scenario	77.22 (66.14, 85.60)	80.31 (76.98, 83.27)	32.45 (25.92, 39.71)	96.64 (94.64, 97.64)	0.80 (0.75, 0.85)	60.61 (47.80, 72.18)	87.10 (84.08, 89.63)	34.19 (25.83, 43.60)	95.24 (93.01, 96.81)	0.77 (0.70, 0.84)
Scenario #1	59.49 (47.84, 70.21)	91.01 (88.47, 93.05)	44.76 (35.15, 54.76)	94.83 (92.70, 96.38)	0.78 (0.72, 0.84)	50.00 (37.56, 62.44)	94.30 (92.05, 95.96)	49.25 (36.95, 61.64)	94.46 (92.23, 96.10)	0.74 (0.68, 0.81)
Scenario #2	51.90 (40.44, 63.17)	93.95 (91.75, 95.61)	51.25 (39.89, 62.48)	94.10 (91.92, 95.74)	0.76 (0.70, 0.82)	40.91 (29.18, 53.70)	95.64 (93.59, 97.08)	50.94 (37.00, 64.75)	93.61 (91.29, 95.36)	0.70 (0.63, 0.76)
*P value₁	< 0.0001	< 0.0001	0.0361	0.1314	0.2382	0.0156	< 0.0001	0.0444	0.5545	0.2774
**P value₂	< 0.0001	< 0.0001	0.0037	0.0408	0.0945	0.0002	< 0.0001	0.0384	0.2293	0.0181
¶ The diagnostic performance of different scenarios was compared among women with BI-RADS-US (HHUS or ABUS) category 3 and 4a lesions.
Current scenario: all women with BI-RADS-US (HHUS or ABUS) category 4a underwent biopsy;
Scenario #1: women with BI-RADS-US category 4a and BI-RADS-MAM category 3,4, and 5 underwent biopsy;
Scenario #2: women with BI-RADS-US category 4a and BI-RADS-MAM category 4 and 5 underwent biopsy
* Compare Scenario #1 with Current scenario
** Compare Scenario #2 with Current scenario
Abbreviations: ABUS: Automated Breast Ultrasound; HHUS: Handheld Ultrasound; MAM:Mammography; PPV:Positive Predictive Value; NPV:Negative Predictive Value; AUC:Area Under Curve

The diagnostic performance of different scenarios was compared among women with BI-RADS-US category 3 and 4a lesions (Table 4). The AUCs of the combination of HHUS and MAM (both scenarios #1 and #2) were similar to that of the current scenario (P = 0.24; P = 0.09). Although sensitivity was significantly lower in both new scenario groups than in the current scenario group, specificity and PPV improved for HHUS and ABUS (all P༜0.05). Meanwhile, only scenario #1 which adds MAM to ABUS obtained a similar AUC compared with the current scenario (P = 0.28).

For HHUS, the unnecessary biopsy rate was significantly reduced to 39.86% (55/138) and 28.26% (39/138) for scenario #1 and scenario #2 compared with the current scenario, respectively (all P༜0.001), and the malignancy rate of biopsy increased to 45.54% (46/101) and 51.25% (41/80), respectively (P = 0.102; P = 0.008). However, both new scenarios had significantly lower IC detection rates than the current scenario (Table 5). Similar patterns were recorded for ABUS apart from scenario #1 significantly reduced the false positive biopsies (P༜0.001) without decreasing IC yield (P = 0.13) (Table 5).

Table 5

Effect of increasing biopsy thresholds on unnecessary biopsies and cancer yields when add MAM to HHUS and ABUS
Biopsy thresholds¶		HHUS (N = 138)			ABUS (N = 94)
Biopsy thresholds¶		Unnecessary biopsy rate (n,%)	IC detection rate (n,%)	Malignancy rate of biopsy (n,%)	Unnecessary biopsy rate (n,%)	IC detection rate (n,%)	Malignancy rate of biopsy (n,%)
Total
	Current scenario	84(60.87)	46(33.33)	54(39.13)	55(58.51)	28(29.78)	39(41.49)
	Scenario #1	55(39.86)*	38(27.54)†	46(45.54)	33(35.11)*	24(25.53)	33(50.00)
	Scenario #2	39(28.26)*	34(24.64)†	41(51.25)	26(27.66)*	20(21.28)†	27(50.94)
Stratified by breast density
Less dense
	Current scenario	11(52.38)	8(38.10)	10(47.62)	10(55.56)	4(22.22)	8(44.44)
	Scenario #1	8(38.10)	8(38.10)	10(55.56)	4(22.22)*	4(22.22)	6(60.00)
	Scenario #2	5(23.81)*	7(33.33)	9(64.29)	3(16.67)*	3(16.67)	5(62.50)
More dense
	Current scenario	73(72.39)	38(32.48)	44(37.61)	45(59.21)	24(31.58)	31(40.79)
	Scenario #1	47(40.17)*	30(25.64)†	36(43.37)	29(38.16)*	20(26.32)	27(48.21)
	Scenario #2	34(29.06)*	27(23.08)†	32(48.48)	23(30.26)*	17(22.37)†	22(48.89)
Stratified by age
40–49 years
	Current scenario	52(73.24)	17(23.94)	19(26.76)	34(68.00)	14(28.00)	16(32.00)
	Scenario #1	34(47.89)*	13(18.31)	15(30.61)	22(44.00)*	12(24.00)	14(38.89)
	Scenario #2	29(40.85)*	11(15.49)†	13(30.95)	19(38.00)*	10(20.00)	12(38.71)
50–69 years
	Current scenario	32(47.76)	29(43.28)	35(52.24)	21(47.72)	14(31.81)	23(52.27)
	Scenario #1	21(31.34)*	25(37.31)	31(59.62)	11(25.00)*	12(27.27)	19(63.33)
	Scenario #2	10(14.93)*	23(34.33)†	28(73.68)§	7(15.91)*	10(22.73)	15(68.18)
Stratified by palpability of the mass
Palpable
	Current scenario	36(50.70)	32(45.07)	35(49.30)	25(50.00)	20(40.00)	25(50.00)
	Scenario #1	27(38.03)*	30(42.25)	33(55.00)	17(34.00)*	18(36.00)	23(57.50)
	Scenario #2	20(28.17)*	27(38.03)	30(60.00)	14(28.00)*	16(32.00)	20(58.82)
Non-Palpable
	Current scenario	48(71.64)	14(20.90)	19(28.36)	30(68.18)	8(18.18)	14(31.82)
	Scenario #1	28(41.79)*	8(11.94)†	13(31.71)	16(36.36)*	6(13.64)	10(38.46)
	Scenario #2	19(28.36)*	7(10.45)†	11(36.67)	12(27.27)*	4(9.09)	7(36.84)
¶ Current scenario: all women with BI-RADS-US (HHUS or ABUS) category 4a underwent biopsy; Scenario #1: women with BI-RADS-US category 4a and BI-RADS-MAM category 3,4, and 5 underwent biopsy; Scenario #2: women with BI-RADS-US category 4a and BI-RADS-MAM category 4 and 5 underwent biopsy
* P < 0.05 for the unnecessary biopsy rate of three new scenarios vs current scenario with McNemar’s χ² test.
† P < 0.05 for the IC detection rate of three new scenarios vs current scenario with McNemar’s χ² test.
§ P < 0.05 for the malignancy rate of biopsy of three new scenarios vs current scenario with Chi-square test.
Abbreviations: ABUS: Automated Breast Ultrasound; BI-RADS: Breast Imaging Reporting and Data System; IC: Invasive Cancer; HHUS: Handheld Ultrasound; MAM: Mammography

We also compared the unnecessary biopsy rates, IC yields, and malignant rate of biopsy between two new scenarios and the current scenario for HHUS and ABUS by age, breast density, and palpability of the mass (Table 5). In all subgroups, a lower unnecessary biopsy rate was always significantly noted for two new scenarios in both HHUS and ABUS. The IC yields of two new biopsy thresholds were not inferior to the current scenario for HHUS in women with less dense breasts (P = 1.00) and those with palpable masses (P = 0.06). For ABUS, we did not observe a significant difference in diagnostic performance in all subgroups between the two new scenarios and the current scenario, except for the IC yields of scenario #2 of women with dense breasts (P = 0.02).

The potentially large number of unnecessary biopsies resulting from the current recommendation for BI-RADS-US category 4a creates an additional burden for women and impacts clinical resources. Our findings showed that the false-positive rate of category 4a in ABUS was almost 65.81% which was similar to HHUS (67.55%). Meanwhile, clinical and sonographic factors influencing the 4a false-positive lesions were observed differently between HHUS and ABUS which might be associated with radiologists’ experiences and equipment difference. Supplementing MAM in women with BI-RADS-US (HHUS or ABUS) category 4a lesions can improve specificity and PPV. To note, the potential added value of the second-look MAM adjunct to HHUS 4a was identified to reduce unnecessary biopsy procedures, but might with the risk of missing invasive cancer. However, the new strategy combining ABUS category 4a and MAM 3, 4, and 5 as a new biopsy threshold would have the potential to safely reduce false-positive biopsies.

Current criticisms of HHUS include concern about the false positive results and associated unnecessary biopsies [17]. The range of the malignancy rate for BI-RADS-US 4 lesions is wide (2 ~ 95%) [7]. In particular, considerable overlapped image features between benign and malignant lesions in category 4a result in difficulty to distinguish malignancy. The primary reason is lacking objective criteria for the subclassification of category 4 lesions which are largely based on the experience of the sonographers [18]. Our results also reflected the benign biopsy rate of 4a was higher even when performed by highly qualified experts from high-level hospitals which was following the conclusions of previous studies [9–10].

The potential of ABUS in the diagnostic setting of breast cancer has currently become the research focus because of its benefits [19]. Some unique features through multiplanar reconstructions can provide additional information for differentiating benign and malignant masses [20]. For example, the retraction phenomenon, as the specific feature observed in ABUS coronal view, has been suggested to be a predictable characteristic of breast cancer [21]. Our previous studies have suggested that specificity and PPV were significantly higher in ABUS, compared with that of HHUS [22–23]. However, this study showed the unnecessary biopsy rate of ABUS among 4a masses is similar to that of HHUS. This might be explained by the lower ability of radiologists who review the ABUS images to evaluate category 4a even with standardized training before. Additionally, there is not yet a well-established specific criterion in determining lesion characteristics with ABUS images worldwide, primarily based on BI-RADS-US descriptors. Of note, the non-essential biopsy rate was 72.84% in 81 patients assessed with both ABUS and HHUS. Thereby, the technological inherent limitations of US equipment may also be another important reason.

In routine clinical practice, the interpretation criteria of category 4a are that a mass with benign ultrasound appearance but exhibiting any suspicious sign [24]. Of all normal or benign lesions, duct change, nonparallel masses, and architectural distortion increased the level of suspicion for these masses and prefer BI-RADS 3 to 4a for HHUS in our study. Surrounding background tissue change results in the poor demarcation between masses and normal tissue, which may partly be explained by these features impede evaluate accurately the breast lesion [25–27]. Meanwhile, Calcification was observed associated with false-positive cases in lesions of category 4a examined using ABUS. Due to the influence of probe frequency, tissue background echo, and operator technology, ultrasound is not ideal to detect microcalcification in lesions even though it is the key imaging feature for the diagnosis of breast cancer [28]. Notably, we also found that menopausal status and larger masses tended to higher probability of false positives. ABUS separates image acquisition (performed by the technicians) from interpretation. Therefore, sonographers will pay more attention to the clinical characteristics of patients compared with HHUS, such as menopausal status. In addition, a previous study has also revealed that women with larger lesions tend to raise the level of suspicion [29]. Above all, benign possibilities should be taken into account when these features are found which suggests that examiners need to integrate other important image features when interpreting ultrasound images by receiving specific training about BI-RADS descriptors. More importantly, supplemental other screening tools might be effective strategies help to triage populations with lower risk by delaying biopsy interventions and avoiding making unnecessary recommendations.

No other trials of integrated MAM in ultrasound have reported results. Lacking sufficient evidence to reduce breast cancer mortality could be a barrier to implementing the widespread ultrasound as the stand-alone screening modality. Currently, supplemental ultrasound to MAM has become a mainstay of diagnostic breast imaging for women with mammographically dense breasts. Some low- and middle-income countries (LMICs) are exploring HHUS application as a primary screening method for breast cancer because of the advantages of being cheap, having higher access, and being noninvasive [30–32]. A meta-analysis study demonstrated that studies focusing on HHUS applications in LMICs have risen nearly by 60%, which reveals the increasing adoption of HHUS equipment worldwide [33]. However, given the lower specificity and higher false-positive rate of HHUS, it is important to explore ultrasound-based diagnostic strategies in combination with other techniques.

US (HHUS or ABUS) category 4a combined with MAM positive results as the biopsy threshold can significantly improve diagnostic performance and reduce false-positive biopsies when compared to the current scenario, but probably with the risk of missing invasive cancer. The most likely explanation is that more than 70% of participants age 40 and older and almost 50% of them were premenopausal who undergo MAM are found to have dense breasts in our study, which may be associated with the lower sensitivity of MAM [16]. Of note, our findings also revealed that the new biopsy threshold did not affect the invasive cancer yield which is comparative with the current scenario for women with less dense breasts. Furthermore, this study was conducted in hospitals and the conclusions came from the symptomatic population who has a higher risk for breast cancer than the asymptomatic population. In view of these issues, whether an immediate biopsy strategy is needed for this group still depends on clinicians’ perceptions of acceptable risks based on an individual patient basis to balance the pros and cons.

Notably, we found that the added value of the second-look MAM adjunct to HHUS 4a could acquire higher cancer yields when breast masses were palpable which might be related to the probability of malignancy being fairly high in palpable lesions. Pliability is likely to be viewed with more suspicion by these masses, providing information to aid diagnosis for radiologists [29]. A previous study showed the combination of MAM and HHUS could potentially increase the negative predictive value among women with palpable breast abnormality [34].

Most importantly, this study provides a more practical perspective that when the biopsy threshold identified BI-RADS 3 and above for MAM combined with BI-RADS 4a for ABUS has benefited over the current biopsy strategy for reducing false-positive biopsies without affecting the detection performance. Findings from a prospective study indicated that ABUS has a higher ability to detect architectural distortions, one of the risk factors of subsequent breast cancer in mammographic findings [35], on the coronal plane than HHUS [36]. Additionally, ABUS can supplement mammography to detect more non-calcified carcinomas compare with HHUS in women with dense breasts [35]. This might explain the higher diagnostic performance of the biopsy threshold (Scenario #2) for ABUS than that of HHUS. Furthermore, we also acknowledged that the difference between HHUS and ABUS might result from the limited sample size in category 4a.

The reasons for false-positive findings need to be identified through external quality assessment in clinical practice. Some cases without abnormal pathological findings might have image changes that mimic the appearance of precancerous lesions, resulting in misclassification as positive results. This group then needs to be given priority attention, because the image feature abnormalities are more likely to be risk markers of breast cancer [37]. A previous retrospective study performed by Hofvind et al. showed a higher interval breast cancer rate appeared after a false-positive result in a MAM-based screening program [38]. The biological susceptibility maybe contributes to the increased risk for breast cancer [39]. Thereby, risk-based stratification management strategies play a vital role for women with false-positive results. However, because of our cross-sectional study design, future works should be conducted to explore the safe screening intervals for false-positive recalls.

The main strength of this study is the first to evaluate the added value of the second-look MAM adjunct to US (HHUS or ABUS) category 4a. It possibly contributes to understanding that MAM might be a useful additional tool for US in breast cancer diagnostics to better distinguish which patients require a histopathologic confirmation of suspicious lesions on imaging.

This study had several limitations. First, the experience of the radiologists among five research centers could affect the ability of image acquisition and interpretation. However, the variability among radiologists might be avoided to some extent by the standardized training before the research. Another limitation is the absence of follow-up information may affect the accurate evaluation of long-term effectiveness results in patients with false-positive biopsies of US 4a among different biopsy thresholds. In addition, the study participants were recruited from hospital outpatient with a higher risk of breast cancer, which does not reflect the new biopsy thresholds applications for the general population. To address this issue, we now have conducted ongoing real-world research to explore the screening effectiveness for HHUS, ABUS, and MAM in average-risk populations.

The higher unnecessary biopsied rate of category 4a by ABUS was very similar to HHUS whereas image factors influencing the false positive 4a lesions showed the difference, which might be attributed to the experiences of radiologists and the equipment difference. Therefore, these image features should be the focus of integrated training to improve the diagnostic accuracy of US category 4a. The second-look MAM adjunct to US had the potential to improve diagnostic performance and reduce overdiagnosis but might miss invasive cancer. However, the added value was observed for women with less dense breasts and those with palpable breast masses among HHUS group. Notably, BI-RADS 3 and above for MAM combined with BI-RADS 4a for ABUS benefited from the current biopsy strategy and safely reduced false-positive biopsies. Future work is still needed to explore the appropriate follow-up interval for false-positive patients in specific populations.

Author’s contributions

All authors have made significant contributions to this study. Designing the study: FZ, WR,YQ, and ZX. Analyzing and interpreting the data: WR, XLZ, XWZ, HY, and SH. Drafting the manuscript: WR, XLZ, and XWZ. Revising the manuscript: All authors.

Funding

This study was funded by GE Healthcare (No. CH-EPI-027) and Chinese Academy of Medical Sciences Innovation Fund for Medical Sciences (CIFMS 2017-I2M-B&R-03).

Data availability

The dataset analyzed during the current study is available from the corresponding author on reasonable request.

Conflict of interest All authors declare that they have no conflict of interest.

Ethical approval Institutional Review Board approval was obtained.

Informed consent Informed consent was obtained from all individual participants.

Monticciolo DL, Newell MS, Moy L, Niell B, Monsees B, Sickles EA (2018) Breast Cancer Screening in Women at Higher-Than-Average Risk: Recommendations From the ACR. J Am Coll Radiol 15(3 Pt A):408–414. https://doi.org/10.1016/j.jacr.2017.11.034
Tohno E, Umemoto T, Sasaki K, Morishima I, Ueno E (2013) Effect of adding screening ultrasonography to screening mammography on patient recall and cancer detection rates: a retrospective study in Japan. Eur J Radiol 82(8):1227–1230. https://doi.org/10.1016/j.ejrad.2013.02.007
Ohuchi N, Suzuki A, Sobue T, Kawai M, Yamamoto S, Zheng YF, Shiono YN, Saito H, Kuriyama S, Tohno E, Endo T, Fukao A, Tsuji I, Yamaguchi T, Ohashi Y, Fukuda M, Ishida T, J-START investigator groups (2016) Sensitivity and specificity of mammography and adjunctive ultrasonography to screen for breast cancer in the Japan Strategic Anti-cancer Randomized Trial (J-START): a randomised controlled trial. Lancet 387(10016):341–348. https://doi.org/10.1016/S0140-6736(15)00774-6
Sarma EA (2015) Barriers to screening mammography. Health Psychol Rev 9(1):42–62. https://doi.org/10.1080/17437199.2013.766831
Wang J, Zheng S, Ding L, Liang X, Wang Y, Greuter MJW, de Bock GH, Lu W (2020) Is Ultrasound an Accurate Alternative for Mammography in Breast Cancer Screening in an Asian Population? A Meta-Analysis. Diagnostics (Basel) 10(11):985. https://doi.org/10.3390/diagnostics10110985
Zanotel M, Bednarova I, Londero V, Linda A, Lorenzon M, Girometti R, Zuiani C (2018) Automated breast ultrasound: basic principles and emerging clinical applications. Radiol Med 123(1):1–12. https://doi.org/10.1007/s11547-017-0805-z
American College of Radiology (2013) Breast Imaging Reporting and Data System, the 5th version. https://www.acr.org/Clinical-Resources/Reporting-and-Data-Systems/Bi-Rads. Accessed 26 June 2022
Gao L, Li J, Gu Y, Ma L, Xu W, Tao X, Wang R, Zhang R, Zhang Y, Wang H, Jiang Y (2022) Breast ultrasound in Chinese hospitals: A cross-sectional study of the current status and influencing factors of BI-RADS utilization and diagnostic accuracy. Lancet Reg Health West Pac 29:100576. https://doi.org/10.1016/j.lanwpc.2022.100576
Xie Y, Zhu Y, Chai W, Zong S, Xu S, Zhan W, Zhang X (2022) Downgrade BI-RADS 4A Patients Using Nomogram Based on Breast Magnetic Resonance Imaging, Ultrasound, and Mammography. Front Oncol 12:807402. https://doi.org/10.3389/fonc.2022.807402
Choi JS, Han BK, Ko EY, Ko ES, Shin JH, Kim GR (2016) Additional diagnostic value of shear-wave elastography and color Doppler US for evaluation of breast non-mass lesions detected at B-mode US. Eur Radiol 26(10):3542–3549. https://doi.org/10.1007/s00330-015-4201-6
Brewer NT, Salz T, Lillie SE (2007) Systematic review: the longterm effects of false-positive mammograms. Ann Intern Med 146(7):502–510. https://doi.org/10.7326/0003-4819-146-7-200704030-00006
Zagouri F, Sergentanis TN, Gounaris A, Koulocheri D, Nonni A, Domeyer P, Fotiadis C, Bramis J, Zografos GC (2008) Pain in different methods of breast biopsy: emphasis on vacuum-assisted breast biopsy. Breast 17(1):71–75. https://doi.org/10.1016/j.breast.2007.07.039
Yazici B, Sever AR, Mills P, Fish D, Jones SE, Jones PA (2006) Scar formation after stereotactic vacuum-assisted core biopsy of benign breast lesions. Clin Radiol 61(7):619–624. https://doi.org/10.1016/j.crad.2006.03.008
Choi EJ, Choi H, Park EH, Song JS, Youk JH (2018) Evaluation of an automated breast volume scanner according to the fifth edition of BI-RADS for breast ultrasound compared with hand-held ultrasound. Eur J Radiol 99:138–145. https://doi.org/10.1016/j.ejrad.2018.01.002
Tohno E, Umemoto T, Sasaki K, Morishima I, Ueno E (2013) Effect of adding screening ultrasonography to screening mammography on patient recall and cancer detection rates: a retrospective study in Japan. Eur J Radiol 82(8):1227–1230. https://doi.org/10.1016/j.ejrad.2013.02.007
Zhang X, Lin X, Tan Y, Zhu Y, Wang H, Feng R, Tang G, Zhou X, Li A, Qiao Y (2018) A multicenter hospital-based diagnosis study of automated breast ultrasound system in detecting breast cancer among Chinese women. Chin J Cancer Res 30(2):231–239. https://doi.org/10.21147/j.issn.1000-9604.2018.02.06
Berg WA, Bandos AI, Mendelson EB, Lehrer D, Jong RA, Pisano ED (2015) Ultrasound as the Primary Screening Test for Breast Cancer: Analysis From ACRIN 6666. J Natl Cancer Inst 108(4):djv367. https://doi.org/10.1093/jnci/djv367
Elverici E, Barça AN, Aktaş H, Özsoy A, Zengin B, Çavuşoğlu M, Araz L (2015) Nonpalpable BI-RADS 4 breast lesions: sonographic findings and pathology correlation. Diagn Interv Radiol 21(3):189–194. https://doi.org/10.5152/dir.2014.14103
Chen L, Chen Y, Diao XH, Fang L, Pang Y, Cheng AQ, Li WP, Wang Y (2013) Comparative study of automated breast 3-D ultrasound and handheld B-mode ultrasound for differentiation of benign and malignant breast masses. Ultrasound Med Biol 39(10):1735–1742. https://doi.org/10.1016/j.ultrasmedbio.2013.04.003
Lin X, Wang J, Han F, Fu J, Li A (2012) Analysis of eighty-one cases with breast lesions using automated breast volume scanner and comparison with handheld ultrasound. Eur J Radiol 81(5):873–878. https://doi.org/10.1016/j.ejrad.2011.02.038
Schiaffino S, Gristina L, Tosto S, Massone E, De Giorgis S, Garlaschi A, Tagliafico A, Calabrese M (2021) The value of coronal view as a stand-alone assessment in women undergoing automated breast ultrasound. Radiol Med 126(2):206–213. https://doi.org/10.1007/s11547-020-01250-7
Lin X, Jia M, Zhou X, Bao L, Chen Y, Liu P, Feng R, Zhang X, Zhu L, Wang H, Zhu Y, Tang G, Feng W, Li A, Qiao Y (2021) The diagnostic performance of automated versus handheld breast ultrasound and mammography in symptomatic outpatient women: a multicenter, cross-sectional study in China. Eur Radiol 31(2):947–957. https://doi.org/10.1007/s00330-020-07197-7
Jia M, Lin X, Zhou X, Yan H, Chen Y, Liu P, Bao L, Li A, Basu P, Qiao Y, Sankaranarayanan R (2020) Diagnostic performance of automated breast ultrasound and handheld ultrasound in women with dense breasts. Breast Cancer Res Treat 181(3):589–597. https://doi.org/10.1007/s10549-020-05625-2
Choi EJ, Lee EH, Kim YM, Chang YW, Lee JH, Park YM, Kim KW, Kim YJ, Jun JK, Hong S, on the behalf of the Alliance for Breast Cancer Screening in Korea (ABCS-K) (2019) Interobserver agreement in breast ultrasound categorization in the Mammography and Ultrasonography Study for Breast Cancer Screening Effectiveness (MUST-BE) trial: results of a preliminary study. Ultrasonography 38(2):172–180. https://doi.org/10.14366/usg.18012
Song SE, Yie A, Seo BK, Lee SH, Cho KR, Woo OH, Lee KY, Kim YS (2012) A prospective study about abnormal ductal dilatations without associated masses on breast US: what is the significance for us? Acad Radiol 19(3):296–302. https://doi.org/10.1016/j.acra.2011.10.021
Raza S, Goldkamp AL, Chikarmane SA, Birdwell RL (2010) US of breast masses categorized as BI-RADS 3, 4, and 5: pictorial review of factors influencing clinical management. Radiographics 30(5):1199–1213. https://doi.org/10.1148/rg.305095144
Hooley RJ, Scoutt LM, Philpotts LE (2013) Breast ultrasonography: state of the art. Radiology 268(3):642–659. https://doi.org/10.1148/radiol.13121606
Ouyang YL, Zhou ZH, Wu WW, Tian J, Xu F, Wu SC, Tsui PH (2019) A review of ultrasound detection methods for breast microcalcification. Math Biosci Eng 16(4):1761–1785. https://doi.org/10.3934/mbe.2019085
Patterson SK, Neal CH, Jeffries DO, Joe A, Klein K, Bailey J, Pinsky R, Paramagul C, Watcharotone K (2014) Outcomes of solid palpable masses assessed as BI-RADS 3 or 4A: a retrospective review. Breast Cancer Res Treat 147(2):311–316. https://doi.org/10.1007/s10549-014-3109-1
Ma L, Lian ZQ, Zhao YX, Di JL, Song B, Ren WH, Miao HZ, Wu JL, Wang Q (2021) Breast ultrasound optimization process analysis based on breast cancer screening for 1 501 753 rural women in China. Zhonghua Zhong Liu Za Zhi 43(4):497–503. https://doi.org/10.3760/cma.j.cn112152-20190828-00549
Dickerson LK, Rositch AF, Lucas S, Harvey SC (2017) Pilot Educational Intervention and Feasibility Assessment of Breast Ultrasound in Rural South Africa. J Glob Oncol 3(5):502–508. https://doi.org/10.1200/JGO.2016.008086
Sood R, Rositch AF, Shakoor D, Ambinder E, Pool KL, Pollack E, Mollura DJ, Mullen LA, Harvey SC (2019) Ultrasound for Breast Cancer Detection Globally: A Systematic Review and Meta-Analysis. J Glob Oncol 5:1–17. https://doi.org/10.1200/JGO.19.00127
Stewart KA, Navarro SM, Kambala S, Tan G, Poondla R, Lederman S, Barbour K, Lavy C (2020) Trends in Ultrasound Use in Low and Middle Income Countries: A Systematic Review. Int J MCH AIDS 9(1):103–120. https://doi.org/10.21106/ijma.294
Beyer T, Moonka R (2003) Normal mammography and ultrasonography in the setting of palpable breast cancer. Am J Surg 185(5):416–419. https://doi.org/10.1016/s0002-9610(03)00042-4
Posso M, Alcántara R, Vázquez I, Comerma L, Baré M, Louro J, Quintana MJ, Román M, Marcos-Gragera R, Vernet-Tomas M, Saladie F, Vidal C, Bargalló X, Peñalva L, Sala M, Castells X, BELE study group (2022) Mammographic features of benign breast lesions and risk of subsequent breast cancer in women attending breast cancer screening. Eur Radiol 32(1):621–629. https://doi.org/10.1007/s00330-021-08118-y
Vourtsis A, Kachulis A (2018) The performance of 3D ABUS versus HHUS in the visualisation and BI-RADS characterisation of breast lesions in a large cohort of 1,886 women. Eur Radiol 28(2):592–601. https://doi.org/10.1007/s00330-017-5011-9
Román M, Hofvind S, von Euler-Chelpin M, Castells X (2019) Long-term risk of screen-detected and interval breast cancer after false-positive results at mammography screening: joint analysis of three national cohorts. Br J Cancer 120(2):269–275. https://doi.org/10.1038/s41416-018-0358-5
Hofvind S, Sagstad S, Sebuødegård S, Chen Y, Roman M, Lee CI (2018) Interval Breast Cancer Rates and Histopathologic Tumor Characteristics after False-Positive Findings at Mammography in a Population-based Screening Program. Radiology 287(1):58–67. https://doi.org/10.1148/radiol.2017162159
Castells X, Torá-Rocamora I, Posso M, Román M, Vernet-Tomas M, Rodríguez-Arana A, Domingo L, Vidal C, Baré M, Ferrer J, Quintana MJ, Sánchez M, Natal C, Espinàs JA, Saladié F, Sala M, BELE Study Group (2016) Risk of Breast Cancer in Women with False-Positive Results according to Mammographic Features. Radiology 280(2):379–386. https://doi.org/10.1148/radiol.2016151174

Download PDF

Version 1

posted

You are reading this latest preprint version

The potential of adding mammography to HHUS and ABUS to reduce unnecessary biopsies in BI-RADS ultrasound category 4a: a multicenter hospital-based study in China

Status:

Version 1

Abstract

Figures

Introduction

Materials And Methods

Study population

Image Acquisition And Interpretation

Statistical analysis

Results

Distribution of benign and malignant lesions in category 3 and 4a

Clinical And Imaging Factors Associated With False-positive Lesions In Category 4a

Diagnostic Performance Of Adding Mam To Hhus And Abus

Discussion

Conclusions

Declarations

References

Status:

Version 1