Multi-Variate and Multi-dimensional CFAR Detection of Breast Cancer

doi:10.21203/rs.3.rs-2110232/v1

Download PDF

Research Article

Multi-Variate and Multi-dimensional CFAR Detection of Breast Cancer

https://doi.org/10.21203/rs.3.rs-2110232/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Breast cancer is the most common type of cancer in females. In many cases, the mortality rate can be drastically lowered if the disease is detected early. Due to its safety and lack of risk to the patient, microwave breast imaging is considered a potential replacement for mammography. This paper presents a breast cancer detection approach based on the Multi-Variate and Multi-Dimensional Constant False Alarm Rate (MVMD-CFAR) method. This method has several advantages over mammography using x-rays, including increased patient comfort and lower costs. On an open-source experimental database derived from the University of Manitoba Microwave Mammography Dataset UM-BMID, the performance of the (2D-CFAR) method is evaluated by examining the available data set for breast microwave sensing. We segregate infected and healthy samples and assessed the probability density function PDF for pictures of normal and malignant tissue. The third dimension of the algorithm is the image's color data, which comprises three variables (three colors). Initial testing show that the MVMD-CFAR detector is highly effective, with a detection probability of 97.4% and a false alarm probability of 10%. However, a few challenges must be overcome before this imaging technique can reach its full potential and be implemented in clinical settings.

Breast Cancer Detection

Probability Density Function

MVMD-CFAR

Probability of False Alarm

Detection Probability

As stated by the American Cancer Society, breast cancer is the most common malignancy diagnosed in women. About 287,850 new instances of breast cancer are expected to be identified in the United States in 2022, with 43,250 individuals estimated to die from the disease [1]. Medical science's current cancer treatment is only effective if it begins within the early stages of the disease. As a result, the focus is on cancer identification early on. For cancer detection, X-ray mammography, magnetic resonance imaging (MRI), ultrasound, microwave imaging, and other current medical diagnosis techniques are used [2]. However, these procedures have several disadvantages, such as a high rate of missed detection and ionizing radiation in mammography. An alternative to conventional methods for detecting breast tumors, microwave imaging is non-ionizing, non-invasive, non-expensive, and non-ionizing [3].

Microwave imaging has already been presented as a potential tool for the earlier detection of breast cancer. They propose a method for statistical microwave imaging [4]. A series of generalized likelihood ratio tests are performed on microwave scattering data to locate solid scatterers' locations and presence, such as breast cancer tumors. The GLRT assumes Gaussian backscatter data with a specified covariance matrix. They show how to form a GLRT for various 2-dimensional phantoms by estimating the covariance matrix offline, a numerical phantom in 3-dimensional with a natural one-half elliptic form, and a 3-dimensional experimental phantom.

The presence of modelling noise due to significant amounts of scattering, which severely degrades the efficiency of estimation and detection methods, is one of the most challenging aspects of diagnosing possible breast tumors. As a result, they present a simple parametric inverse 3D model that allows them to detect the presence of a tumor and estimate its size and position. It is believed that using parametric models in the equation could increase accuracy. Parametric models, in general, can amplify modelling noise [5]. It is believed that a well-defined parametric model can be valuable if a clinical examination is based on fewer factors that can be made by building a three-dimensional finite element model of electromagnetic wave propagation through breast tissue. The model chronology is concluded as follows. First, the model is based on the size and location of the tumor, and it is presumed that the tumor's permittivity can be modelled using a Gaussian probability density function PDF. Secondly, the probability density and likelihood function can be calculated for measured data (power received by antennas). Finally, the likelihood of the unknown parameters is maximized. In [6], the authors have used a cell-averaging CFAR detector that automatically detects secretions in fundus imaging comparable to the methodology used in this study. The CA-CFAR detector estimates the noise level surrounding a pixel under test (PUT) by computing the mean magnitude of pixels in a "background" window. In radar imaging, the CFAR detector is widely employed to detect bright objects when the background clutter changes dramatically from scene to scene. Similarly, the CFAR detector overcomes the challenge of detecting exudate lesions in RGB and multispectral fundus pictures when the background clutter frequently varies in luminance and texture.

A group of researchers at the University of Manitoba have provided an open-access testing dataset for microwave breast sensing https://github.com/TysonReimer/itDAS, which is derived from numerical models from [7]. UM-BMID provides information for 3D-printed breast illusions from 1257 MRI-based scans, setting the foundation for large-scale (BMI) method evaluations. This paper Utilizes the data (UM-BMID) and isolates the infected and healthy samples, then estimates the normal and cancerous tissue image's probability density function. It is well known that the PDF of normal tissue is not similar everywhere. Different PDFs should be considered for the tissue near the chest, the tissue near the nipple, and the parts in between. A 2D-CFAR can be used in this case. In 2D-CFAR, the regular tissue PDF is estimated near the suspicious region; therefore, the model is usually more accurate.

The remainder of this work is structured as follows: The model and database are presented in Section 2. Statistical detection methods are stated in Section 3. The CFAR procedure is shown in Section 4. Section 5 explains the simulation results, and Section 6 concludes the paper.

UM-BMID phantom scans were performed using a preclinical radar-based microwave breast imaging system. A vector network analyzer (VNA) provides the frequency of the stepped continuous-wave signal on 1001 points across the UWB range of frequency 1 ~ 8 GHz, where the antenna model used to conduct the survey is a double-ridged horn antenna in the imaging system [8]. The system can scan monostatically with one antenna or bistatically with two antennas. During a scan, antennas are affixed on a platform and rotate 72 times within a circular path and S-parameters are measured at all 72 locations. During bistatic operation, the separation between the two antennas is sixty degrees. The antenna's trajectory can be modified depending on the breast's size. The antenna radius of all UM-BMID scans varied between 20 and 23 centimeters, and the trajectory could be changed based on breast size [8]. For the purpose of reconstructing images of each phantom scan, the beamformers Delay-Multiply-and-Sum (DMAS) ,Delay-and-Sum (DAS) ,iterative Delay-Multiply-and-Sum (itDMAS) ,and Iterative Delay-and-Sum (itDAS) were utilized. Every beamformer generates a total of 12 reconstructions that displays recognizable tumor responses and only these reconstructions are taken into account for comparison. This paper has relied on this data and applied statistical methods to get the results.

The most well-known detection methods are maximum likelihood, Bayesian, Minimax and CFAR. All four methods are widely used to make the best decision whenever the PDF of noise and signal plus noise are known. It is known that any wrong decision introduces some additional cost for the decision-maker. Therefore, all decision methods aim to minimize this cost in different aspects. When the cost of each decision and the prior probability of each event is known, then the final decision rule is as follows [9]:

$${L}\left({Y}\right)\triangleq \frac{{{p}}_{{y}}\left[{y}\right|{{H}}_{1}] }{{{p}}_{{y}}\left[{y}\right|{{H}}_{0}]} \begin{array}{c}\begin{array}{c}{{H}}_{1}\\ >\end{array}\\ \begin{array}{c}<\\ {{H}}_{0}\end{array}\end{array} \frac{{{P}}_{0}\left({{C}}_{10}-{{C}}_{00}\right)}{{{P}}_{1}\left({{C}}_{01}-{{C}}_{11}\right)} \triangleq {\eta }$$

Here $y$ is the observed value(s), and $\text{L}\left(\text{y}\right)$is named the likelihood function. Also, ${p}_{y}\left[\text{y}\right|{H}_{0}]$and ${p}_{y}\left[\text{y}\right|{H}_{1}]$are the conditional probability density functions of $y$ given ${H}_{0}$ and ${H}_{1}$, respectively. We indicate that using ${\eta }$ is a precomputable threshold determined from priori probabilities and costs on the right side. We have a set of the expenses of the kind ${C}_{ij}$ that corresponds to the cost of deciding $H$ and a priori probability ${P}_{1.2}$. This sums up our knowledge of the applicable hypothesis before the availability of any observable data.

where:

Table 1

different costs and their definition
Symbol	The expense of examinations and treatment in case of
${C}_{11}$	Suspicion of a tumor, and the result is positive.
${C}_{10}$	Suspicion of a tumor, and the result is negative.
${C}_{01}$	If the presence of the tumor is not suspected and the result is positive.
${C}_{00}$	If the presence of the tumor is not suspected and the result is negative.

Also, ${P}_{1}$ is the prior possibility of a cancerous case, and ${P}_{0}$ is the prior probability of an ordinary (non-cancerous) case.

Researchers have discovered that one in every eight women in the world, or approximately 13%, may acquire breast cancer at some time in their lives [10]. This percentage is very high and demonstrates how dangerous the situation is when a woman develops breast cancer. As a result, it appears that ${P}_{1}$ should be 12.5% and ${P}_{0}$ should be around 87.5%. Let us consider the age of onset of breast cancer between 15 and 65 years old and the duration of treatment to be about ten years. The probability of encountering cancerous tissue (${P}_{1}$) in each observation is around 2.5%, and thus the ${P}_{0}$ is equal to 97.5%.

Previous studies showed that the average breast cancer health care screening costs [2], the average cost of subsequent testing and mammography screenings was $353, denoting ${C}_{01}$ . In a retrospective study of treatment costs in a group of patients, Blumen and colleagues show how relatively high the cost of treatment was for patients with more spread cancer when the Diagnosis was not made early. According to their findings, the average authorized price for a patient with stage 4 disease is $182,655, or ${C}_{10}$ , compared to $71,909 for patients with stage 1 disease diagnosed quickly here, or ${C}_{11}$ . We will assume that ${C}_{00}$ is zero because there is no treatment cost if no cancer is detected [11].

In detection theory, our goal is to minimize the mathematical expectation of cost; we have four types of expenses (${C}_{11}$, ${C}_{10}$, ${C}_{01}$, ${C}_{00}$) and two priori probabilities (${P}_{0}$, ${P}_{1}$)

Based on these, optimal decisions should be taken and there are three different ways to do this which are summarized below.

Maximum Likelihood Detection (MLD) Algorithm: If we do not have information on costs and possibilities, use MLD and compare the threshold value with 1.
Bayesian Algorithm: This method needs to know the costs and priori probabilities to use this formula to cause the lowest cost, according to the abovementioned data.

$${\eta }\triangleq \frac{{{P}}_{0}\left({{C}}_{10}-{{C}}_{00}\right)}{{{P}}_{1}\left({{C}}_{01}-{{C}}_{11}\right)}= \frac{0.875\left(353-0\right)}{0.125\left(182.655-71.909\right)}=0.0223$$

It is observed that the threshold will also change when the probabilities are changed.

Minimax Algorithm: This method is used when the cost of treatment is known, Nevertheless, due to unavailability of priori probability, the worst probability for ${P}_{0}$should be chosen.

The Constant False Alarm Rate (CFAR) is a fundamental detection technique applied to the received signal. The CFAR detector block implements two-dimensional image data. When the value of an image cell exceeds a threshold, the target is declared present. On the other hand, in real-world contexts, the power of noise and clutter is a non-stationary random process that changes with time. A fixed threshold is applied to actual data will result in many false alarms. Furthermore, the desired probability of a false alert will not be achieved. CFAR is meant to keep the chance of a false warning from background noise or clutter at a constant level. To define a detection threshold that adapts to the power level of noise or clutter, the CFAR estimates the background power level from the surrounding samples.

The picture of the breast tissue has three colors (Red, Green and Blue). It can be assumed that the color is a Vector of three elements. It has a joint probability density function under ${H}_{0}$ and ${H}_{1}$ hypotheses. Therefore, the likelihood function can be generated using the ratio of the PDF function under two hypotheses.

$${L}\left({Y}\right)=\frac{{f}\left({Y}|{{H}}_{1}\right)}{{f}\left({Y}|{{H}}_{0}\right)}$$

where $Y ={\left[{y}_{R} {y}_{G} {y}_{B}\right]}^{T}$ ranges from 0 to 250. All conventional detectors compare this likelihood function with a fixed threshold.

$${L}\left({Y}\right)\begin{array}{c}>\\ <\end{array}{\eta }$$

However, we should consider two additional topics.

First, it is well known that the PDF of the normal tissue is not similar everywhere. It is wise to consider different PDFs for the tissue near the chest, the one near the nipple, and the parts in between. In this case, a two-dimensional CFAR can be used. In two-dimensional CFAR, the normal tissue's PDF is estimated near the suspicious region. So, the model is generally more accurate so that the actual detection probability would increase. The concept of two-dimensional CFAR is shown in Fig. 1.

Here we want to work on the cell under test (CUT). To do so, we first disregard the cell near the CUT (named the guard cells) because they may be contaminated by the cancerous tissue like the CUT. Then the training samples next to the guard cells are considered to estimate the $f\left(Y\right|{H}_{0})$. It is recognized that the conventional CFAR methods are modified versions of the Neyman-Pearson test. In such a test, the property of $f\left(Y\right|{H}_{0})$ is considered only. Hence, the mainstream of the CFAR detector is, to some extent, different from what is needed in our problem. Here the abnormality means the different PDF for the CUT and the Training cells. However, we know that without a predefined model, PDF cannot be estimated of the CUT under a single observation. The PDF estimation using one sample is inaccurate even using a predefined model. Alternatively, know that if there is a cancerous tissue, it is usually not limited to one sample. However, it is a region. Therefore, we should develop a different version of 2D-CFAR, which is narrated as:

There are some samples in the region under test (RUT), and there are other samples in the training cells; and we want to decide whether the PDF of the RUT is the same as that of the training cells. Again, we encounter two other problems, and the first one is that the test should be non-parametric. The reason is that it is neither straightforward nor acceptable to suppose a predefined model for PDF of all tissue regions. In this case, only the non-parametric tests can be applied to such a problem. Here arises the second problem, wherefore the majority of the non-parametric test (e.g. Kolmogorov-Smirnov, Kendall $\tau$, Spearman $\rho$ tests) are constructed assuming one-dimension data. However, in our problem, the data (the colourful figure) has three dimensions. In this case, we need a multi-variate non-parametric test. Baringhaus and Franz [12] discussed a good choice for such a situation.

Assume that we have two sample sets $X$ and $Y$. All samples of these sets are n-dimensional. It is known that all samples in $X$ have the same distribution $\left(F\right)$ and all samples in $Y$ also have the same distribution $\left(G\right)$. Both $F$ and $G$ are unknown, and we want to decide whether $F$ equals $G$ or not. Defining $\left|\left|1-{Z}_{2}\right|\right|$ as the Euclidean distance between points ${Z}_{1}$ and ${Z}_{2}$ in the n-dimensional space, then Baringhaus and Franz in [12] have proven that:

$${E}\left|\left|{{X}}_{{i}}-{{Y}}_{{j}}\right|\right|-\frac{1}{2}{E}\left|\left|{{X}}_{{i}}-{{X}}_{{j}}\right|\right|-\frac{1}{2}\left|\left|{{Y}}_{{i}}-{{Y}}_{{j}}\right|\right|\ge 0$$

Here ${X}_{i}$ and ${X}_{j}$ are two randomly selected samples from $X$ set, and ${Y}_{i}$ and ${Y}_{j}$ are two randomly selected samples from $Y$ set. The equality holds if and only if $F = G$.

If there were infinite samples in the ordinary and suspicious region, it was possible to calculate the expectations in the above equation. However, as the samples are limited, we should calculate the statistical average and hold it as an approximation to the expectation.

Assume that there are $N$ samples ($X$ samples) in the training region and $M$ samples ($Y$ samples) in the RUT region. Then the above expectation is approximated as:

$${z}=\frac{1}{{M}{N}}\sum _{{i}=1}^{{N}}\sum _{{j}=1}^{{M}}\left|\left|{{X}}_{{i}}-{{Y}}_{{j}}\right|\right|-\frac{1}{2{{N}}^{2}}\sum _{{i}=1}^{{N}}\sum _{{j}=1}^{{N}}\left|\left|{{X}}_{{i}}-{{X}}_{{j}}\right|\right|-\frac{1}{2{{M}}^{2}}\sum _{{i}=1}^{{N}}\sum _{{j}=1}^{{N}}\left|\left|{{Y}}_{{i}}-{{Y}}_{{j}}\right|\right|$$

As the number of samples is finite, $z$ is a random variable. The expectation of $z$ under ${H}_{0}$ (same distribution) is zero, while it is a positive value under ${H}_{1}$.

Generally, it is not so simple to calculate the PDF of $z$. However, regarding the central limit theory, it is deduced that $z$ has approximately Gaussian distribution. Therefore, under this approximation, we have:

$\left\{\begin{array}{cc}{z}\tilde{N}(0.{{\sigma }}_{0}^{2})& {{H}}_{0}\\ {z}\tilde{N}({\mu }.{{\sigma }}_{1}^{2})& {{H}}_{1}\end{array}\right.$

(7)

It is known that $\mu >0$. It is well known the Ne for this problem has the following form:

$${z}\begin{array}{c}>\\ <\end{array}{\eta }$$

Again if the ${\sigma }_{0}$ is known, the threshold value can be set based on the acceptable $Pfa$ as:

$${\eta }={{\sigma }}_{0}{{G}}^{-1}\left(1-{P}{f}{a}\right)$$

Here ${G}^{-1}$ is the inverse CDF of the Gaussian distribution. However, the actual value of ${\sigma }_{0}$ is not known and the parameter can be estimated as:

$\widehat{{{\sigma }}_{0}^{2}}=\frac{1}{{{N}}^{2}}\sum _{{i}=1}^{{N}}\sum _{{j}=1}^{{N}}\left|\left|{{X}}_{{i}}-{{X}}_{{j}}\right|\right|$

(10)

Therefore it seems that the following statistic is independent of the ${\sigma }_{0}$value:

$${w}=\frac{\frac{1}{{M}{N}}\sum _{{i}=1}^{{N}}\sum _{{j}=1}^{{M}}\left|\left|{{X}}_{{i}}-{{Y}}_{{j}}\right|\right|-\frac{1}{2{{N}}^{2}}\sum _{{i}=1}^{{N}}\sum _{{j}=1}^{{N}}\left|\left|{{X}}_{{i}}-{{X}}_{{j}}\right|\right|-\frac{1}{2{{M}}^{2}}\sum _{{i}=1}^{{N}}\sum _{{j}=1}^{{N}}\left|\left|{{Y}}_{{i}}-{{Y}}_{{j}}\right|\right|}{\frac{1}{{{N}}^{2}}\sum _{{i}=1}^{{N}}\sum _{{j}=1}^{{N}}\left|\left|{{X}}_{{i}}-{{X}}_{{j}}\right|\right|}$$

Therefore, our proposed detector has the following form:

$${w}\begin{array}{c}>\\ <\end{array}{{\eta }}_{1}$$

It is straightforward to show that any change in the mean or standard deviation of the ${X}_{iz}$ cannot change the statistics of the w; consequently, the abovementioned detector has a constant false alarm rate.

Simulation Results

The ${P}_{D}$ is the probability that the target will be observed, given that the target is genuinely present. The ${P}_{fa}$ is the probability that the target will be detected by measurement when the target is not present.

The detector must determine whether a tumor is present at each pixel in a picture. Hypothesis ${H}_{0}$ describes the situation where the measurement y corresponds to a section of the image with no tumor. ${H}_{1}$ indicates the presence of a tumor. Estimates of the probability density functions can be used to calculate detection and false alarm probability as a function of the decision threshold. When a noise peak exceeds the threshold, it appears as if there is a signal when there is only noise.

The proposed algorithm is as follows:

The region under test (RUT) is defined as a rectangular region ${m}_{0}\times {n}_{0}$. ${X}_{i}{\prime }s$ is the samples’ name in this region.

The guard area is defined as an ${m}_{1}\times {n}_{1}$ rectangle surrounding the RUT.

Training samples are selected as a ${m}_{2}\times {n}_{2}$ rectangle surrounding the guard area. Samples are named as ${Y}_{i}{\prime }s$.

This simulation scenario is as follows: Begin with extracting images from the data set (UM-BMID), isolating tumor-containing and tumor-free images, and estimating the PDF for normal and cancer tissue images, where 104 images have been extracted and simulated, 52 of which were diagnosed with a tumor and 52 were normal tissue, where ${P}_{D}$ and ${P}_{fa}$ determined using the same number of samples.

Preliminary results indicate good performance; it is observed that for the detection threshold $T$, the lower the ${P}_{fa}$, the higher the detection threshold will be obtained, and vice versa. If we take the example of a ${P}_{fa}$ is 10%, we will get a value of 2 from the threshold $T$, and note that when the probability of a false alarm is reduced, a higher threshold $T$ is obtained, as shown in Fig. 2.

Calculating the ${P}_{D}$ is frequently the most important factor in determining the performance of our work. For example, the probability of a false alarm is 10%. Four values of 𝑅𝑈𝑇 were chosen to monitor the performance of produced probability. During all four values, the training region is set to a constant value which equals to TR = 10*10. The number of pixels of the region under test set to RUT = 1*1, 2*2, 3*3 and 4*4, that end up with the result of the CFAR algorithm produced a probability of detection of 97.4%, 97.2%, 96.5% and 95.9% respectively. To sum up, it is concluded that when the array of damaged pixels decreases, a higher detection probability will be achieved, because the accuracy of the pixels is greater, as shown in Fig. 3.

In the following case of our results, the probability of a false alarm is 10%. Four values of training region TR were chosen to monitor the performance of produced probability. During all four values, the RUT is set to a constant value which equals to 1*1. the number of pixels of the training region set to TR = 10*10, 11*11, 12*12 and 13*13, that end up with the result of the CFAR algorithm produced a probability of detection of 97.9%, 96.9%, 96.4% and 95.9% respectively. To sum up, it is concluded that when the array of healthy pixels increases, this will decrease the probability of detection due to the large array of healthy pixels, as shown in Fig. 4.

In the following case, we set the region under test $RUT=2\text{*}2$, and the training region $TR=10\text{*}10$, and we mix the Pixel of damage (PoD) and the Pixel of health (PoH) in the region under test, we conclude that when the greater the number of damaged pixels in the region under test gives better performance and achieves the highest probability of detection (97.2%) for the probability of false alarm (10%), as shown in Fig. 5.

Table 2 compares the results of the proposed algorithm (last row) and the various methods previously described in terms of detection probability. In [13], an artificial neural network (ANN) is constructed to determine whether a patient has breast cancer. Saritas performs well in terms of sensitivity or probability of detection but poorly in terms of specificity or probability of false alarm. In [14], a hybrid model of computational intelligence that is built on unsupervised learning methodologies, such as value complex neural networks (CVNN) and self-organizing maps (SOM), has been developed for accurate breast cancer detection. Shirazi produces relatively high levels of sensitivity and specificity, i.e. a high chance of detection probability even when the probability of a false alarm is low. Finally, in [15], which focuses primarily on the transformational learning process for breast cancer detection and where modified VGG (MVGG) has been proposed, Khamparia achieves high sensitivity and, therefore, a good detection probability. However, the sensitivity is somewhat diminished in the case of a low false-alarm probability. Through Table 2, it can be seen that our research methodology has achieved a completely successful performance compared to other proposed diagnostic methods by obtaining a very high detection probability even when the probability of false alarm is at its lowest.

Table 2 Results of the suggested work compared to other's work on detection probability
	${P}_{fa}$	0.1	0.3	0.5	0.7	0.9
Artificial Neural Networks (ANN)[13]	${P}_{D}$	75%	92%	96%	98%	100%
(SOM-CVNN)[14]	${P}_{D}$	90%	92%	95%	97%	99%
Hybrid Transfer Learning (Modified VGG 16)[15]	${P}_{D}$	82%	93%	96%	98.5%	99.5%
Proposed method (MVMD-CFAR)	${P}_{D}$	97.4%	98.3%	98.8%	99%	99.4%

This paper presents a method for detecting breast cancer. The proposed approach utilizes a constant false alarm rate detector strategy based on multi-dimensional and multi-variate for improving detection probability. We have three working regions where the region under test (RUT) is defined as a rectangular region ${m}_{0}\times {n}_{0}$ representing the damaged region. The protective region has been defined as an ${m}_{1}\times {n}_{1}$ rectangle surrounding RUT, which represents the protective cells as they may be contaminated with cancerous tissue such as RUT. As for the third region, which represents healthy tissues and fibers, the training samples were selected as a rectangle ${m}_{2}\times {n}_{2}$surrounding the guard area. It is acknowledged that if there is a cancerous problem, it is usually not limited to one sample. Nevertheless, it is a region. Thus, we developed a different version of 2D-CFAR where the 2D-CFAR algorithm's performance is evaluated using an open-source experimental database derived from the (UM-BMID). We isolated infected and healthy samples and estimated the probability density function PDF of the ordinary and cancerous tissue images by analyzing the Breast Microwave Sensing. The third dimension in our algorithm is the image's color data, which in turn has three variates (three colors). The results prove that it is superior in obtaining a high ${P}_{D}$ by making some changes, such as changing the size of the damaged and healthy area by varying the number of pixels and mixing between the pixels that carry the information of damaged tissues and healthy tissues.

It is suggested to do an additional test on a more comprehensive data set to acquire statistically significant results. In addition, it would be good in the future to apply a different, advanced MVMD-CFAR algorithm to the same data set and compare its performance.

Ethical Approval

As we have only used simulated data, the ethical reproval is irrelevant.

Competing interests

To the extent of our knowledge by the time there is no competing interest regarding the current state of our work. May be in future when we have more progress, we should reconsider it again.

Authors' contributions

The main idea of multivariate CFAR detection is developed by Yaser Norouzi. Azhar Albaaj has completed this issue and he has also performed all the simulations which are presented in the paper. Gholamreza Moradi has revised the paper and help the first and the second author to improve the quality of the paper

Funding

No funding is used to complete this research

Availability of data and materials

The database used for simulations are available upon request via email to the corresponding author: [email protected]

Reimer, T., M. Solis-Nepote, and S. Pistorius, The application of an iterative structure to the delay-and-sum and the delay-multiply-and-sum beamformers in breast microwave imaging. Diagnostics, 2020. 10(6): p. 411. https://doi.org/10.3390/diagnostics10060411
Heller, S.L. and L. Moy, Breast Cancer Screening and Health Care Costs. JAMA internal medicine, 2020. 180(11): p. 1552-1553. https://doi.org/10.1001/jamainternmed.2020.2374
Yun, X., et al., Compact antenna for radar-based breast cancer detection. IEEE Transactions on Antennas, 2005. 53(8): p. 2374-2380. https://doi.org/10.1109/TAP.2005.852308
Davis, S.K., et al., Ultrawideband microwave breast cancer detection: a detection-theoretic approach using the generalized likelihood ratio test. IEEE transactions on biomedical engineering, 2005. 52(7): p. 1237-1250. https://doi.org/10.1109/TBME.2005.847528
Jeremic, A. and E. Khosrowshahli. Bayesian Estimation of Tumours in Breasts Using Microwave Imaging. in Except From the Proceedings of the 2012 COMSOL Conference in Boston. 2012.
Khanna, M. and E. Kapoor. Detection of exudates in fundus imagery using a constant false-alarm rate (CFAR) detector. in Radar Sensor Technology XVIII. 2014. SPIE. https://doi.org/10.1117/12.2068216
Burfeindt, M.J., et al., MRI-derived 3-D-printed breast phantom for microwave breast imaging validation. IEEE antennas wireless propagation letters 2012. 11: p. 1610-1613. https://doi.org /10.1109/LAWP.2012.2236293
Reimer, T., J. Krenkevich, and S. Pistorius. An open-access experimental dataset for breast microwave imaging. in 2020 14th European Conference on Antennas and Propagation (EuCAP). 2020. IEEE. https://doi.org/10.23919/EuCAP48036.2020.9135659
Willsky, A.S., G.W. Wornell, and J.H. Shapiro, Stochastic processes, detection and estimation. J Course notes for MIT, 2003. 6: p. 109.
Abbas, S., et al., BCD-WERT: a novel approach for breast cancer detection using whale optimization based efficient features and extremely randomized tree algorithm. J PeerJ Computer Science, 2021. 7: p. e390. https://doi.org/10.7717/peerj-cs.390
Blumen, H., K. Fitch, and V. Polkus, Comparison of treatment costs for breast cancer, by tumor stage and type of service. American health & drug benefits, 2016. 9(1): p. 23.
Baringhaus, L.a.C.J.S.S.F., Rigid motion invariant two-sample tests. Statistica Sinica, 2010. Vol. 20, No. 4 p. pp. 1333-1361.
Saritas, I., Prediction of breast cancer using artificial neural networks. Journal of Medical Systems, 2012. 36(5): p. 2901-2907. https://doi.org/10.1007/s10916-011-9768-0
Zadeh Shirazi, A., S.J. Seyyed Mahdavi Chabok, and Z. Mohammadi, A novel and reliable computational intelligence system for breast cancer detection. Medical biological engineering computing. 2018. 56(5): p. 721-732. https://doi.org/10.1007/s11517-017-1721-z
Khamparia, A., et al., Diagnosis of breast cancer based on modern mammography using hybrid transfer learning. Multidimensional systems signal processing. 2021. 32(2): p. 747-765. https://doi.org/10.1007/s11045-020-00756-7

No competing interests reported.

Download PDF

Version 1

posted

You are reading this latest preprint version

Multi-Variate and Multi-dimensional CFAR Detection of Breast Cancer

Status:

Version 1

Abstract

Figures

1 Introduction

2 The Model And Database

3 Statistical Detection Methods

4 Cfar Procedure

5 Conclusion

Declarations

References

Additional Declarations

Status:

Version 1

Symbol	The expense of examinations and treatment in case of
\({C}_{11}\)	Suspicion of a tumor, and the result is positive.
\({C}_{10}\)	Suspicion of a tumor, and the result is negative.
\({C}_{01}\)	If the presence of the tumor is not suspected and the result is positive.
\({C}_{00}\)	If the presence of the tumor is not suspected and the result is negative.

	\({P}_{fa}\)	0.1	0.3	0.5	0.7	0.9
Artificial Neural Networks (ANN)[13]	\({P}_{D}\)	75%	92%	96%	98%	100%
(SOM-CVNN)[14]	\({P}_{D}\)	90%	92%	95%	97%	99%
Hybrid Transfer Learning (Modified VGG 16)[15]	\({P}_{D}\)	82%	93%	96%	98.5%	99.5%
Proposed method (MVMD-CFAR)	\({P}_{D}\)	97.4%	98.3%	98.8%	99%	99.4%