Automated detection of oil spills in images: combining a novel feature extraction technique based on the q- Exponential distribution with machine learning models

doi:10.21203/rs.3.rs-2263261/v1

Download PDF

Research Article

Automated detection of oil spills in images: combining a novel feature extraction technique based on the q- Exponential distribution with machine learning models

https://doi.org/10.21203/rs.3.rs-2263261/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Oil spills are harmful, with negative environmental, social, and economic consequences. Generally, a risk-based framework involves preventing, detecting, and mitigating these undesirable events. Regarding detection, rapid oil spill identification is essential for mitigation, which fosters the use of automated procedures. Usually, automated oil spill detection involves radar images, computer vision, and machine learning techniques for classification. In this work, we propose a novel feature extraction method based on the q-Exponential probability distribution, named q-EFE. Such a model is suitable to account for atypical extreme pixel values, as it can have the power-law behavior. The q-EFE is combined with machine learning (ML) models, comprising a computer vision methodology to automatically classify images as “with oil spill” or “without oil spill”. We used a public dataset with 1112 Synthetic Aperture Radar (SAR) images to validate our methodology. Considering the proposed q-Exponential-based feature extraction, the SVM and XGB models outperformed deep learning models, including a ResNet50 one, and LBP and GLCM techniques for the biggest dataset size. The obtained results suggest that the proposed q-EFE can extract complex features from SAR images. Combined with ML models, it can perform image classification with satisfactory balanced accuracy.

q-Exponential distribution

Feature extraction

Machine learning

Computer vision

Oil spills

Risk analysis

Oil spills are environmental disasters provoked by human activities. They are described as the release of a liquid petroleum hydrocarbon into the environment, especially marine areas (Briggs and Briggs 2018), originated in refineries, oil tankers that have an accident or “clean” their tanks in the ocean, and operative discharge from ships (Huz, Lastra, and López 2018; Brekke and Solberg 2005). Indeed, considering the pollution by liquid petroleum, the illicit outflow of ballast and tank-cleaning oily residues from oil tankers and ships are the main causes contributing to the contamination of seas and oceans (Huz et al. 2018; Ansell et al. 2001). Oil spills have major economic, ecological, and social impacts (Negreiros et al. 2022), which are costly to companies due to the waste of spilled oil and fines imposed by the government for pollution (Singh et al. 2020; Krata and Jachowski 2020; Krestenitis et al. 2019b; Beyer et al. 2016). Indeed, as an ecological concern, the disasters lead to consequences affecting much of the natural marine environment (Krata and Jachowski 2020), (Krestenitis et al. 2019a) and even human health (Webler and Lord 2010; D’Andrea and Reddy 2018). Thus, early detection and immediate warning of the oil spill become crucial to attenuate the environmental consequences, control oil dispersion, and ensure human lives are not in danger (Ribeiro et al. 2020).

In the last decades, oil spills accident increased considerably, such as in well-known disasters of Amoco Cadiz (France, 1978), Exxon Valdez (Alaska – USA, 1989), “GulfWar” (Kuwait, 1991), Aegean Sea in (Spain, 1992), Erica (France, 1999), the Prestige (Spain and France, 2002), and British Petroleum platform Deepwater Horizon (Mexico 2010; Topouzelis 2008). In 2019, a huge oil spill on the Brazilian coast (Araújo et al. 2020) emerged, impacting the environment, tourism business, and fishermen. In total, 1009 affected locations were identified across 130 municipalities in all nine northeastern states and two southeastern states only five months after this tragic event (Ribeiro et al. 2020).

Safety barriers are necessary to prevent such accidents; they may be related to preventative measures (i.e., avoid the accident) or consequence reduction measures (i.e., after the accident happen). This work involves techniques useful to the second group once the intention is to rapidly detect the oil spill to mitigate its disastrous consequences since a considerable time gap between the oil spill incident and the cleaning procedure generally accentuates the negative impacts (Bubbico et al. 2020).

In this context, using images to detect oil spills is mainly separated into two approaches: manual/visual and automated (Røed and Bjerga 2017). In manual detection, most of the process is made by humans, in which contextual information such as oil rigs’ and pipelines’ location, wind speed, and direction are important (Brekke and Solberg 2005). However, this approach is time-consuming and labor-intensive due to the large number of images to be analyzed in a short period for effective oil spill monitoring (Shu et al. 2010). In addition, manual detection is highly dependent on the knowledge and experience of operators, which are subjective. According to Jiao et al (2018), as manual detection cannot rapidly detect oil spills, enterprises’ operating costs remain high, and their detection methods hardly prevent oil pollution.

In turn, automated oil spill detection automatically identifies patterns in the images to classify them. Nevertheless, a well-known problem in detecting oil spills is its resemblance to a natural ocean phenomenon called “look-alikes” (e.g., currents, eddies), which, as the oil, appear like dark spots in images (Krestenitis et al. 2019b; Topouzelis 2008). Additionally, oil spills are often especially dark due to the remote sensing radars mounted at a distance from the target area (e.g., in satellites and aircraft). According to Kerf et al. (2020), during the nighttime, the odds of detecting an oil spill lowers significantly since not every part of the water is illuminated. Even so, an efficient automatic oil spill detection system is generally faster, cheaper, and more reliable than a manual system (Shu et al. 2010).

Automatic image feature extraction techniques involving computer vision (CV) approaches have become common because of their efficiency and practical applications (Huang et al. 2021; Liu et al. 2021; Ghahremani et al. 2021; Xiao et al. 2021). Distinct methods may be applied to extract features and detect oil spills from images efficiently. In this context, features are small newsworthy, descriptive, or informative patches in images (Mahony et al. 2020). For example, Local Binary Pattern (LBP) (Ojala et al. 2002), Gray Level Co-occurrence Matrix (GLCM) (Haralick et al. 1973), and Local Tetra Patterns (LTrP) (Murala et al. 2012) are well-known feature extraction methods already applied in different areas to excerpt important small features to be inputted into classification models. Likewise, methods based on DL, such as convolutional neural networks (CNNs) (Li et al. 2021), are increasingly useful and, according to Mahony et al. (2020), usually improve prediction results using big data and abundant computing resources. Therefore, depending on the application, traditional CV techniques with global features using the context of the image as a whole (Murphy et al. 2006) are an important solution compared to DL-based methods (Mahony et al. 2020; Zhang et al. 2021; Nikan et al. 2021), such as identifying the objects in an image (Marcus 2018).

The number of publications englobing oil spills has increased over the last 20–30 years (Vasconcelos et al. 2020), with a similar trend for works proposing methods to automate the detection of oil spills. Most use CV techniques combined with machine learning (ML) (Amato et al. 2022) models or apply DL (Ahmed et al. 2022) for feature extraction and oil spill detection. For instance, Xu et al. (2020) proposed a model that uses only ML methods. The authors used CV techniques in the preprocessing, capturing morphological features of targets and support vector machines (SVM) to classify the wave information about oil spills detected by a local adaptive threshold and displayed on an electronic chart based on a geographic information system (GIS). Mera et al. (2017) use a CV system step of feature extraction, comprising the computation of 52 types of features (geometrical, textural, and physical). They applied five feature selection (FS) methods to improve the feature set. Indeed, the FS methods are preprocessing techniques for discarding features with minor impact, resulting in a reduced set of relevant features (Guyon et al. 2006). The selected features feed an SVM model, which indicates the presence or absence of oil spills in images. Also, Singha et al. (2013) performed the CV feature extraction by combining traditional and polarimetric features for object-based oil spills and look-alike discrimination. They applied a multilayer perceptron (MLP) model for image segmentation and feature classification.

Regarding DL, Chen et al. (2017) compared the results of Stacked Auto-Encoders (SAE) and Deep Belief Networks (DBN) with results achieved by SVM and MLP. Gallego et al. (2018) used a deep neural autoencoder to segment oil spills from Side-Looking Airborne Radar (SLAR) imagery. Jiao et al. (2018) proposed a CNN followed by two post-processing steps (filtering and detection box) to improve accuracy. Besides, they proposed a methodology utilizing unmanned aerial vehicle (UAV) images to inspect the areas of interest. Cantorna et al. (2019) applied clustering, logistic regression, and CNN models to detect oil spills in images. Krestenits et al. (Krestenitis et al. 2019a) combined a deep CNN and Synthetic Aperture Radar (SAR) imagery to perform a multi-classification, including oil spills, look-alikes, land areas, ships, and sea surfaces. Kerf et al. (2020) proposed a framework based on UAV, thermal infrared (IR) camera, and CNN. Zeng and Wang (2020) proposed a CNN, named the Oil Spill Convolutional Network (OSCNet), to detect oil spills in SAR imagery.

In this work, we propose a new feature extraction method, based on the q-Exponential distribution, capable of obtaining complex information from SAR images and detecting the presence of oil spills. It is a distribution-based feature extractor, which means that this method uses a probabilistic distribution model to extract features from images. The q-Exponential is a probabilistic model that stems from the Tsallis non-extensive entropy (Tsallis 1988). It is already used, for example, in reliability engineering (Negreiros et al. 2020; Sales Filho et al. 2016), finance (Ludescher and Bunde 2014), and urban agglomeration (Malacarne et al. 2002). An important characteristic of this distribution is its ability to model rare events due to its power-law behavior (Picoli et al. 2003). As the oil spills are generally small portions of the images, they can be seen as “rare events”. Although few studies correlate the Tsallis Non-Extensive Statistical Mechanics models with images (Ferraro et al. 2019), there is no study in which q-Exponential probability distribution has been used to extract features from images. We named this approach as q-Exponential feature extraction (q-EFE). The proposed q-EFE is coupled to a machine learning (ML) model to perform the classification task. Some ML models were tested (Support Vector Machine, Multilayer Perceptron, Extreme Gradient Boosting, Logistic Regression, and Random Forest) and compared. We also applied the well-known ResNet50 model, deemed successful for image recognition, three other DL-based models relying on CNN architectures, and two classical CV techniques, namely LBP and GLCM.

The remainder of this work is organized as follows: Section 2 presents the q-Exponential distribution and its related functions. Section 3 describes q-EFE, the novel feature extractor proposed in this article, whose output feeds the ML model. Section 4 presents the DL architectures used in this work for comparison purposes. Section 5 describes the oil spill-related SAR image dataset. Section 6 provides the results involving q-EFE, LBP, GLCM, each of them coupled to an ML model presented above and deep models (ResNet 50, and three CNN architectures). Finally, Section 7 brings the conclusion, limitations, and ongoing and future research.

The q-Exponential distribution has the following probability density function (PDF):

$\text{f}\left(\text{t}\right)=\frac{2-\text{q}}{{\eta }}{\text{e}\text{x}\text{p}}_{\text{q}}\left(-\frac{\text{t}}{{\eta }}\right)=\frac{2-\text{q}}{{\eta }}{\left[1-\frac{\left(1-\text{q}\right)\text{t}}{{\eta }}\right]}^{\frac{1}{1-\text{q}}},$

(1)

where, $\text{t}>0$, ${\eta }>0$ is the scale parameter, and $\text{q}<2$ determines the PDF shape and is the entropic index. When $\text{q}<1$, Eq. 1 has limited support with an upper bound that depends on ${\eta }$ and $\text{q}$, see Eq. 2:

The corresponding cumulative distribution function (CDF) is:

$t\in \left\{\begin{array}{c}\left(0,\infty \right), 1<q<2\\ \left(0,\frac{\eta }{1-q}\right), q<1\end{array}\right.$	(2)
$F\left(t\right)=1-{\left[{exp}_{q}\left(-\frac{t}{\eta }\right)\right]}^{2-q}=1-{\left[1-\frac{\left(1-q\right)t}{\eta }\right]}^{\frac{2-q}{1-q}}, t\ge 0$.	(3)

The CDF is a function that computes the probability of a random variable $T$ being less than or equal to a specific value $t$. Considering that the oil spills in images are often dark spots and that, in grayscale, dark pixels are close to zero, we expect pixels related to oil spills will have low CDFs, and pixels without oil spills will have greater CDFs. Based on the maximum likelihood method, which has some important properties (Schneider 2018) (such as asymptotic unbiasedness, strong consistency, and efficiency), for a given sample $t= \left({t}_{1}, {t}_{2}, \cdots , {t}_{n}\right)$ of pixel values, the q-Exponential likelihood function is given by

$$L\left(t|q,\eta \right)={\prod }_{i=1}^{n}\frac{2-q}{\eta }{\text{e}\text{x}\text{p}}_{q}\left(-\frac{{t}_{i}}{\eta }\right)={\prod }_{i=1}^{n}\frac{2-q}{\eta }{\left[1-\frac{\left(1-q\right){t}_{i}}{\eta }\right]}^{\frac{1}{1-q}}$$

The corresponding log-likelihood function is (Negreiros et al. 2020):

$$l\left(t|q, \eta \right)=n{ln}\left(\frac{2-q}{\eta }\right)+\frac{1}{1-q}{\sum }_{i=1}^{n}ln\left[1-\frac{\left(1-q\right){t}_{i}}{\eta }\right]$$

The q-Exponential has the characteristic of modeling very well rare events as it presents a heavy-tailed density function (power-law behavior) when the shape parameter is between 1 and 2. In this work, we consider the pixels of each $n \times n$ grayscale submatrix of a given image as q-Exponentially distributed. As the oil spill images are dark and the presence of look-alikes may mislead image classification, we expect that the power law behavior of the q-Exponential model would adequately capture the specific behavior of the oil spill-related pixels. Therefore, such a characteristic would aid the ML classifiers in performing their task.

The resolution of the maximum log-likelihood problem when $1 < q < 2$ is straightforward. However, we may still observe 𝑛 × 𝑛 submatrices of pixels related to 𝑞 < 1. In this situation, we may have a monotone log-likelihood function (Pianto and Cribari-Neto 2011), which forbids obtaining practical optimal parameter values. To handle both situations ($q < 1$ and $1 < q < 2$), we here use the methodology proposed by Negreiros et al. (2020) to estimate the parameters of the q-Exponential distribution. Specifically, they use a correction based on Firth’s method (Firth 1993) for the log-likelihood function when the monotone behavior is identified, enabling parameter estimation in those cases.

In this work, we propose an image feature extraction method based on the q-Exponential distribution (q-EFE) as a part of a CV methodology to assist in the automated detection of oil spills in oceans and seas. With possible customization, a traditional CV system (CVS) have the following steps: (i) input, (ii) pre-processing (optional), (iii) feature extraction, (iv) post-processing (optional), and (v) classification/segmentation (Kılıç et al. 2007). In this work, we use the q-EFE in the feature extraction step.

In our proposed CVS with q-EFE, data augmentation (DA) is a preprocessing step, and dimensionality reduction is a post-processing stage; they are both optional. DA is a technique to inflate the original training set with label-preserving transformations (e.g., flip, zoom, shear, and rotation) to increase the amount and diversity of data (Krizhevsky et al. 2012; Maior et al. 2021). Additionally, we consider principal component analysis (PCA) for dimensionality reduction. PCA captures the most relevant dataset information, which may improve data handling and interpretability for ML classifiers (Bro and Smilde 2014).

Our proposed q-EFE approach is explained in the following steps and is illustrated in Fig. 1:

First, the image is loaded in a chosen size $\left(p\times q\right)$. Then, it is converted to a grayscale representation.
Then, the zero-valued pixels of the image are replaced by one once the feature extraction process uses the q-Exponential log-likelihood maximization to estimate the distribution parameters, and a model restriction is related to the positivity of values (Eq. 2). We chose one to replace zero because it is the closest integer to the latter in grayscale (0, 1, 2, …, 255). Indeed, it is rather not possible to see the difference between these two tons of grey with an unaided eye.
Then, the choice of using DA or not is considered. If the dataset is too small or very imbalanced, the DA may improve the results.
The feature extraction (Fig. 2) starts on the resized grayscale with no null pixels in the image. In this step, we take $n \times n$ image patches (i.e., ${n}^{2}$ pixels), maximize the q-Exponential log-likelihood function using a numeric maximization method (such as Nelder-Mead (Nelder and Mead 1965)), as in Negreiros et al. (2020)) and compute the q-Exponential function Ø_i (e.g., PDF, CDF, entropy) of interest. We repeat this process for the entire image considering a stride size $\left(\varDelta \right)$. The output of this process is a feature vector with a size that depends on the image dimension, the patch size, and the stride.

The feature vector often presents a high dimension. For example, for $p=q=64$, $\varDelta =1$ and $n = 4$, the output feature vector has dimension 3721. In a general way, to compute the size of the feature vector $\left(M\right)$, the following formulas are given

If $\varDelta =n$ and $p=q$ (without overlapping),

$$M={\left(p/\varDelta \right)}^{2}$$

If $\varDelta <n$ and $p=q$ (with overlapping),

$$M=\left(\left(p-\left(n-1\right)\right)/\varDelta \right)$$

Hence, for dimensionality reduction, the feature vector undergoes PCA for obtaining the first $k$ principal components. After this, the reduced feature vector is ready to feed the machine learning classifier.

The ML models used in this work along with q-EFE are: MLP (Ramchoun et al. 2020), Random Forest (RF) (Breiman 2020), SVM (Campbell and Ying 2011), Logistic Regression (LR) (Pregibon 1981), and Extreme Gradient Boosting (XGB) (Chen and Guestrin 2016). All these ML methods were trained with grid search to set the best parameters; further information is presented in Section 6.

Once properly trained and validated, the generated model may be applied to a test set. In this case, the general steps of the proposed CV methodology to automate oil spill detection in images (Fig. 1 and Fig. 2) are: (a) resize and convert a tested input image, (b) extract features via q-EFE, (c) perform dimensionality reduction, and (d) process ML models to automatically classify the image as “with oil spill” or “without oil spill”.

For comparison purposes, we performed comparisons with two classical CV techniques. The first is the Grey Level Co-occurrence Matrix (GLCM) and six Haralick features (i.e., contrast, correlation, energy, homogeneity, dissimilarity, and ASM) (Haralick et al. 1973). The GLCM computes the frequency of pairs of pixels’ values (e.g., (0, 0), (0, 1), (0, 2), etc.) in a specified spatial relationship (e.g. horizontal). The GLCM properties of an image are expressed as a matrix with the same number of columns and rows as the grayscale range (e.g., 0 to 255), and then statistical measures can be extracted from this GLCM matrix (Sastry et al. 2012; Öztürk and Akdemir 2018).

The second one is the traditional Local Binary Pattern (LBP), originally proposed by Ojala et al. (1996), and four LBP variants (Liu et al. 2012). These methods have been applied to texture classification and analysis, image description, face recognition, signal processing (Liu et al. 2013; Suruliandi et al. 2012; Houam et al. 2014; Chatlani and Soraghan 2010; Guo et al. 2010)

The classical LBP is denoted by

$${LBP}_{p,r}=\sum _{n=0}^{p-1}s\left({x}_{r,n}-{x}_{\text{0,0}}\right){2}^{n} , s\left(x\right)=\left\{\begin{array}{c}1, x\ge 0\\ 0, x<0\end{array}\right.$$

where $x$ denotes the grayscale intensity, ${x}_{\text{0,0}}$ is the gray value of the central pixel and ${x}_{r,n}$ is the value of its neighbor $n$, $r$ is the radius of the considered circle, and $p$ is the number of pixels in the circle. The LBP variants consider central intensity, the neighboring intensity, and the radial difference in the grayscale image. They were proposed to overcome some drawbacks and limitations of the classical LBP, such as sensibility to image rotation, small spatial support, loss of local textural information, and high sensitivity to noise.

We compute four features for each of the four LBP images (i.e., contrast, correlation, energy, and homogeneity) based on (Wang et al. 2021). Also, we extracted four additional features from the LBP image (i.e., thresh out, entropy, local variance, and standard deviation) as proposed by (Davoudi et al. 2020). Table 1 provides their detailed description. The outcome of this process is a feature vector composed of twenty-four features.

Additionaly, we also applied a successful deep learning model for feature extraction and classification, the ResNet50 (He et al. 2016), which is pretrained with imageNet. It has been used in many knowledge areas, such as health (Sharma et al. 2022; Elpeltagy and Sallam 2021; Çinar et al 2020), agriculture (Mukti and Biswas 2019), cognitive computing (Chu et al. 2020), satellite image classification (Shabbir et al. 2021), and failure detection (Yang et al. 2020). Besides, we also considered three DL models based on CNN (Längkvist et al. 2014), which has also been used in many areas, such as COVID-19 diagnosis (Maior et al. 2021), oil spill detection (Zeng and Wang 2020), and image segmentation (Chen et al. 2018). CNN architectures applied in this paper are presented in Fig. 3. Here, CNN_1 is the simplest, with three convolutional layers, three max-pooling layers, and three last dense layers. The max-pooling layers aggregate activations of spatial locations to produce a fixed-size vector in several CNNs. Each convolutional layer has 64 neurons, two dense layers have 128 neurons, and the last dense layer has two neurons (related to classes “with oil spill” and “without oil spill”). Also, CNN_2 has five convolutional layers, three max-pooling layers, and four last dense layers. There are 64 neurons in each convolutional layer. Three dense layers have 512 neurons, and the last one has two neurons. And, CNN_3 is more complex than the first and second ones. It has eight convolutional layers, five max-pooling layers, and four last dense layers. Each convolutional layer has 64 neurons. Three dense layers have 512 neurons, and the last one has two neuron.

The dataset considered in this work was made available by Krestenitis et al. (2019a) and Krestenitis et al. (2019b). It has the geographic coordinates and timestamps information about the pollution event provided by the European Maritime Safety Agency (EMSA) through the Clean Sea Net service. Besides, EMSA confirmed the presence of dark spots depicted in the SAR images as oil spills. The oil pollution records cover a period from 28 September 2015 up to 31 October 2017 while the SAR images were acquired from the Sentinel-1 European Satellite missions (Krestenitis et al. 2019a).

The dataset used is composed of 1112 images, where 873 images present oil spills and 239 do not. Each image has 1250 x 650 pixels. Figure 4 presents an example of an extracted SAR image accompanied by its respective ground truth mask in which the cyan color identifies the oil spill and other parts of interest, such as look-alikes (red), land (green), and sea surface (black) are presented. However, once this work deals with detecting images containing oil spills, only two classes were considered: images with oil spills and images without oil spills.

In this paper, we used the balanced accuracy $\left(BAc\right)$ to evaluate the classification results. The $BAc$ is a classification performance metric devised to account for imbalanced classes (Tharwat 2020). It is defined in Eq. 7:

$BAc=\frac{\left[\frac{TP}{TP+FN}+\frac{TN}{TN+FP}\right]}{2}$,

(7)

in which $TP$ represents the true positives, $TN$ is the true negatives, $FP$ is the false positives, and $FN$ is the false negatives. It is the average between sensitivity and specificity.

Table 2 presents the parameters used in the grid search for all the ML methods to get the best hyperparameters.

Table 1

Description of textural properties
Feature	Description	Formulation
Contrast	Intensity contrast between a pixel and its neighbor over the whole image.	$\sum _{i,j}{\left\|i-j\right\|}^{2}p\left(i,j\right)$
Correlation	Correlation between a pixel and its neighbor over the whole image.	$\sum _{i,j}\frac{\left(i-\mu i\right)\left(j-\mu j\right)p\left(i,j\right)}{{\sigma }_{i}{\sigma }_{j}}$
Energy	Sum of squared elements in the GLCM.	$\sum _{i,j}{p\left(i,j\right)}^{2}$
Homogeneity	The closeness between the distribution of elements in GLCM and its diagonal.	$\sum _{i,j}\frac{p\left(i,j\right)}{1+\left\|i-j\right\|}$
Thresh Out	The relative proportion of the oil spill objects and non-oil spill objects.	The threshold value for the Sobel edge detection method
Entropy	A statistical measure of randomness.	$-\sum _{p}q{{log}}_{2}q$
Local variance	The average local standard deviation of 3 × 3 neighborhood around each pixel in the image.	$stdfilt$ function in MATLAB divided by the number of pixels in the image
Standard Deviation	The standard deviation of all values.	$\sqrt{\frac{\sum _{i,j}{\left(p-\stackrel{-}{p}\right)}^{2}}{n}}$

We ran the q-EFE and LBP experiments in two steps: i) first, the feature extraction procedure in a local GPU GeForce RTX 2080 Ti (11GB) with 32 GB RAM, in Python computational language; ii) then, the classification step in a free Tesla K80 GPU provided by Google Colaboratory, also using Python. Also, in the second step, we used PCA to reduce the feature vectors to 20 components. We assessed other numbers of components (10, 40, 50, 100), but the results were inferior in all situations.

On the other hand, we trained the CNN models with TensorFlow and Keras libraries in Python, also running in a free Tesla K80 GPU provided by Google Colaboratory. The training and testing of the CNN models comprised 256x256 images and 100 epochs for training with an early stop if the validation loss (i.e., binary cross-entropy) did not improve in ten consecutive epochs. In this situation, the training stops, and the best weights are stored. We considered a learning rate of 0.001 and the “adam” optimizer. Other parameters were tested, such as different learning rates (0.1, 0.01, and decreasing rates) and optimizers (“sgd”, “rmsprop”, “adadelta”), but the results were inferior. Also, the batch size was equal to 32. And, the GLCM technique were run by the same Google Colaboratory. To run the GLCM experiments we set a distance of 2 and an angle of 0, other values were tested but the results were inferior.

We used the Python library ‘HParams’ to optimize the CNN hyperparameters to support the proposition of the specific CNN architectures (low, medium, and high complexity). Table 3 shows the two applied configurations for the proposed q-EFE approach (Fig. 1 and Fig. 2). Such configurations were chosen to test different image sizes and strides (with and without overlap). We used the CDF values to compose the feature vectors. As arguments of the CDF, the minimum, median, or maximum (i.e., order statistics) pixel of a given image patch would result in outputs always near 0, 0.5, or 1, respectively. Thus, as the mean is a central tendency summary statistic that considers magnitude, we used the mean of pixels’ values to represent the patch; it is the chosen argument for the CDF. Therefore, we expect to obtain varied CDF values, that is, feature values, throughout the entire image. No works used the CDF to extract features from images using the q-Exponential distribution. However, some papers also used probability distributions to extract features from images. For instance, Rodrigues et al. (2019) proposed the q-sigmoid function derived from non-extensive Tsallis statistics and used it for feature extraction in tasks of regions enhancement in ultrasound images. On the other hand, Marques et al. (2012) applied the ${\mathcal{g}}_{A}^{0}$ distribution to perform segmentation in SAR imagery to characterize image regions.

To account for performance variability, we repeated the experiment related to each model 10 times. Table 4 presents $BAc$ mean and standard deviation provided by the q-EFE approach, using configurations 1 and 2, and four data set sizes to train the models (200, 400, 1002, 1572). For the smallest training sets (i.e., sizes 200 and 400), we randomly chose an equal number of images from both classes from the original training data set of size 1002, with 786 images ‘with oil spill’ and 216 ‘without oil spill’. For the largest (i.e., 1572), we used DA to complete the training dataset, where 786 images have oil spills and 786 have not; thus, only the minority class (‘without oil spill’) was augmented. This DA process was performed in Python, using the Keras library (Ketkar 2017), performing rotation, zoom, flip, width shift range, height shift range, and shear range. All the training processes took 10% of the training data for validation. The mentioned training datasets do not include the test set, consisting of 110 images (88 labeled as ‘with oil spills’ and 22 as ‘without oil spills’). All the models analyzed considered the same test set. Besides, as we have an imbalanced test set, $BAc$ is used to evaluate the results.

Table 2

Parameters values for the grid search.
Method	Parameters
MLP	Hidden_layer_sizes: [50, 100, 500, 1000] Activation: [identity, logistic, tanh, relu] Solver: [lbgs, sgd, adam] Learning_rate: [(constant), (invscaling), (adaptive)]
RF	N_estimators: [10, 50, 100, 200, 500] Max_features: [auto, sqrt, log2, 5, 10, 30] Max_depth: [2, 8, 16, 32, 64, 128] Min_samples_split: [1, 2, 4, 8, 16, 24] Min_samples_leaf: [1, 2, 5, 10, 15, 30]
SVM	C: [0.1, 1, 100, 1000] Gamma: [auto, 1, 0.01, 0.0001] Kernel: [linear, rbf, sigmoid]
LR	Penalty: [l1, l2] C: [100, 10, 1, 0.1, 0.01, 0.001]
XGB	N_estimators: [100, 500, 1000] Learning_rate: [0.1, 0.05, 0.01] Max_depth: [2, 8, 16, 64, 128] Colsample_bytree: [0.3, 0.8, 1] Gamma: [0, 1, 5]

Firstly, we will comment on the results of q-EFE combined with the different ML models. Considering Table 4 and configuration 1, for 200 training images, RF presented the highest mean $BAc$ (69.79%), closely followed by SVM (69.34%). SVM presented the best performances for 400 and 1572 training images (70.22% and 74.68%, respectively). XGB achieved the best result for the imbalanced original dataset (1002 images) and was the second-best for the 1572-sized data set. LR, in turn, achieved the worst mean $BAc$ for 1572 images (62.63%). For 1002 images, the MLP had a very poor mean $BAc$ (50.00%), which means that the model was incapable of distinguishing the images as it considered all the test examples to pertain to the same class ‘with oil spill’. SVM had the lowest standard deviations for most situations, which means that this model is rather not sensitive to minor input variations. On the other hand, considering configuration 2, again, SVM was the method presenting the best performances, except for the imbalanced dataset (1002 training images). In this case, the XGB was again the best method. Additionally, configuration 1 was superior to configuration 2 since the best model, considering the latter achieved a mean $BAc$ only close to 70%, against a mean $BAc$ near 75% of the best model with the former.

Table 5 presents the results of LBP, GLCM, ResNet50, and CNNs. All these techniques were trained and tested with the same datasets used in the previous experiments involving the proposed q-EFE with ML models. The performances show that LBP provided the worst results in two situations (200 and 1002 images), and ResNet50 reached the worst results in the other two cases (400 and 1572 images). On the other hand, the methods with GLCM provided better results than LBP and DL (ResNet50 and CNNs) for all dataset sizes. XGB using 200 images provided the best GLCM result (74.06%), approaching SVM with q-EFE, configuration 1, and 1572 images (74.68%). Also, in Table 5, we do not report standard deviations related to SVM, LR, and XGB because we use the same input (feature vectors) and the same seed (random state) in the grid search step; the results for these methods do not change. Differently from MLP and RF, which are initialized randomly, and, even for the same input, the results can vary. For the results corresponding to the q-EFE approach, there is a variation for SVM, LR, and XGB because we perform a PCA before each round, and the input data may change. As q-EFE using configuration 1 and GLCM top-ranked, Fig. 5 presents their mean $BAc$. In general, the GLCM presented a higher mean $BAc$ for 200 and 400 images, and the q-EFE gave a higher mean $BAc$ for 1002 and 1572 images. However, the SVM with q-EFE configuration 1 for 1572 images (i.e., with DA) outperformed the other methods and data set sizes with the highest mean $BAc$, followed by XGB with q-EFE.

The generalization ability assessment is essential, as, in practice, new images may have unseen patterns in the training stage. In this sense, we evaluated the generalization of SVM and XGB using q-EFE, configuration 1, trained over the 1572-sized data set. In the grayscale conversion step (Fig. 1), we used a continuous grayscale instead of a discrete one to convert the images from the test set. We obtained a $BAc$ of 78.97% and 73.86% using SVM and XGB, respectively. Still, for the continuous grayscale and considering the regular accuracy, we obtained 82.72% using SVM and 66.36% with XGB. Thus, SVM showed superior performance in the generalization task than XGB.

This paper proposed a novel feature extraction method based on the q-Exponential distribution (q-EFE) combined with ML models, composing a CV methodology for automatically detecting oil spills in images. Five ML methods (MLP, RF, SVM, LR, and XGB) were applied, and two configurations (Table 3) were considered to validate the q-EFE novel approach. Such proposed image feature extractor slides the entire image, computing CDF values for each patch of 16 values. This technique proved to be interesting to extract complex features from oil spill SAR imagery. The SVM and XGB achieved the best results using q-EFE as a feature extractor.

Table 3

The two different configurations to extract features from images.
	Description	Feature vector size
Configuration 1	$q=p=64$ $\varDelta =1$ $n=4$	3721
Configuration 2	$q=p=256$ $\varDelta =4$ $n=4$	4096

Table 4

Test results.$BAc$ mean and standard deviation for $BAc$ obtained in 10 rounds, considering the four training set sizes and configurations 1 and 2 for the ML models combined to q-EFE. The best mean and standard deviation for each training data set size is highlighted in bold, and the worst is underlined.
Configuration 1
Dataset size
Model	200	400	1002	1572
MLP	0.6333 $\pm$ 0.0755	0.5657 $\pm$ 0.0355	0.5000 $\pm$ 0.0000	0.6679 $\pm$ 0.0284
RF	0.6979 $\pm$ 0.0173	0.6834 $\pm$ 0.0273	0.5197 $\pm$ 0.0237	0.6544 $\pm$ 0.0379
SVM	0.6934 $\pm$ 0.0037	0.7022 $\pm$ 0.0094	0.5043 $\pm$ 0.0137	0.7468 $\pm$ 0.0065
LR	0.6240 $\pm$ 0.0256	0.4804 $\pm$ 0.0260	0.5278 $\pm$ 0.0149	0.6263 $\pm$ 0.0125
XGB	0.6800 $\pm$ 0.0340	0.6724 $\pm$ 0.0204	0.7348 $\pm$ 0.0292	0.7399 $\pm$ 0.0123
Configuration 2
Dataset size
Model	200	400	1002	1572
MLP	0.6146 $\pm$ 0.0601	0.5980 $\pm$ 0.0583	0.5000 $\pm$ 0.0000	0.6298 $\pm$ 0.0230
RF	0.6635 $\pm$ 0.0260	0.6893 $\pm$ 0.0265	0.4984 $\pm$ 0.0208	0.5649 $\pm$ 0.0986
SVM	0.6652 $\pm$ 0.0506	0.7018 $\pm$ 0.0101	0.5178 $\pm$ 0.0566	0.7069 $\pm$ 0.0150
LR	0.6010 $\pm$ 0.0265	0.5506 $\pm$ 0.0358	0.5319 $\pm$ 0.0000	0.6360 $\pm$ 0.0122
XGB	0.6457 $\pm$ 0.0239	0.6684 $\pm$ 0.0423	0.6418 $\pm$ 0.0505	0.6821 $\pm$ 0.0205

Table 5

Test results. $BAc$ mean and standard deviation obtained in 10 replications of MLP and RF with LBP and GLCM, CNNs, and ResNet50; $BAc$ reached by SVM, LR, and XGB with LBP and GLCM. The best values for each training set size are in bold, and the worst are underlined.
	Dataset size
Model	200	400	1002	1572
MLP_LBP	0.5104 $\pm$ 0.0331	0.5015 $\pm$ 0.0050	0.5000 $\pm$ 0.0000	0.5747 $\pm$ 0.0673
RF_LBP	0.5000 $\pm$ 0.0000	0.5013 $\pm$ 0.0121	0.4994 $\pm$ 0.0018	0.5727 $\pm$ 0.0513
SVM_LBP	0.5000	0.5000	0.5000	0.6349
LR_LBP	0.5697	0.5422	0.5582	0.6509
XGB_LBP	0.5147	0.5697	0.5684	0.6496
MLP_GLCM	0.5125 $\pm$ 0.0397	0.5256 $\pm$ 0.0539	0.5056 $\pm$ 0.0165	0.5448 $\pm$ 0.0545
RF_GLCM	0.7215 $\pm$ 0.0136	0.7279 $\pm$ 0.0219	0.6360 $\pm$ 0.0185	0.6436 $\pm$ 0.0036
SVM_GLCM	0.5409	0.7031	0.5000	0.5000
LR_GLCM	0.6984	0.7258	0.5102	0.7321
XGB_GLCM	0.7406	0.6996	0.6234	0.6451
CNN_1	0.6873 $\pm$ 0.0219	0.6359 $\pm$ 0.0253	0.6325 $\pm$ 0.0300	0.6299 $\pm$ 0.0254
CNN_2	0.6725 $\pm$ 0.0621	0.6859 $\pm$ 0.0380	0.6557 $\pm$ 0.0483	0.5715 $\pm$ 0.0472
CNN_3	0.6559 $\pm$ 0.0573	0.5927 $\pm$ 0.0647	0.5473 $\pm$ 0.0679	0.5169 $\pm$ 0.0257
ResNet50	0.5159 $\pm$ 0.0266	0.4953 $\pm$ 0.0343	0.5110 $\pm$ 0.0186	0.4960 $\pm$ 0.0224

For comparison purposes, we used the same experimental setting and data set to apply two classical CV techniques (LBP and GLCM) combined with the above five ML models, a pre-trained ResNet50 with the ImageNet weights, and three CNN architectures with different complexities. Considering configuration 1, the proposed q-EFE combined with ML models, especially SVM and XGB, provided comparable or superior results to the other models.

The proposed CV methodology can be part of a comprehensive risk-based framework for oil spill prevention and mitigation. Specifically, the trained q-EFE + SVM model could be used online to support oil spill rapid detection, enabling the implementation of mitigating and recovery actions in due time to avoid oil spread on seas. There is a lack of available data sets related to oil spills, possibly because of the delicate nature of this data type, as it may relate to national security and environmental safety. Nevertheless, we are working on partnerships to access other data sets, test the proposed CV system based on q-EFE, and improve its performance.

We also intend to develop a multiclass study based on q-EFE to differentiate between look-alikes and oil spills, investigating their respective patterns as in (Moura et al. 2022; Huang et al. 2022; Conceição et al. 2021). Also, as a topic of ongoing research, we will evaluate its application in other contexts, e.g., health (radiological imagery to support diagnosis) and structural reliability (crack detection).

Acknowledgment

The authors thank the National Agency for Research (CNPq), the Foundation of Support for Science and Technology of Pernambuco (FACEPE), and ‘Coordenação de Aperfeiçoamento de Pessoal de Nível Superior’ (CAPES), Brazil - Finance Code 001 for the financial support through research grants.

Authors contribution

Ana Cláudia Souza Vidal de Negreiros: Conceptualization, investigation, validation, writing manuscript, review and editing. Isis Didier Lins: Conceptualization, review and editing. Caio Souto Maior: Conceptualization, review and editing. Márcio das Chagas Moura: Review and editing.

Ahmed AA, Masrur MHA, Sanjoy KS, Ahmed Oli, Sutradhar A (2022) Optimization Algorithms as Training Approach with Hybrid Deep Learning Methods to Develop an Ultraviolet Index Forecasting Model. Stochastic Environmental Research and Risk Assessment 36 (10): 3011–39. https://doi.org/10.1007/s00477-022-02177-3.
Amato F, Guignard F, Walch A, Mohajeri N, Scartezzini JL, Kanevski M (2022) Spatio-Temporal Estimation of Wind Speed and Wind Power Using Extreme Learning Machines: Predictions, Uncertainty and Technical Potential. Stochastic Environmental Research and Risk Assessment 36 (8): 2049–69. https://doi.org/10.1007/s00477-022-02219-w.
Ansell DB, Dicks C, Guenette T, Moller R, Santner, W (2001) A Review of the Problems Posed By Spills of Heavy Fuel Oils. International Oil Spill Conference Proceedings 2001 (March). https://doi.org/10.7901/2169-3358-2001-1-591
Araújo ME, Ramalho CWN, Melo PW (2020) Artisanal Fishers, Consumers and the Environment: Immediate Consequences of the Oil Spill in Pernambuco, Northeast Brazil. Cadernos De Saude Publica 36 (1): e00230319. https://doi.org/10.1590/0102-311X00230319
Beyer, JHC, Trannum TB, Hodson PV, Collier TK (2016) Environmental Effects of the Deepwater Horizon Oil Spill: A Review. Marine Pollution Bulletin 110 (1): 28–51. https://doi.org/10.1016/j.marpolbul.2016.06.027.
Breiman L (2020) Random Forests. SpringerLink Accessed 8 May 2021. https://link.springer.com/article/10.1023/A:1010933404324
Brekke C, Solberg AHS (2005) Oil Spill Detection by Satellite Remote Sensing. Remote Sensing of Environment 95 (1): 1–13. https://doi.org/10.1016/j.rse.2004.11.015
Briggs IL, Chidinma BB (2018) Petroleum Industry Activities and Human Health’. In The Political Ecology of Oil and Gas Activities in the Nigerian Aquatic Ecosystem. Academic Press. https://doi.org/10.1016/B978-0-12-809399-3.00010-0
Bro R, Age KS (2014) Principal Component Analysis. Analytical Methods 6 (9): 2812–31. https://doi.org/10.1039/C3AY41907J
Bubbico R, Lee S, Moscati D, Paltrinieri N (2020). Dynamic Assessment of Safety Barriers Preventing Escalation in Offshore Oil&Gas. Safety Science 319–30. https://doi.org/10.1016/j.ssci.2019.09.011
Campbell C, Ying Y (2011) Learning with Support Vector Machines. Synthesis Lectures on Artificial Intelligence and Machine Learning 5 (1): 1–95. https://doi.org/10.2200/S00324ED1V01Y201102AIM010
Cantorna D, Dafonte C, Iglesias A, Arcay B (2019) Oil Spill Segmentation in SAR Images Using Convolutional Neural Networks. A Comparative Analysis with Clustering and Logistic Regression Algorithms. Applied Soft Computing 105716. https://doi.org/10.1016/j.asoc.2019.105716
Chatlani, N, Soraghan JJ (2010) Local Binary Patterns for 1-D Signal Processing. European Signal Processing Conference
Chen G, Li Y, Sun G, Zhang Y (2017). Polarimetric SAR Oil Spill Detection Based on Deep Networks. IEEE International Conference on Imaging Systems and Techniques (IST. https://doi.org/10.1109/IST.2017.8261559
Chen, L, Zhu Y, Papandreou G, Schroff F, Adam H. (2018). Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation. In Computer Vision – ECCV Lecture Notes in Computer Science. https://doi.org/10.1007/978-3-030-01234-2_49
Chen T, Guestrin C (2016) XGBoost: A Scalable Tree Boosting System. International Conference on Knowledge Discovery and Data Mining. https://doi.org/10.1145/2939672.2939785
Chu Y, Yue X, Yu L, Sergei M, Wang Z. 2020 Automatic Image Captioning Based on ResNet50 and LSTM with Soft Attention. Wireless Communications and Mobile Computing. https://doi.org/10.1155/2020/8909458
Conceição MRA, Mendonça LFF, Carlos Lentini CAD, Lima ATC, Lopes JM, Vasconcelos RN, Gouveia MB, Porsani MJ. (2021) SAR Oil Spill Detection System through Random Forest Classifiers. Remote Sensing. https://doi.org/10.3390/rs13112044
D’Andrea MA, Reddy GK (2018) The Development of Long-Term Adverse Health Effects in Oil Spill Cleanup Workers of the Deepwater Horizon Offshore Drilling Rig Disaster. Frontiers in Public Health 6. https://doi.org/10.3389/fpubh.2018.00117
Davoudi R, Miller GR, Kutz JN (2021) Structural Load Estimation Using Machine Vision and Surface Crack Patterns for Shear-Critical RC Beams and Slabs. Journal of Computing in Civil Engineering. https://doi.org/10.1061/(ASCE)CP.1943-5487.0000766
Kerf T, Gladines J, Sels S, Vanlanduit S (2020) Oil Spill Detection Using Machine Learning and Infrared Images. Remote Sensing 4090. https://doi.org/10.3390/rs12244090
Çinar A, Yildirnm M, Eroglu Y (2020) Classification of Pneumonia Cell Images Using Improved ResNet50 Model. International Information and Engineering Technology Association.
https://doi.org/10.18280/ts.380117
Elpeltagy M, Sallam H (2021) Automatic Prediction of COVID− 19 from Chest Images Using Modified ResNet50. Multimedia Tools and Applications 26451–63. https://doi.org/10.1007/s11042-021-10783-6
Ferraro F, Koutalonis I, Vallianatos F, Agosta F (2019) Application of Non-Extensive Statistical Physics on the Particle Size Distribution in Natural Carbonate Fault Rocks. Tectonophysics 228219. https://doi.org/10.1016/j.tecto.2019.228219
Firth D (1993) Bias Reduction of Maximum Likelihood Estimates. Biometrika. https://doi.org/10.2307/2336755
Gallego AJ, Gil P, Pertusa A, Fisher R (2018) Segmentation of Oil Spills on Side-Looking Airborne Radar Imagery with Autoencoders. Sensors 18 797. https://doi.org/10.3390/s18030797
Ghahremani, M, Liu Y, Tiddeman B (2021). FFD: Fast Feature Detector. IEEE Transactions on Image Processing 30: 1153–68. https://doi.org/10.1109/TIP.2020.3042057
Guo Z, Zhang L, Zhang D (2010) A Completed Modeling of Local Binary Pattern Operator for Texture Classification. IEEE Transactions on Image Processing : A Publication of the IEEE Signal Processing Society 1657–63. https://doi.org/10.1109/TIP.2010.2044957
Guyon I, Gunn S, Nikravesh M, Zadeh LA (2006) Feature Extraction: Foundations and Applications. New York: Springer
Haralick R, Shanmugam K, Dinstein I (1973) Textural Features for Image Classification. IEEE Trans Syst Man Cybern SMC-3 610–21
Ramchoun H, Idrissi MAJ, Ettaouil M (2021) Multilayer Perceptron: Architecture Optimization and Training. International Journal of Interactive Multimedia and Artificial Intelligence. https://doi.org/10.9781/ijimai.2016.415
He K, Zhang X, Ren S, Sun J (2016) Deep Residual Learning for Image Recognition. Computer Vision and Pattern Recognition 770–78. https://doi.org/10.1109/CVPR.2016.90
Houam L, Hafiane A, Boukrouche A, Lespessailles R, Jennane R (2014) One Dimensional Local Binary Pattern for Bone Texture Characterization. Pattern Analysis and Applications 179–93. https://doi.org/10.1007/s10044-012-0288-4
Huang MQ, Ninić J, Zhang QB (2021) BIM, Machine Learning and Computer Vision Techniques in Underground Construction: Current Status and Future Perspectives. Tunnelling and Underground Space Technology 103677. https://doi.org/10.1016/j.tust.2020.103677
Huang X, Zhang B, Perrie W, Lu Y, Wang C. 2022. A Novel Deep Learning Method for Marine Oil Spill Detection from Satellite Synthetic Aperture Radar Imagery. Marine Pollution Bulletin 179. https://doi.org/10.1016/j.marpolbul.2022.113666.
Huz R, Lastra M, López J (2018) Other Environmental Health Issues: Oil Spill. https://doi.org/10.1016/B978-0-12-409548-9.11156-X
Jiao Z, Jia C, Cai Y (2018) A New Approach to Oil Spill Detection That Combines Deep Learning with Unmanned Aerial Vehicles. Computers & Industrial Engineering. https://doi.org/10.1016/j.cie.2018.11.008
Ketkar N (2017) Introduction to Keras. In Deep Learning with Python: A Hands-on Introduction, edited by Nikhil Ketkar, 97–111. https://doi.org/10.1007/978-1-4842-2766-4_7
Kılıç K, Boyac IH, Köksel H, Küsmenoğlu I (2007) A Classification System for Beans Using Computer Vision System and Artificial Neural Networks. Journal of Food Engineering 897–904. https://doi.org/10.1016/j.jfoodeng.2005.11.030
Krata P, Jachowski (2020) Towards a Modification of a Regulatory Framework Aiming at Bunker Oil Spill Prevention from Ships – A Design Aspect of Bunker Tanks Vents Location Guided by CFD Simulations’. Reliability Engineering & System Safety 107370. https://doi.org/10.1016/j.ress.2020.107370
Krestenitis M, Orfanidis G, Ioannidis K, Avgerinakis K, Vrochidis S, Kompatsiaris I. (2019a) Early Identification of Oil Spills in Satellite Images Using Deep CNNs. https://doi.org/10.1007/978-3-030-05710-7_35
Krestenits et al. (2019b) Oil Spill Identification from Satellite Images Using Deep Neural Networks. Remote Sensing 1762. https://doi.org/10.3390/rs11151762
Krizhevsky A, Sutskever I, Hinton G (2012) ImageNet Classification with Deep Convolutional Neural Networks. Neural Information Processing Systems. https://doi.org/10.1145/3065386
Längkvist M, Karlsson L, Loutfi A (2014) A Review of Unsupervised Feature Learning and Deep Learning for Time-Series Modeling. Pattern Recognition Letters 11–24. https://doi.org/10.1016/j.patrec.2014.01.008
Li T, Chan Y, Lun DPK (2021) Improved Multiple-Image-Based Reflection Removal Algorithm Using Deep Neural Networks’. IEEE Transactions on Image Processing. https://doi.org/10.1109/TIP.2020.3031184
Liu D, Zhang D, Song Y, Huang H, Cai W (2021) Panoptic Feature Fusion Net: A Novel Instance Segmentation Paradigm for Biomedical and Biological Images’. IEEE Transactions on Image Processing 30:2045–59. https://doi.org/10.1109/TIP.2021.3050668
Liu F, Tang Z, Tang J (2013) WLBP: Weber Local Binary Pattern for Local Image Description. Neurocomputing, Image Feature Detection and Description, 120 325–35. https://doi.org/10.1016/j.neucom.2012.06.061
Liu L, Zhao L, Long Y, Kuang G, Fieguth P (2012) Extended Local Binary Patterns for Texture Classification. Image and Vision Computing 30 (2): 86–99. https://doi.org/10.1016/j.imavis.2012.01.001
Ludescher J, Bunde A (2014) Universal Behavior of the Interoccurrence Times between Losses in Financial Markets: Independence of the Time Resolution. Physical Review. E, Statistical, Nonlinear, and Soft Matter Physics 90: 062809. https://doi.org/10.1103/PhysRevE.90.062809
Mahony N, Campbell S, Carvalho A, Harapanahalli S, Velasco-Hernandez G, Krpalkova L, Riordan D, Walsh J (2020) Deep Learning vs. Traditional Computer Vision. https://doi.org/10.1007/978-3-030-17795-9
Maior CBS., Santana JMM, Lins ID, Moura MJC. (2021) Convolutional Neural Network Model Based on Radiological Images to Support COVID-19 Diagnosis: Evaluating Database Biases. PLOS ONE 16 (3):e0247839. https://doi.org/10.1371/journal.pone.0247839
Malacarne L, Mendes R, Lenzi E (2002) Q-Exponential Distribution in Urban Agglomeration. Physical Review. E, Statistical, Nonlinear, and Soft Matter Physics 65:017106. https://doi.org/10.1103/PhysRevE.65.017106
Marcus G (2018) Deep Learning: A Critical Appraisal. Computer Science. https://doi.org/10.48550/arXiv.1801.00631
Marques RCP, Medeiros FN, Nobre J (2012) SAR Image Segmentation Based on Level Set Approach and GA0 Model. IEEE Transactions on Pattern Analysis and Machine Intelligence 34:046–2057. https://doi.org/10.1109/TPAMI.2011.274
Mera D, Bolon-Canedo V, Cotos JM, Alonso-Betanzos A (2017) On the Use of Feature Selection to Improve the Detection of Sea Oil Spills in SAR Images. Computers & Geosciences 100:166–78. https://doi.org/10.1016/j.cageo.2016.12.013
Moura NVA, Carvalho OLF, Gomes RAT, Guimarães RF, Carvalho Júnior AO (2022) Deep-Water Oil-Spill Monitoring and Recurrence Analysis in the Brazilian Territory Using Sentinel-1 Time Series and Deep Learning. International Journal of Applied Earth Observation and Geoinformation 107. https://doi.org/10.1016/j.jag.2022.102695.
Mukti IZ, Biswas D (2019) Transfer Learning Based Plant Diseases Detection Using ResNet50. International Conference on Electrical Information and Communication Technology (EICT) 1–6. https://doi.org/10.1109/EICT48899.2019.9068805
Murala S, Maheshwari RP, Raman B (2012) Local Tetra Patterns: A New Feature Descriptor for Content-Based Image Retrieval. IEEE Transactions on Image Processing : A Publication of the IEEE Signal Processing Society 21:2874–86. https://doi.org/10.1109/TIP.2012.2188809
Murphy K, Torralba A, Eaton D, Freeman W (2006) Object Detection and Localization Using Local and Global Features. Toward Category-Level Object Recognition. https://doi.org/10.1007/11957959_20.
Negreiros ACSV, Lins ID, Maior CBS, Moura MJC (2022) Oil Spills Characteristics, Detection, and Recovery Methods: A Systematic Risk-Based View. Journal of Loss Prevention in the Process Industries.https://doi.org/10.1016/j.jlp.2022.104912.
Negreiros ACSV, Lins ID, Moura MJC, Droguett EL (2020) Reliability Data Analysis of Systems in the Wear-out Phase Using a (Corrected) q-Exponential Likelihood. Reliability Engineering & System Safety 197:106787. https://doi.org/10.1016/j.ress.2019.106787
Nelder JA, Mead R (1965) A Simplex Method for Function Minimization. The Computer Journal 7 (4) 308–13. https://doi.org/10.1093/comjnl/7.4.308.
Nikan S, Osch KV, Bartling M, Allen DG, S. Rohani A, Connors B, Agrawal SK, Ladak HM (2021) PWD-3DNet: A Deep Learning-Based Fully-Automated Segmentation of Multiple Structures on Temporal Bone CT Scans. IEEE Transactions on Image Processing 30:739–53. https://doi.org/10.1109/TIP.2020.3038363
Ojala T, Pietikainen M, Maenpaa T (2002) Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns. IEEE Transactions on Pattern Analysis and Machine Intelligence 24 (7):971–87. https://doi.org/10.1109/TPAMI.2002.1017623
Ojala T, Pietikäinen M, Harwood D (1996) A Comparative Study of Texture Measures with Classification Based on Featured Distributions. Pattern Recognition 29 (1):51–59
Öztürk S, Akdemir B (2018) Application of Feature Extraction and Classification Methods for Histopathological Image Using GLCM, LBP, LBGLCM, GLRLM and SFTA. Procedia Computer Science International Conference on Computational Intelligence and Data Science, 132: 40–46. https://doi.org/10.1016/j.procs.2018.05.057
Pianto DM, Cribari-Neto F (2011) Dealing with Monotone Likelihood in a Model for Speckled Data. Computational Statistics & Data Analysis 55 (3):1394–1409. https://doi.org/10.1016/j.csda.2010.09.029
Picoli S, Mendes RS, Malacarne LC (2003) Q-Exponential, Weibull, and q-Weibull Distributions: An Empirical Analysis. Physica A: Statistical Mechanics and Its Applications 324 (3):678–88. https://doi.org/10.1016/S0378-4371(03)00071-2
Pregibon D (1981) Logistic Regression Diagnostics. The Annals of Statistics 9 (4):705–24. https://doi.org/10.1214/aos/1176345513
Ribeiro LC, Souza K, Domingues E, Magalhaes A (2020) Blue Water Turns Black: Economic Impact of Oil Spill on Tourism and Fishing in Brazilian Northeast. Current Issues in Tourism 24. https://doi.org/10.1080/13683500.2020.1760222
Røed W, Bjerga T (2017) Holistic Understanding and Clarification of Environmental Safety Barriers in the Oil and Gas Industry. https://doi.org/10.1201/9781315210469-164
Sales Filho R, Droguett E, Lins I, Moura M, Amiri M, Azevedo R (2016) Stress-Strength Reliability Analysis with Extreme Values Based on q -Exponential Distribution: Stress-Strength Reliability and q -Exponential Distribution. Quality and Reliability Engineering International. https://doi.org/10.1002/qre.2020
Sastry SS, Kumari TV, Rao CN,Mallika K, Lakshminarayana S, Tiong HS (2012) Transition Temperatures of Thermotropic Liquid Crystals from the Local Binary Gray Level Cooccurrence Matrix. Advances in Condensed Matter Physics 2012: e527065. https://doi.org/10.1155/2012/527065
Schneider KA (2018) Large and Finite Sample Properties of a Maximum-Likelihood Estimator for Multiplicity of Infection. PLOS ONE 13 (4):e0194148. https://doi.org/10.1371/journal.pone.0194148
Sergio R, Wachs-Lopes PG, Santos RM, Coltri E, Giraldi GA (2019) A Q-Extension of Sigmoid Functions and the Application for Enhancement of Ultrasound Images. Entropy 21 (4):430. https://doi.org/10.3390/e21040430
Shabbir A, Ali N, Ahmed J, Zafar B, Rasheed A, Sajid M, Ahmed A, Dar SH (2021) Satellite and Scene Image Classification Based on Transfer Learning and Fine Tuning of ResNet50. Mathematical Problems in Engineering. https://doi.org/10.1155/2021/5843816
Sharma AK, Nandal A, Dhaka A, Koundal D, Bogatinoska DC, Alyami H (2022) Enhanced Watershed Segmentation Algorithm-Based Modified ResNet50 Model for Brain Tumor Detection. BioMed Research International. https://doi.org/10.1155/2022/7348344
Shu Y, Li J, Yousif H, Gomes G (2010) Dark-Spot Detection from SAR Intensity Imagery with Spatial Density Thresholding for Oil-Spill Monitoring. Remote Sensing of Environment 114 (9):2026–35. https://doi.org/10.1016/j.rse.2010.04.009
Singh H, Bhardwaj N, Arya SK, Khatri M (2020) Environmental Impacts of Oil Spills and Their Remediation by Magnetic Nanomaterials. Environmental Nanotechnology, Monitoring & Management 14:100305. https://doi.org/10.1016/j.enmm.2020.100305
Singha STB, Trieschmann O (2013) Satellite Oil Spill Detection Using Artificial Neural Networks. Selected Topics in Applied Earth Observations and Remote Sensing. https://doi.org/10.1109/JSTARS.2013.2251864
Suruliandi A, Meena K, Rose R (2012) Local Binary Pattern and Its Derivatives for Face Recognition. Computer Vision. https://doi.org/10.1049/iet-cvi.2011.0228.
Tharwat A (2020) Classification Assessment Methods. Applied Computing and Informatics 17 (1):168–92. https://doi.org/10.1016/j.aci.2018.08.003
Topouzelis K (2008) Oil Spill Detection by SAR Images: Dark Formation Detection, Feature Extraction and Classification Algorithms. Sensors. https://doi.org/10.3390/s8106642
Tsallis C (1988) Possible Generalization of Boltzmann-Gibbs Statistics. Journal of Statistical Physics 52 (1):479–87. https://doi.org/10.1007/BF01016429
Vasconcelos RN, Lima ATC, Lentini CAD, Miranda GV, Mendonça LF, Silva MA, Cambuí ECB, Lopes JM, Porsani MJ (2020) Oil Spill Detection and Mapping: A 50-Year Bibliometric Analysis. Remote Sensing 12 (21):3647. https://doi.org/10.3390/rs12213647
Wang G, Zhang P, Ren G, Xi K (2021) Texture Feature Extraction Method Fused with LBP and GLCM.
Webler T, Lord F (2010) Planning for the Human Dimensions of Oil Spills and Spill Response. Environmental Management 45:723–38. https://doi.org/10.1007/s00267-010-9447-9
Xiao X, Chen Y, Gong Y, Zhou Y (2021) Low-Rank Preserving t-Linear Projection for Robust Image Feature Extraction. IEEE Transactions on Image Processing 30:108–20. https://doi.org/10.1109/TIP.2020.3031813
Xu J, Wang H, Cui C, Zhao B, Li B (2020) Oil Spill Monitoring of Shipborne Radar Image Features Using SVM and Local Adaptive Threshold. Algorithms 13 (3): 69. https://doi.org/10.3390/a13030069
Yang P, Dong C, Zhao X, Chen X (2020) The Surface Damage Identifications of Wind Turbine Blades Based on ResNet50 Algorithm. Chinese Control Conference (CCC). https://doi.org/10.23919/CCC50068.2020.9189408
Zeng K, Wang Y (2020) A Deep Convolutional Neural Network for Oil Spill Detection from Spaceborne SAR Images. Remote Sensing 12 (6):1015. https://doi.org/10.3390/rs12061015
Zhang C, Liu A, Liu X, Xu Y, Yu H, Ma Y, Li T (2021) Interpreting and Improving Adversarial Robustness of Deep Neural Networks With Neuron Sensitivity. IEEE Transactions on Image Processing 30:1291–1304. https://doi.org/10.1109/TIP.2020.3042083

No competing interests reported.

Download PDF

Version 1

posted

You are reading this latest preprint version

Configuration 1
Dataset size
Model	200	400	1002	1572
MLP	0.6333 \(\pm\) 0.0755	0.5657 \(\pm\) 0.0355	0.5000 \(\pm\) 0.0000	0.6679 \(\pm\) 0.0284
RF	0.6979 \(\pm\) 0.0173	0.6834 \(\pm\) 0.0273	0.5197 \(\pm\) 0.0237	0.6544 \(\pm\) 0.0379
SVM	0.6934 \(\pm\) 0.0037	0.7022 \(\pm\) 0.0094	0.5043 \(\pm\) 0.0137	0.7468 \(\pm\) 0.0065
LR	0.6240 \(\pm\) 0.0256	0.4804 \(\pm\) 0.0260	0.5278 \(\pm\) 0.0149	0.6263 \(\pm\) 0.0125
XGB	0.6800 \(\pm\) 0.0340	0.6724 \(\pm\) 0.0204	0.7348 \(\pm\) 0.0292	0.7399 \(\pm\) 0.0123
Configuration 2
Dataset size
Model	200	400	1002	1572
MLP	0.6146 \(\pm\) 0.0601	0.5980 \(\pm\) 0.0583	0.5000 \(\pm\) 0.0000	0.6298 \(\pm\) 0.0230
RF	0.6635 \(\pm\) 0.0260	0.6893 \(\pm\) 0.0265	0.4984 \(\pm\) 0.0208	0.5649 \(\pm\) 0.0986
SVM	0.6652 \(\pm\) 0.0506	0.7018 \(\pm\) 0.0101	0.5178 \(\pm\) 0.0566	0.7069 \(\pm\) 0.0150
LR	0.6010 \(\pm\) 0.0265	0.5506 \(\pm\) 0.0358	0.5319 \(\pm\) 0.0000	0.6360 \(\pm\) 0.0122
XGB	0.6457 \(\pm\) 0.0239	0.6684 \(\pm\) 0.0423	0.6418 \(\pm\) 0.0505	0.6821 \(\pm\) 0.0205

Automated detection of oil spills in images: combining a novel feature extraction technique based on the q- Exponential distribution with machine learning models

Status:

Version 1

Abstract

Figures

1 Introduction

2 The Q-exponential Distribution

3 The Proposed Q-exponential Feature Extraction Method

4 Classical Cv Models And Dl Architectures

5 Oil Spill Sar Image Dataset

6 Experiments Results

7 Conclusion

Declarations

References

Additional Declarations

Status:

Version 1

Feature	Description	Formulation
Contrast	Intensity contrast between a pixel and its neighbor over the whole image.	\(\sum _{i,j}{\left\|i-j\right\|}^{2}p\left(i,j\right)\)
Correlation	Correlation between a pixel and its neighbor over the whole image.	\(\sum _{i,j}\frac{\left(i-\mu i\right)\left(j-\mu j\right)p\left(i,j\right)}{{\sigma }_{i}{\sigma }_{j}}\)
Energy	Sum of squared elements in the GLCM.	\(\sum _{i,j}{p\left(i,j\right)}^{2}\)
Homogeneity	The closeness between the distribution of elements in GLCM and its diagonal.	\(\sum _{i,j}\frac{p\left(i,j\right)}{1+\left\|i-j\right\|}\)
Thresh Out	The relative proportion of the oil spill objects and non-oil spill objects.	The threshold value for the Sobel edge detection method
Entropy	A statistical measure of randomness.	\(-\sum _{p}q{{log}}_{2}q\)
Local variance	The average local standard deviation of 3 × 3 neighborhood around each pixel in the image.	\(stdfilt\) function in MATLAB divided by the number of pixels in the image
Standard Deviation	The standard deviation of all values.	\(\sqrt{\frac{\sum _{i,j}{\left(p-\stackrel{-}{p}\right)}^{2}}{n}}\)

	Description	Feature vector size
Configuration 1	\(q=p=64\) \(\varDelta =1\) \(n=4\)	3721
Configuration 2	\(q=p=256\) \(\varDelta =4\) \(n=4\)	4096