Prediction and detection of localised corrosion attack of stainless steel in biogas production: Machine Learning Classification Approach

doi:10.21203/rs.3.rs-3322058/v1

Download PDF

Research Article

Prediction and detection of localised corrosion attack of stainless steel in biogas production: Machine Learning Classification Approach

https://doi.org/10.21203/rs.3.rs-3322058/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Biogas contributes to environmental protection by reducing greenhouse gas emissions and promoting the recycling of organic waste. Its utilization plays a crucial role in addressing the challenges of climate change and sustainability. However, the deterioration of process plants involved in biogas production due to corrosion has a critical impact on the safety and durability of their operations. In order to maintain safety of structures in service life with respect to corrosion, it is essential to develop effective corrosion engineering control methods. The electrochemical techniques have become a useful tool to evaluate corrosion resistance. However, these techniques may require microscopic analysis of the material surface and the analysis may be influenced by subjective factors. To solve this drawback, this work proposes the use of SVM models to predict corrosion status of the material used in biogas production with no need to perform microscopic analysis after the electrochemical test. The obtained results of sensitivity and specificity equal to 0.94 and 0.97, respectively, revealed the utility of the proposed stochastic models to assure the corrosion state of the equipment involved in biogas production. SVM-based models become an effective alternative to evaluate material durability accurately.

localised corrosion

biogas

stainless steel

machine learning

The results show that the proposed model is a useful tool to predict the behaviour of stainless steel against corrosion according to the environmental conditions to which the material is exposed in biogas production.

The uncontrolled emissions of greenhouse gas (GHG) generated by agricultural activities have influence on technical, economic and social factors that can affect the sustainability of these activities (Nzila et al. 2012). With the aim to achieve the goals of Paris Agreement and to mitigate the impact of climate change, EU has proposed a reduction in GHG emissions by 40% in 2030 compared to 1990 (EU 2018). In addition, the Sustainable Development Goals (SDGs) are set by the United Nations to reduce the worldwide waste generation by 2030 (Khan and Kabir 2020). One option to reduce the global warming and transform the pollutant waste into a valuable resource is through the production of biogas. This process involves the conversion of biodegradable material, in the absence of oxygen, to biogas (ca. 55%vol CH₄, 45%vol CO₂) as a result of different microbial processes (O’Shea et al. 2020). Biodigesters are considered as friendly technology environmentally and suitable for a decentralized application, easy to construct and operate. Most sustainability studies related to biogas and other renewable energy systems focus on limited aspects such as feedstock sustainability or energy efficiency (Kan et al. 2018). However, it naturally follows that any evaluation of the durability of the systems used in the production process would be an important point to analyse. Like most industrial assets, some elements of a biogas plant can be affected by corrosion since the oxidation and fermentation processes in digesters, whether concrete or steel, provide a perfect environment for corrosion. In order to maintain safety of structures in service life with respect to corrosion in biogas production, monitoring and design are important measures (Alzbutas et al. 2014; Li and Xiao 2021).

The deterioration of metals and alloys as the result of their interactions with the surrounding environment is called corrosion. Corrosion is a worldwide crucial problem that strongly affects natural and industrial environments. This degradation may cause weakness in the material due to the reduction of its area, changes in the crystalline structure and reduce its strength leading to toxic substance leakage, equipment structure failure or even explosion accidents (Thodi et al. 2009; Foorginezhad et al. 2021). Although biogas technology is quite mature, the problems related to corrosion during its production is a topic that has not been well documented. In biogas production, the biodigesters are exposed to direct contact with a wide range of aggressive products that may cause damage on their durability (Li et al. 2019). Therefore, in order to prevent failures in the system, it is necessary to protect the biodigesters from corrosion attack, reducing the maintenance tasks at the same time.

There is no doubt that concrete is one of the most widely materials used for tanks involved in biogas production. However, the large-scale construction of facilities made of concrete requires the use of very heavy machines. Furthermore, even when special coating is used, the prevention of corrosion in these concrete tanks is really complicated (Estoup and Cabrillac 1997). In this context, stainless steel has become a suitable option for the manufacture of biodigesters. Stainless steel has excellent properties that make it a suitable option in a wide variety of applications. It is characterized by its durability without maintenance requirements. In addition, it can be recycled at the end of its service life. The advantageous properties compared to standard carbon mild steel are (Singh 2020): higher corrosion resistance and general durability in a wide variety of media, good mechanical strength at high temperature, higher ductility and ease implantation, higher hardness and a more attractive long-lasting appearance, among others. The combination of these features has led to the extensive use of stainless steel in many applications: agriculture, transport, building, chemical and petrochemical engineering (Oberndorfer et al. 2004; Lo et al. 2009). The different grades of commercially available stainless steel are mainly ferritic, austenitic, martensitic, duplex and heat resisting austenitic stainless steel (Cadoni et al. 2012). The choice of a certain grade of stainless steel is based on different factors such as functional requirements and cost effectiveness. Therefore, the appropriate selection of the chemical composition of the stainless steel used in biodigesters is considered as a critical aspect for the durability of the system.

As shown previously, stainless steel offers excellent corrosion resistance in many different environments. This property is explained based on the formation of a passive film on the surface of stainless steel. This layer is formed spontaneously when chromium, presented in its composition, reacts with oxygen in the air. This film acts as a barrier protecting the material from the corrosive environment. In order to assure the formation of the passive layer, the stainless steel must contain a minimum amount of chromium of 10.5%. However, with this chromium content, the passive layer can be damaged when the material is exposed to aggressive environments. Under these conditions, the passive layer may be broken leading to corrosion damage. Corrosion can be presented in different forms: it can be concentrated in small areas leading to the formation of pit or crack (localised corrosion) or it can be extended over the entire material surface (uniform corrosion) (Mori and Bauernfeind 2004). Although uniform corrosion leads to the greater proportion of metal deterioration, at least, it is predictable, so that allowance can be made for it in the design of a structure (Shreir 2013). On contrary, localised corrosion is difficult to detect and this type of attack is less predictable, particularly for pitting in which the location of pit on the material surface and its distribution and size will depend on different factors such as environmental conditions and the structure of the metal. For these reasons, as one of the most universal corrosion forms, pitting corrosion has posed a great threat to the industrial production and economy (Shekari et al. 2017; Subramanian 2018).

With the rapid development of biogas production, the problems related to corrosion have become increasingly important, making the corrosion management a crucial factor to ensure process safety. For the industrial sector, the global cost of corrosion was estimated to be 3.4% of the global GDP (Hays 2010). It is estimated that savings of between 15 and 35% of the cost of corrosion could be realized by using available corrosion control practices (Bhaskaran et al. 2005).. The prediction of corrosion behaviour, as the preliminary stage of equipment corrosion management is a crucial step to ensure the proper corrosion control and the safety protection of the material (Vrignat et al. 2022). However, the highly nonlinear nature of corrosion in addition to the lack of theoretical basis for some corrosion phenomena in certain environments have promoted the development of models based on artificial intelligence techniques applied to the study corrosion phenomenon (Chou et al. 2017; Jia et al. 2022; Yang et al. 2023).

During last years, artificial intelligence techniques have been applied to model complex real problems based on experimental data (Soares et al. 2016; Chang et al. 2020; Leong et al. 2020; Melo et al. 2020; Sacks et al. 2020; Diao et al. 2021; Le et al. 2021; Li et al. 2021; Bao et al. 2022; Sun et al. 2022; Zuben and Viana 2022; Forkan et al. 2022). Scientists have been making great efforts to get effective tools to model corrosion behaviour. Among the different artificial intelligence approaches that can be found in the literature, one of the best known is Support Vector Machines (SVMs). These techniques have been proposed by many authors for structural reliability analysis (Okabe and Otsuka 2021; Pepper et al. 2022). However, research about corrosion modelling using SVMs is few. Wen et al. (Wen et al. 2009) proposed a SVM model to predict corrosion rate of carbon steel in different sea water conditions. Hatamy et al. (Hatami et al. 2016) applied SVM technique to predict CO₂ corrosion rate in oil and gas industries considering different parameters such as CO₂ partial pressure, temperature, pH and flow velocity. Chou et al. (Chou et al. 2017) applied SVM to predict pitting corrosion risk of steel reinforced concrete and marine corrosion rate of carbon steel. Lv et al. (Lv et al. 2020) used SVM to calculate different cross-section digitalization parameters to predict the sectional corrosion rate of steel and Seghier et al. (El Amine Ben Seghier et al. 2020) proposed SVM model to model pitting corrosion attack in oil and gas pipelines. Despite this technique has been used by some authors to achieve a deep knowledge about the corrosion behaviour of different materials, scarce literature has been found for those environments related to biogas production.

In this study, according to the experience of the authors and the good results obtained with SVM models developed in previous studies (Jiménez-Come et al. 2014, 2015, 2019), the use of SVM was proposed to develop a stochastic model capable of predicting the behaviour against localized corrosion of different types of stainless steels involved in biogas production. Compared with the manual classification of corrosion resistance of the different materials carried out from electrochemical tests and microscopic analysis, the classification speed resulting from SVM technology is faster and it can reduce the impact of subjectivity on the results. The proposed models become a useful tool in the design of biodigester, since they predict corrosion behaviour of stainless steel accurately depending on the environmental conditions to which the material will be exposed in biogas production. The developed models represent a great advance in this field since they not only allow economic savings related to maintenance tasks, but also improve the durability of the equipment involved in biogas production, avoiding problems of pollution, leaks, and product loss.

As it has been introduced previously, with the development of computer technology, machine learning technology has been adopted for classification and recognition problems. In this case, SVM technique was proposed to model localised corrosion behaviour of different grades of stainless steel used for equipment manufacturing involved in biogas production. The model tried to predict the corrosion status of the material based on electrochemical tests with no need to perform microscopic analysis. The compositions of different grades of stainless steel considered in this study are collected in the following table:

Table 1

Stainless steel grade composition for experimental tests.
Stainless Steel grade	Cr (%)	Mo(%)	N(%)	Mn(%)	Ti(%)	Nb(%)
EN 1.4404	16.71	2.018	0.038	1.291	0.016	0.016
EN 1.4462	22.892	3.279	0.1619	1.397	0.034	0.011
EN 1.4482	19.94	0.223	0.1374	4.016	0.025	0.006
EN 1.4003	11.45	0.01	0.0115	0.547	0.015	0.006
EN 1.4571	16.801	2.066	0.0134	1.578	0.301	0.008
EN 1.4509	18.492	0.051	0.0235	0.43	0.148	0.393
EN 1.4521	18.555	1.999	0.024	0.507	0.127	0.416
EN 1.4318	17.744	0.089	0.144	1.258	0.002	0.007

The main objective of this study was to predict the corrosion behaviour of the material according to the environmental conditions to which it may be exposed in biogas production in order to minimize the impact of corrosion on this material and to prevent equipment durability problems. For this reason, the experiments were carried out under different conditions by using artificial solutions (AS) simulating biogas environments. The compositions of the artificial solutions tested in this study are collected in the following table:

Table 2

Artificial solution conditions.
Reagent	g/L	Artificial Solution 1	Artificial Solution 2
Sodium Sulphate, Na₂SO₃	12.67	X	X
Ammonium Chloride, NH₄Cl	69.81	X	X
Ammonium carbonate, (NH4)₂CO₃	15.13	X	X
Sodium Acetate Trihydrate, NaCH₃COOH 3H₂O	17.06	X	X
Hydrogen chloride, HCl, from FeCl₂ desulfuration	7.31	--	X
pH range		8.2–8.5	6.6–7.2

The experimental data set was obtained by means of electrochemical tests in order to evaluate the susceptibility of different grades of stainless steel to suffer localised attack in biogas environments. Two types of localised corrosion were analysed since they are the most common attacks in these environments: pitting and crevice corrosion.

The electrochemical tests were based on ASTM standards G3-94 “Conventions Applicable to Electrochemical Measurements in corrosion testing” and G5-94 “Making potentiostatic and Potentiodynamic Anodic Polarization Measurements” (International and Materials 2004). In this application, a variable potential was applied to the sample under study, located in the cell shown in Fig. 1, and the current density of the system was measured for each potential value. The potential was represented versus the current density to obtain the polarization curve for the sample, providing information about the behaviour of the material under the tested conditions. According to the polarization curve, a parameter called “breakdown potential” can be defined as an indicator of the susceptibility to the initiation of localized corrosion. This parameter is defined by the potential where an abrupt increase in the anodic current density is observed. Many authors have set that this condition can be evaluated as the potential at which the current density exceeds 100 µA/cm² (Deng et al. 2008). However, there are materials with high corrosion resistance and for these cases, the transpassive region can be reached and the breakdown of the passive layer is not caused by localised corrosion. Therefore, it is necessary to analyse the material surface microscopically after electrochemical test in order to determine if there is pit or crevice on the surface of the material, or on contrary, the material is resistant to localised corrosion. In this experimental work, according to the microscopic analysis, when any evidence of localised corrosion was observed on the surface, the sample was defined as a corrosion pattern. With the aim to ensure reproducibility of the experimental results, each condition tested was repeated three times. A total of 520 groups of experimental data set was recorded. Table 3 shows the detailed information of the experimental conditions.

Table 3

Details of experimental conditions analysed for electrochemical tests.
Experimental conditions	Parameters
Material	Stainless steel
Grades	8 different grades of stainless steel
Solutions	Two types: AS1 / AS2
Temperature	35°C (mesophilic) / 50°C (thermophilic)
Number of tests per condition	3
Electrochemical test	Pitting / crevice corrosion tests
Surface finish	No polishing / #600# grain polishing

According to the type of localised corrosion studied, pitting or crevice corrosion, the electrochemical cell was prepared with different configurations. When pitting corrosion test was carried out, the crevice formation was avoided by using the inner and outer o-rings, as it is shown in Fig. 1. For pitting corrosion test, a paper filter was located between the o-rings to carry the flow of water into the cell. For crevice corrosion test, the system contained an inner o-ring in contact with the sample to be analysed promoting the formation of crevice.

Based on the electrochemical test, each sample was defined according to the following features: chemical composition of the alloy, properties of the artificial solution that simulated biogas environment (type of artificial solution and temperature), material surface finish (according to the type of polishing used for the sample: no polishing or #600# grain polishing) and the type of localised corrosion analysed (pitting or crevice corrosion). All these features, in addition to the breakdown potential evaluated from the polarization tests, were considered as inputs for the proposed SVM model whereas the output was defined according to the corrosion status of the sample analysed under microscope (1 for corrosion sample when corrosion attack was observed on the material surface and 0, otherwise). Up to 520 groups of experimental data set was recorded.

The proposed SVM model tried to predict the corrosion status of the material under study for new experimental conditions that had not been previously evaluated. In this case, after subjecting the sample to electrochemical test, the surface status of the sample may be predicted with the model without the need to resort to microscopic analysis. Compared with manual classification, the classification speed by using the proposed model is faster, and it can reduce the impact of subjectivity on the results since many times the detection of pit or crevice on the surface of the material, after the electrochemical test, is a complex task and may depend on the experience and human behaviour.

Support Vector Machine theory were developed by Vapnik (Cortes and Vapnik 1995). This technique has become more popular nowadays due to many attractive features of which one of them is the promising empirical performance. SVMs are based on the structural risk minimization principle that has been demonstrated to be superior to traditional Empirical Risk Minimization principle (ERM) used by artificial neural networks (Vapnik 2000). This fact leads to a great ability shown by SVMs to generalise, since this technique tries to minimize an upper bound of the expected risk instead of focusing on minimizing the error evaluated on training data, that is the specific feature of ERM. SVM can be applied to solve classification and regression problems.

For classification problems, the goal is to find out a function capable of separating the different classes presented to the model. According to Fig. 2.a), there may be different linear hyperplanes that can separate the data from the different classes. However, there is only one classifier that maximizes the margin, that is, the distance between the hyperplane and the nearest patterns of each class. This frontier is called the optimal separating hyperplane. This hyperplane can be used to classify new data.

For binary classification problems, the system can be defined according to the following equation:

$$D=\left\{\left({x}_{1},{y}_{1}\right), \dots ,\left({x}_{l},{y}_{l}\right)\right\} for i=1,\dots , l$$

$$x \in {\mathbb{R}}^{l}, y\in \{-\text{1,1}\}$$

where the hyperplane is defined by:

$$⟨w,x⟩+b=0$$

Soft margin SVM classification model.

In this sense, the set of patterns can be optimally separated when there is a solution that defines the frontier between the different classes without error and providing the maximum distance between the hyperplane and the closest pattern of each class, see Fig. 2.b). For this case, Vapnik defines the optimization problem to be solved as:

$$\text{m}\text{i}\text{n}|⟨w,{x}_{i}⟩+b|=1$$

with the following constraints:

$${y}_{i}[⟨w·{x}_{i}⟩+b]\ge 1 for i=1,\dots ,l$$

The optimization problem is solved considering the Karush–Kuhn–Tucker (KKT) conditions and using Lagrange multipliers (Kuhn and Tucker 1951). Those patterns whose Lagrange multipliers are non-zero are considered as support vectors and they are the points that contain the information to define the optimal hyperplane to solve the classification problem. When the problem is linearly separable, all the support vectors are located on the margin and in this case, the number of support vectors is reduced. However, when the training data is not linearly separable, see Fig. 2.C, additional variables related to misclassification, ξ_i, are introduced in the optimization problem (Vapnik 1999). For non-linearly separable case, the constraints are modified to be defined as follows:

$${y}_{i}[⟨w,{x}_{i}⟩+b]\ge 1-{\xi }_{i} for i=1,\dots ,l$$

Where ξ_i are the slack variables with non-negative values. For this case, the optimal separating hyperplane is determined by:

$${ϕ \left(w,\xi \right)= \frac{1}{2}}^{}{\left|\left|w\right|\right|}^{2}+C{\sum }_{i}{\xi }_{i}$$

Where C is related to the regularization parameter introduced to define a balance between maximizing the margin and minimizing the classification error (Schölkopf and Smola 2002). In addition, when a linear boundary is inappropriate for the problem under study, the input vector can be mapped into a high dimensional feature space by using kernel functions where the SVMs can define an optimal separating hyperplane (Muller et al. 2001). Among acceptable kernel functions to get it, the most frequently used are polynomials (SVM-POL) and radial basis function (SVM-RBF).

Apart from selecting the best kernel function, one of the most critical steps in this classification problems is how to define the regularization parameter, C. In this study, cross validation technique is applied to determine the optimal value of this parameters and those involved in kernel functions: polynomial degree for polynomial function and gamma parameter for radial basis function kernel. A total of 20 repetitions, considering 5-cross validation, were developed to assure a good generalization capability of the proposed SVM model. Finally, the normalization of the original data set can be required for certain Kernel functions due to their restricted domain. For this reason, all the original variables were normalized in the range [-1,1].

In this way, SVM model was proposed to determine the corrosion status of the sample without the need to carry out the microscopic analysis for new conditions that have not been tested before (see Fig. 3). This study can be considered as a classification problem: according to the sample features and the environmental conditions, the proposed SVM model will have the capacity of determining the corrosion resistance of the material under different environmental conditions involved in biogas production.

For the classification problem, the performance of the proposed model can be evaluated according to precision and accuracy indices defined by the following equations:

$$Precision=TPR= \frac{TP}{TP+FP}$$

$$Accuracy= \frac{TP+TN}{P+N}$$

where TP (True Positive) is defined as the number of corrosion patterns that have been classify correctly, TN (True Negative) is the number of no-corrosion patterns that have been classified correctly whereas FP (False Positive) is the number of those no-corrosion patterns that have been misclassified as corrosion patterns and FN (False Negative) is the number of corrosion patterns that have been misclassified as no-corrosion patterns. P and N correspond to the number of corrosion and no-corrosion patterns from the original data, respectively.

In addition, for the classification problem, sensitivity and specificity can be defined by the following equations:

$$sensitivity=TPR= \frac{TP}{TP+FN}$$

$$specificity=1-FPR= \frac{TN}{TN+FP}$$

With the aim to determine the optimal configuration for the proposed SVM model, multiple comparison analysis was applied. In this study, Friedman test was considered as it is usually referred as one of the most important non-parametric test for this type of analysis (Friedman 1940). When the null hypothesis for Friedman’s test is rejected, that is, there is significant difference among the compared models, a post-hoc test is required to carry out pairwise comparisons. In this study, Fisher’s Least Significant Difference (LSD) was applied as a tool to identify the models that were statistically different in order to the determine the optimal configuration (Welling 2005). The applied procedure is represented in Fig. 4 for SVM-RBF model, as an example. In the case of SVM-POL, the procedure is similar by replacing the value of γ by the polynomial degree value.

Once the optimal structure was defined for each configuration: SVM-POL and SVM-RBF, the results were compared with those obtained by using traditional techniques such as the K-nearest neighbours (KNN) (Cover and Hart 1967) and classification tree (Jamain and Hand 2008).

The purpose of this study was to develop SVM models with the ability to determine corrosion status of different grades of stainless steel exposed to biogas environments. The experimental data set was obtained from electrochemical tests in addition to microscopic analysis simulating the environmental conditions involved in biogas production.

The obtained results from the different configurations proposed for SVM models are shown below. In this case, the influence of two different kernel functions was analysed: polynomial (SVM-POL) and radial basis functions (SVM-RBF). Furthermore, the influence of the regularization parameter, C, was considered for both functions with the aim to determine the optimal configuration of the proposed SVM model in this application.

Figure 5 shows the results in terms of precision and accuracy obtained for SVM-POL models. According to the figure, it can be pointed out that these models presented similar behaviour for both indices. In this case, the models that resulted to be the optimal ones for localised corrosion modelling of stainless steel in biogas environments were SVM models considering linear kernel function. For these models, the regularization parameter had no great influence on classification performance. The highest values related to precision and accuracy indices provided by SVM-POL model were 0.94 and 0.93, respectively. These values reflected the capacity of the proposed SVM-POL model to predict the corrosion status of stainless steel after electrochemical tests in biogas environments accurately.

In order to identify the optimal configurations for these models, a statistical procedure for multiple group comparison, considering precision and accuracy values, was applied according to the procedure shown in Fig. 4. Firstly, the optimal values of C for each degree of the polynomial function considered in SVM-POL models were identified. These models are represented in Fig. 6 with red colour. Secondly, the optimal degrees of the polynomial function for each C value were identified. These models are represented in Fig. 6 with blue colour. Comparing the results obtained from these steps, those models that resulted to be identified as optimal models for both steps are represented with grey colour. Finally, these models are subjected to a multiple comparison test with the aim to define the optimal global equivalent configuration for SVM-POL models presented in this study. As a result of the application of this statistical procedure, the optimal configurations of SVM-POL model are represented in Fig. 6 with black colour. These configurations marked with black colour resulted to be equivalent to the model that provided the highest values of precision and accuracy, 0.94 and 0.93, respectively, marked with a cross in each graph. According to Fig. 6, the optimal configurations for SVM-POL model resulted to be the linear kernel function with C = 2⁰, 2¹, 2².

Similarly, the influence of the parameters involved in SVM-RBF model was analysed. For this model, the results in terms of precision and accuracy are represented in Fig. 7 where the influence of C and γ values can be analysed. According to Fig. 7, the highest precision value reached by this model was 1. In this case, it can be observed that higher values of γ provided better results for the classification problem whereas the regularization parameter defined by C did not exhibit the same behaviour. For this case, as this parameter increased, a decrease in the classification performance was observed. This behaviour was different when the performance of the model in terms of accuracy was analysed. For accuracy results, the model provided the best classification performance when γ took intermediate values from the considered range whereas the highest values of C provided the maximum value of accuracy, equal to 0.95.

With the aim to determine the optimal equivalent configurations that provided the best results for SVM-RBF model, the procedure represented in Fig. 4 was applied. The results obtained from the application of multiple comparison tests are collected in Fig. 8.

According to the results collected in Fig. 8, the equivalent configurations that provided the optimal precision value (precision = 1 in Fig. 8.a), resulted to be SVM-RBF model with the following configurations, see Fig. 8.a): (γ = 2⁴ – C = 2⁰), (γ = 2⁵ – C = 2⁰, 2¹, 2², 2⁵), (γ = 2⁶ – C = 2⁰, 2¹, 2²,2³, 2⁴, 2⁷), (γ = 2⁷ – C = 2⁰, 2¹, 2²,2³, 2⁴, 2⁵, 2⁶) and (γ = 2⁸ – C = 2⁰, 2¹, 2²,2³, 2⁴, 2⁵, 2⁶, 2⁷, 2⁸). Related to accuracy results, see Fig. 8.b), the equivalent configurations providing the optimal behaviour (accuracy = 0.952), resulted to be SVM-RBF model with the following pairs of values: (γ = 2² – C = 2³, 2⁴, 2⁵, 2⁶, 2⁷, 2⁸), (γ = 2³ – C = 2⁴, 2⁶, 2⁷), (γ = 2⁴ – C = 2⁵, 2⁶) and (γ = 2⁵ – C = 2⁷).

Related to the results provided by SVM-RBF model, it can be noted that there were different configurations providing the highest precision value. However, these configurations may be different from those configurations identified as the optimal ones when accuracy results were analysed. This behaviour can be explained according to Eq. (7). Based on this equation, for those cases where the models provided a value of precision equal to 1, the false positive term results to be null. This means that the model presented high capacity to determine those patterns that will not suffer localised corrosion. However, this model may not present similar capacity to detect all the patterns that will suffer this attack accurately since no information about false negative patterns is included in precision term. For the optimal configurations of SVM-RBF model in accuracy term, the maximum value reached was 0.952, much lower than the result obtained for precision.

With the aim to determine the optimal configurations for the proposed SVM models: SVM-POL and SVM-RBF and looking for a balance between the capacity to detect the patterns that will suffer corrosion and those that will not suffer corrosion in biogas environments, ROC space is applied.

The ROC space is created by representing the true positive rate (TPR), defined by Eq. (9), versus the false positive rate (FPR), defined by Eq. (10), for each configuration. As it was introduced previously, these measures can be computed from the confusion matrix for each classification model. TPR corresponds to sensitivity whereas FPR is equivalent to 1-specificity. This graphic represents a useful tool to compare the classification performance of different models since in these classification problems, the goal is to identify those models that provide acceptable discriminability between the existing classes: corrosion and no corrosion patterns. This graphic is a two-dimensional graph that provides a tradeoffs between benefits (TP) and costs (FP). Each model can be represented by a single point in ROC space where the point (0,1) represents the perfect classification. In this way, one model represented in ROC space is better than another if it is to the northwest of the graphic (Fawcett 2006).

In the following figure, the optimal configurations identified for SVM models are represented, in addition to those developed model considering traditional classification techniques, such as classification tree (CT) and k-nearest neighbour (kNN) considering three different values for k: 1, 3 and 5.

According to Fig. 9, it can be pointed out that CT provided better results than KNN models. However, CT presented lower efficiency than SVM models. Therefore, the proposed SVM models become an efficient alternative to traditional techniques for this application. Specially, there were some configurations for SVM models that provided excellent specificity results (100%). These models are represented on the Y-axis (False Positive = 0). However, these configurations were not considered as the optimal ones since they did not present high capacity to detect all the corrosion patterns correctly since the number of FN patterns provided by these configurations was high. This is the reason why they are represented far from the upper left corner. For the application considered in this study, with the aim to find a balance between the right classification of patterns that will suffer corrosion and those that will not suffer this attack, the best configuration for SVM model can be defined as the model located nearest the upper left corner. In this case, the optimal configuration for SVM model resulted to be SVM-RBF (C = 2⁶ and γ = 2³) with sensitivity and specificity values of 94.0% and 96.6%, respectively. These values reflected the high capacity of the proposed model to predict the corrosion status of different grades of stainless steel in biogas environments without the need to perform microscopic analysis of the material surface, avoiding subjectivity in the results.

In this study, SVM models were proposed in order to determine the corrosion status of different grades of stainless steel used in biogas production. The classification performance of this technique was compared with traditional techniques such as classification tree and k-nearest neighbour. Two different functions were analysed as kernel functions for SVM models: polynomic and RBF kernel functions. In this case, the maximum values of precision and accuracy resulted to be 100% and 95%, respectively. However, in order to evaluate the prediction performance of the proposed models, the limitations of diagnostic based on these terms required the use of specificity and sensitivity terms. These indices are more meaningful than accuracy and precision and can be used to obtain the ROC space. According to ROC space, it can be concluded that SVM models outperformed those models based on traditional techniques. The optimal configuration for SVM model was SVM-RBF (C = 2⁶ and γ = 2³) that provided values of sensitivity and specificity equal to 0.94 and 0.966, respectively. These results demonstrated the utility of the proposed SVM model to predict corrosion status of stainless steel when localised corrosion (pitting and crevice corrosion) is analysed in biogas environments. The model can be presented as a complementary tool to electrochemical tests since it predicts the corrosion status of different grades of stainless steel, according to the conditions involved in biogas production, with no need to analyse the surface microscopically.

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this work.

Acknowledgement

The authors gratefully acknowledge the financial support provided by the Projects funded by the University of Cadiz (52004195). It has been developed with the support of the European project “Innovative and competitive solutions using stainless steel and adhesive bonding in biogas. (BiogaSS)” developed partly in ACERINOX EUROPA S.A.U.

Alzbutas R, Iešmantas T, Povilaitis M, Vitkute J (2014) Risk and uncertainty analysis of gas pipeline failure and gas combustion consequence. Stoch Environ Res Risk Assess 28:1431–1446. https://doi.org/10.1007/s00477-013-0845-4
Bao L, Li K, Zheng J et al (2022) Surface characteristics and stress corrosion behavior of AA 7075-T6 aluminum alloys after different shot peening processes. Surf Coat Technol 440:128481. https://doi.org/10.1016/j.surfcoat.2022.128481
Bhaskaran R, Palaniswamy N, Rengaswamy NS, Jayachandran M (2005) Global Cost of Corrosion—A Historical Review. Corros Mater 13:621–628
Cadoni E, Fenu L, Forni D (2012) Strain rate behaviour in tension of austenitic stainless steel used for reinforcing bars. Constr Build Mater 35:399–407
Chang MJ, Lin GF, Lee FZ et al (2020) Outflow sediment concentration forecasting by integrating machine learning approaches and time series analysis in reservoir desilting operation. Stoch Environ Res Risk Assess 34:849–866. https://doi.org/10.1007/s00477-020-01802-3
Chou JS, Ngo NT, Chong WK (2017) The use of artificial intelligence combiners for modeling steel pitting risk and corrosion rate. Eng Appl Artif Intell 65:471–483. https://doi.org/10.1016/j.engappai.2016.09.008
Cortes C, Vapnik V (1995) Support-vector networks. Mach Learn 20:273–297
Cover T, Hart P (1967) Nearest neighbor pattern classification. Inf Theory. IEEE Trans 13:21–27
Deng B, Jiang Y, Gong J et al (2008) Critical pitting and repassivation temperatures for duplex stainless steel in chloride solutions. Electrochim Acta 53:5220–5225
Diao Y, Yan L, Gao K (2021) Improvement of the machine learning-based corrosion rate prediction model through the optimization of input features. Mater Des 198:109326. https://doi.org/10.1016/j.matdes.2020.109326
El Amine Ben Seghier M, Keshtegar B, Tee KF et al (2020) Prediction of maximum pitting corrosion depth in oil and gas pipelines. Eng Fail Anal 112:104505. https://doi.org/10.1016/j.engfailanal.2020.104505
Estoup JM, Cabrillac R (1997) Corrosion of biological origin observed on concrete digestors. Constr Build Mater 11:225–232. https://doi.org/10.1016/S0950-0618(97)00041-X
EU (2018) Directive (EU) 2018/2001 of the European Parliament and of the Council of 11 December 2018 on the promotion of the use of energy from renewable sources (recast). Off J Eur Union 2018:82–209
Fawcett T (2006) An introduction to ROC analysis. Pattern Recognit Lett 27:861–874
Foorginezhad S, Mohseni-Dargah M, Firoozirad K et al (2021) Recent Advances in Sensing and Assessment of Corrosion in Sewage Pipelines. Process Saf Environ Prot 147:192–213. https://doi.org/10.1016/j.psep.2020.09.009
Forkan ARM, Kang Y, Bin, Jayaraman PP et al (2022) CorrDetector: A framework for structural corrosion detection from drone images using ensemble deep learning. Expert Syst Appl 193:116461. https://doi.org/10.1016/j.eswa.2021.116461
Friedman M (1940) A comparison of alternative tests of significance for the problem of m rankings. Ann Math Stat 86–92
Hatami S, Ghaderi-Ardakani A, Niknejad-Khomami M et al (2016) On the prediction of CO2 corrosion in petroleum industry. J Supercrit Fluids 117:108–112. https://doi.org/10.1016/j.supflu.2016.05.047
Hays GF (2010) World Corrosion Organization. Corrodia NACE Int, pp 2010–2011
International A, Materials, AS for T& (2004) Annual book of ASTM Standards. American Society for Testing & Materials
Jamain A, Hand DJ (2008) Mining supervised classification performance studies: A meta-analytic investigation. J Classif 25:87–112
Jia H, Qiao G, Han P (2022) Machine learning algorithms in the environmental corrosion evaluation of reinforced concrete structures - A review. Cem Concr Compos 133:104725. https://doi.org/10.1016/j.cemconcomp.2022.104725
Jiménez-Come MJ, de la Luz Martín M, Matres V (2019) A support vector machine-based ensemble algorithm for pitting corrosion modeling of EN 1.4404 stainless steel in sodium chloride solutions. Mater Corros 70. https://doi.org/10.1002/maco.201810367
Jiménez-Come MJ, Turias IJ, Ruiz-Aguilar JJ (2015) Pitting corrosion behaviour modelling of stainless steel with support vector machines. Mater Corros 66:915–924. https://doi.org/10.1002/maco.201407788
Jiménez-Come MJ, Turias IJ, Trujillo FJ (2014) An automatic pitting corrosion detection approach for 316L stainless steel. Mater Des 56. https://doi.org/10.1016/j.matdes.2013.11.045
Kan X, Zhou D, Yang W et al (2018) An investigation on utilization of biogas and syngas produced from biomass waste in premixed spark ignition engine. Appl Energy 212:210–222. https://doi.org/10.1016/j.apenergy.2017.12.037
Khan I, Kabir Z (2020) Waste-to-energy generation technologies and the developing economies: A multi-criteria analysis for sustainability assessment. Renew Energy 150:320–333. https://doi.org/10.1016/j.renene.2019.12.132
Kuhn HW, Tucker AW (1951) Nonlinear programming. In: Proceedings of the second Berkeley symposium on mathematical statistics and probability. California
Le AV, Veerajagadheswar P, Kyaw PT et al (2021) Towards optimal hydro-blasting in reconfigurable climbing system for corroded ship hull cleaning and maintenance. Expert Syst Appl 170:114519. https://doi.org/10.1016/j.eswa.2020.114519
Leong WC, Kelani RO, Ahmad Z (2020) Prediction of air pollution index (API) using support vector machine (SVM). J Environ Chem Eng 8:103208. https://doi.org/10.1016/j.jece.2019.103208
Li C, Xiao K (2021) Chloride threshold, modelling of corrosion rate and pore structure of concrete with metakaolin addition. Constr Build Mater 305:124666. https://doi.org/10.1016/j.conbuildmat.2021.124666
Li T, Wu J, Frankel GS (2021) Localized corrosion: Passive film breakdown vs. Pit growth stability, Part VI: Pit dissolution kinetics of different alloys and a model for pitting and repassivation potentials. Corros Sci 182:109277. https://doi.org/10.1016/j.corsci.2021.109277
Li Y, Alaimo CP, Kim M et al (2019) Composition and Toxicity of Biogas Produced from Different Feedstocks in California. Environ Sci Technol. https://doi.org/10.1021/acs.est.9b03003
Lo KH, Shek CH, Lai JKL (2009) Recent developments in stainless steels. Mater Sci Eng R Reports 65:39–104. https://doi.org/10.1016/j.mser.2009.03.001
Lv Y, Wang J, Wang JJ, liang et al (2020) Steel corrosion prediction based on support vector machines. Chaos Solitons and Fractals 136. https://doi.org/10.1016/j.chaos.2020.109807
Melo C, Dann M, Hugo RJ, Janeta A (2020) Extreme value modeling of localized internal corrosion in unpiggable pipelines. Int J Press Vessel Pip 182:104055. https://doi.org/10.1016/j.ijpvp.2020.104055
Mori G, Bauernfeind D (2004) Pitting and crevice corrosion of superaustenitic stainless steels. Mater Corros 55:164–173. https://doi.org/10.1002/maco.200303746
Muller K-R, Mika S, Ratsch G et al (2001) An introduction to kernel-based learning algorithms. Neural Networks. IEEE Trans 12:181–201
Nzila C, Dewulf J, Spanjers H et al (2012) Multi criteria sustainability assessment of biogas production in Kenya. Appl Energy 93:496–506. https://doi.org/10.1016/j.apenergy.2011.12.020
O’Shea R, Lin R, Wall DM et al (2020) Using biogas to reduce natural gas consumption and greenhouse gas emissions at a large distillery. Appl Energy 279:115812. https://doi.org/10.1016/j.apenergy.2020.115812
Oberndorfer M, Thayer K, Kästenbauer M (2004) Application limits of stainless steels in the petroleum industry. Mater Corros 55:174–180. https://doi.org/10.1002/maco.200303781
Okabe T, Otsuka Y (2021) Proposal of a Validation Method of Failure Mode Analyses based on the Stress-Strength Model with a Support Vector Machine. Reliab Eng Syst Saf 205:107247. https://doi.org/10.1016/j.ress.2020.107247
Pepper N, Crespo L, Montomoli F (2022) Adaptive learning for reliability analysis using Support Vector Machines. Reliab Eng Syst Saf 226:108635. https://doi.org/10.1016/j.ress.2022.108635
Sacks R, Girolami M, Brilakis I (2020) Building Information Modelling, Artificial Intelligence and Construction Tech. Dev Built Environ 4:100011. https://doi.org/10.1016/j.dibe.2020.100011
Schölkopf B, Smola AJ (2002) Learning with kernels: support vector machines, regularization, optimization and beyond. the MIT Press
Shekari E, Khan F, Ahmed S (2017) Economic risk analysis of pitting corrosion in process facilities. Int J Press Vessel Pip 157:51–62. https://doi.org/10.1016/j.ijpvp.2017.08.005
Shreir LL (2013) Localised Corrosion. Corros Third Ed 1. https://doi.org/10.1016/B978-0-08-052351-4.50014-5. 1:151-1:212
Singh R (2020) Welding, corrosion-resistant alloys—Stainless steel. Appl Weld Eng 251–271. https://doi.org/10.1016/b978-0-12-821348-3.00019-7
Soares N, Gaspar AR, Santos P, Costa JJ (2016) Experimental evaluation of the heat transfer through small PCM-based thermal energy storage units for building applications. Energy Build 116:18–34. https://doi.org/10.1016/j.enbuild.2016.01.003
Subramanian C (2018) Localized pitting corrosion of API 5L grade A pipe used in industrial fire water piping applications. Eng Fail Anal 92:405–417. https://doi.org/10.1016/j.engfailanal.2018.06.008
Sun Y, Wang X, Ren N et al (2022) Improved Machine Learning Models by Data Processing for Predicting Life-Cycle Environmental Impacts of Chemicals. Environ Sci Technol. https://doi.org/10.1021/acs.est.2c04945
Thodi P, Khan F, Haddara M (2009) The selection of corrosion prior distributions for risk based integrity modeling. Stoch Environ Res Risk Assess 23:793–809. https://doi.org/10.1007/s00477-008-0259-x
Vapnik VN (2000) The nature of statistical learning theory. Springer-Verlag, New York Incorporated
Vapnik VN (1999) An overview of statistical learning theory. Neural Networks. IEEE Trans 10:988–999
Vrignat P, Kratz F, Avila M (2022) Sustainable manufacturing, maintenance policies, prognostics and health management: A literature review. Reliab Eng Syst Saf 218. https://doi.org/10.1016/j.ress.2021.108140
Welling M (2005) Fisher linear discriminant analysis. Dep Comput Sci Univ Toronto 1:123–168. https://doi.org/10.1109/TNN.2010.2090047
Wen YF, Cai CZ, Liu XH et al (2009) Corrosion rate prediction of 3C steel under different seawater environment by using support vector regression. Corros Sci 51:349–355. https://doi.org/10.1016/j.corsci.2008.10.038
Yang J, Suo G, Chen L et al (2023) Prediction method of key corrosion state parameters in refining process based on multi-source data. Energy 263:125594. https://doi.org/10.1016/j.energy.2022.125594
Zuben A, Von, Viana FAC (2022) Generative adversarial networks for extrapolation of corrosion in automobile images. Expert Syst Appl 213:118849. https://doi.org/10.1016/j.eswa.2022.118849

No competing interests reported.

Download PDF

Version 1

posted

You are reading this latest preprint version

Prediction and detection of localised corrosion attack of stainless steel in biogas production: Machine Learning Classification Approach

Status:

Version 1

Abstract

Figures

Synopsis

1. INTRODUCTION

2. EXPERIMENTAL PROCEDURE

3. METHODOLOGY

4. RESULTS

5. CONCLUSION

Declarations

References

Additional Declarations

Status:

Version 1