Ensemble Regressors for Half Cell Potential Prediction

doi:10.21203/rs.3.rs-4269740/v1

Download PDF

Research Article

Ensemble Regressors for Half Cell Potential Prediction

https://doi.org/10.21203/rs.3.rs-4269740/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

This study addresses the critical issue of steel corrosion in concrete structures, a major concern in the construction industry. By integrating advanced machine learning techniques, particularly ensemble methods, the research aims to enhance the accuracy and reliability of corrosion prediction models for reinforced concrete structures. Through experimentation and meticulous data collection, key input parameters such as distances from the anode, relative humidity, temperature, and concrete age were identified. Various ensemble learning methods including Boosted Trees, Bagged Trees, and Optimizable Ensembles were employed and evaluated using performance metrics such as RMSE, R-squared, MSE, MAE, prediction speed, and training time. LSBoost with Bayesian optimization emerged as the top-performing method, achieving the lowest RMSE of 0.018097, highest R-squared of 0.97, lowest MSE of 0.00032752, and smallest MAE of 0.013769. Despite its longer training time, LSBoost with Bayesian optimization offers superior predictive accuracy compared to other methods, warranting consideration for applications where accuracy is paramount. Bagged Trees and Boosted Trees also demonstrated good performance, balancing prediction speed and accuracy, making them suitable for time-sensitive applications. This research provides valuable insights for developing cost-effective maintenance and rehabilitation strategies, ultimately improving the durability and strength of concrete structures.

Civil Engineering

Corrosion

Ensemble Regressors

Machine learning

Half Cell Potential Prediction

performance metrics

The most pressing durability concern in the construction sector is steel corrosion, which causes concrete buildings to deteriorate [1,2]. Chloride ions (Cl-) or carbon dioxide (CO2) in concrete dissolve alkaline hydrated cement products that protect (passivate) embedded steel, causing steel corrosion. The implanted steel loses tensile and flexural strength due to corrosion-induced cross-sectional decrease [3]. This causes structural fissures in the concrete, reducing its load-bearing capability. According to some findings, Cl- ion-induced steel corrosion is more severe and costlier to repair than carbonation-induced corrosion [4]. The 2013 NACE Impact Study estimated that corrosion cost over US$2.5 trillion, or approximately 4% of the world's GDP. Implementing corrosion prevention best practices can save 15–35% of damage costs worldwide [5]. In extreme cases of reinforced concrete deterioration, rehabilitation might cost more than building [6]. Thus, embedded steel corrosion must be used to estimate reinforced concrete (RC) construction service life.

Carbonation, chloride ions, sulphates, and other causes can cause corrosion. The most damaging part is unbound Cl- ions entering the system. The presence of chloride in concrete can be caused by chloride-contaminated aggregates or construction water, or by diffusion from the environment, such as being exposed to a marine environment (e.g. microenvironment with wetting/drying cycles) or winter deicing salt use [7]. Both options include environmental diffusion. Through capillary suction, diffusion, and penetration, free chlorides Cl388 mostly enter capillaries [7]. Over the past couple decades, AI technology has advanced greatly. Machine learning models are used in many sectors of life [18, 19, 82, 83, 84, 85]. Machine learning algorithms have solved several civil engineering problems [84] such as, Geotechnical engineering [20, 21], pavement structures [22], structural engineering [23–24], composite structural elements [25], material science [26-30], traffic engineering [31–34], etc.

Machine learning finds patterns and models regression in massive data sets for high-dimensional, complex issues or systems [35]. In structural health monitoring, machine learning has been used to automatically detect structural surface defects [36, 37], material performance [38,39], seismic risk [40], cable conditions [41], and data anomaly [36-42]. Machine learning can quickly discover information from raw data without post-processing or expert judgment. Additionally, and probably most importantly, machine learning models can be trained to identify correlations that traditional mathematical or statistical methods cannot if enough data is available.
AI has also been used to predict concrete chloride diffusivity, Dcl [43-45]. These studies used multi-gene genetic programming [46], ensemble machine learning techniques, decision tree, multivariate adaptive regression spline [47,48], metaheuristic optimization algorithms [49], hybrid artificial neural network models using three swarm-based optimization algorithms [50], three-phase composites model at mesoscopic levels [51], and a computational model [52, 53].
Liu et al. [54] examined the Artificial Neural Network (ANN) model to create a rational and accurate prediction model for concrete chloride diffusion coefficient (Dcl). This network model was created using a reliable database from previous investigations. The findings show that the ANN can discover dataset anomalies and calculate Dcl in hostile concrete structures.
Ensemble learning to forecast corrosion-related parameters in reinforced concrete structures, particularly the Half Cell Potential, is understudied. Bagging, boosting, and stacking are useful ensemble approaches in other domains, but their use in concrete corrosion prediction is limited. Not investigating multiple models and using ensemble learning to increase forecast accuracy and resilience may miss chances. Concrete infrastructure corrosion forecast may be limited by this.

By incorporating the cutting-edge ensemble learning approaches, this research can transform HCP prediction processes in reinforced concrete structures and aims to improve the accuracy and reliability of corrosion prediction models, ultimately contributing to the development of cost-effective maintenance and rehabilitation strategies for concrete infrastructure. In addition, the results of this research are expected to reduce the economic and structural effects of corrosion-related damage, while enhancing the overall strength and durability of concrete structures in various environmental conditions.

Further, experimentally obtained real-time data on cathodically protected structures subjected to chloride ingress are also scarcely available. The present work describes the use of machine learning models and trains the models using meticulously obtained experimental data subjected to various environmental conditions. The input parameters used in developing the models, namely, the distances from the anode (Dist x and Dist y), RH, Temperature, and age of concrete are simple and easy to measure components. This study provides a lucid and easy prediction technique. It also evaluates the performance of the common prediction techniques used in machine learning and suggests the best among them. Thus, based on the simple input parameters, the long term durability of cathodically protected structures subjected to chloride ingress can be adjudged and a fairly acceptable prediction on the extent of corrosion based on HCP values could be projected. Various parameters which evaluate the performance of the machine learning model are obtained and most of the models exhibit a high R-squared score, indicating that the experimentally obtained values are in agreement with the actual site conditions. Hence, these machine learning techniques can be suitably used to predict the HCP values accurately if the distances from the anode, RH, temperature, and the age of concrete are given.

Materials and Experimental Setup

Ten 1000 mm x 1000 mm x 100 mm reinforced concrete slabs were cast. Fig. 1 shows a 10 mm steel reinforcement mat with a 25 mm clear cover and 190 mm centre-to-centre spacing in the formwork. Steel reinforcement mat surface area was 1.884 m². Pickling solution was used to eliminate corrosion from reinforcements. To complete the electrochemical cell, 22 mm diameter, 250 mm long pure Mg anodes were centrally positioned and cast monolithically. Insulated copper wires were connected to reinforcement ends and epoxy-covered. These wires were needed to measure HCP with the Standard Calomel Electrode.
A 1:1.5:3 nominal concrete ratio and 0.45 water-to-cement ratio were used. Two sets of five slabs were cast: Slab #1 with 3.5% NaCl by weight of cement and Slab #2 without. Table 1 specifies how portable water was used to build these slabs. Both slabs were cast on the same day to ensure similar conditions. IS: 456-2000 and IS:10262-2019 requirements guided ingredient selection, design, and curing. The experimental input data included temperature, RH, HCP, anode distance, and concrete age in days.

OPC 53 grade provided by UltraTech cement, confirming IS 12269-2013 [35] was used in this work. River sand provided by Rajalaxmi Crusher and Sand Plant, Awan, India, conforming to grading zone II of IS: 383-2016 was used in this work, with particle size distribution as shown in Fig.2. Fig. 3 shows the relevant points where measurement of HCP values was done in both the slabs.

Data set and input parameters

The input parameters included in the analysis were distance in x-direction, distance in y-direction, age of concrete slab, temperature and RH of the day of measurement. The HCP values were considered as output. The number of data points obtained based on experimentation carried out for 270 days was 1134. The descriptive statistics of the data involved in this work is presented in Table 4, while Table 5 shows the posterior distribution Characterization for Pairwise Correlations.

The posterior distribution for pairwise correlations between X, Y, Age, Temperature, RH, and HCP is well characterized by the data. Each cell in the table provides important correlation coefficient statistics. The posterior mode shows the most likely correlation coefficient, whereas the mean shows the posterior distribution average. Variance shows correlation coefficient consistency or dispersion. A measure of uncertainty is the 95% credible interval, which estimates the true correlation coefficient within a 95% probability level. In the X-Y correlation cell, the posterior mode of 0.059 shows a positive correlation, while the credible interval suggests a genuine correlation coefficient between 0.001 and 0.116. This specification of the posterior distribution for pairwise correlations improves dataset analysis and interpretation, facilitating decision-making and hypothesis testing in relevant domains. The distribution histogram of input variables is presented in Fig. 4. For multivariate datasets, matrix plots can visualize pair-wise interactions between variables, revealing patterns, correlations, and probable links.
They help find linear or nonlinear correlations, clusters, and trends between pairs of variables and are presented in Fig.5.

Preprocessing of Data

The preprocessing of the raw dataset was carried out through the following ways:

Each experimentally value was obtained as the average of three results
Data cleaning: erratic values in the experiments were removed using outlier technique involving statistical measures (standard deviation)
The data was further normalized to ensure consistent scaling and data splitting was carried out as per standard protocols. Thus, 70 % of data set was randomly considered for training, 15 % for testing and 15 % for validation of the results.

10-fold cross-validation was used in the present work. This technique splits a dataset into 10 equal sections or folds to evaluate a machine learning model. Firstly, the dataset is partitioned into 10 about equal folds. The model is then trained on 9 folds and validated on the remaining fold. This process is repeated 10 times with a different fold as the validation set. After 10 iterations, the model is evaluated using parameters such as, accuracy, precision, recall, or F1-score. The performance metric is calculated iteratively. The model's performance is estimated by averaging the 10 iterations' metrics. To evaluate the model's stability and reliability, the performance variability across 10 folds is then examined. Due to its more accurate model performance estimate, 10-fold cross-validation is recommended over holdout validation. Multiple validation sets and averaging reduce performance estimate bias and volatility. This method is beneficial when the dataset is small or a train-test split is unrepresentative.

Performance evaluation parameters

The performance of the machine learning models is evaluated based on six parameters viz. R-squared score, Mean Square Error (MSE), Root Mean Square Error (RMSE), Mean Absolute Error (MAE), a-10 index, and a-20 index. These parameters are widely used for evaluating machine learning models [21, 29, 84, 85]. The basic details of each of these parameters are provided in Table 3.

The dataset consisted of 1134 X 6 values out of which 1134 X 5 columns were input parameters and one column consisting of 1134 parameters was output. The input parameters consisted of distances in X and Y directions, temperature, age and relative humidity, while the output was HCP value. A five folds cross-validation was carried out to prevent over-fitting of data. The Principal Component Analysis (PCA) was also carried out. The criteria for Component reduction was specified at 95 % explained variance. The explained variance per component was obtained as 43.0 %, 38.2 %, 11.7 %, 6.6 % and 0.5 % for distance in X, distance in Y, age, temperature and RH respectively. Hence, after training, only the first four components were kept. The original normalized dataset is presented in Fig. 6

Results of Boosted Trees

Boosted Trees is a sophisticated ensemble learning method that stands out as a powerful machine learning tool. It is very good at both regression and classification tasks. Boosted Trees combine many decision trees to create a powerful predictive model. This method uses sequential training to create a decision tree on the original dataset and iteratively improve the model's predictive power by fixing previous errors. Gradient boosting allows each new tree to minimize a predetermined loss function by reducing ensemble errors. Gradient boosting recalibrates training instances based on the loss function gradient relative to ensemble predictions. Boosted Trees use restricting tree depth, shrinkage parameters to modify each tree's contribution, and sub-sampling during training to avoid over-fitting. Boosted Trees are known for their outstanding prediction ability and robustness to noisy data. They can identify complex correlations between features and target variables across varied datasets. Fig 8 shows the results of Boosted Trees. Fig 8 (a) shows the response plot of Boosted trees. The blue dots represent the true values while the yellow dots represent the predicted values. The response plot is non-linear, this indicates that the predicted response is not consistently influenced by the input variable throughout its entire range. It thus, gives a visual evaluation of the predicted performance of the model, enabling recognition of patterns, trends, outliers, and areas that could use improvement. Similar results are obtained for bagged trees and optimizable ensembles.

Fig. 7(b) shows the plot of predicted vs actual response of boosted trees. The x-axis shows the response variable's observed values, while the y-axis shows the model's anticipated values. Each point on the scatter plot represents an observation in the dataset, and its position reflects the relationship between its actual values and its anticipated values. All points should align along a diagonal line to show that the model's predictions and observed results match. Systematic deviations from this line can reveal model performance trends or biases and suggest improvements. The spread and variability of data points around the diagonal line can reveal the model's accuracy and precision over a number of actual value ranges. The Predicted vs. Actual Response graphic can be used to assess the model's predictive strength and ability to capture data linkages.

Fig.7 (c) presents the residual plot obtained from boosted ensemble

Bagged Trees

Bagged Trees, also known as Bootstrap Aggregating Trees, are a powerful ensemble learning technique used in machine learning for regression and classification. For regression and classification, multiple decision trees are built independently using varied bootstrap samples of the training data and then averaged or voted on. This approach uses tree diversity to improve model prediction accuracy and robustness. Bagged Trees construct subsets of data with replacement using bootstrap sampling, allowing several trees to capture different data distributions. Bagged Trees reduce variance by aggregating predictions from these trees, reducing overfitting and improving model stability. Bagged Trees may also train huge datasets and distributed computing systems efficiently due to its parallelizability. Bagged Trees can handle noise and outliers, but their simplicity may restrict their ability to capture complicated correlations compared to ensemble approaches. However, Bagged Trees remain popular among machine learning practitioners because to their simplicity, versatility, and solid performance across domains and applications. Fig. 8 shows the results of Boosted Trees.

Optimizable Ensemble

Bayesian optimization and Random search optimizers were next used with maximum iterations of 30, while the acquisition function used was expected improvement per second plus. The hyperparameter search range in the ensemble method was with Bag and LSBoost. The number of learners ranged from 10-500 with a learning rate of 0.001-1. The minimum leaf size ranged between 1-567 and the number of predictors to sample were 1-4.

For the Bayesian optimization, LSBoost Ensemble method, with a minimum leaf size of 94, number of learners as 270, a learning rate of 0.34858 and number of predictor samples of 4 was obtained as the optimized hyperparameter. For this model, the RMSE, R², MSE and MAE were 0.018097, 0.97, 0.00032752 and 0.013769 respectively. The prediction speed was approximately 4600 obs/sec and training time was 116.66 sec. The response plot,

In Random search optimizers Fig. 10, number of learners as 58, number of predictor samples of 2 was obtained as the optimized hyperparameter. For this model, the RMSE, R², MSE and MAE were 0.023896, 0.95, 0.00057534 and 0.018053 respectively.

Performance Evaluation

RMSE: This metric measures the average difference between predicted and actual values. Lower values indicate better performance. LSBoost with Bayesian optimization has the lowest RMSE (0.018097), indicating the most accurate predictions. R squared: R-squared represents the proportion of variance in the dependent variable explained by the independent variables. Higher values are desirable, indicating better model fit. LSBoost with Bayesian optimization achieves the highest R-squared (0.97), suggesting it captures a significant portion of the variance in the data.

MSE: Similar to RMSE, MSE measures the average squared difference between predicted and actual values. Lower values indicate better performance. LSBoost with Bayesian optimization has the lowest MSE (0.00032752), indicating superior predictive accuracy.

MAE: MAE measures the average absolute difference between predicted and actual values. Again, lower values are better. LSBoost with Bayesian optimization achieves the lowest MAE (0.013769), indicating the smallest average absolute prediction error.

Prediction Speed: This metric measures how quickly the model can make predictions. Higher values are desirable, indicating faster prediction times. Here, the Bagged Trees ensemble method has the highest prediction speed (12000 obs/sec), followed by Boosted Trees (8500 obs/sec). Training Time: Training time measures how long it takes to train the model. Shorter times are preferable, particularly for large datasets or real-time applications. LSBoost with Bayesian optimization has the longest training time (116.66 sec), whereas Bagged Trees and Boosted Trees have relatively shorter training times (9.0191 sec and 9.7955 sec, respectively).

Minimum Leaf Size and Number of Learners: These are hyperparameters that can significantly impact model performance. LSBoost with Bayesian optimization has a larger minimum leaf size (94) and a higher number of learners (270) compared to the other methods. PCA: This indicates the percentage of variance explained by each principal component. PCA can be used for dimensionality reduction. It's unclear how this relates directly to model performance without further context.

Number of Predictors to Sample: This hyperparameter determines the number of features randomly selected at each split. LSBoost with Bayesian optimization uses 4 predictors to sample, while the others have different values.

Critical evaluation of the performance: LSBoost with Bayesian optimization consistently outperforms other methods across various metrics, indicating its effectiveness in this scenario. However, its training time is substantially longer compared to other methods. Depending on the application, this trade-off between accuracy and computational cost needs careful consideration. Bagged Trees and Boosted Trees also perform well, offering a good balance between prediction speed and accuracy. These methods might be preferable for applications where training time is a crucial factor, and slight decreases in predictive accuracy are acceptable. Overall, the choice of ensemble method depends on the specific requirements of the application, considering factors such as prediction accuracy, training time, and computational resources available. LSBoost with Bayesian optimization stands out for its high accuracy but may be less suitable for time-sensitive applications due to its longer training time.

In this particular scenario, the efficiency of LSBoost with Bayesian optimisation is demonstrated by the fact that it consistently outperforms other approaches across a comprehensive range of metrics. However, in comparison to other approaches, the amount of time required for training is significantly longer. It is necessary to give careful thought to this trade-off between accuracy and computing cost, depending on the application. The longer training time of LSBoost with Bayesian optimization compared to other methods can be attributed to several factors:

Bayesian Optimization Overhead: Bayesian optimization involves building a probabilistic model of the objective function and iteratively selecting new hyperparameters based on the model's predictions. This iterative process incurs computational overhead, including the evaluation of the objective function and updating the probabilistic model.

Complexity of LSBoost: LSBoost is a sophisticated ensemble learning method that sequentially combines weak learners (typically decision trees) to minimize a least squares objective function. The optimization process in LSBoost is inherently more complex compared to simpler methods like Bagged Trees or Boosted Trees, requiring more computational resources and time for training.

Large Search Space: Bayesian optimization explores a broader search space for hyperparameters compared to other methods. While this thorough exploration can lead to better-performing models, it also requires more computational effort to evaluate a larger set of hyperparameters and select the optimal combination.

Number of Learners: LSBoost with Bayesian optimization in this scenario employs a high number of learners (270). Increasing the number of learners typically leads to longer training times due to the sequential nature of the ensemble method. Each additional learner requires fitting to the residuals of the previous models, which adds to the computational burden.

Minimum Leaf Size: The choice of hyperparameters, such as the minimum leaf size (94 in this case), can also influence training time. A larger minimum leaf size can result in more complex trees, which may require more computational effort to build during the training process.

While LSBoost with Bayesian optimization may have longer training times, it offers superior predictive performance across various metrics, as evidenced by lower RMSE, MSE, and MAE values and higher R-squared values. Therefore, the trade-off between training time and accuracy must be carefully considered based on the specific requirements of the application. In scenarios where prediction accuracy is paramount and computational resources are available, the longer training time of LSBoost with Bayesian optimization may be justified. However, for time-sensitive applications or when computational resources are limited, alternative methods with shorter training times, such as Bagged Trees or Boosted Trees, may be more suitable despite potentially sacrificing some predictive performance.

LSBoost with Bayesian optimization consistently outperforms other methods in terms of RMSE, R-squared, MSE, and MAE, indicating its effectiveness in this scenario.
Despite its longer training time, LSBoost with Bayesian optimization offers superior predictive performance compared to other methods, warranting careful consideration of the trade-off between accuracy and computational cost.
Bagged Trees and Boosted Trees also perform well, offering a good balance between prediction speed and accuracy, making them suitable for applications where training time is crucial.
The choice of ensemble method depends on specific application requirements, considering factors such as prediction accuracy, training time, and available computational resources.
LSBoost with Bayesian optimization achieved the lowest RMSE of 0.018097, indicating superior predictive accuracy compared to other methods. The R-squared value for LSBoost with Bayesian optimization was 0.97, suggesting that it captures a significant portion of the variance in the data. LSBoost with Bayesian optimization achieved the lowest MSE of 0.00032752, indicating the smallest average squared difference between predicted and actual values. The MAE for LSBoost with Bayesian optimization was 0.013769, indicating the smallest average absolute prediction error among the methods evaluated.
Bagged Trees had the highest prediction speed of 12000 obs/sec, followed by Boosted Trees with 8500 obs/sec.
Training time for LSBoost with Bayesian optimization was the longest at 116.66 sec, while Bagged Trees and Boosted Trees had relatively shorter training times of 9.0191 sec and 9.7955 sec, respectively.
Some of the potential future scope in this work could be fine tuning the hyperparameters, exploring more sophisticated ensemble methods and enhancing the interpretability of models by incorporating SHAP (SHapley Additive exPlanations) values or LIME (Local Interpretable Model-agnostic Explanations) and other similar techniques.

Acknowledgement

The author thanks Jaypee University of Engineering and Technology, Guna Department of Civil Engineering faculty and staff for technical support.

Funding

This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.

Conflict of Interest

The author declare that they have no conflict of interest.

Data Availability Statement

The data that support the findings of this study are available from the corresponding author upon reasonable request.

W. Yodsudjai, T. Pattarakittam, Factors influencing half-cell potential measurement and its relationship with corrosion level, Measurement 104 (2017) 159–168, https://doi.org/10.1016/j.measurement.2017.03.027.
R.R. Hussain, Underwater half-cell corrosion potential bench mark measurements of corroding steel in concrete influenced by a variety of material science and environmental engineering variables, Measurement 44 (2011) 274–280, https://doi.org/10.1016/j.measurement.2010.10.002.
G. Qiao, Y. Hong, J. Ou, Quantitative monitoring of pitting corrosion based on 3- D cellular automata and real-time ENA for RC structures, Measurement 53 (2014) 270–276, https://doi.org/10.1016/j.measurement.2014.03.045.
Y. Zhou, B. Gencturk, K. Willam, A. Attar, Carbonation-Induced and ChlorideInduced Corrosion in Reinforced Concrete Structures, J. Mater. Civ. Eng. 27 (2015) 04014245, https://doi.org/10.1061/(ASCE)MT.1943-5533.0001209.
NACE, International Measures of Prevention, Application and economics of Corrosion Technologies, NACE IMPACT Int. (2023). http://impact.nace. org/documents/Nace-International-Report.pdf (accessed May 7, 2023).
G. Qiao, Y. Hong, J. Ou, Corrosion monitoring of the RC structures in time domain: Part I. Response analysis of the electrochemical transfer function based on complex function approximation, Measurement 67 (2015) 78–83, https://doi.org/10.1016/j.measurement.2014.12.018
L. Bertolini, B. Elsener, P. Pedeferri, R.B. Polder, Corrosion of Steel in Concrete: Prevention,Diagnosis, Repair, Wiley-VCH, 2013.
Uhlig, H. H. (1963). Corrosion and Corrosion Control, John Wiley & Sons Inc. New York.
Tuutti K. Corrosion of steel in concrete. Stockholm: Swedish Cement and Concrete Research Institute; 1982.
Zhang J, Lounis Z. Nonlinear relationships between parameters of simplified diffusion-based model for service life design of concrete structures exposed to chlorides. Cem Concr Compos 2009;31(8):591–600. https://doi.org/10.1016/j.cemconcomp.2009.05.008
Montemor M, Simoes A, Ferreira M. Analytical characterization of the passive film formed on steel in solutions simulating the concrete interstitial electrolyte. Corrosion 1998;54(5):347–53.
Angst, U. (2011). Chloride induced reinforcement corrosion in concrete: Concept of critical chloride content–methods and mechanisms.
Jones DA. Principles and prevention of corrosion. 2nd ed. Englewood Cliffs, NJ: Prentice-Hall; 1992.
J.A. Gonzalez, A. Molina, E. Otero, W. López, On the mechanism of steel corrosion in concrete: the role of oxygen diffusion, Mag. Concr. Res. 42 (1990) 23–27. https://doi.org/10.1680/macr.1990.42.150.23. 2026
J.A. González, E. Otero, S. Feliu, W. López, Initial steps of corrosion in the steel/Ca(OH)2 + Cl− system: The role of heterogeneities on the steel surface and oxygen supply, Cem. Concr. Res. 23 (1993) 33–40. https://doi.org/10.1016/0008-8846(93)90132-S
T.D. Marcotte, Characterization of chloride-induced corrosion products that form in steel reinforced cementitious materials, University of Waterloo, 2001.
K.K. Sagoe-Crentsil, F.P. Glasser, “Green rust”, iron solubility and the role of chloride in the corrosion of steel at high pH, Cem. Concr. Res. 23 (1993) 785–791. https://doi.org/10.1016/0008-8846(93)90032-5
M. Pourbaix, Applications of electrochemistry in corrosion science and in practice, Corros. Sci. 14 (1974) 25–82. https://doi.org/10.1016/S0010-938X(74)80006-5
U. Angst, B. Elsener, A. Jamali, B. Adey, Concrete cover cracking owing to reinforcement corrosion - theoretical considerations and practical experience, Mater. Corros. 63 (2012) 1069–1077. https://doi.org/10.1002/maco.201206669.
Ali, N.M.; Farouk, A.; Haruna, S.; Alanazi, H.; Adamu, M.; Ibrahim, Y.E. Feature selection approach for failure mode detection of reinforced concrete bridge columns. Case Stud. Constr. Mater. 2022, 17, e01383. https://doi.org/10.1016/j.cscm.2022.e01383
Fei, Z.; Liang, S.; Cai, Y.; Shen, Y. Ensemble Machine-Learning-Based Prediction Models for the Compressive Strength of Recycled Powder Mortar. Materials 2023, 16, 583. https://doi.org/10.3390/ma16020583
Nguyen, Q.H.; Ly, H.-B.; Nguyen, T.-A.; Phan, V.-H.; Nguyen, L.K.; Tran, V.Q. Investigation of ANN architecture for predicting shear strength of fiber reinforcement bars concrete beams. PLoS ONE 2021, 16, e0247391. https://doi.org/10.1371/journal.pone.0247391
Nguyen, T.-A.; Ly, H.-B.; Pham, B.T. Backpropagation Neural Network-Based Machine Learning Model for Prediction of Soil Friction Angle. Math. Probl. Eng. 2020, 2020, 8845768. https://doi.org/10.1155/2020/8845768
Sarkhani Benemaran, R.; Esmaeili-Falak, M.; Javadi, A. Predicting resilient modulus of flexible pavement foundation using extreme gradient boosting based optimised models. Int. J. Pavement Eng. 2022, 526, 1–20. https://doi.org/10.1080/10298436.2022.2095385
Shariati, M.; Mafipour, M.S.; Mehrabi, P.; Bahadori, A.; Zandi, Y.; Salih, M.N.; Nguyen, H.; Dou, J.; Song, X.; Poi-Ngian, S. Application of a Hybrid Artificial Neural Network-Particle Swarm Optimization (ANN-PSO) Model in Behavior Prediction of Channel Shear Connectors Embedded in Normal and High-Strength Concrete. Appl. Sci. 2019, 9, 5534. https://doi.org/10.3390/app9245534
Murthy, Y. I., & Gandhi, S. investigations on AZ91D anodes for chloride-induced corrosion in reinforced cement concrete slabs.
Murthy, Y. I. (2024). Taguchi Grey Relational Analysis of Chloride Diffusivity of Mortar Containing Nano-Titanium Dioxide.
Murthy, Y. I., Gandhi, S., & Kumar, A. Micro-Characterization Of Pure Mg And AZ91D Used As Sacrificial Anodes In Reinforced Cement Concrete.
Murthy, Y. I., Gandhi, S., & Kumar, A. (2018). Comparative study of pure Mg and AZ91D as sacrificial anodes for reinforced cement concrete structures in chloride atmosphere. Civil Engineering Journal, 4(8), 1750-1759.
Pandey, S., Gandhi, S., & Murthy, Y. I. (2023). Effect of addition of sugarcane baggasse ash on half-cell potential of cathodically protected RCC structures subjected to chloride ingress. Materials Today: Proceedings.
Adamu, M.; Haruna, S.I.; Malami, S.I.; Ibrahim, M.N.; Abba, S.I.; Ibrahim, Y.E. Prediction of compressive strength of concrete incorporated with jujube seed as partial replacement of coarse aggregate: A feasibility of Hammerstein–Wiener model versus support vector machine. Model. Earth Syst. Environ. 2021, 8, 3435–3445. https://doi.org/10.1007/s40808-021-01301-6
Taffese, W.Z.; Espinosa-Leal, L. A machine learning method for predicting the chloride migration coefficient of concrete. Constr. Build. Mater. 2022, 348, 128566. https://doi.org/10.1016/j.conbuildmat.2022.128566
Ahmad, W.; Ahmad, A.; Ostrowski, K.A.; Aslam, F.; Joyklad, P.; Zajdel, P. Application of Advanced Machine Learning Approaches to Predict the Compressive Strength of Concrete Containing Supplementary Cementitious Materials. Materials 2021, 14, 5762. . https://doi.org/ 10.3390/ma14195762
Wan, Z.; Xu, Y.; Šavija, B. On the Use of Machine Learning Models for Prediction of Compressive Strength of Concrete: Influence of Dimensionality Reduction on the Model Performance. Materials 2021, 14, 713. https://doi.org/10.3390/ ma14040713
Garg, N.; Sharma, M.; Parmar, K.; Soni, K.; Singh, R.; Maji, S. Comparison of ARIMA and ANN approaches in time-series predictions of traffic noise. Noise Control. Eng. J. 2016, 64, 522–531. https://doi.org/10.3397/1/376398
Ahmed, A.A.; Pradhan, B. Vehicular traffic noise prediction and propagation modelling using neural networks and geospatial information system. Environ. Monit. Assess. 2019, 191, 190. https://doi.org/10.1007/s10661-019-7333-3
Çolakkadıo ˘glu, D.; Yücel, M. Modeling of Tarsus-Adana-Gaziantep highway-induced noise pollution within the scope of Adana city and estimated the affected population. Appl. Acoust. 2017, 115, 158–165. https://doi.org/10.1016/j.apacoust.2016.08.029
Sharma, A.; Vijay, R.; Bodhe, G.L.; Malik, L. An adaptive neuro-fuzzy interface system model for traffic classification and noise prediction. Soft Comput. 2018, 22, 1891–1902. https://doi.org/10.1007/s00500-016-2444-z
Y. Bao, H. Li, Machine learning paradigm for structural health monitoring, Struct. Health Monit. 20 (4) (2021) 1353–1372, https://doi.org/10.1177/ 1475921720972416.
R. Ali, J.H. Chuah, M.S.A. Talip, N. Mokhtar, M.A. Shoaib, Structural crack detection using deep convolutional neural networks, Autom. Constr. 133 (2022) 103989. https://doi.org/10.1016/j.autcon.2021.103989
E. Karaaslan, U. Bagci, F.N. Catbas, Attention-guided analysis of infrastructure damage with semi-supervised deep learning, Autom. Constr. 125 (2021), 103634, https://doi.org/10.1016/j.autcon.2021.103634.
C. Wang, W. Li, Y. Wang, S. Xu, X. Yang, Chloride-induced stray current corrosion of Q235A steel and prediction model, Constr. Build. Mater. 219 (2019) 164–175. https://doi.org/10.1016/j.conbuildmat.2019.05.113
Rawat, G., Gandhi, S., & Murthy, Y. I. (2023). Durability Aspects of Concrete Containing Nano-Titanium Dioxide. ACI Mater. J, 120, 25-36.
Rawat, G., Gandhi, S., & Murthy, Y. I. (2022). Influence of nano-TiO2 on the chloride diffusivity of concrete. Emerging Materials Research, 11(4), 495-505.
[45] S. Li, S. Wei, Y. Bao, H. Li, Condition assessment of cables by pattern recognition of vehicle-induced cable tension ratio, Eng. Struct. 155 (2018) 1–15. https://doi.org/10.1016/j.engstruct.2017.09.063
Y. Bao, Z. Tang, H. Li, Y. Zhang, Computer vision and deep learning–based data anomaly detection method for structural health monitoring, Struct. Health Monit. 18 (2) (2019) 401–421. https://doi.dox.org/10.1177/1475921718757405
K.P. Nascimento, A. Frizera-Neto, C. Marques, A.G. Leal-Junior, Machine learning techniques for liquid level estimation using FBG temperature sensor array, Opt. Fiber Technol. 65 (2021) 102612. https://doi.org/10.1016/j.yofte.2021.102612
S. Dhanalakshmi, P. Nandini, S. Rakshit, P. Rawat, R. Narayanamoorthi, R. Kumar, R. Senthil, Fiber Bragg grating sensor-based temperature monitoring of solar photovoltaic panels using machine learning algorithms, Opt. Fiber Technol. 69 (2022) 102831. https://doi.org/10.1016/j.yofte.2022.102831
A.G. Leal-Junior, V. Campos, C. Díaz, R.M. Andrade, A. Frizera, C. Marques, A machine learning approach for simultaneous measurement of magnetic field position and intensity with fiber Bragg grating and magnetorheological fluid, Opt. Fiber Technol. 56 (2020) 102184. https://doi.org/10.1016/j.yofte.2020.102184
[50] L.V. Nguyen, C.C. Nguyen, G. Carneiro, et al., Sensing in the presence of strong noise by deep learning of dynamic multimode fiber interference, Photonics Res. 9 (4) (2021) B109–B118, https://doi.org/10.1364/PRJ.415902.
D.L. Smith, L.V. Nguyen, D.J. Ottaway, et al, Machine learning for sensing with a multimode exposed core fiber specklegram sensor, Opt. Express. 30.7 (2022) 10443-10455. https://doi.org/110.1364/OE.443932
Bo ˘ga, A.R.; Öztürk, M.; Topçu, I.B. Using ANN and ANFIS to predict the mechanical and chloride permeability properties of concrete containing GGBFS and CNI. Compos. Part B Eng. 2013, 45, 688–696. https://doi.org/10.1016/j.compositesb.2012.05.054
Hoang, N.-D.; Chen, C.-T.; Liao, K.-W. Prediction of chloride diffusion in cement mortar using Multi-Gene Genetic Programming and Multivariate Adaptive Regression Splines. Measurement 2017, 112, 141–149. https://doi.org/110.1016/j.measurement.2017.08.031
Liu, Q.; Iqbal, M.F.; Yang, J.; Lu, X.; Zhang, P.; Rauf, M. Prediction of chloride diffusivity in concrete using artificial neural network: Modelling and performance evaluation. Constr. Build. Mater. 2021, 268, 121082. https://doi.org/10.1016/j.conbuildmat.2020.121082
Inthata, S.; Kowtanapanich, W.; Cheerarot, R. Prediction of chloride permeability of concretes containing ground pozzolans by artificial neural networks. Mater. Struct. 2013, 46, 1707–1721. https://doi.org/10.1617/s11527-012-0009-x
[56] Abdulalim Alabdullah, A.; Iqbal, M.; Zahid, M.; Khan, K.; Nasir Amin, M.; Jalal, F.E. Prediction of rapid chloride penetration resistance of metakaolin based high strength concrete using light GBM and XGBoost models by incorporating SHAP analysis. Constr. Build. Mater. 2022, 345, 128296. https://doi.org/10.1016/j.conbuildmat.2022.128296
Murthy, Y. I. (2023). Neural Network Models for the Half Cell Potential of Reinforced Slabs with Magnesium Sacrificial Anodes Subjected to Chloride Ingress. Journal of Soft Computing in Civil Engineering, 85-106. https://doi.org/10.22115/SCCE.2023.347658.1470
IS: 12269- 2019, Ordinary Portland Cement, 53 Grade - Specification, Bureau of Indian Standards, New Delhi, India, Indian Stand. (2019). Bureau of Indian Standard, Manak Bhavan, Bur. Indian Standard Manak Bhavan, 9 Bahadur Shah Zafar Marg, New Delhi.
M. Kutner, C. Nachtsheim, J. Neter, W. Li, Applied Linear Statistical Models, 5th edition, McGraw-Hill/Irwin, Montreal, Boston, 2004.
H. Liang, W. Song, Improved Estimation in Multiple Linear Regression Models with Measurement Error and General Constraint, J. Multivar. Anal. 100 (2009) 726, https://doi.org/10.1016/j.jmva.2008.08.003.
[61] J.-S. Chou, C.-F. Tsai, A.-D. Pham, Y.-H. Lu, Machine learning in concrete strength simulations: Multi-nation data analytics, Constr. Build. Mater. 73 (2014) 771–780, https://doi.org/10.1016/j.conbuildmat.2014.09.054
Theodoridis, S. Machine Learning: A Bayesian and Optimization Perspective; Academic Press: Amsterdam, The Netherlands, 2015.
Bui DK, Nguyen T, Chou JS, Nguyen-Xuan H, Ngo TD. A modified firefly algorithm-artificial neural network expert system for predicting compressive and tensile strength of high-performance concrete. Construction and Building Materials. 2018 Aug 20;180:320-33. https://doi.org/10.1016/j.conbuildmat.2018.05.201
Nguyen ST, Kwak HY, Lee SH, Gim GY. Using stochastic gradient decent algorithm for incremental matrix factorization in recommendation system. In2019 20th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD) 2019 Jul 8 (pp. 308-319). IEEE. https://doi.org/10.1109/SNPD.2019.8935671
Breiman, L. Random Forests. Machine Learning 45, 5–32 (2001). https://doi.org/10.1023/A:1010933404324
Breiman, L. Bagging predictors. Mach Learn 24, 123–140 (1996). https://doi.org/10.1007/BF00058655
Amit, Y.; Geman, D. Shape quantization and recognition with randomized trees. Neural Comput. 1997, 9, 1545–1588. https://doi.org/10.1162/neco.1997.9.7.1545
Daneshvar, K.; Moradi, M.J.; Amooie, M.; Chen, S.; Mahdavi, G.; Hariri-Ardebili, M.A. Response of low-percentage FRC slabs under impact loading: Experimental, numerical, and soft computing methods. In Structures; Elsevier: Amsterdam, The Netherlands, 2020; pp. 975–988. https://doi.org/10.1016/j.istruc.2020.06.005
Moradi, M.J.; Hariri-Ardebili, M.A. Developing a library of shear walls database and the neural network based predictive meta-model. Appl. Sci. 2019, 9, 2562. https://doi.org/10.3390/app9122562
Fawagreh, K.; Gaber, M.M.; Elyan, E. Random forests: From early developments to recent advancements. Syst. Sci. Control Eng. Open Access J. 2014, 2, 602–609. https://doi.org/10.1080/21642583.2014.956265
Nisbet, R.; Elder, J.; Miner, G.D. Handbook of Statistical Analysis and Data Mining Applications; Academic Press: Cambridge, MA, USA, 2009
Qi, C., Huang B., Wu M., Wang K., Yang S., Li, G. Concrete Strength Prediction Using Different Machine Learning Processes: Effect of Slag, Fly Ash and Superplasticizer. Materials 2022, 15(15), 5369; https://doi.org/10.3390/ma15155369
[73] V. Q. Tran. “Machine learning approach for investigating chloride diffusion coefficient of concrete containing supplementary cementitious materials”. In: Constr. Build. Mater. 328 (2022), p. 127103. https://doi.org/10.1016/j.conbuildmat.2022.127103
Malik A, Saggi MK, Rehman S, Sajjad H, Inyurt S, Bhatia AS, Farooque AA, Oudah AY, Yaseen ZM. Deep learning versus gradient boosting machine for pan evaporation prediction. Engineering Applications of Computational Fluid Mechanics. 2022 Dec 31;16(1):570-87. https://doi.org/10.1080/19942060.2022.2027273
H. Adel, M. I. Ghazaan, and A. H. Korayem. “Machine learning applications for developing sustainable construction materials”. In: Artificial Intelligence and Data Science in Environmental Sensing. Elsevier, 2022, pp. 179–210. https://doi.org/10.1016/B978-0-323-90508-4.00002-2
Faridmehr I, Shariq M, Plevris V, Aalimahmoody N. Novel hybrid informational model for predicting the creep and shrinkage deflection of reinforced concrete beams containing GGBFS. Neural Computing and Applications. 2022 Aug;34(15):13107-23. https://doi.org/10.1007/s00521-022-07150-3
[77] Tao H, Awadh SM, Salih SQ, Shafik SS, Yaseen ZM. Integration of extreme gradient boosting feature selection approach with machine learning models: application of weather relative humidity prediction. Neural Computing and Applications. 2022 Jan;34(1):515-33. https://doi.org/10.1007/s00521-021-06362-3
Ke G, Meng Q, Finley T, Wang T, Chen W, Ma W, et al. Lightgbm: a highly efficient gradient boosting decision tree. Proceedings of the 31st international conference on neural information processing systems. Curran Associates Inc; 2017. p. 3149–57.
Sun X, Liu M, Sima Z. A novel cryptocurrency price trend forecasting model based on LightGBM. Financ Res Lett. 2020;32: 101084. https://doi.org/10.1016/j.frl.2018.12.032
ASTM C876-15. 2015. Standard Test Method for Corrosion Potentials of Uncoated Reinforcing Steel in Concrete. ASTM International, West Conshohocken, PA, 2015, 8 pp. www.astm.org
Meena, K.B., Tyagi, V. A Deep Learning based Method for Image Splicing Detection. J. Phys. Conf. Ser. 1714, (2021). https://doi.org/10.1088/1742-6596/1714/1/012038
Meena, K.B., Tyagi, V. A Deep Learning Based Method to Discriminate Between Photorealistic Computer Generated Images and Photographic Images. Data, Eng. Appl. 19, 212–223 (2020). https://doi.org/10.1007/978-981-15-6634-9_20
Meena, K.B., Tyagi V., Distinguishing computer-generated images from photographic images using two-stream convolutional neural network, Applied Soft Computing, vol. 100, 2021. https://doi.org/10.1016/j.asoc.2020.107025
Shahani N.M., Kamran M., Zheng X., Liu C., Guo X., Application of gradient boosting machine learning algorithms to predict uniaxial compressive strength of soft sedimentary rocks at Thar Coalfield, Advances in Civil Engineering, 2021, Pp. 1-19, https://doi.org/10.1155/2021/2565488
Niaz Muhammad Shahani, Muhammad Kamran, Xigui Zheng & Cancan Liu, Predictive modeling of drilling rate index using machine learning approaches: LSTM, simple RNN, and RFA, Petroleum Science and Technology, 40:5, 534-555, 2022, https://doi.org/10.1080/10916466.2021.2003386
Kamran, M. et al., Intelligent based decision-making strategy to predict fire intensity in subsurface engineering environments. Process Saf. Environ. Prot. 171, 374–384 (2023). https://doi.org/10.1016/j.psep.2022.12.096
Kamran, M., A State of the art Catboost-Based T-Distributed Stochastic Neighbor Embedding Technique to Predict Back-break at Dewan Cement Limestone Quarry. Journal of Mining and Environment, 12(3), 679-691. 201, https://doi.org/10.22044/jme.2021.11222.2104

Tables 1 to 4 are available in the Supplementary Files section

The authors declare no competing interests.

Tables.docx

Download PDF

Version 1

posted

You are reading this latest preprint version

Ensemble Regressors for Half Cell Potential Prediction

Status:

Version 1

Abstract

Figures

Introduction

Data Generation through Experimentation

Results and Discussions

Conclusions

Declarations

References

Tables

Additional Declarations

Supplementary Files

Status:

Version 1