Abstract
This paper presents the application of an Artificial Intelligence-based method in analyzing the effects of environmental conditions, chloride contamination in concrete, and surface corrosion of rebars on the amplitude of Ground Penetrating Radar (GPR) signals. Six reinforced concrete slabs with different chloride contamination mixtures were fabricated and tested. GPR data were collected under various temperature and ambient humidity combinations. A total of 288 rebar picks were used for training, validation, and testing the proposed Artificial Neural Network (ANN) model. Multiple ANN model configurations with a variation in learning algorithms and the number of nodes in the hidden layer were explored to obtain the optimal model for the nondestructive data. It is shown that the “trainlm” learning algorithm produced the high accuracy prediction of the reflection amplitude of GPR signals. The sensitivity analysis was also conducted with the ANN model to investigate the effects of the input on the output parameters. Results from the sensitivity analysis revealed that the GPR reflection amplitudes were more sensitive to the changes of temperature parameter (TEM) and chloride contamination level (CCL), while they were less sensitive to the variation of ambient relative humidity (ARH) and rust condition on the rebar surface (CSR).
1. Introduction
Ground Penetrating Radar is a geophysical technique using radar pulses for mapping subsurface structure. GPR works by emitting a short pulse of electromagnetic radiation into media and recording the return reflected signal via the receiving antenna. When the electromagnetic waves encounter a buried object or a boundary between two media with different dielectric permittivity, the signals may be reflected or scattered back to the surface. By evaluating the return signals, the subsurface objects, changes in material properties, and voids and cracks can be detected [1]. Figure 1 shows different types of signals produced by a commercial GPR device for three kinds of material, namely, (i) rebar, (ii) an empty plastic conduit, and (iii) a plastic conduit filled with water [2].

The depth range of GPR investigation depends on factors such as the electrical conductivity of the media and the frequency of the transmitted wave. Materials with a high electrical conductivity would attenuate the wave energy quickly. As a result, the penetration depth would decrease in these materials, and vice versa. The rate of energy attenuation drops with lower frequency waves. Thus, the introduced waves with lower frequencies in the same media would penetrate much higher than that of the higher frequencies. The resolution, however, increases with a higher frequency wave. Thus, the selection of operating frequency is a trade-off between resolution and the penetrating depth.
The application of the GPR method has been extended over recent years in various fields, including archaeology, earth science, nondestructive testing, and environmental remediation. In the nondestructive testing area, GPR is used to detect and pinpoint rebar in concrete [3], to estimate concrete cover thickness and rebar diameters [4, 5], and to monitor the deterioration process in concrete structures [6]. In these applications, the reflection amplitudes were analyzed to identify the target objects using the specialized software. The GPR reflection amplitude, however, was affected by factors such as chloride contaminated concrete, rust on the surface of rebar, and the environmental conditions, as suggested in a recent study [7].
Traditionally, the method using the experimental nondestructive test is used to collect data for investigating the effects of chloride content in concrete, surface rust on rebar, and environmental conditions on GPR signals [7–13]. In recent years, an alternative method that can be employed for collecting GPR data is to apply the Artificial Intelligence-based technique. Available data were utilized for training, validating, and testing the proposed ANN model. The successful ANN model could learn from data to establish the nonlinear relationship between the inputs and outputs. Thus, the GPR reflection amplitude can be predicted from the certain input conditions without a need to perform the experimental tests.
Many researchers have successfully applied the ANN model in dealing with various engineering issues [14–22]. For example, Guneyisi et al. [20] employed ANN to predict the flexural overstrength factor of steel beams using 141 experimental records from previously published sources. Results from the study revealed that the ANN model can be used to estimate the capacity of the beam with a high level of accuracy. In another study, Hakim et al. [21] apply ANN to detect the damage of steel girders. It was reported that the ANN model can determine the severity of damage with an error of 6.8 percent.
Regarding the application of ANN for GPR-related problems, Dinh et al. [23] applied the ANN method for investigating the effects of concrete cover thickness on rebar reflection amplitude. An ANN model was developed using concrete cover thickness as the input. The output of the model was the GPR reflection amplitude of rebar. Rebar picks were used for training, testing, and validating the proposed ANN model. Results from the study indicated that the ANN model could be employed for evaluating the influence of concrete cover thickness on rebar GPR reflection amplitude with the coefficient of determination value for the entire data set was 0.91.
In recent research, Liu et al. [24] employed a neural network to pinpoint subsurface objects and evaluate the backscattering properties from GPR data. A single-layer artificial system, called Adaptive Linear Neural Network, was utilized in the study. The inputs were a series of primary functions derived from GPR signals, and the output was the reconstructed GPR data. Numerical GPR data were created and used to train, test, and validate the proposed ANN model. The inverted backscattering intensity with a relative error rate of less than three percent was found in the study. Another approach is using a neural network to process the images produced by GPR devices [25–27].
Many investigators have carried out research to address the GPR-related issues with various approaches [23–27]. Few studies were conducted to develop the relationship between one input with GPR reflection amplitude from rebar using the ANN model [23]. To the best of the authors’ knowledge, there are no available publications using the ANN method to establish the GPR signals from rebar with multiple inputs. The primary objective of this research is to fill the knowledge in the application of the ANN method for predicting GPR signals from different input variables. The ANN structure used in this study was constructed with MATLAB R2020a Runtime Environment.
2. Experimental Data
GPR data used in this study were collected on six reinforced concrete slab specimens fabricated at the Advanced Material Testing Laboratory at Marshall University using a GPR device. Four input parameters, namely, chloride contamination level, ambient temperature, surrounding relative humidity, and rust condition on the rebar surface, were examined during the data collection process. The reinforced concrete slabs were fabricated with an identical reinforcement configuration but different chloride contamination concrete levels. GPR data were collected in various temperature and ambient relative humidity combinations inside an environmental growth chamber. The GPR reflection amplitudes from rebar were manually picked from the raw data using commercial software. These steps were described in detail in the subsequent sections.
2.1. Specimen Fabrication
The overall dimensions of all specimens were 36.5 × 114.3 × 17.8 cm (width × length × thickness) with specific rebar locations shown in Figure 2. Six noncorroded steel rebars (nominal diameter of 1.58 cm) labeled in numerical order from one (#1) to six (#6) were placed into concrete slab specimens at varying depths. Besides, a precorroded steel rebar at position seven (#7) was placed at the approximate depth as the noncorroded steel rebar at the location six (#6). Specimens were named from SPC1 to SPC6 along with the increase in the percent of chloride content in the specimen (by the weight of concrete) from zero percent (i.e., control specimen, SPC1) to 0.10 percent (in SPC6) with an increment of 0.02 percent. Details of specimens and their chloride content can be found in Table 1.

2.2. GPR Data Acquisition
The testing method using the environmental chamber to study the effects of temperature and relative humidity has also been used by many researchers [28–33]. In this study, GPR data were collected inside an environmental growth chamber to account for the effects of environmental conditions on GPR reflection amplitude. The chamber was operated at three levels of relative humidity 55%, 70%, and 85%, respectively. The temperature corresponding to each relative humidity level varied from 5°C to 40°C with an increment of 5°C. Table 2 shows detailed information about each parameter related to the GPR data collection process.
The first GPR data collection procedure was conducted at a combination of a temperature of 5°C and relative humidity of 55%. After completing the first data collection process, the temperature in the chamber was increased from 5°C to 10°C, while the relative humidity was maintained at 55%. This ambient environmental condition remained constant until the next GPR data collection was conducted. The same process was applied for the temperature between 15°C and 40°C with an increment step of 5°C. When all GPR data for a different level of temperatures corresponding to relative humidity 55% were collected entirely, the relative humidity in the environmental chamber was changed to a new relative humidity level (70% and 85%, respectively), and the similar data collection procedure was used for the remaining tests.
2.3. GPR Reflection Amplitude
The raw GPR data were processed using RADAN7 commercial software [34]. Due to the limitation of the testing devices, the variation in the depth of rebar would not be included in the data collection. The reflection amplitudes from two rebars located at the same depth (rebar #6 and rebar #7) were manually picked, normalized, and converted from data units to decibel (dB) using equation (1). The normalization step was conducted to ensure that the picked GPR amplitude data for steel rebar are comparable [35].where Ampdb is data amplitude in decibels, Amprb is rebar amplitude in data units, and Ampdc is direct-coupling amplitude in data units.
The final data set consisted of 288 data points (rebar picks); each of them included five properties, namely, temperature (TEM), ambient relative humidity (ARH), chloride level (CLL), corrosion condition on the surface of rebar (CSR), and reflection amplitude from rebar (GRA). The range of the input and output parameters is shown in Table 3. It is worth mentioning that the original values data in the CRS column were not a number. Thus, the values in this column were converted into a readable format for the ANN model. Number 0 was represented for the noncorroded rebar, while number 1 was designated for the corroded rebar.
3. Artificial Neural Networks Methods
An intelligence-based technique, called ANN, was employed in this study to predict the GPR reflection amplitude from the input parameters listed in Table 2. Different types of learning algorithms were examined, and the number of neurons in the hidden layer was investigated to construct the optimal ANN model for the GPR data. Details of these steps are presented in the following sections.
3.1. Network Architecture
ANN is an adaptive data processing system that can learn from the information of the given inputs to generate outputs. An ANN system contains a set of simple neurons working independently and connecting to the others. A neuron collects input values from neurons on the previous layers, calculates an output, and transmits the results to all connected neurons on the next layer. Figure 3 illustrates the structure of a neuron. It consists of four main parts: (i) inputs or information that enters the neuron, (ii) weights and bias, (iii) transferring unit, and (iv) activation and outputs [36].

ANN learns to bridge between input and output through a learning process known as error backpropagation. The process operates by using the errors generated in the network output to adjust the weights in each layer. The error backpropagation includes two reversal processes; one is a feed-forward process, and the other is a backpropagation process. In the feed-forward process, the inputs are used to obtain the outputs with some network errors. The errors are then passed back to the input layers through the backpropagation process. The weights are adjusted during this process to reduce the network errors to an appropriate level.
3.2. Performance Criteria
Performances of the ANN model were assessed based on three factors: coefficient of determination (R2), Mean Squared Error (MSE), and Root Mean Squared Error (RMSE). The coefficient of determination measures the correlation between input and output parameters usingwhere yi is the ith actual output; is the mean of the actual outputs; is the ith predicted outputs; n is the total number of data samples. MSE is the average squared difference between predicted outputs and actual outputs. MSE can be computed using
Root Mean Squared Error is the square root of Mean Squared Error and can be calculated by
3.3. Choice of Networks
Four parameters, including temperature (TEM), ambient relative humidity (ARH), chloride level (CLL), and corrosion condition on the surface of rebar (CSR), were selected as the inputs for the ANN model, and the GPR reflection amplitude (GRA) was assigned as the output. The dataset was randomly divided into three subsets in which 70% (i.e., 200 data points) of the entire dataset was employed for training model, 15% (i.e., 44 data points) for validation. The remaining 15% (i.e., 44 data points) was utilized for testing the prediction accuracy of the ANN model.
Six popular learning algorithms were used to explore potential ANN models for estimating GPR reflection amplitude. Three assessment categories, namely, training performance (TRP), testing performance (TEP), and validation performance (VAP), were employed to evaluate the performance of the potential ANN models. For each model, ten trials were conducted, and the best performance results were recorded and presented in Table 4. The “trainlm” algorithm (Levenberg–Marquardt) was found to produce the best performance for training, testing, and validation. Thus, the “trainlm” algorithm was selected for the proposed ANN model. The selection was in line with the previous study [37].
Based on the recommendation of the previous study [37], the ANN model with one hidden layer was appropriate for dealing with most of the engineering problems. Thus, the ANN model with one hidden layer was selected for this study. In order to determine the number of neurons in the hidden layer, the ANN models were formed with the number of neurons in the hidden layer being changed from one neuron to 20 neurons with an increment of one neuron. Each model configuration conducted ten trials with identical datasets to obtain the average performance results. The performance of ANN models was then evaluated and plotted in Figure 4 based on the MSE values of the training, validation, and testing.

It can be seen clearly from Figure 4 that the ANN model with 16 neurons generated the best results. Thus, the ANN model with 16 neurons in the hidden layer was picked to be utilized in this study. Table 5 presents detailed information about the chosen ANN model.
4. Prediction of GPR Amplitude
As mentioned above, the GPR reflection amplitude data were predicted based on the four input parameters using the proposed ANN model. The performance of the model was evaluated through the three indications, namely, R2, MSE, and RMSE. Alternatively, the performance results were presented in terms of regression plots for training, validation, testing, and overall. In addition, the sensitivity analysis was conducted to evaluate the effects of the single input parameter on the output. Details were described in the subsequent sections.
4.1. Capacity of ANN Model in Predicting GPR Data
Figure 5 shows the performance of the proposed ANN model in the training stage. The performance results of the ANN model for all datasets are presented in Table 6. The ANN model performed well in predicting the GPR reflection signals with the coefficient of determination (R2) for overall being 0.9958. These numbers for training, validation, and testing were 0.9965, 0.9938, and 0.9941, respectively. Note that the value of R2 varies between 0 and 1, and the higher the value of R2, the better the model. Besides R2, MSE and RMSE coefficients were also used in this study to evaluate the performance of the model. MSE value for the overall was 0.122, showing the excellent prediction capacity of the ANN model.

(a)

(b)
Figure 6 illustrates the linear regression plots of the ANN model performance for training, validation, testing, and overall. In these figures, the X-axis (i.e., horizontal axis) represents the values from experimental data, and the Y-axis (i.e., vertical axis) presents the prediction values produced by the ANN model. The instances located on the diagonal line indicate a perfect prediction result from the model.

(a)

(b)

(c)

(d)
4.2. Errors Assessment
The error histogram for the ANN model in different datasets is presented in Figure 7. It measures the error between predicted values generated from the ANN model and the experimental data. In this graph, the entire error range was divided into 20 vertical bars (bins). The vertical axis (i.e., Y-axis) presents the number of instances from the dataset in a certain bar. For example, a 9th bin was corresponding to the error of 0.028 dB, and the height of about 48 represented the total number of instances in the training, validation, and testing dataset with that error. The zero-line is related to the zero error on the horizontal axis (i.e., X-axis).

As can be seen from Figure 7, the errors of the most test samples were between −0.156 dB and 0.167 dB. The negative error value means that the predicted GPR reflection amplitude was smaller than that of the experimental one. A limited number of instances in the training set experienced an error as large as 0.539 dB. The substantial error value might be due to the outliners from the original data. The outliners often generate during data collection processes because of some unexpected activities such as inadequate calibration in measuring devices, mistakes in the initial setup of data processing, or human errors from the measurement. The performance of the proposed ANN model would increase if these outliners were removed from the input data.
4.3. Study of Inputs and Output Relationship
A sensitivity analysis was performed to investigate the variation of the input parameters to the variation of the output. The sensitivity analysis involves two steps. First, changing one input parameter while keeping others at their nominal values. Second, returning that variable to its nominal value and then repeating the first step for each of the other inputs. In this study, the value of each input variable was sorted into five categories including Low (the smallest value of the input parameter), Mid Low (halfway between Low and Mid), Mid (halfway between Low and High), Mid High (halfway between Mid and High), and High (the biggest value of the input parameter), as listed in detail in Table 7.
The sensitivity analysis was conducted for each input variable by changing its value from Low to High while keeping the values of other input at the Mid. Results from the sensitivity analysis of all input variables were plotted in the form of a parallel coordinate graph, as presented in Figure 8. This graph has five vertical axes positioned from left to right along with the X-axis; each of the axes represents a certain level of the input parameter values, including Low, Mid Low, Mid, Mid High, and High. The value on the vertical axis is the GPR reflection amplitude in decibel.

As can be observed clearly from Figure 8, the GPR reflection amplitudes were sensitive to the change of temperature (TEM) and the chloride contamination level (CLL). Specifically, the GPR reflection amplitude would decrease along with the increase in the TEM variable, and vice versa. The same trend was applied for the variation of the CLL parameter. In other words, the GPR reflection amplitude is high when the chloride contamination level in concrete is low. By contrast, the ambient relative humidity variable (ARM) and the rust condition on the rebar surface (CRS) were found to produce a minimal effect on the GRP reflection amplitude. This means that the change in the value of the ARM and CRS parameters would yield a little influence on the output.
5. Conclusions
In this paper, the ANN method was employed to predict the GPR reflection amplitudes from the four inputs, namely, temperature (TEM), ambient relative humidity (ARH), chloride level (CLL), and corrosion condition on the surface of rebar (CSR). A total of 288 GPR data points were collected from a series of chloride contamination concrete slabs under various environmental profiles that were used to train, validate, and test the proposed ANN model. The performance results revealed that the GPR reflection amplitudes could be predicted from the inputs at a high level of accuracy with the coefficient of determination of 0.9958.
The effects of changing inputs on the variation of the output were also conducted in this study. Results from the study indicated that the GPR reflection amplitudes, on one hand, were found to be more sensitive to the temperature changes (TEM) and chloride contamination level (CCL) parameters. The variation of ambient relative humidity (ARH) and rust condition on rebar surface (CSR) variables, on the other hand, were found to be less sensitive to the output of the proposed ANN model.
Data Availability
The data used to support the findings of this study are included in the article.
Conflicts of Interest
The authors declare no conflicts of interest.
Authors’ Contributions
Z. W. and H. N. performed review, funding acquisition, and supervision; H. N. performed project administration and provided resources; Z. W., H. N., and T. N. conceptualized the study; T. N. were responsible for methodology and data curation and provided software and wrote the original draft.
Acknowledgments
This paper is based upon the work supported by the West Virginia Department of Transportation (WVDOT) through the research project entitled “Corrosion Research to Maintain and Sustain Infrastructure in West Virginia.”