Abstract
This paper describes the application of two artificial intelligence- (AI-) based methods to predict the 28-day compressive strength of fiber-reinforced high-strength self-compacting concrete (FRHSSCC) from its ingredients. A series of 131 data samples collected from various published literature sources were used for training, validation, and testing models. Various AI models were developed with different training algorithms and a number of nodes in the hidden layer to obtain the optimal model for the FRHSSCC data. It is shown that the performances of the artificial neural network (ANN) were better than that of the adaptive neurofuzzy inference system (ANFIS) model. Specifically, the overall coefficient of determination (R2) of the ANN and ANFIS models was 0.9742 and 0.9584, respectively. The sensitivity analysis was also conducted with the ANN model to investigate the effects of input parameters on the output. The results from the sensitivity analysis revealed that the compressive strength of FRHSSCC at 28 days was more sensitive with the changes of water by cement ratio (WCR) parameter and insensitive with varying amounts of fiber (VOF). Finally, it can be concluded that the application of artificial intelligence shows the great potential in the prediction of compressive strength of FRHSSCC.
1. Introduction
Concrete is among the most widely used materials in civil buildings and other structures around the world. Conventional concrete is a mixture of water, cement, and aggregates [1]. Due to some limitations such as low strength/weight ratio, the conventional concrete is often suitable for low-rise civil buildings and typical span structures. Efforts of expanding the applications of concrete in the construction industry lead to the arrival of high-performance concrete (HPC). Unlike the traditional concrete, HPC is designed to achieve many exceptional properties such as more workability in the fresh state and high strength and durability in the hardened state [2].
Among many types of concrete in the HPC family, high-strength self-compacting concrete (HSSCC) is broadly utilized in the construction projects [3, 4]. HSSCC is often made by adding mineral admixtures (fly ash and blast furnace slag) and chemical materials (superplasticizer) to the conventional concrete mixture. Along with the development in the field of concrete technology, a new class of concrete, called fiber-reinforced high-strength self-compacting concrete, is developed. FRHSSCC is the combination of HSSCC with various kinds of fiber. The new construction material benefits many superior properties of HSSCC such as highly flowable, nonsegregating, as well as the ductility improvement from added fibers [5, 6].
As a crucial hardened property of FRHSSCC, the compressive strength and other material properties can be evaluated using the experimental tests. The method, however, not only is time-consuming to conduct but also requires intensive testing facilities and skilled personnel [7–10]. To overcome those obstacles, an alternative method to estimate the compressive strength of FRHSSCC using AI can be applied. The AI-based method used published experimental mixes from previous studies to develop the nonlinear implicit relationship between various input parameters with the compressive strength of FRHSSCC.
Within the various AI-based techniques, artificial neural networks and adaptive neurofuzzy inference system are the most commonly used methods for evaluating the hardened properties of the concrete material. For example, Ince employed the ANN model to predict the fracture parameters of concrete [11]. The ANN method was also applied for the prediction of compressive strength for different types of concrete such as concrete containing construction and demolition waste [12], lightweight concrete [13], geopolymer concrete [14], environment-friendly concrete [15], high-strength concrete [16], no-slump concrete [17], and HPC with the specific admixtures [18].
Not only the ANN model but also the ANFIS model was utilized by many researchers. For instance, Khademi et al. estimated the 28-day compressive strength of conventional concrete using ANN and ANFIS models [19]. Nguyen and Dinh applied the ANFIS method to estimate the 28-day compressive strength of concrete [20]. The application of ANFIS for the prediction of dry-cast concrete strength was found in a study by Sobhani and Najimi [21] as well. Besides the applications for prediction of concrete properties, the AI-based models have been successfully applied in a variety of fields, including air quality forecasting [22], bridge desk rating [23], and wastewater treatment plant optimization [24].
In a recent study, Armaghani et al. [25] applied an ANN method to predict the shear strength of the concrete beam. The conclusion from the study revealed that the shear strength of a reinforced concrete beam with and without stirrups could be successfully estimated. In another study by Sarir et al. [26], different AI-based models, including GEP tree-based, neuroswarm, and whale optimization, were used to evaluate the bearing capacity of concrete-filled steel tube columns. More recently, a modified ANFIS model was used to predict the uniaxial compressive strength of the rock in a study by Jing et al. [27].
More and more experimental data are available along with advanced AI applications recently, which enable developing a new technique to evaluate the compressive strength of FRHSSCC. To the best of the authors’ knowledge, there has been no available research to apply the AI-based model for predicting the compressive strength of FRHSSCC. Accordingly, in this study, a series of 131 FRHSSCC experimental mix samples with the compressive strength up to 162 MPa (the percentage in the volume of the fiber in the mixes is up to four percent) were carefully selected. A trial-and-error procedure was utilized for this dataset to obtain the best AI model with the highest coefficient of determination (R2) and the lowest root mean squared error (RMSE). Additionally, the sensitivity analysis was performed for the selected model to evaluate the effects of input parameters on the output.
1.1. Data Preparation
The original experimental data were obtained from the previous study by Salari et al. [5]. Nine FRHSSCC properties, namely, fiber type (FBT), percentage in the volume of fiber (VOF), cement (CEM), water (WTR), mineral powder (MPR), water by cement ratio (WCR), fine aggregate (FIA), coarse Aggregate (COA), and the 28-day compressive strength (F28), were used. Because of conflicting information in the original data, the data need to be refined to attain reliable inputs. The following criteria were applied in the refinement procedures: (i) only mixtures with the amount of fiber in percentage were selected, (ii) mixtures with the amount of water not in kg/m3 unit were excluded, and (iii)mixtures with no information about compressive strength were not selected. In the final refinement stage, the chemical admixture column was removed from the database due to the inconsistent units between mixtures.
The refined database was a matrix with 131 rows and 9 columns. Each row in the database matrix contained nine important properties of FRHSSCC in which data from column 1 to column 8 were used as the inputs and compressive strength data in column 9 were the output. Seven types of fiber were utilized in the experimental dataset: (1) steel, (2) polyphenylene sulfide, (3) glass fiber-reinforced polymer, (4)wave steel fiber, (5) hook-ended steel fiber, (6) flat steel fiber, and (7) the combination of steel and one of the fibers. The ranges of the input and output parameters are listed in Table 1.
The classifications of the 28-day compressive strength of FRHSSCC in each specific interval are presented in Table 2. The distributions of the four considered input parameters in this study are shown in Figure 1.

(a)

(b)

(c)

(d)
2. Methods
2.1. Model Architecture
Two supervised learning AI-based models were employed in this study to predict the 28-day compressive strength of FRHSSCC. In the first approach, the artificial neural networks model was used. Unlike the conventional approaches, which chiefly rely on pure mathematical models, the Artificial Neural Networks model simulates the function of the biological nervous systems in the human brain to solve the problems. Basically, an ANN model contains a collection of simple processing units (or neurons) working independently and fully connecting to the others as a network. A neuron receives input values from neurons on the previous layers, computes an output, and sends the results to all connected neurons on the next layer.
Figure 2 depicts a schematic for a neuron. It consists of five main parts: (i) inputs or information that enters the neuron, (ii) weight or effect of an input set, (iii) sum function or a function that calculates net input that comes to a neuron, (iv) activation function or a function that processes the net input and determines the neuron output, and (v)outputs.

In the second approach, the ANFIS model was utilized. ANFIS is a combination of two machine learning methods (fuzzy logic and neural network). In an ANFIS model, the parameters of a fuzzy inference system (FIS) were tuned by using the neural network learning method. The basic ANFIS architecture with two input variables is shown in Figure 3. In this architecture, two fuzzy IF-THEN rules based on a first-order Sugeno model are presented. Rule 1: if x is A1 and y is B1, then Rule 2: if x is A2 and y is B2, then where x and y are the inputs; Ai and Bi are the fuzzy sets; fi are the outputs within the fuzzy region specified by the fuzzy rule; pi, qi, and ri are the design parameters that are determined during the training process.

2.2. Performance Criteria
The performances of the ANN and ANFIS models were evaluated based on four indicators: coefficient of determination (R2), mean squared error (MSE), root mean squared error (RMSE), and . The coefficient of determination measures the correlation between input and output variables using the following equation:where yi is the ith actual output, is the mean of the actual outputs, are the ith predicted outputs, and n is the total number of samples. MSE is the average squared difference between predicted outputs and actual outputs. MSE can be calculated using the following equation:
Root mean squared error is the square root of mean squared error and can be calculated by the following equation:
Another index, called , was also used in this study to evaluate the performance of the models. This index presents the number of samples with the difference between the predicted value and experimental value within ±20 percent [25, 28]. can be calculated by the following equation:where m20 is the number of samples with the ratio of experimental value over predicted value falling from 0.8 to 1.2 and M is the number of samples in the dataset.
2.3. Model Development
Eight FRHSSCC properties, fiber type, percentage in the volume of fiber, cement, water, and mineral powder, water by cement ratio, fine aggregate, and coarse aggregate, were selected as the input parameters of the ANN model, and the 28-day compressive strength of FRHSSCC (F28) was assigned as the output. The dataset was divided randomly into three subsets, in which 70% (i.e., 91 data samples) of the entire dataset was employed for training model, 15% (i.e., 20 data samples) for validation, and the remaining 15% (i.e., 20 data samples) was utilized for testing the prediction accuracy of the ANN model.
Neural Networks learn to map between input and output through a common learning algorithm called feed-forward backpropagation. It works by using the errors presented in the network output to adjust the weights in each layer. The error backpropagation algorithm consists of two different processes: one is a feed-forward process, and the other is a backpropagation process. In the feed-forward process, the inputs are used to obtain the outputs with some network errors. The errors are then passed backwards to the input layers through the backpropagation process, and the weights are adjusted during this process to minimize the network errors to an acceptable level.
In order to find the best learning algorithm that works for the available data, eight ANN models were developed and tested using different learning algorithms. The performances of the models were evaluated based on the values of four parameters, namely, training performance (TRN), testing performance (TES), validation performance (VAL), and the number of epochs (EPC). To obtain a reliable result, five trials were performed for each model. The best values of the performance results from those models are listed in Table 3. Apparently, the Levenberg–Marquardt (LM) algorithm (Trainlm) produces the best performance on training, testing, and validation with a low number of epochs. The results were in line with previous studies [29]. For this reason, the LM algorithm was selected for the proposed ANN model.
A sigmoid function was selected as an activation function in the proposed ANN model. The shape of the sigmoid function is depicted in Figure 4(a). Equation (5) expresses the mathematical form of the sigmoid function. This function transforms the values between 0 and 1.

(a)

(b)
According to a study by Nikbin et al. [29], a network with one hidden layer was adequate for solving most of the engineering problems. Thus, the ANN model with one hidden layer was chosen for this study. To determine the sufficient number of nodes in the hidden layer of the proposed ANN model, 20 ANN models were developed, with the number of nodes in the hidden layer varying from one node to 20 nodes. Each model was performed five times to obtain the average performance results. The performance of the ANN models was then evaluated based on the MSE values of the training, testing, and validation with the identical datasets. Figure 4(b) shows the average values of the performance results from these ANN models. It can be clearly observed that the ANN model with six neurons/nodes yielded the best results. Therefore, the selected ANN model consisted of one hidden layer with six nodes. Detailed information on the selected ANN model is listed in Table 4.
The ANFIS model was also used in this study. The numbers of input and output parameters of the ANFIS model were similar to those of the ANN model. The dataset utilized for the ANFIS model was an identical dataset that was applied for the ANN model. However, the dataset was randomly divided into two subsets in which the training data subset contains about 80% of the entire data, that is, 105 data samples, and the testing data subset containing 20% of the entire data, that is, 26 data samples. Different type of membership functions was examined, and a “gauss2mf” function was selected for constructing the ANFIS structure. In the training process, the back-propaganda optimization method was applied with a zero value of error tolerance. The number of epochs/iterations for both training and testing stages was designated as 100. The detailed structure of the ANFIS model is presented in Table 5.
3. Results and Discussion
There were two phases in this study. In the first phase, the performances of the ANN and ANFIS models were evaluated. All seven inputs were employed for the ANN and ANFIS models to predict the 28-day compressive strength of FRHSSCC. The model with better performance was selected for use in the next phase of this study. In the second phase, the sensitivity analysis was conducted using the selected model. The main objective of the sensitivity analysis was to identify the significance of the inputs that influence the outcomes of the model.
3.1. Performance of the ANN Model
As mentioned above, seven FRHSSCC properties were used as the inputs for the ANN model. The output was the 28-day compressive strength of FRHSSCC. Several trials were performed to determine an optimal number of the hidden layers as well as the number of neurons inside the ANN structure. One hidden layer with six neurons was found to be adequate for this ANN model. The test results from the proposed ANN model in different stages are presented in Table 6. The value of R2 presents the ability of a model to predict the outputs based on the inputs. Values closer to 1 indicate a strong ability of the model in prediction. Performances of the ANN model for the training and validation stage are presented in Figure 5.

(a)

(b)
The ANN model performed well in training, validation, and testing with values of R2 being 0.9755, 0.9851, and 0.9438, respectively. Overall R2 was 0.9742, indicating a good prediction ability of the proposed ANN model. It is worth noting that the validation and testing datasets were unseen in the ANN model. However, R2 produced by the ANN model in both validation and testing sets showed a minimal difference compared to R2 calculated for the training dataset. That means the proposed ANN model is able to generalize and can be employed as a tool to predict the compressive strength of FRHSSCC.
Besides R2, another indicator, RMSE, can also be employed to evaluate the performance of the ANN model. The RMSE values between predicted compressive strength of FRHSSCC and the experimental results are listed in Table 6. The smaller the RMSE value is, the stronger the linear relationship between variables is. As can be seen, the smallest RMSE value of 6.70 MPa was recorded for the validation dataset. The value of RMSE was found to be higher for the new/unseen dataset, which was 8.71 MPa for the testing dataset.
In order to have a clear overall view of the performance of the proposed ANN model, the predicted values of 28-day compressive strength and its experimental results for the entire specimens are plotted in Figure 6(a). In addition, the absolute prediction errors in terms of percentage for all specimens/mixtures in the dataset are depicted in Figure 6(b).

(a)

(b)
As can be seen in Figure 6(a), the proposed ANN model estimated the 28-day compressive strength of FRHSSCC at a high level of accuracy. The absolute mean value of prediction errors was only about 5.76 percent, as presented in Figure 6(b). The results showed that the output values could be achieved from the input parameters. Several abnormal absolute errors were found in specimen numbers 29, 54, 84, 89, and 126, as seen in Figure 6(b). The possible explanation for this might be due to the limited number of samples in the original dataset. As a result, the data were not sufficient to represent all aspects of the input variables. This issue may resolve when a dataset with a large number of samples was applied.
The linear regression plot was used in this study to present the results from the proposed ANN model. The plots for the performance of the proposed ANN model at different stages, namely, training, validation, testing, and overall, are shown, respectively, in Figures 7(a)–7(d). In these figures, the linear fitting line presents the relationship between the experimental results and the predicted values produced from the model. In addition, the “x = y” line shows a perfect correlation between inputs and outputs.

(a)

(b)

(c)

(d)
3.2. Performance of the ANFIS Model
As previously discussed, the inputs and output for this ANFIS model were identified as used for the ANN model. The same dataset was employed for the ANFIS model; however, the dataset in this case was only divided into training and testing sets. Figure 8 shows the accumulated root mean squared error of the ANFIS model for both training and testing datasets. The errors reduced significantly in the first 50 epochs, and the minimum errors were obtained at the epoch of 100. The detailed performance results of the ANFIS model are listed in Table 7.

(a)

(b)
As can be seen in Table 7, the proposed ANFIS model performed well for the training dataset with values of R2 and RMSE being 0.9624 and 9.39, respectively. However, with the unseen dataset in the testing process, the value of R2 was slightly smaller, which was 0.9518. It is worth noting that the overall performance of this ANFIS model was reasonable, with a value of R2 being 0.9584. However, compared with the overall performance of the ANN model in the previous section, the ANFIS model showed a more deficient prediction capacity.
Figure 9(a) shows the plots of experimental compressive strength and the predicted values produced by the proposed ANFIS model. The absolute prediction errors in terms of percentage for the entire dataset can be seen in Figure 9(b). Although the average mean value of the prediction errors was at an acceptable level, at around seven percent, this model showed a low accuracy in prediction for unseen data. The majority of errors were found in the testing phase. Some predicted values experienced an error as large as 52 percent, as seen in Figure 9(b).

(a)

(b)
Figure 10(a) presents the linear regression plot of the results produced by the proposed ANFIS model for the training dataset. The linear regression plot for the overall performance can be seen in Figure 10(b).

(a)

(b)
The ANFIS model was also applied to examine the effects of the percentage of fiber and other input parameters on the 28-day compressive strength of FRHSSCC. The surface plots of the relationship between the VOF parameter and the four selected inputs are presented in Figure 11.

(a)

(b)

(c)

(d)
3.3. Model Selection
Results of the performance of the ANN and ANFIS models for the unseen dataset from the previous sections indicated that the ANN model had a higher prediction ability compared to that of the ANFIS model. Thus, within the context of this research, it was concluded that the performance of the ANN model in terms of R2 for both testing data and overall was better than that of the ANFIS model. This conclusion is in line with a result of the previous study [18]. For that reason, the ANN model was selected to perform the sensitivity analysis in the subsequent sections.
3.4. Sensitivity Analysis
The sensitivity analysis was conducted for the selected ANN model to evaluate the influence of the inputs on the 28-day compressive strength of FRHSSCC. In order to perform the sensitivity analysis, each input parameter was divided into five groups, namely, Lowest (LO), Mid-Low (ML), Middle (MD), Mid-High (MH), and Highest (HI). MD is the mean value of the LO and HI. ML and MH represent halfway from LO to MD and from MD to HI, respectively. Detailed values of these input parameters are listed in Table 8.
The sensitivity analysis was performed for each input parameter by changing its value from LO to HI while keeping the other inputs constant at the average values. The results of the sensitivity analysis for different input parameters are presented in Figure 12. It can be seen clearly that the 28-day FRHSSCC compressive strength was most sensitive to the water/cement ratio (WCR) and the amount of water content (WTR). This is reasonable since WCR and WTR are two main factors that govern the compressive strength of concrete. It is interesting to note that the amount of fiber had a minimal effect on the compressive strength of FRHSSCC. In other words, an increase in the amount of fiber from the minimum value found in this experimental dataset would have a limited effect on the 28-day compressive strength of FRHSSCC.

4. Conclusions
In this research, the ANN and ANFIS models were developed to predict the 28-day compressive strength of fiber-reinforced high-strength self-compacting concrete from 131 experimental mixtures. A series of ANN models were developed with different training algorithms and the number of neurons in the hidden layer to acquire the optimal model. The ANN model, with the Levenberg–Marquardt training algorithm and six nodes in the hidden layer, was found to work best with the experimental FRHSSCC data. In addition, both ANN and ANFIS models could predict the 28-day compressive strength at a high level of accuracy. In terms of statistical parameter, R2, it is observed that the ANN model performed better than the ANFIS model.
The RMSE values of the ANN model for training, testing, and overall were 7.33 MPa, 8.71 MPa, and 7.47 MPa, respectively. The absolute mean error between the actual FRHSSCC compressive strength and the predicted values was approximately six percent. With respect to the ANFIS model, the values of RMSE were 9.39 MPa for the training phase and 9.46 MPa for overall performance. These models can be employed as a reliable tool for selecting the optimized mix design for FRHSSCC. It is worth noting that the small number of test samples in the experimental input data is a limitation of this study. The performance of the proposed models would improve with an increase in the amount of input data. Finally, the sensitivity analysis results with the ANN model revealed that the 28-day compressive strength of FRHSSCC was more sensitive with the changes of water by cement ratio (WCR) parameter and less sensitive with varying amounts of fiber (VOF).
Data Availability
The data used to support the findings of this study are included in the article.
Conflicts of Interest
The authors declare no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Acknowledgments
This research was funded by the National University of Civil Engineering (NUCE), Hanoi, Vietnam (Grant no. 29-2019/KHXD-TĐ).