Abstract

Bridge engineering is an important component of the transportation system, and early warnings of construction safety risks are crucial for bridge engineering construction safety. To solve the challenges faced by early warnings risk and the low early warning accuracy in bridge construction safety, this study proposed a new early-warning model for bridge construction safety risk. The proposed model integrates a rough set (RS), the sparrow search algorithm (SSA), and the least squares support vector machine (LSSVM). In particular, the initial early warning factors of bridge construction safety risk from five factors (men, machines, methods, materials, and environment) were selected, and the RS was used to reduce the attributes of 20 initial early warning factors to obtain the optimized early warning factor set. This overcame the problem of multiple early warning factors and reduced the complexity of the subsequent prediction model. Then, the LSSVM with the strongest nonlinear modelling ability was selected to build the bridge construction early-warning model and adopted the SSA to optimize the LSSVM parameter combination, improving the early warning accuracy. The Longlingshan Project in Wuhan and the Shihe Bridge Project in Xinyang, China, were then selected as case studies for empirical research. Results demonstrated a significant improvement in the performance of the early-warning model following the removal of redundancy or interference factors via the RS. Compared with the standard LSSVM, Back Propagation Neural Network and other traditional early-warning models, the proposed model exhibited higher computational efficiency and a better early warning performance. The research presented in this article has important theoretical and practical significance for the improvement of the early warning management of bridge construction safety risks.

1. Introduction

Developing countries such as China are currently implementing large-scale bridge projects [1]. However, these projects require large project investment, complex technology, and extensive contents and are typically established in poor construction environments. This results inextremely high construction safety, with an enhanced probability of serious accidents. In recent years, bridge construction safety accidents (e.g., the impact of the cofferdam, the collapse of floating cranes, and flooding) have been a common occurrence [2]. The early warning of construction safety risk generally involves the monitoring, evaluation, and prediction of factors corresponding to construction safety risk, with the aims of predicting future risks, determining the potential time range of risks, and measuring the strength of risks and their damage degree. Such a system aids decision-makers in taking appropriate risk control measures [3]. Therefore, the systematic identification, estimation, and early warning control of bridge construction safety risks can effectively reduce construction safety risks and achieve the management goals of bridges while complying with safety regulations.

Numerous scholars have performed in-depth studies on the early warning of risk in many fields. To effectively reduce the financial risks of non-life-insurance enterprises, Yan et al. [4] constructed a financial risk early-warning model. Zhang [5] developed a safety risk early-warning model of the food industry chain, revealing that introducing early warning theory into the field of food safety risk can effectively improve the level of risk management. In order to reduce the loss of ship collision, Cheng et al. [6] introduced risk early warning theory into ship collision risk management. Based on climatology, disaster science, and environmental science, Zhang et al. [7] determined that the risk early warning theory should be introduced into the early management of the drought disaster risk of crops such as corn. Sattlee et al. [8] integrated risk early warning theory into the reliability analysis of geological disasters to effectively reduce the losses caused by landslides and falling rocks. Great achievements have been made in the theory of early warning risk, but the research results on early warning of bridge construction safety risk are rarely reported. The early warning of risk denotes the prediction or classification of the target value of research objects. Scholars have adopted various research methods to build risk early-warning models across different fields of research. Ding [9] comprehensively combined the analytic hierarchy process (AHP) and the fuzzy comprehensive evaluation method to construct the risk early-warning model of financial enterprises. However, the fuzzy comprehensive evaluation is a linear weighted evaluation method and cannot effectively reflect the elevated influence of all evaluation indicators. Nonlinear characteristics are also unable to meet the requirements of practical evaluations. The AHP is associated with several disadvantages, such as strong subjectivity and the sensitivity to extreme expert opinions. Based on the strong self-learning ability and nonlinear processing of artificial neural networks (ANNs), Yan et al. [10] constructed an early-warning model of human resource management risk based on the back propagation neural network (BPNN). However, the application of the BPNN in early warning risk research results in several shortcomings, including overfitting, slow convergence, and easy to fall into a local minimum. Wang et al. [11] employed the grey model (GM) to construct an early warning safety risk model of a railway service system. Although the GM is simple to operate, it requires that variables satisfy the multivariate normal distribution, which is difficult to meet in practical applications. Chen and Zhang [12] used the logistic regression model to construct a logistics-based early warning risk management system for the default risk of cultural creative crowdfunding projects. However, the calculation of the logistic regression model is approximate and thus has several shortcomings, such as complex calculations and low prediction accuracy.

The least square support vector machine (LSSVM) inherits the structural risk concept and kernel mapping concept of the standard support vector machine (SVM). Starting from the loss function of machine learning, two norms are used in the objective function of the optimization problem in the LSSVM, and equality constraint is used instead of inequality constraint in SVM standard algorithm, which makes the solution of optimization problem of LSSVM method become a set of linear equations obtained by Kuhn–Tucker condition. The LSSVM trains the SVM by solving the transformed linear equations, which greatly improves the training efficiency of the SVM [13]. In recent years, the LSSVM has been widely used in data prediction, data classification, and other research fields. Zhao et al. [13] used the LSSVM to effectively diagnose aircraft engine faults. Ahmadi et al. [14] successfully simulated the vaporization enthalpies of pure hydrocarbons and petroleum fractions via the LSSVM. Statistical results determined the LSSVM-predicted average relative deviation and of the vaporization enthalpies as 0.51% and 0.9998, respectively, indicating the high prediction accuracy of the LSSVM. To improve the prediction accuracy of the cotton fabric K/S value, Yu et al. [15] combined particle swarm optimization (PSO) with the LSSVM to construct a new prediction model. In order to overcome the low early warning accuracy, in the current study, the LSSVM is employed to build an early-warning model of bridge construction safety risk.

Despite its strong robustness and generalization ability, the LSSVM cannot simplify the dimensions of information space. For high dimensions or large training samples, the LSSVM often faces problems, such as dimension disaster, and an increase in time consumption due to the limited memory capacity or complex network structure. The rough set (RS) does not require any prior knowledge and removes redundant data without affecting the classification accuracy. It is widely used in the fields of attribute sets [16] and key indicator screening. Introducing the RS into the LSSVM can effectively determine key attributes and reduce the adverse effects of redundancy and multicollinearity among various input variables on prediction accuracy. Therefore, when developing the proposed early-warning model based on the LSSVM, the RS is introduced to solve the problem of multiple early warning factors.

As a new machine learning method, the prediction accuracy and computational performance of the LSSVM depend on the reasonable selection of regularization and kernel width parameters. In the development of a landslide displacement prediction model for rainfall, Zhu et al. [17] employed the genetic algorithm (GA) to determine the LSSVM optimal parameters. However, the GA is associated with several bottlenecks, including complex coding, a slow calculation speed, and easy premature convergence [18]. To improve the accuracy of the LSSVM in predicting concrete strength, Xue [19] optimized the calculation parameters via PSO. However, the PSO is also prone to premature convergence (particularly when dealing with complex multimodal search problems) and has a poor local optimization ability [14].

The sparrow search algorithm (SSA) is a new swarm intelligence optimization approach that was inspired by the foraging and antipredation behaviours of sparrows [20]. When predicting the deboning strain of fibre reinforced polymer reinforced concrete, Li et al. [21] employed the SSA to optimize the initial weight and threshold of the BPNN. Empirical results revealed that the SSA-optimized BPNN surpassed the traditional version in terms of prediction accuracy and robustness. Liu and Rodriguez [22] used the SSA to accurately solve the problem of sustainable energy optimization in residential engineering, a complex and multiobjective nonlinear optimization problem. Wumaier et al. [23] adopted the SSA to optimize the SVM parameter combination, and based on wind turbine fault diagnosis data, the SSA-SVM was clearly superior to GA-SVM and PSO-SVM in terms of computing performance. Therefore, the SSA in the proposed method was adopted to determine the LSSVM optimal parameter for efficient computing performance.

Based on the above analysis and review of the relevant literature, a hybrid early-warning model of bridge construction safety risk was developed based on the RS, SSA, and LSSVM. The contributions and innovations of this article are as follows. (1) At present, researches on the early warning of risk mainly focuses on the financial risk, the food safety, the disaster risk, or other fields. In this study, early warning for the construction safety risk of bridge projects was studied, and a detailed case study was made. This provided new insights for early warning and management of bridge construction project. (2) From five aspects (men, machines, materials, methods, and environment), the index system of early warning for construction safety risk of bridge projects was constructed completely, which provided reference and foundation for similar researches. (3) In this study, a novel early-warning model based on the RS, SSA, and LSSVM was constructed, which not only effectively solved the problem of multiple risk early warning factors but also overcame the problem of low precision of traditional early warning methods.

The remaining sections of this article are arranged as follows. In Section 2, the early warning indicator system of bridge construction safety risk and the early-warning model are constructed. In Section 3, the Longlingshan Bridge Project in Wuhan and the Shihe Bridge Project in Xinyang, China, are used to perform the empirical research. Section 4 analyses the computational performance of the early-warning model proposed in this study, whereas Section 5 summarizes the main conclusions and limitations of this study.

2. Materials and Methods

2.1. Establishment of an Early-Warning Indicator System for Bridge Construction Safety Risk
2.1.1. Preliminary Selection of Early-Warning Indicators for Bridge Construction Safety Risks

Risk early warning mainly includes three parts: identification, division, and prediction of risk levels. The early-warning management system of construction safety risk in a bridge project plays a key role in scientifically creating an early warning indicator system for safety risk. Referring to the underlying reasons of risk loss during bridge construction, the early warning indicators of bridge construction safety risk was screened from five aspects (men, machines, materials, methods and environment; 4M1E) [24, 25], as described in the following.

(1) Men. This includes all participants in the bridge construction process. The construction and management of bridge engineering is generally performed by employees, and thus, their safety has a great influence on bridge construction safety. The illegal operation of workers is a common instigator of construction safety accidents [2]. In general, the more workers violating regulations, the greater the possibility of construction safety accidents and the more serious the consequences. Therefore, the more skilled workers, the lower the safety risks. The safety skills of project managers also play a key role in the daily risk management of construction safety. The stronger the safety skills of project managers, the more effective the daily risk management is. When safety accidents occur, project managers with a good emergency handling ability can effectively reduce secondary accidents and losses caused by safety accidents.

(2) Machines. This includes all machinery and equipment used in the construction project. The safety state of machinery in bridge construction projects is a key influencing factor of construction safety. Mechanical quality, mechanical installation, mechanical operation, and mechanical operation failure are generally the most important factors affecting the safety status of objects in bridge construction [3].

(3) Materials. This includes all materials employed in the bridge construction process. The quality and management problems of materials will consequently result in the quality and safety problems of the whole bridge construction. The most important building materials for bridge construction are concrete and steel, which are the main stress parts of the bridge structure, and they have obvious influence on the structural performance. When the structural performance of bridges is insufficient, safety accidents such as local collapse are easy to occur. Therefore, the qualification rate of these two building materials has a great influence on the construction safety and engineering quality. In addition, related management problems (e.g., stacking and random inspection of materials) may also result in bridge construction safety accidents [2, 24].

(4) Methods. This denotes the overall management process of bridge construction. Method indicators related to construction safety include all rules and regulations, technical requirements, organization and management, and construction methods. Combined with the construction technology and characteristics of bridge engineering, the most important indicators are the monitoring and calculation methods, as well as technical disclosure. Furthermore, a large number of new technologies and processes are often used in bridge construction; thus, the proportion of new construction schemes also has a significant impact on construction safety [25]. In general, the greater the proportion of new construction schemes and the less mature the technology, the greater the risk of construction safety.

(5) Environment. This denotes the natural environment and social environment during bridge construction. The characteristics of bridge construction sites and technology and economy lead to a complex construction environment, and there are many natural and social factors that affect bridge construction. Natural disasters, such as earthquakes, typhoons, and rainstorms, will seriously affect construction safety. In bridge construction, the wind has a great influence on bridge construction safety [3]. The social environment of bridge construction mainly refers to the sustainability and socioeconomic stability of engineering projects.

Table 1 provides details of the five early warning indicators. Here, the qualitative indicator refers to the data obtained by expert investigations, whereas the quantitative indicator is obtained by field investigations, consulting design data, or calculations based on national standards. The benefit indicator increases with the bridge construction safety risk level, whereas the opposite is true for the cost indicator.

2.1.2. Threshold Value of Early Warning Bridge Construction Safety Risk

The early-warning threshold of bridge construction safety risk is divided into a general indicator early-warning threshold and a subindicator early-warning threshold. In the actual construction process, rapidly obtaining quantitative data of risk indicators proves to be a difficult task, particularly as indicators related to early warnings still rely on the observation and subjective judgment of site managers. Therefore, a qualitative and quantitative combination, with a qualitative expert determination, method was adopted to judge the warning situation of early warning indicators. Table 2 reports the division of each indicator and the corresponding descriptions.

The warning degree division of each secondary indicator comprehensively considers the related specifications, such as the Load Code for the Building Structure Design (GB 50009-2012), the Specifications for Long-Span Highway Bridge Construction Surveys (JTG/T 365-02-2019), the Safety Technical Specifications for Highway Engineering Construction, (JTG F90-2015), the Specifications for Bridge Design Wind Speed Calculations (QX/T 438-2018), the Design Code of the Municipal Bridge (CJJ 11-2011), and the results of previous research of early warning systems [26].

2.2. Introduction to the RS

The RS, a data analysis theory proposed by Polish mathematician Pawlak, is able to deal with the uncertainty, incompleteness, and incompatibility between information and knowledge. The RS-based attribute reduction process involves deleting unimportant or irrelevant redundant attributes while maintaining the classification or decision-making ability of information systems unchanged and deducing the classification or decision-making of the problem to be solved [16, 17]. Therefore, theoretically speaking, applying the RS to the intensive prewarning indicators of bridge construction safety risks can reduce the unimportant indicators while maintaining the prewarning ability, thus improving the calculation accuracy of the subsequent risk prewarning model.

In the RS, information systems can be represented by 4 tuples [16]:where is the domain of discourse; is a finite set of non-empty attributes, , is a conditional attribute set, and is a decision attribute set; is the set of attribute values, and is the range of attribute . Function is used to assign corresponding attribute values to each object attribute in theory.

The support degree of conditional attribute C to decision attribute D is an important concept of rough set theory [16]:where is denoted as the positive domain of about , which describes all the element sets in that can be exactly classified into the class according to the knowledge of , and indicates the ratio of objects that can be classified into the decision class under conditional attribute and expresses the degree of support of the conditional attribute to the decision attribute.

Assume that there exists and in information system , if , then it is said that is a necessary attribute in and must be kept; otherwise, is an unnecessary attribute in and can be deleted [16].

In information system , and are independent and satisfy . is denoted as the reduction of , , and the set of essential attributes in forms the core of , . If is independent, it is considered as the minimum set required to maintain the classification ability of universe [16].

Although the mathematical theory of RS is very complex, the development of Rosetta, which is a computing software integrating rough set principle, facilitates data set classifications and minimum set searches.

2.3. Introduction to the SSA

The SSA is a novel swarm intelligence optimization algorithm [20]. Compared with classical algorithms, such as the PSO, it exhibits a superior computing performance due to the addition of discoverer and early warnings [21, 22].

At each SSA, iteration the location update of the discoverer is described as follows [20]:where represents the current iteration number, is a constant indicating the maximum iteration number, represents the position information of the -th sparrow in the -th dimension, is a random number, and represent the warning and safety values, respectively, is a random number that obeys a normal distribution, represents a matrix, and is the dimensions of the problem to be optimized.

During each iteration, the position update of the enrollee is described as follows [20]:where is the best current position occupied by the discoverer, is the worst position in the world at present, and is a matrix, in which each element is randomly assigned 1 or −1.

When aware of the danger, the sparrow population will exhibit antipredation behaviour, which is described as follows [20]:where is the current global optimal position, is the step control parameter, is a random number, is the fitness value of the current sparrows, and are the best and worst fitness values in the world, respectively, and is the smallest constant required to avoid a zero denominator.

Compared with other classical meta-heuristic optimization algorithms, the SSA algorithm possesses advantages, including good stability, strong global search ability, and minimal parameters. The bionics principle of the SSA algorithm simulates the foraging process of sparrows, which is similar to other classical meta-heuristic optimization algorithms and is based on “discoverer-follower” optimization. However, the SSA algorithm further simulates the detection and warning behaviour of sparrows during the foraging process. Not all sparrows are constantly approaching the current optimal solution in each iteration, yet some sparrows will experience detection and early warning. This mechanism enhances the global search ability and faster optimization speed of the SSA algorithm.

2.4. Introduction to the LSSVM

Assume that there is a training set , where , , is the input space dimension, and is the number of training samples. is projected into feature space using nonlinear mapping and constructed in following the structural risk minimization criterion. This process can be described as the following constrained optimization problem [13]:where is the compromise coefficient between the empirical risk and confidence interval, that is, the regularization parameter, is the uncorrelated random error, is the weight variable of the prediction function, is the nonlinear mapping function, and b is the deviation coefficient.

This constrained optimization problem can be transformed into the following form by the Lagrange method [13]:where is a Lagrange multiplier.

Using the Karush–Kuhn–Tucker condition, equation (7) can be transformed into equality form, allowing us to obtain the linear model [13] by solving and via the least square method.where is the kernel function.

The radial basis function (RBF) is selected as the kernel function, namely, , where is the kernel width parameter reflecting the distribution characteristics of the training samples. Because the RBF kernel function satisfies the Mercer condition, calculating the kernel function in the original input space can replace the inner product operation in the mapping feature space.

To ensure the generalization performance of the LSSVM, it is necessary to optimize the regularization and kernel width parameters to determine the optimal parameter combination. At present, the K-fold calculation method or meta-heuristic optimization algorithm are commonly used to determine the optimal calculation parameters.

2.5. Implementation of the Early-Warning Model

The core concepts of the proposed early-warning model are described as follows: (1) The RS is used as the preprocessing system of the LSSVM, which preprocesses the safety risk data of the bridge construction and eliminates the redundant attributes and conflict attributes, thus simplifying the early-warning model structure of the LSSVM, shortening the training and improving the classification performance. (2) The LSSVM model is optimized via the global optimization ability of the SSA, and the parameter combination with the highest calculation accuracy is obtained to construct the optimal early-warning model of bridge construction safety risk. Figure 1 presents the architecture diagram of the proposed early-warning model.

In particular, the key steps of the proposed early-warning model for bridge construction safety risk are described in the following.Step 1. Training and test sets are built, and the research object is determined. The original engineering data are collected based on the bridge construction safety risk early warning indicator system detailed in Section 2.1. Some samples are randomly selected from the original data as training sets and the remaining data are taken as the test sets.Step 2. Data preprocessing and attribute set. The original engineering data are normalized as the database of early warning research. On the premise of guaranteeing the original classification ability, the redundant or conflicting attributes in the decision table are eliminated via a rough set theory and the minimum conditional attribute set is obtained.Step 3. The early-warning model based on the LSSVM is trained and optimized. In particular, the early-warning model and kernel function parameters are initialized, the training sample set is input into the model, and the SSA algorithm is implemented to optimize the calculation parameters. Note that the iterative optimization calculation only occurs in the SSA, not in the LSSVM. The optimal calculation parameters and engineering data are input into the LSSVM to determine the early warning results.Step 4. Early warning identification. The test sets are input into the optimal LSSVM model for early warning identification to obtain the early warning results of bridge construction safety risks.

It should be emphasized that although the three algorithms (RS, SSA, and LSSVM) worked almost independently, these three algorithms belonged to the research framework of bridge construction safety risk early warning. The RS was to solve the problem of many factors affecting early warning and also to simplify the subsequent prediction model. LSSVM optimized by SSA was used to solve the problem of low prediction accuracy. Or it could be considered that both RS and SSA were used to optimize the LSSVM model, in which RS was used to optimize the input variables of LSSVM, and the SSA was used to optimize the optimal calculation parameters of LSSVM.

3. Case Study

3.1. Introduction of the Case Study

The Longlingshan Bridge Project in Wuhan and the Shihe Bridge Project in Xinyang, China were selected as the case studies.

The Longlingshan Bridge Project adopts four two-way lanes, with a standard width of 18.5 m and a total length of 1678.5 m. Its structural form is a restressed concrete continuous beam. The project is located in Wuhan, Hubei Province, China. There are almost no earthquakes, yet heavy rains often occur. Limestone karst caves are located around the project and are generally 0.3–8.1 m high. The main construction technology of this pile cap is summarized as follows: measure and support the position of the lofting steel sheet, construct the purlin (which also serves as the guide beam), insert the steel sheet pile to the design elevation, and excavate the foundation pit to pour in cushion concrete. Cranes are used as vertical transportation tools during the pier construction process, while scaffolding is set up as an operation platform. The concrete is commercial concrete, and piers lower (higher) than 11 m are cast by day pump once (twice). The basic construction steps of the concrete box girder are the erection of the supports, the one-time pouring, and overall tensioning. According to the site construction conditions, there are two forms of box girder support: (1) the floor-type bracket with full buckle and (2) the combined bracket (smaller steel pipe bracket for the door opening buckle bracket).

The Shihe Bridge Project in Xinyang is located in Xinyang City, Henan Province, China, with a total length of 203 m. It is a typical thrust concrete-filled steel tube arch bridge with a deck width of 37.5 m and six lanes in both directions. The construction procedures of the concrete-filled steel tubular arch bridge typically include pouring, reinforced, concrete, V-shaped piers and box girders, closing hollow steel tubular arch ribs, installing and tensioning tie bars, pouring concrete into arch ribs, installing and tensioning suspenders, hoisting bridge deck beams and longitudinal beams, laying prefabricated hollow slabs on the bridge deck, pouring bridge deck concrete, installing anticollision walls, paving bridge deck asphalt concrete, and so on. The construction of multispan, continuous, concrete-filled, steel, tubular arch bridges generally adopts the construction sequence of flowing water from the side span to the middle span. The subsequent construction process can be carried out once the entire bridge is completed. The advantage of this construction scheme is that the uniform loading between spans is realized to the maximum extent, thus effectively ensuring the arch rib line type and increasing the uniformity of the V-shaped buttress and arch.

3.2. Data Acquisition and Preprocessing
3.2.1. Data Acquisition and Reliability Analysis

The Longlingshan Bridge and Shihe Bridge Projects were constructed in 700 and 360 days, respectively. 1,060 groups of data from each day during the construction phase of these two projects were collected. Incomplete and questionable data sets were eliminated, and a total of 372 data sets were obtained; 300 (80.64%) of which were randomly selected as training sets, and the remaining 72 (19.36%) as test sets. Table 3 presents an example of the raw data.

Obtaining engineering data has always been one of the main difficulties in the engineering field. According to different engineering data sources or data acquisition methods, the 20 secondary indicators were roughly divided into the following three categories.(1)The index data of A51 (Disaster frequency) and A52 (Wind load grade) were obtained by consulting local yearbooks or design data.Taking Long Lingshan Bridge Project in Wuhan as an example, this study illustrated the data acquisition processes of A51 and A52. According to the statistics of Wuhan Yearbook (http://tjj.wuhan.gov.cn/tjfw/tjnj/) published by Wuhan Municipal Government, during the period from 2001 to 2020, there were 47 natural disasters in the urban area of Wuhan, so the score of A51 was 2.35, and 2.35 natural disasters occurred every year on average. The data of A52 was the wind forecast data released by Wuhan Meteorological Bureau (http://hubei.weather.com.cn/wuhan/index.shtml).(2)Data of A11 (Rate of operation violation), A12 (Rate of improper emergency handling), A13 (Acceptance rate of worker technology), A21 (Unqualified rate of mechanical quality), A22 (Failure rate of mechanical installation), A23 (Rate of Mechanical operation error), A24 (Rate of mechanical failure), A31 (Qualified rate of concrete), A32 (Qualified rate of steel), A43 (Rate of technical disclosure), and A44 (Proportion of new construction scheme) were obtained by field investigation or field investigation combined with standard calculation. Taking Long Lingshan Bridge Project in Wuhan as an example, this study illustrated the data acquisition process of A11 and A31. Project managers randomly checked the operation of workers in daily inspection. If there were 53 unqualified workers' operations in 125 random checks, the score of A11 should be 0.424 (). Project managers randomly checked the quality of concrete. If 2453 of the 2500 selected times were qualified, the score of A31 should be 98.12 (). The data of other indicators were also calculated according to the daily management data of the project.(3)Data of A14 (Safety skills of managers), A33 (Rationality of material stacking), A34 (Sampling inspection of materials), A41 (Rationality of monitoring method), A42 (Advancement of calculation method), A53 (Sustainability of engineering project), and A54 (Social and economic sustainability) were obtained via questionnaires. According to their rich engineering experience, the experts graded the qualitative indicators with reference to Table 2. The average score of 20 experts was taken as the final score of the input variable. To ensure the validity of the expert scoring data, the invited 20 experts had more than 15 years of work experience and were senior engineers. The reliability of the qualitative indicator data is evaluated by determined the Cronbachs’ α coefficient (Table 4).

The Cronbachs’α coefficients of all qualitative indicators exceed 0.7, proving the high reliability of the questionnaire survey results [27]. The acquisition methods of other indicator data are based on field statistics, standard calculations, and official statistics; thus, their reliability is well guaranteed.

3.2.2. Data Preprocessing and Attribute Intensive

To eliminate the influence of dimensional and data-level differences of early warning indicators on the performance of early-warning models, the data are preprocessed by the normalization methods in equations (9) and (10) for indicators with beneficial type or cost type, respectively:where and are the -th indicator value of the -th indicator and its normalized value, and and are the minimum and maximum values of the -th indicator value, respectively.

Linear correlations may be observed among the 20 secondary indicators. These redundant or interfered secondary indicators are likely to affect the prediction accuracy of the early-warning model. Therefore, this study adopted an attribute reduction algorithm based on the RS. More specifically, redundant irrelevant or unimportant attributes were deleted, and warning indicators that were sensitive to the water state of bridge construction safety risks were identified.

The preprocessed 372 data sets were imported into Rosetta, and the attribute reduction was performed via Johnson’s algorithm. Table 5 reports the attribute reduction results, namely, the most important indicators based on the RS analysis.

The eight secondary indicators in Table 5 were also identified as the most important indicators affecting the construction safety risks in previous research [24, 25]. This acts as a validation of our results.

As the RS is a data processing tool, the final RS-based screening of the indicator system may differ for different engineering data sets. Thus, the final early warning indicator system in Table 5 is only applicable to the two engineering cases selected in this study.

3.3. Early Warning of Bridge Construction Safety Risks
3.3.1. Optimization of LSSVM Parameters Based on the SSA

The SSA specifications are as follows [22, 23]: population size of 100, maximum iteration number of 200, safety threshold of 0.7, discoverers account equal to 30% of the population size, the maximum permissible error is 0.00001, and 10 sparrows aware of danger. Calculations were performed in Matlab 2016a (MathWorks) on a computer with the Intel i7 processor configured at 3.40 GHz and a memory of 31 GB. In the LSSVM model, the range of regularization parameters was limited to [0.01, 1000], and the range of the kernel function width coefficient was limited to [0.001,10000].

In the iterative optimization process of the SSA, the average value of four warning errors was selected as the fitness function of the SSA. The lower the average value of early warning errors, the lower the function value of fitness function and the better parameter combination.

In order to optimize the LSSVM model parameters for the early warning of the bridge construction safety risks, 170 sets of preprocessed training data were input into the calculation program in Matlab. Figure 2 depicts the adaptive function curve of the SSA. In the enlarged view of Figure 2, the filled rectangular points represented the points with the minimal fitness function.

The fitness function of the SSA is observed to decline rapidly from 44% to approximately 3% at the initial stage. Table 6 tracks the optimization calculation process of the SSA algorithm.

The SSA is observed to reach convergence in the 51th generation, converging to 3.657321%. Figure 2 and Table 6 reveal the ability of the SSA to effectively optimize the LSSVM model parameters in the 51th generation. Following the optimization, the optimal parameter combination is determined with kernel function and penalty factor parameters equal to 13.42 and 3.24, respectively.

3.3.2. Early Warning Calculations

The kernel function and penalty factor parameters (13.42 and 3.24, respectively) were input into the LSSVM model to construct the optimized early-warning model. Table 7 reports the results of the 72 preprocessed sets of test data imported into the optimized LSSVM model.

The parameters results reveal the high calculation accuracy of the proposed prediction model, with a very low misjudgement ratio for different warning degrees. The highest proportion of prediction errors occurred in No (I), which fully met the needs of the engineering practices. For the moderate (III) and severe (IV) cases, two important warning degrees, the proposed prediction model was completely correct. Thus, the proposed model has strong potential in engineering applications.

The proposed model is basedon data; thus, the statistical analysis of the prediction results is required, and the calculations were repeated 1,000 times. Table 8 reports the average of the 1,000 calculations.

The proposed prediction model exhibits a high calculation accuracy for the 1,000 repeated calculations, with extremely low misjudgement ratios for different alarm levels. The highest proportion of prediction errors equals just 1.97%, occurring for no (I). This low error rate fully meets the needs of engineering practices. The high calculation accuracies for the moderate (III) and severe (IV) levels also fully meet the needs of engineering practices. These results further demonstrate the high accuracy of the proposed model and its corresponding potential applicability in the field of engineering.

To further analyse the calculation accuracy of different prediction methods, this study calculated the average value of prediction errors. The calculation results showed that the average prediction error of the prediction model proposed in this study was only 1.025%, which was the lowest among all the results.

Computational time and stability are additional key indicators of the computational ability of prediction models. Table 9 reports the average calculation time of 1,000 calculations and the standard deviation of the prediction results.

The running time of our prediction model is observed to be just 3.241 s. Furthermore, the standard deviation of the prediction model results is very low, indicating the high stability of the model.

For this case study, the data structure characteristics of randomly selected training sets may exert a strong influence on the prediction performance of the model. We thus repeated the above calculation 10 times, with the training set randomly selected each time (Table 10).

The difference of the 10 repeated calculations is observed to be minimal, indicating the limited influence of the data structure characteristics from the randomly selected training sets of the case study on the model prediction performance. The maximum prediction error is 2.12%, fully meeting the needs of engineering practices.

4. Discussion

4.1. Comparative Analysis of the Attribute Intensive Calculation Results

Correlation analysis [28] and the mean impact value (MIV) [29] are commonly employed as index screening tools. For the former, the Pearson correlation coefficient () is typically used to investigate the linear correlation between different variables [28]:where and are the values of the first and second variables, respectively, and and are the average values of the first and second variables, respectively.

The absolute value of can effectively characterize the linear correlation between variables: 0.8 denotes a strong linear correlation between two variables, 0.7 denotes a moderate correlation, and 0.7 denotes almost no linear correlation. Moreover, 0 (0.8) represents a positive (negative) correlation.

The normalized engineering data in Table 3 was taken to calculate the Pearson correlation coefficient between an index and the warning degree on equation (11). This also denotes the correlation coefficient between input and output variables. Table 11 reports the results, where bold values indicate absolute values greater than 0.7.

Alarm degrees A12, A13, A23, A32, A33, A41, A44, and A54 are observed to correlate strongly with the output index. Therefore, the remaining 12 indicators were selected as the key risk early warning indicators and input into the SSA-LSSVM model for subsequent calculations. Table 12 reports the results of the calculations repeated 1,000 times.

Following the method of [29], the MIV method was employed to screen out the key risk indicator and input them into the SSA-LSSVM model for subsequent calculations (Table 12).

Compared with the correlation analysis and the MIV method, this study adopted RS to screen key risk indicators and obtained the best calculation accuracy. These results prove the advancement of the RS presented in this study.

4.2. Comparative Analysis of Different Early-Warning Models

To demonstrate the advancement of the early-warning model proposed in this study, a comparative analysis was performed based on two factors: the advancement of the optimization algorithm and the advancement of the nonlinear modelling method. The proposed model without the RS, the LSSVM, the SSA-BPNN, the PSO-LSSVM, and the PSO-BPNN were systematically ran. Tables 9 and 10 report the results, respectively. Based on the parameter setting method of [10], the sigmoid activation function was adopted for the BPNN, with a minimum convergence error of 10−5. For the PSO algorithm, following the parameter setting method of references [15, 30], the maximum iteration number was set as 200, the weight factor varied linearly from 0.4 to 0.9, and the acceleration constants C1 and C2 were both equal to 2.

4.2.1. Advancement of the Selected Optimization Algorithm

Figure 2 reveals that the fitness function of the SSA converged to approximately 3.5% in the 60–80 generation during the LSSVM parameter search, whereas the PSO fitness function converged in 110–120 generations. Thus, the SSA converged faster than the PSO, which is consistent with results from the previous literature [21, 22].

Table 9 demonstrates the running time of the proposed prediction model to be 1.401 s lower than that of the PSO-LSSVM. More specifically, for the typical nonlinear optimization problem of optimizing the LSSVM parameters, the SSA exhibited a better global fast retrieval ability than the PSO. Comparing the average running times of the PSO-BPNN and SSA-BPNN, the SSA algorithm revealed to save 0.896 s. The SSA exhibited an improved global fast retrieval ability in searching for the optimal initial threshold and initializing the BPNN.

These results prove the superior ability of the SSA algorithm for global fast retrievals.

4.2.2. Advancement of the Selected Nonlinear Modelling Method

Table 9 demonstrates the running time of the proposed prediction model to be 1.625 s, 1.401 s, and 2.511 s lower than that of SSA-BPNN, PSO-LSSVM, and PSO-BPNN, respectively. This highlights the global fast retrieval ability of the SSA and the simple structure of the LSSVM.

The running time of the proposed prediction model is clearly superior to that without the RS. This can be attributed to the removed redundant attributes of the original data from the attribute reduction of the RS. The reduced minimum conditional attribute set was consequently more representative and reduced the complexity of the LSSVM model. Note that the LSSVM exhibited the shortest running time because it does not involve iterative optimization calculations.

The LSSVM exhibited an improved calculation accuracy and stability compared with the BPNN (Tables 9 and 10). This is linked to the distinct calculation principles of the two methods. The LSSVM class prediction method determines the segmentation hyperplane with less support vectors, whereas the neural network-based prediction method follows the law of large numbers in nonlinear modelling. More specifically, the more training data samples, the higher the accuracy of the prediction results, with sample sets often required to be more than ten times the number of input variables. Therefore, under sufficient historical data and large research samples, the neural network-based prediction method has an advantage over LSSVM prediction methods in terms of nonlinear modelling ability and computational performance. However, for insufficient historical data and small research samples, the nonlinear modelling ability and computational performance of the LSSVM prediction methods are significantly superior to those based on artificial neural networks.

5. Conclusions

In the current study, an early-warning model of bridge engineering construction safety risk based on the RS, SSA, and LSSVM was constructed. The RS was used to effectively solve the problem of multiple risk early warning factors, whereas the LSSVM model optimized via the SSA overcame the low precision of traditional early warning methods. The construction safety risks of two typical bridge projects in China were predicted. Results demonstrated the warning error of the proposed warning model for bridge construction safety risk to be low for different warning levels. In addition, the number of misjudged samples was less than or equal to one. Compared with the LSSVM, SSA-BP, PSO-LSSVM, PSO-BP, and other models, the proposed model exhibits a fast calculation speed, high early warning accuracy, and strong robustness.

In addition, by employing 4M1E, the early warning index system of bridge construction safety risk was constructed, including 20 secondary indexes. Based on the engineering data of the research object, the index system was screened by the RS, removing the redundant or unrealistic attributes of the original data and effectively improving the early warning accuracy of the LSSVM model. For the restressed concrete continuous beam project and concrete-filled steel tubular arch bridge selected in this study, the operation violation rate, management safety skills, unqualified rate of mechanical quality, mechanical failure rate, concrete qualified rate, steel qualified rate, promotion of new construction schemes, and the wind load grade were the most important risk warning indicators.

Despite the substantial advancement made by this study in the research field, our proposed model faces several limitations. For example, just two case studies (the Longlingshan Bridge Project in Wuhan and the Shihe Bridge Project in Xinyang) were selected. However, bridges can take the form of numerous structural types and construction techniques. Further research will apply the research results of this study to additional bridges with different structural types or construction techniques. Moreover, the expert experience method, which is typically subjective, was adopted to determine the warning limit of early warnings for bridge construction safety risk. Although the conclusion of the case study was consistent with the actual situation, the determination of the early warning threshold requires further quantification. Lastly, the classification accuracy and performance of the LSSVM depend on the calibration of its parameters. Future work will focus on exploring effective multiobjective optimization techniques to optimize the LSSVM parameter combination and improve its prediction performance.

Abbreviations

AHP:Analytic hierarchy process
ANNs:Artificial neural networks
BPNN:Back propagation neural network
GA:Genetic algorithm
GM:Grey model
LSSVM:Least squares support vector machine
MIV:Mean impact value
PSO:Particle swarm optimization
RS:Rough set
SSA:Sparrow search algorithm
SVM:Support vector machine
4M1E:Men, machines, materials, methods and environment.

Data Availability

The case analysis data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Acknowledgments

This study was supported by the Science and Technology Project of Wuhan Urban and Rural Construction Bureau, China (201943).