Abstract
Except the two types of state, complete failure and perfect functioning, some intermediate states also existed between those two states. This characteristic enables multistate system (MSS) whose components are often of more than two states. Moreover, because of dependence between components, common cause failure (CCF) enhances the failure risk of components during the operating period of MSS. A protection mechanism needs to be adopted because of the importance of certain components. For a MSS with CCF and protection mechanism, its reliability can be evaluated by a proposed approach. Certain components not only can cause failure at one state, but also can cause failure to other components because of the CCF. Under the impact of one type of CCF, those components can constitute a specific group that can overlap with certain components. Universal generating function (UGF) is adopted to incorporate the CCF and protection mechanism of a component. Consequently, reliability indices of a MSS can be evaluated by the proposed approach based on UGF. Examples and comparisons illustrate and validate this method. This method can also help engineers prioritize elements that require protection mechanism under various resource limitations.
1. Introduction
With the development of industrial technology, product quality is being considered as an important aspect for customers. In addition, the traditional reliability theory cannot be applied in the modernized industrial system and/or component because of its limitation to two states. For example, certain systems (such as those in the manufacturing, production, and oil and gas industry, and energy systems whose total performance can remain at different stages depending on the operating conditions) are referred to as multistate system (MSS) [1]. The analysis of MSS entails the development of new representation, modeling, and quantification technologies. A few researchers had focused on the above-mentioned aspects [2–5]. A new approach based on logical differential calculus is proposed for importance analysis and application of MSS, where MSS states change depending on simultaneous changing of some components’ states [6, 7]. Other technologies including these for assessment and optimization can be applied to MSSs in industries [8–11].
Multiple components fail owing to one cause, and this type of failure is named common cause failure (CCF). In real-life industry fields, it often occurred. Unprotected electronic equipment can experience outage in extreme weather conditions such as lightning events. And multiple failures of components often originate from voltage surge because of inappropriate switching in power system. CCFs can be divided into two types according to the origin of the cause: internal CCF (e.g., originating from propagated failures than from one component or multiple failures within system) and external CCF (e.g., abrupt variations in the environment such as those in the temperature and humidity or fire caused by lightning) [12, 13]. From the perspective of maintenance, an implicit two-stage approach was used to analyze the reliability of a nonrepairable MSS incorporating the CCF [14]. Using recursive and exact methods, the authors developed the reliability evaluation procedure for a nonrepairable system, wherein an individual component’s failure could cause CCFs of other components [15]. However, in many real-life scenarios, repair is generally required to improve the reliability and/or availability of a system. An instantaneous availability model aimed at repairable MSSs with public bus performance sharing was suggested [16]. With regard to other aspects, such as the predictive inference of system reliability, a nonparametric approach was developed for a system with CCF [17].
CCF reduces the reliability of a technology system because of its interdependent failures (i.e., failures depend on other components). Interdependent failures are also called “interactive failure” and can be categorized into two types based on the consequences of failure of a component in the system: immediate failure dependence and gradual degradation dependence [18]. Life cycle cost and reliability were analyzed considering interdependent degradation [19, 20]. The reliability analysis of MSS with statistics dependent components was studied from the perspective of statistics dependence that exists in the physical and complicated systems [21]. Owing to the inflection of element’s performance distribution from those of other elements or subsystems, the dependency of elements is considered to evaluate MSS reliability based on universal generating function (UGF) technology [22]. Propagated failures are taken into account when a recursive approach was proposed for reliability analysis of MSS with CCF [15, 23]. System reliability can be improved using redundancies, fault detection, and isolation mechanisms. Nevertheless, once the above methods fail, certain components’ failures cannot be detected. This phenomenon is named as imperfect fault cover (IFC). Stochastic multivalue models are put forward to achieve the availability of MSS with the incorporation of IFC [24]. An irrelevancy coverage model is also put forward to cover the irrelevant components, as well as the faulty components [25]. For assessing the risk of large cascading blackout, based on random chemistry, a new approach with high computational efficient is reported [26]. A dynamical stochastic is developed to explore the spread of flowing failure in the communication grid [27]. An MSS and component with CCF were modeled based on the dynamic Bayesian network to perform reliability assessment [28, 29]. A reliability analysis method based on the evidence network was investigated to address the epistemic uncertainty of complex MSS with CCF [30]. Although many other researchers have focused on dependent failure during reliability assessment, inadequate consideration has been paid to the protection of important elements. Risk analysis was applied to reduce the failure risk of industrial systems in related fields [31–33]. Meanwhile, performance optimization and assessment of certain wireless networks and power line engineering were also investigated [34, 35]. Although a rich body of research has been conducted for addressing effects of CCF in reliability analysis of diverse types of systems, protecting mechanism on certain important elements during the reliability assessment of an MSS with CCF is seldom focused on. One contribution is to build protecting mechanisms, which are implemented and extended for improvement of system reliability based on earlier studies. The other contribution is to develop a recursive procedure, which is based on the straightforward and conveniently programmable UGF technology. In this paper, the procedure aims to evaluate the reliability of an MSS with CCF under the protection measures implemented on certain important elements. Here, the protection measures can be considered as a special mechanism. For example, certain components need to be protected and isolated from fire caused by other components. A firewall would be set up to protect the important components.
The remainder of this paper is organized as follows. Section 2 presents the MSS model. The UGF method for estimating the system performance distribution and reliability indices is presented in Section 3. In addition, a specific method to evaluate an MSS with CCF under protection measures is developed in this section. Section 4 presents several examples to illustrate the validity of the approach. Section 5 concludes this paper.
2. Modeling of MSS
One of the most important characteristics of an MSS is the multiple states of its components, which are generally connected in series-parallel. The performance level of component can vary from zero (complete failure) to (perfect function). We denote each performance level by , and all the levels can form the vector . Furthermore, the performance levels of the overall system would adopt values from the following set, which is obtained as the Cartesian product of all :
The performance level of component at a specified moment will be also assumed to be a discrete random variable . In addition, would adopt the value at a certain probability such that . All the probabilities can constitute a set
Here, . That is, component could be in one, and only one, state at any time because all the states establish a group that is composed by mutually exclusive events.
Here, one assumption is that certain external reasons, such as lightning, which can cause failure or degradation of the system, will not be considered. Based on this situation, the state of an MSS can be ascertained explicitly from the state of its components.
The states of an MSS are denoted as ; i.e., the MSS would have states. Similarly, state 0 denotes complete failure, state denotes the perfect function, and the others denote the intermediate states. If denotes the performance level of an MSS at system state , all the MSS performance levels can constitute the following set :
Suppose that is a discrete random variable to denote certain performance level of MSS and can adopt values from . Combining (1) and the above modeling of components and MSS, the system function of the MSS can be written as
Essentially, this formula is to map the performance level space of components to the one of system. Likewise, the probability of system at certain performance level can be denoted by
Several reliability indices can be derived from the above model to quantify the reliability of an MSS. In general, CCFs can be divided into two groups based on the origin of their causes (external and internal causes). CCFs belonging to the latter group are frequently referred to as propagated failures because these can propagate to other components of the same system. For the MSS considered here, component failure because of certain type of internal cause would result in the failure of other components. These failed components may be mutually dependent or even overlay on one or more components. Consequently, the performance level of the MSS would be weakened by an induced decrease in reliability. If the performance level reduces to a value below a boundary value , which is generally named as the system demand, the reliability of the MSS cannot satisfy the demand and is considered unacceptable. In this scenario, the reliability of an MSS with CCF can be determined from the probability that the system satisfies . From (5), the system reliability function can be determined aswhere .
Another important index can be defined as the conditional expected performance . This indicates the expected performance level of an MSS under the scenario where the MSS is in an acceptable state. This index can be obtained using the system reliability function :
To calculate the two indices, the distribution of the performance level of an MSS should be first obtained according to (5). The UGF technique has been demonstrated to be highly efficient for the MSS’s reliability evaluation of different kinds of system structure. Particularly, for an MSS, whose components are connected by series or parallel, it is highly effective to implement the UGF technique by the recursive approach.
3. UGF and Reliability Evaluation
3.1. UGF Technique
The high-dimensional combinatorial problem is generally encountered during the process of calculating MSS reliability because of its multistate components. UGF technique can address this problem efficiently because UGF, also called a u-function or universal z-transform [36], is based on the generalization of the ordinary moment-generating function and z-transformation. For a multistate component at a state with a probability mass function (PMF) , its UGF can be defined by
The UGF of a series-parallel MSS can be determined by performing certain algebraic operations on the components’ UGF. To represent the UGF of an MSS with the structure function (4), the composition operator needs to be defined as follows by combining the UGF of each components:
Here, the polynomial represents the UGF of the MSS. The items on the right-hand side of the above formula are the feasible mutually exclusive combinations of the UGF of individual independent components. The function needs to be determined according to the task characteristics.
The direct derivation of is still challenging for different types of task. As presented in [36], exhibiting the in a recursive method is effective from the perspectives of simplified calculation and derivation clarity. Particularly, for an MSS with complex configuration, the overall system can be represented as the composition of several subsystems consisting of certain components. This characteristic can be expressed as follows:
Here, the functions , and represent different operations for different configurations. The configuration of an MSS or its subsystems can generally be considered as the connection style among the subsystems or components. For any series-parallel components in subsystem, the composition operator can be employed recursively to obtain the UGF of the series-parallel components.
For continuous energy or material transmission system, such as electric power, oil, and water, its performance level can be defined as productivity. For paralleled components, the total productivity of the system is the sum of productivities of each individual component. The total productivity of two paralleled components ( and ) is equal to the sum of the individual productivities. The function calculates the sum of the corresponding performance. The performance of the two components in this occasion can be expressed as
Similarly, for series components, the component that is of the minimum productivity would become the limit. Function calculates the minimum of all the parameters. Consequently, the UGF of two components ( and ) connected in series should adopt the following formula:
3.2. Expression of Component’s UGF Including CCF
The CCF triggered by component can correspond to state with the performance level . Actually, the performance value can be equal to the component performance value of local failure with zero. If component is in state , those components involved by this CCF would also be in the failure state with performance . If a component cannot arouse CCF, the PMF of the corresponding state could be zeroed . The UGF of component containing the CCF can be expressed by
The conditional PMF of component that cannot fail owing to the CCF can be denoted bywhere is the probability of causing the CCF.
Let there be components that can cause the CCF simultaneously and independently. For the component vector , , the probability that the CCF originated from one component can be denoted by . Because these CCFs can occur independently, the result of combination is .
For one combination , the CCF comes from component if
The group of components corresponding to the can be achieved after calculating the above formula from to . The probability of the CCF corresponding to each is
The group of components causing CCF under the specific is
Assuming that a component fails owing to the CCF, the conditional PMF of its performance level can be denoted by . This shows that the component can only be in the failure state with performance level .
When the CCF corresponding to the combination occurs, all the components in would be in CCF mode, and their UGFs would be substituted by . Conversely, the UGFs of components not belonging to must be denoted by (14).
3.3. Incorporating Protection into Component’s UGF
Based on the properties of CCF, other failed components caused by component can be expressed by . For each , the group of components affected by the CCF can be obtained by
For certain important components, protection measures can be adopted to safeguard the components from CCF. When the CCF corresponding to occurs, any component belonging to the group fails when its protection mechanism fails and continues functioning with its typical performance level when its protection mechanism functions well. The protection for component can fail with probability . Based on the whole probability theorem, protection can be incorporated in UGF as follows:
Because of the protection, the UGF of a component in should be replaced with (19). Here, the performance level is generally equal to zero. That is, .
Then, the conditional PMF of system performance would be attained as . Combining (16), system UGF can be computed by
Using the expression of system’s UGF, the relevant indices of reliability can be evaluated by (6) and (7).
In addition to the above analysis, the next steps should be adopted to realize the evaluation.(i)S1. Integrate the CCF into component’s UGF using (13).(ii)S2. Achieve the conditional UGF of component according to (14).(iii)S3. Find out all the combinations of components evoked by the CCF.(iv)S4. For each combination , fix the corresponding components, which can arouse the CCF based on (15).(v)S5. Compute the probability of every combination by (16).(vi)S6. Establish the set of components triggering CCF by (17), and substitute the UGF of all the components belonging to with .(vii)S7. Determine the group of components influenced by the CCF given a combination according to (18). Furthermore, substitute the UGF of all the components belonging to the group 6 with (19).(viii)S8. Represent the conditional UGF based on the physical structure of MSS.(ix)S9. Obtain the UGF of system by (20).
After is achieved, the reliability indices can be evaluated using (6) and (7).
4. Illustrative Examples
4.1. Analytic Example
A type of series-parallel MSS shown in Figure 1 can be illustrated to validate the above method for the reliability evaluation of MSSs with CCF and protection. Flowing materials such as water, oil, or steam can be transported from A to B by the multistate components. The structure of system in this example is presented as follows. The MSS is configured by two subsystems: Sub1 and Sub2. Each subsystem is connected in series. Within each subsystem, these components are connected in parallel. Because of the CCF, the components C12 and C21 constitute the common group CG1. The component C13 and the second subsystem form CG2. For example, a feeding water system consisting of pumps in a power plant can be divided into two subsystems. C21 and C22 are vital and need to be protected from the fire originating from C12 and C13.

Performance level of MSS in this example will depend on components’ states and system configuration. The state number of each component and probabilities of these components at each state are listed in Table 1. For each component, the events of component in working, local failure, and failure to cause CCF constitute the total spaces. Thus, the sum of the probabilities in those states is one. However, the probability of protection failure is not dependent on the component and should be handled independently.
Based on the above analysis, the UGF of each component can be represented as
Their conditional UGF can be obtained by (14) as follows:
The components causing CCF are C12 and C13, such that . Therefore, the components affected by these are , . The parameters corresponding to different combinations are listed in Table 2. These values of will be used to calculate the UGF of system according to(20).
Because of the protection measures, according to (19), the UGF of C21 and C22 can be expressed as follows:
According to the physical structure of the example, its conditional UGF can be obtained by
For , there is no element in the sets and . Therefore, in (24), no element’s UGF needs to be substituted. There are 14 items in the UGF expression in the calculation result. The performance values and corresponding probabilities are displayed in Figure 2(a).

(a)

(b)

(c)

(d)
For , and . Therefore, and in (24) should be substituted with and , respectively. Similarly, the 9 items in the result are displayed in Figure 2(b).
For , and . Therefore, , and in (24) should be substituted by , and , respectively. Similarly, there are 10 items in the UGF expression in the result as shown in Figure 2(c).
For , and . Therefore, in (24), and need to be substituted by . and should be substituted by and , respectively. In the result, the UGF expression in the result has 4 items, as shown in Figure 2(d).
Finally, UGF of the whole system can be achieved by (20) like this:
The UGF expression of the whole system has 14 items and is shown in cyan color in Figure 3.

The reliability function of the entire system with protection is depicted in cyan color in Figure 4. From point A, we obtain .

Similarly, the curve of the conditional expected performance of this system with protection is plotted in cyan color in Figure 5. Figure 5(a) is the entire graph of conditional expected performance, and Figure 5(b) shows details at . From Point A, we obtain .

(a)

(b)
Although there are only five components in the system, the proposed approach is also suitable for system with a large number of components. This is because the entire procedure can be programmed conveniently using an ordinary programming language. For example, the total time required for the calculation using MATLAB in the computing environment applied by the authors is 1.9536 s. The present main configuration of the computer is as follows: main frequency = 2.4 GHz and memory = 4 GB.
4.2. Comparison with the Cases without Protection
A comparison with the case without protection on the MSS can be performed to validate the effectiveness of the protection mechanism. The difference between the system with and without protection lies in the incorporation of the S6 and S7 in subsection C of Section 3. If an MSS has no protection mechanism, the elements in the set should be substituted by while calculating . Based on the example in Section A, the procedure for calculating is as follows:
For , is empty. Therefore, the result does not vary with the result in the above example.
For , . Thus, the UGF of C12 and C21, and in (24) should be substituted by . Its UGF expression in the result has 5 items.
For , . Therefore , , and in (24) should be substituted by . The result includes one item in the UGF as follows:
For , and , and therefore . In (24), , , and need to be substituted by . The result for this case is as follows:
Lastly, the UGF of whole system without protection will be obtained according to (20). There are also 14 items in its UGF, as shown in magenta in Figure 3.
In this case, its reliability function is plotted in magenta in Figure 4. Point B displays a lower value than that of Point A in Figure 4. Similarly, the curve of the conditional expected performance of the system at different performance levels is shown in magenta in Figure 5. A comparison of these curves reveals that these values are extremely close. We obtain from Point B.
From the comparison results, it can be found that the reliability results of system with protection have higher values than the ones of the system without protection. This validates that the protection mechanism is effective and justified. Similarly, the curve of conditional expected performance of system without protection is higher than the one of the system with protection because it is inverse proportional to the reliability function. This also provides a proof to the effectiveness of protection.
4.3. Numerical Example
Here is another numerical example (with more number of components and CCFs) to prove the efficiency of the suggested approach. The MSS topology structure in this example is shown in Figure 6. This system consists of three subsystems: Sub1, Sub2, and Sub3. Sub1 and Sub2 are connected in series, and their combination is parallel to Sub3. There are three common groups: CG1, CG2, and CG3. The components in each common group are marked using dotted lines. In CG1, component C11 causes CCF, and C21 is affected by the CCF. In addition, C21 is also protected with a certain probability. In CG2, component C12 triggers the CCF, and C23 would fail and would also be protected. In CG3, component C31 triggers the CCF and causes the failure of C32 and C33. In CG3, component C33 would be protected with a certain probability.

Performance level of MSS will also vary with the states variation of components. All the states and probabilities of these components at each state in the system are recorded in Table 3.
The UGF of this system in different cases can be procured conveniently based on the proposed method. A comparison of these 26 items in the UGFs is depicted in Figure 7.

The reliability function of the system with and without protection can be compared. The results are displayed in Figure 8. Evidence shows that system with protection has a higher reliability level than that without protection. Point A has a higher value than that for Point B . It also validates the effectiveness of protection against system safety.

In addition, there is no significant difference in conditional expected performance between the systems with and without protection. The whole graphs are shown in Figure 9(a). As shown in the figure, the values at Point A and Point B are similar. Further details are shown in the local graphs in Figure 9(b).

(a)

(b)
The above analysis reveals that the recommended method is also suitable for a more complicated system with large-scale components. Furthermore, less time-consumption is another advantage over other similar methods. The time-consumption for this example under a configuration similar to that in Subsection A of Section 4 is 2.1484 s. The time cost is generally reasonable for engineers. Based on the time-consumption results from the above two examples, this procedure implementing protection on important component is also acceptable in realistic engineering system.
In addition, the data or information of system coming from the computing result is crucial for reliability engineers. When making decision on which components need to be protected, those reliability indices provide scientific evidence. In particular, when there are many components in a large system, engineers can choose the most valuable components among the system and implement the protection on them based on the reliability evaluation indices.
5. Conclusion
To evaluate the reliability of series-parallel MSSs with CCF and protection, a reliability evaluation approach is recommended in this paper. Components that can evoke the CCF may trigger failures of different subgroups of system components. These subgroups can also overlay with certain components. Because a few components are important for the system, the protection mechanisms are implemented on these ones. The factors of CCF and protection can be melted in the UGF based on the UGF technology. Because it is repeated for each combination, this approach can be programmed suitably by repetition. Some indices such as reliability and the expected conditional performance level can be calculated. To prove the validity of protection mechanism, a comparison with components without protection can be performed. This approach would also help in deciding which component is the most valuable and therefore, needs to be protected by reliability engineers. According to the assumptions in this approach, the restriction of system structure lies in the series-parallel MSS now. Probabilistic and fuzzy UGF would be considered to extend this approach in complex MSS such as bridge and G(k/n) system in the future.
Data Availability
The data used to support the findings of this study are included within the article.
Conflicts of Interest
The authors declare that there are no conflicts of interest regarding the publication of this paper.
Acknowledgments
This work was supported financially in part by a grant from Fundamental Research Funds for the Central Universities (Nos. 2020MS120 and 2018MS076).