Abstract
This paper has investigated the estimation problem for the competing risks model where the data are adaptive progressively type-II censored and follow the Rayleigh distribution. Maximum likelihood and Bayesian methods are used to estimate the unknown parameters. To generate the interval estimates of the model parameters, approximate confidence intervals and two bootstrap confidence intervals are also considered based on the classical setup. Bayes estimates are obtained using independent gamma priors based on various loss functions including squared error, LINEX, general entropy, weighted SE, and precautionary loss functions. Moreover, Bayes credible intervals and the highest posterior density intervals of the model parameters are obtained. A numerical investigation has been carried out to assess the performance of the classical and Bayes estimates as well as the associated confidence and credible intervals. Finally, one simulated and two real data sets, one for breaking strengths of wire connections and the other for times to failure of small electrical appliances, have been analyzed for illustrative purposes. The results showed that the Bayes estimates using the LINEX loss function provide more reasonable estimates than the classical and Bayes estimates using squared error, general entropy, and precautionary loss functions.
1. Introduction
The investigator may not be able to receive complete information on failure times for all experimental units via life-testing and reliability investigations, and thus, censoring is inevitable. In clinical practice, for example, patients may be removed from the analysis before the experiment is completed, or the analysis may have to be discontinued at a specific time threshold. Units in industrial investigations may fail by accident before a predetermined time point. In many cases, however, the discarding of units before failure is planned ahead of time to preserve the duration and expense of testing. In the past decades, in order to remove units during a lifetime experiment, various progressive censoring (PC) approaches have been presented in the statistical literature (see Balakrishnan [1] and Balakrishnan and Aggarwala [2]). In the course of time, several modifications on PC have been undertaken by many authors (see Kundu and Joarder [3]; Childs et al. [4]). To improve the efficiency of statistical investigation as well as for reducing overall test time, adaptive progressive type-II censoring (APCS-TII) was introduced by Ng et al. [5]. In APCS, the effective sample size is predetermined in this scheme, while the testing duration is permitted to pass the threshold time . The following is a description of an APCS-TII: suppose identical units are placed in a life-testing experiment, and the effective sample size is designated as the starting value of the experiment with a predetermined progressive censoring scheme . If the failure happens before the threshold (i.e., ), the test ends at the time . On the other hand, once the testing time exceeds the time but the number of observed failures has not reached , by modifying the number of items progressively withdrawn from the test upon failure, the researcher can terminate it as soon as possible. Thus, we should leave as numerous surviving units on the experiment as possible. Suppose is the number of failures before , that is, , , where and . When the experiment passed the predetermined time , we put and . This formulation directs us to end the test as soon as possible if the failure time is larger than for . The threshold of plays a significant role in determining the values of and also as a compromise between a shorter testing time and a greater opportunity to get extreme failures. When tends to infinity, the APCS-TII reduces to a conventional progressive type-II censoring scheme. Moreover, if , this variant reduces to the usual type II censoring scheme. Several investigations based on the APCS-TII have been conducted; readers can refer to the works of Hemmati and Khorram [6], Amein [7], Nassar and Abo-Kasem [8], Ateya and Mohammed [9], and Nassar et al. [10], among many others.
Researchers have recently shown a significant interest in studies on competing risk models, which include data from survival analysis, lifetime studies, reliability studies, and other areas. The independence of causes of failure appears to be restricted in a competing risks model; however, when the reason for failure is dependent, identifiability issues of the underlying model may occur. This must be highlighted that without knowledge of covariates, it is impossible to evaluate the assumption of the failure time distributions of competing causes using just observed data (see Kalbfleisch and Prentice [11], Crowder [12]). Several authors have studied competing risks model under different scenarios. In this regard, readers may refer to the works of Kundu et al. [13]. Competing risks model under APCS-TII data is studied by Ashour and Nassar [14] and Hemmati and Khorram [6]. Recently, Dutta and Kayal [15] studied competing risk model based on improved APCS-TII sample under independent exponential distributions.
We consider that the lifetimes follow the Rayleigh distribution (RD) in this investigation. This distribution is chosen because it is a logical extension of the exponential distribution and hence plays a key role in acoustics, life-testing of electro-vacuum systems, reliability analysis, communication engineering, and so on. Rayleigh [16] wa is the first to utilize this distribution in the literature, using it in the context of acoustics. It can be employed to model numerous paths of densely scattered signals while reaching a receiver. It is also used in reliability engineering for the lifetime modelling of an object, electromagnetic wave propagation, and physiological sensing systems, see Hossain et al. [17] for more details. The failure rate of this distribution is a linear function of time, which is a major characteristic. The Rayleigh distribution’s dependability function degrades at a considerably higher rate than the exponential distribution’s reliability function. Many researchers have written on situations in which data reveal increasing hazard rates. Besides, it has some relationship with other distributions such as Weibull, generalized extreme value, and Chi-squared distributions, and thus, its utility in different fields is significant. The RD with one parameter has been considerably investigated by several researchers (see Dey and Das [18] and Liao and Gui [19]) concerning the estimation, prediction, and several other inferences.
In this paper, our objective is to analyse competing risk data under APCS-TII where the lifetimes of the various risk factors are distributed independently and follow RD with a known cause of failure. Here, it is further assumed that Cox [20] latent failure time model is used for studying competing risks data. The competing causes of failure are considered to be distributed independently in the latent failure time modelling. According to the aforementioned censoring scheme, the maximum likelihood estimates (MLEs), approximate confidence intervals (ACIs), and two bootstrap confidence intervals of the unknown parameters are investigated in the context of the classical phenomenon. The Bayes estimates are obtained under different loss functions, namely, squared error (SE), LINEX, general entropy (GE), weighted SE (WSE), and precautionary (Pr) loss functions based on the assumption of independent gamma priors of the unknown parameters. Furthermore, for comparison purposes, Bayes credible intervals (BCIs) and highest posterior density (HPD) intervals are obtained via the MCMC sampling approach. Simulation studies are conducted to evaluate the performance of different point and interval estimates. Two real data sets are investigated for illustration purposes. To the best of our knowledge, no studies have been done on the aforementioned estimation methods for the Rayleigh distribution using an APCS-TII with competing risks data.
The remaining part of the paper proceeds as follows: In Section 2, we present the competing risk model and the corresponding likelihood function based on APCS-TII data. In Section 3, the MLEs and asymptotic variance-covariance matrix are obtained. The Bayes estimators based on various loss functions of the unknown parameters are studied in Section 4. We also provide the ACIs, two parametric bootstrap methods for obtaining CIs, the BCIs, and HPD intervals in Section 5. In Section 6, a simulation study is implemented to compare the efficiency of the different estimators and the different confidence intervals. In Section 7, we investigate simulated and real data sets to show the importance of the proposed methods and how these methods work in practice, and finally, we conclude the paper in Section 8.
2. Description and Notation of the Model
Without loss of generality, suppose that we have identical items in a lifetime experiment. Moreover, assume that there are only two causes of failure, and then, and refer to the latent failure time of the unit under the cause of failure. We also assume that are independent identically distributed random variables. In addition, it is assumed that and are independent and follow the RD with the following probability density function:
The corresponding cumulative distribution function is given by
We can write the observations under adaptive progressively type-II censored competing risks data as follows:where and , which indicates the cause of failure of item . Letthen and refer to the number of failures owed to cause 1 and 2, respectively, and . Based on the independence of and , we can obtain the relative risk due to cause 1 as follows:Similarly, the relative risk due to cause 2 can be obtained as follows:
Given a progressive censoring scheme , then the likelihood function of the observed data can be expressed as follows:where , for the sake of simplicity, and
3. Maximum Likelihood Estimation
Based on (1)–(3), the likelihood function, ignoring the normalized constant, can be written as follows:
The natural logarithm of the likelihood function in (4) takes the form
To obtain the MLEs of the and , we find the first-order partial derivatives of the log-likelihood function in (5) with respect to and , and then, by equating each of them to zero, we getwhere
From (6) and (7), we can acquire the MLEs and of and , respectively, as follows:
To obtain the ACIs for the unknown parameters and , we require to obtain the second derivatives of the log-likelihood function as follows:
Thus, the asymptotic variance-covariance matrix is obtained by inverting the observed Fisher information matrix as follows:
4. Bayesian Estimation
In this section, we use the Bayesian estimation approach to estimate the parameters of the RD under adaptive progressively type-II censored competing risks data. In a statistical study, the Bayesian technique has some benefits when compared with the usual maximum likelihood method. The capability of possessing prior details in analysis makes the Bayesian approach incredibly useful in reliability studies and many other areas where one of the important challenges is the limited availability of data. The Bayes estimates are obtained based on the assumption that the random variables and have gamma prior distributions with known shape and scale parameters and , , with pdf given by
Here, we consider the independent gamma priors in Bayesian analysis for the RD due to its flexibility, see, for examples, about independent gamma priors Kundu [21] and Ashour and Nassar [22]. In addition, some possible motivations for employing the independent priors for parameters and are furnished as follows: First, independent priors are fairly straightforward and concise, which may not produce many complicated inferential and computational problems. Second, in numerous practical problems, although the dependent prior appears more appealing, the dependent property between parameters cannot be explained from a statistical viewpoint due to historical knowledge and expert background where such prior may be very rare. Therefore, independent priors are more widespread in statistics under the Bayesian approach for the sake of simplicity. The joint posterior density of and can be expressed using the likelihood function in (4) and the joint prior distribution in (12) as follows:where
The marginal posterior densities of and are given, respectively, by
From (15) and (16), it can easily be noted that the marginal posterior distributions of and are gamma distributions, that is, and , respectively. Now, to obtain the Bayes estimators of and , we suggest using three distinct loss functions, the first one is the SE loss function, the second function is the LINEX loss function, and the third is the GE loss function.
4.1. Bayesian Estimators Using SE Loss Function
First, we consider the SE loss function. Based on the SE loss function, the Bayesian estimators of and , denoted by and , are the mean of the posterior densities (15) and (16), which are given by
It is observed that in the case of non-informative priors, , the Bayesian estimators of and corresponds to the MLEs given by (9).
4.2. Bayesian Estimators Using LINEX Loss Function
The LINEX loss function was introduced by Varian [23] as an asymmetric loss function and got its popularity due to Zellner [24]. Let be an unknown parameter to be estimated, and then, the Bayesian estimator of the unknown parameter using the LINEX loss function, denoted by , iswhere , and for closed to zero, the LINEX loss function may be considered as a natural extension of the SE loss function. In our case and from (15), (16), and (19), we can obtain the Bayesian estimators of and using the LINEX loss function, respectively, as follows:
4.3. Bayesian Estimators Using GE Loss Function
The GE loss function was introduced by Calabria and Pulcini [25]. Accordingly, the Bayesian estimator of the unknown parameter based on the GE loss function can be defined aswhere is the Bayesian estimator of under GE loss function. It is to be noted that for , the GE loss function reduces to SE loss function. When , it reduces to the WSE loss function. For , it reduces to the Pr loss function (Norstrom [26]) which is an asymmetric loss function. Based on (15), (16), and (22), the Bayes estimators of and using the GE loss function are, respectively, given by
Here, we assume three particular cases, corresponding to , , and . When , (23) and (24) correspond to the Bayesian estimators under the SE loss function given by (17) and (18). When , we can get the Bayesian estimators of and under the WSE loss function asand when , the Bayesian estimators using the Pr loss function of and , denoted by and , are as follows:
5. Confidence Intervals
In this section, five methods are proposed to construct the confidence intervals for the parameters and . The proposed methods are the ACIs, two kinds of bootstrap confidence intervals, the BCIs, and HPD intervals.
5.1. ACIs
Utilizing the asymptotic normality of the MLEs, the ACIs of and , are, respectively, given bywhere is the upper percentile of a standard normal distribution, and is the standard error of , which can be computed by carrying the square root of the main diagonal in (11).
5.2. Bootstrap Confidence Intervals
For a small sample size, the ACIs may not perform well. Hence, we offer to employ two parametric bootstrap confidence intervals for , namely, the percentile bootstrap (p-boot) and the studentized-t bootstrap (t-boot) confidence intervals. The p-boot and t-boot confidence intervals are obtained through the following steps:
5.2.1. p-boot
(1)From the original data, the MLEs of are obtained.(2) is used to get a bootstrap adaptive progressively type-II censored competing risks sample, and the bootstrap estimates of , say, are computed.(3)Step 2 is repeated times to calculate .(4)All ’s in ascending order as are arranged.(5)The two-sided p-boot confidence interval for is given by5.2.2. t-boot
(1)As in (1) and (2) in p-boot,(2)the -statistic is computed, where .(3)Step 2-3 is repeated times to get .(4) is arranged in ascending order as .(5)The two-sided t-boot confidence intervals for are5.3. Bayes Credible Intervals
The credible intervals with equal tails of and can be obtained using the posterior distributions of and . From (15) and (16), if , are integers, we can see that and follow Chi-squared distribution with and degrees of freedom, respectively. Therefore, BCIs for and are
If are not integer values, then gamma distribution can be employed to create the desired credible intervals.
5.4. Highest Posterior Density Intervals
The HPD intervals and for and are given by solving the next two nonlinear equations, respectively,
On simplifications, we can write (32) and (33) aswhere is the incomplete gamma function. Expressions in (34) and (35) can be solved numerically to get the required HPD intervals of and .
6. Simulation Study
In this section, we report the simulation outcomes to evaluate the performance of the different estimators and the different confidence intervals discussed in the previous sections. The simulation study is conducted by choosing the parameter (Par) values and , and various values of and the following censoring schemes (Sch)(i)Sch 1: .(ii)Sch 2: .(iii)Sch 3: .
For different cases, we obtain the MLEs and Bayesian estimates under SE, LINEX (with ), GE (with ), WSE, and Pr loss functions based on 1000 simulations. The Bayesian estimates are obtained by assuming gamma prior distributions for and with parameters and , respectively. These values are specified in such a way that prior averages are exact to the actual averages. Table 1 reports the mean values of estimates and mean square errors (MSEs). The confidence lengths and the associated coverage probabilities for the different confidence intervals are displayed in Table 2. The simulation study is carried out by using the MATHCAD program Ver.15. From Tables 1 and 2, we can conclude the following:(1)The Bayes estimates using the LINEX loss function serve more satisfactory results than other estimates in terms of the lowest MSEs followed by Bayes estimates based on the WSE loss function.(2)It is also to be noted that the MSEs decrease as increases in all the cases, which indicate that all estimators are consistent.(3)When and remain constant and increases, no specific pattern in the MSEs is observed because the number of failures is predetermined and no more information is collected when increases.(4)It is observed that the HPD intervals have the shortest confidence lengths followed by the BCI among all the interval estimation methods.(5)The BT intervals have the largest confidence lengths when compared with other methods.(6)As grows, the confidence lengths fall in all the interval estimation methods.(7)All the interval estimation methods are capable to hold the nominal coverage probabilities, except the t-boot intervals which have coverage probabilities always below the nominal level.(8)Sch 1 outperforms the other two schemes in terms of minimum MSEs and minimum confidence lengths.
7. Numerical Illustration
In this section, we analyse simulated and real data sets to show the importance of the proposed methods and how these methods work in practice.
7.1. Simulated Data Set
In this subsection, we analyse one simulated data set. We generate adaptive progressively type-II censored competing risks data for and and by choosing , , and the censoring scheme , with the understanding that when the testing time passed the threshold , we replace by zero, and in this case, we have . The simulated observation is displayed in Table 3.
From Table 3, it is to be noted that the first component refers to the lifetime, while the second one denotes the cause of failure. From the simulated data, we have 14 failures due to cause 1 and 6 failures due to cause 2; that is, and . For these data, we obtain the MLEs and Bayes estimates under the different loss functions discussed before. The Bayes estimates are obtained by considering informative priors, and the hyperparameters are taken to be for and for . The different estimates and the corresponding MSEs are displayed in Table 4. The different 95% confidence intervals are tabulated in Table 5. Based on the outcomes in Table 4, we observe that the Bayesian estimate using the LINEX loss function of has the lowest MSE among all other estimates, while the Bayes estimates based on the WSE loss function perform more satisfactory than other estimates in terms of minimum MSE for . Furthermore, from Table 5, we conclude that the HPD intervals have the shortest confidence lengths for both parameters.
7.2. Real Data Analysis
In this Section 7.2, we reanalyse two real data sets to show the applicability of the point and interval estimates discussed in this paper.
(1) Example One. The first data set is given by King [27] and consists of the breaking strengths (mg) of 20 wire connections. There are two causes of failures, the first cause is a bond lift, which is a failure of the bond and the second cause is a wire break, which is a failure of the wire, for more details about the data, see Nelson [28]. Table 6 illustrates the complete data set and the associated cause of failure.
The letters W and B appear in Table 6 refer to bond lift (cause 1) and wire break (cause 2), respectively. There were 10 failures due to each cause of failure. Before progressing further, we first compare the fit of the RD distribution with some other one-parameter models including Weibull (We), Lindley (L), and inverse Lindley (IL) distributions. The MLEs of the different models, the Kolmogorov-Smirnov (KS) distance, and the corresponding -value are displayed in Table 7 for cause 1 and cause 2, respectively. The results in Table 7 indicate that the RD can be assumed as an adequate model to fit the data set.
Now, we use the original data given in Table 6 to generate an adaptive type-II progressively censored sample by choosing , , and the censoring scheme . The generated observations are displayed in Table 8.
In this case, we have , and , and the estimated relative risk due to bond lift cause and wire break cause is 0.4 and 0.6, respectively. This result indicates that cause 2 is more severe than cause 1. The MLEs and Bayesian estimates of the unknown parameters are computed and shown in Table 9. The Bayesian estimates of and are obtained based on noninformative priors because no prior knowledge is known about the unknown parameters. It is observed that the MLEs and Bayesian estimates under the SE loss are coinciding under the noninformative priors, so we do not display estimates under the SE loss function in Table 9. Also, we can compute the MLEs of the meantime to failure for cause 1 as and the meantime to failure for cause 2 as . The various intervals estimates are also obtained and reported in Table 10. Comparing the different confidence intervals, we observed that the HPD intervals have the minimum confidence lengths.
(2) Example Two. The second data set is taken from Lawless [29] and also analyzed by Sarhan [30]. The data consist of times to failure of 33 small electrical appliances subjected to an automatic life test. Failures were categorised into 18 various modes, and only modes six and nine occur more than twice. We concentrate on failure mode 9. Therefore, the data consist of two causes of failure: cause 1 (mode nine) and cause 2 (all other failure modes). The complete data set is displayed in Table 11.
From Table 10, we have 17 failures due to cause 1 and 16 failures due to cause 2. The MLEs of the different models as well as the KS distance and the corresponding -value are presented in Table 12 for cause 1 and cause 2, respectively. The outcomes in Table 12 imply that the RD is a proper model for these data.
From the actual data shown in Table 11, an adaptive type-II progressively censored sample is generated by selecting , and the censoring scheme and . The generated sample is depicted in Table 13.
From Table 13, we have , and , and the estimated relative risks due to causes 1 and 2 are 0.4 and 0.6, respectively. The MLEs and Bayes estimates of the unknown parameters are calculated and displayed in Table 14. The Bayes estimates of and are obtained using noninformative priors. Under the noninformative priors, the MLEs and Bayes estimates under the SE loss are coinciding, so we do not show the estimates using the SE loss function in Table 14. Moreover, the MLEs of the meantime to failure for cause 1 and cause 2 are 3686.32 and 3009.87, respectively. The various intervals estimates are also obtained and displayed in Table 15. Also, we can see that the HPD intervals have the lowest confidence lengths.
8. Conclusion Remarks
In this investigation, we worked with a competing risks model based on the Rayleigh distribution and used adaptive progressively Type-II censoring with a known cause of failure. Although we concentrated on only two causes of failure in this study, this research may be extended to include more causes of failure. We first obtain the maximum likelihood estimators of the unknown parameters. Bayesian estimators are also derived by considering both symmetric and asymmetric loss functions. Different procedures are adopted for building confidence intervals for the unknown parameters. A simulation study is implemented to assess the behaviour of classical and Bayesian estimates. The simulation results show that the Bayes estimators under LINEX loss function perform better than other estimators in terms of minimum mean square errors and the highest posterior density intervals give the shortest confidence length. Finally, a numerical illustration is offered to demonstrate the different methods involved in the article.
Data Availability
The data used to support the findings of this study are included within the article.
Conflicts of Interest
The authors declare that they have no conflicts of interest.
Acknowledgments
This research was funded by the Princess Nourah bint Abdulrahman University Researchers Supporting Project number (PNURSP2022R50), Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia.