Abstract
This paper proposed a new probability distribution, namely, the half-logistic xgamma (HLXG) distribution. Various statistical properties, such as, moments, incomplete moments, mean residual life, and stochastic ordering of the proposed distribution, are discussed. Parameter estimation of the half-logistic xgamma distribution is approached by the maximum likelihood method based on complete and censored samples. Asymptotic confidence intervals of model parameters are provided. A simulation study is conducted to illustrate the theoretical results. Moreover, the model parameters of the HLXG distribution are estimated by using the maximum likelihood, least square, maximum product spacing, percentile, and Cramer–von Mises (CVM) methods. Superiority of the new model over some existing distributions is illustrated through three real data sets.
1. Introduction
In the recent years, generalization and extension of existing distributions, to enhance flexibility in modeling a variety of data, have received attention of many researchers. Marshall and Olkin [1] proposed a general method of adding a parameter to a family of distributions which is called the Marshall–Olkin-G (MO-G) class. The new parameter gives more flexibility and extends several well-known distributions. For any continuous baseline with the cumulative distribution function (cdf) say , and the probability density function (pdf), say , the MO-G family is defined as follows:where , , and is the survival function . Lately, some of the notable new families of distributions include beta-G [2, 3], gamma-G [4], Kumaraswamy-G [5], transformed-transformer [6], exponentiated generalized [7], Weibull-G [8], exponentiated half-logistic-G [9], Kumaraswamy–Weibull-G [10], exponentiated Weibull-G [11], additive Weibull-G [12], type I half-logistic-G [13], type II half-logistic-G (TIIHL-G) [10], generalized additive Weibull-G [14], odd exponentiated half-logistic-G [15], generalized odd log-logistic-G [16], inverse Weibull-G [17], power Lindley-G [18], and type II generalized Topp–Leone-G [19].
Hassan et al. [20] proposed a class of probability distribution generated by a half-logistic random variable with the following cdf:where is the shape parameter and is a baseline cdf, which relies on a parameter vector . The pdf corresponding to (2) is given by
For in (3) and in (1), pdf (3) provides the MO-G family. The pdf (3) provides a number of known distributions as particular cases with more flexibility in their skewness and kurtosis.
Recently, Sen et al. [21] proposed the xgamma (XG) distribution, following the idea of Lindley distribution. The XG model is a special finite mixture from exponential with a scale parameter (θ) and gamma with a scale parameter (θ) and shape parameter (3). The pdf and cdf of the XG distribution are, respectively, given byand
Sen et al. [21] investigated the structural properties of the XG distribution, and they have found that in many cases that the XG distribution has more flexibility than the exponential distribution. The extensions of XG distribution were studied in [22, 23] and [24], among others.
Our objective here is two folds: First, based on the TIIHL-G class, we propose a new distribution related to xgamma distribution. We call the new model as the half-logistic xgamma (HLXG) distribution, and we study its several statistical properties. Second, the maximum likelihood (ML) method is employed to estimate the model parameters of the HLXG based on complete and censored samples. Further, asymptotic confidence intervals (CIs) of the model parameters are constructed. Simulation and application issues are considered. The rest of the paper is organized as follows. Sections 2 and 3 provide the pdf, cdf, hazard rate function (hrf), and structure properties of the HLXG model. In Section 4, point and approximate CIs of model parameters are derived under complete, type I censoring (TIC) and type II censoring (TIIC). Also, in the same section, the behavior of the estimates is studied via a simulation study. Real data sets are analyzed to demonstrate the flexibility of the HLXG distribution over some known distributions in Section 5. The article ends with some concluding remarks.
2. The Half-Logistic Xgamma Distribution
In this section, we provide a more flexible model by adding one extra shape parameter to the XG model for improving its goodness-of-fit to real data. The motivations of the HLXG distribution are (i) to obtain a more flexible pdf with right skewed, unimodal, and reversed J-shape; (ii) to be capable of modeling decreasing, increasing, and semibathtub hazard rate shapes; and (iii) to provide more flexibility to model the various types of data.
Definition 1. A random variable X is said to have a HLXG distribution, if its cdf is given bywhere .
The corresponding pdf is given as follows:A random variable X with pdf (7) will be denoted by HLXG∼(λ, θ). For λ = 1 in (7), we obtain the MO extended xgamma distribution with δ = 0.5 as a special new model. Further, the sf and hrf of the HLXG are given, respectively, by and .
The pdf and hrf plots for the HLXG are displayed in Figure 1 for some given values of parameters.
Figure 1 reveals that the pdf of X is quite flexible and can take asymmetric forms, among others. Also, the hrf can be increasing, decreasing, and semibathtub shapes. In general, they reinforce the importance of the HLXG model to fit real lifetime data.

3. Main Properties
In this section, we give some important properties of the HLXG distribution such as ordinary and incomplete moments, stochastic ordering, mean residual life, and mean waiting time, among others.
3.1. Expansion
This section provides a useful expansion of the HLXG pdf due its complicated form to obtain its structure properties. Since the generalized binomial series isthen, by applying (8) in (7), we have
Employing the binomial expansion in the last term of (9), we have .
Again, we use the binomial expansion more than one time; then, we havewhere and .
3.2. Moments and Related Statistics
In this section, we obtain moments and some related measures of the HLXG distribution with pdf defined in (10). The sth moment about the origin of the random variable X has the HLXG distribution obtained as follows:
After simplification, the sth moment of the HLXG distribution is given bywhere is the gamma function. Furthermore, the sth central moment of a given random variable X is defined by
Table 1 gives the numerical values for certain parameter values of mean , variance (), coefficient of skewness (CS), and coefficient of kurtosis (CK) of the HLXG distribution. It is evident from Table 1 that skewness and kurtosis heavily depend on the value of the parameters of the new distribution.
Additionally, the sth lower incomplete moment, say , of the HLXG distribution is given bywhich leads towhere is the lower incomplete gamma function. The first incomplete moment, say , for s = 1 in (15 is obtained. The Lorenz and Bonferroni curves are the essential applications of the first incomplete moment. The Lorenz curve, say LO (t), and the Bonferroni curve, say BO (t), of the HLXG are obtained, respectively, as follows:and
3.3. Mean Residual Life and Mean Waiting Time
The mean residual life (MRL) function at age t measures the expected remaining lifetime of an individual of age t. The MRL of X is given by
Hence, the MRL of X can be obtained as follows:
The mean waiting time (MWT) of an item failed in an interval [0, t] is defined as
Hence, the MWT of the HLXG distribution is determined as follows:
3.4. Stochastic Ordering
Shaked and Shanthikumar [25] stated that for independent random variables X and Y with cdfs, and , respectively, X is said to be smaller than Y in the following ordering, if the following is available:(i)Stochastic order if for all (ii)Likelihood ratio order if is decreasing in (iii)Hazard rate order if for all (iv)Mean residual life order if for all
We have the following chain of implications among the various partial orderings discussed above:
Theorem 1. Let X∼HLXG and Y∼HLXG (). If and , then , , , and .
Proof. It is sufficient to show that is a decreasing function of ; the likelihood ratio istherefore,where . Thus, is decreasing in x, and hence . Similarly, we can conclude for , , and .
3.5. Quantile Function
The HLXG quantile function, say , is straightforward to be computed by inverting (6) as follows:where . The uniroot function of the R software can be used to solve nonlinear equation (25), numerically, and then the HLXG random variable X can be generated, where u has the uniform distribution on the interval (0, 1).
4. Parameter Estimation under Censored Samples
In survival analysis and the industrial life testing model, it is necessary to minimize the cost and/or the duration of a life-testing experiment, so one may choose to terminate the test early, which results in the so-called censored sampling scheme. TIC and TIIC are the most popular types of censoring. The life testing experiment is ended at a predetermined time, say T, for the TIC, while for the TIIC, the test is ended when a predetermined number of failures, say r, is reached.
In the following sections, the maximum likelihood (ML) estimators of the model parameters of the HLXG model based on complete, TIC, and TIIC are provided. Approximate CIs are constructed. Further, estimators of survival function and hazard rate function for different mission times are given.
4.1. Maximum Likelihood Estimators via TIIC
Let X(1) < X(2) << X(n) be a TIIC of size r from a life test on n items whose lifetimes have the HLXG model with parameters λ and θ. The log-likelihood function, denoted by , of r failures and (n − r) censored values is given bywhere c = n!/(r − 1!) (n − r)!, and
The first partial derivatives of the log-likelihood function with respect to λ and θ are obtained as follows:where
Putting and equal to zero and solving these equations numerically provide the ML estimator of λ and θ, respectively. Note that for r = n, we obtain the ML of the model parameters in case of a complete sample.
4.2. Maximum Likelihood Estimators via TIC
Suppose that n items, whose lifetimes have the HLXG distribution, are placed on a life test, and the test is terminated at a specified time T before all n items have failed. The number of failures r and all failure times are random variables. The log-likelihood function, based on TIC, is given bywhere
The first partial derivatives of the log-likelihood function with respect to λ and θ are obtained as follows:where
Putting and equal to zero and solving these equations numerically provide the ML estimators of λ and θ.
4.3. Approximate Confidence Intervals
In this section, approximate confidence intervals (CIs) of the model parameters for the HLXG distribution are obtained. The asymptotic variances and the covariance matrix of the ML estimates are given by the elements of the inverse of the Fisher information matrix . Unfortunately, the exact mathematical expressions for the above expectation are very difficult to obtain. Therefore, the Fisher information matrix is given by , which is obtained by dropping the expectation on operation and replacing and with and , respectively (see [26]). The asymptotic variance covariance matrix for the maximum likelihood estimates can be written as follows:
Hence, the approximate 100 two-sided CIs for and are, respectively, given bywhere is the upper percentile of the standard normal distribution. By a similar way, we can find the approximate 100 two-sided CIs for and under TIIC and a complete sample.
4.4. Simulation Study
This section provides a simulation study to assess the behavior of the estimators in case of complete, TIC, and TIIC. Mean square errors (MSEs), biases, lower bounds (LBs) of the CIs, upper bounds (UBs) of the CIs, and average lengths (ALs) of 90% and 95% are calculated via Mathematica 9. The following algorithm is designed as follows:(i)10000 random samples of size n = 30, 50, and 100 are generated from HLXG distribution. The corresponding number of failures is selected as r = (21, 27, 30), (35, 45, 50), and (70, 90, 100).(ii)Values of true parameters () are taken as (0.5, 0.5), (0.5, θ = 1.2), and (1.7, 0.9).(iii)The termination time is selected as T = 2.5 and 4 in case of TIC.(vi)The ML estimates, MSEs, biases, LBs, UBs, and ALs for selected values of parameters are calculated.(v)Numerical outcomes are listed in Tables 2–4 based on complete and TIIC, while Tables 5–7 contain simulation results based on TIC.(vi)ML estimates of sf, hrf, and h (t0), at different mission time t0 where t0{3, 5, 10} of the HLXG distribution, are calculated. Observed values under TIIC and complete samples are recorded in Table 8, while Table 9 contains numerical results based on TIC.
From Tables 2–9, we observed the following: The Biases, MSEs, and average length of and decrease as n increases As the censoring level time values, T, increase, the biases, MSEs, and ALs of and decrease The biases, MSEs, and ALs of and decrease as the number of failures r increases The lengths of the CIs become narrower as n increases The ALs of the CIs increase as the confidence levels increase from 90% to 95%. Also, as the number of failures r increases, the AL of CIs decreases As the mission time increases, the estimates of sf decrease while the hrf estimates increase for all sampling schemes As the sample size increases, the sf and the hrf estimates are increasing The MSEs, biases, and ALs are increasing as increases from 0.5 to 1.2 at
5. Data Analysis
In this section, we fit our proposed HLXG model to the following three real data sets. Furthermore, parameter estimation of the three real data sets by using the ML, least square (LS), maximum product spacing (MPS), percentile (PE), and Cramer–von Mises (CVM) methods is provided.
The first data consists of 100 observations of breaking stress of carbon fibres (in Gba) given by Nichols and Padgett [27]. The data are given as follows:
3.7, 3.11, 4.42, 3.28, 3.75, 2.96, 3.39, 3.31, 3.15, 2.81, 1.41, 2.76, 3.19, 1.59, 2.17, 3.51, 1.84, 1.61, 1.57, 1.89, 2.74, 3.27, 2.41, 3.09, 2.43, 2.53, 2.81, 3.31, 2.35, 2.77, 2.68, 4.91, 1.57, .65, 2.12, 1.8, 0.85, 4.38, 2, 1.18, 1.71, 1.17, 2.17, 0.39, 2.79, 1.08, 2.73, 0.98, 1.73, 1.59, 1.92, 2.38, 3.56, 2.55, 3.22, 3.39, 4.9, 1.69, 3.11, 3.6, 2.05, 1.61, 2.03, 2.48, 1.25, 2.48, 1.12, 2.88, 2.87, 3.19, 1.87, 2.95, 2.67, 4.2, 2.85, 2.55, 2.17, 2.97, 3.68, 0.81, 1.22, 5.08, 1.69, 3.68, 4.70, 2.03, 2.82, 2.50, 3.22, 3.15, 1.47, 5.56, 1.84, 1.36, 2.59, 2.83, 2.56, 3.33, 2.93, and 2.97.
The second data set (gauge lengths of 10 mm) is obtained by Kundu and Raqab [28]. This data set consists of 63 observations and are listed as follows:
1.901, 2.132, 2.203, 2.228, 2.257, 2.350, 2.361, 2.396, 2.397, 2.445, 2.454, 2.474, 2.518, 2.522, 2.525, 2.532, 2.575, 2.614, 2.616, 2.618, 2.624, 2.659, 2.675, 2.738, 2.740, 2.856, 2.917, 2.928, 2.937, 2.937, 2.977, 2.996, 3.030, 3.125, 3.139, 3.145, 3.220, 3.223, 3.235, 3.243, 3.264, 3.272, 3.294, 3.332, 3.346, 3.377, 3.408, 3.435, 3.493, 3.501, 3.537, 3.554, 3.562, 3.628, 3.852, 3.871, 3.886, 3.971, 4.024, 4.027, 4.225, 4.395, and 5.020.
The third data set is censored data discussed by Sickle-Santanello et al. [29] and given by Klein and Moeschberger [30]. The data consist of death times (in weeks) of patients with cancer of the tongue with an aneuploid DNA profile. Data under study were based on the effects of ploidy on the prognosis of patients with cancers of the mouth. Patients were selected in whom a paraffin-embedded sample of the cancerous tissue had been taken, and the times to reinfection for patients with sexually transmitted diseases were calculated. Follow-up survival data were obtained on each patient. The tissue samples were examined using a flow cytometer to determine if the tumor had an aneuploid (abnormal) or diploid (normal) DNA profile using a technique discussed by Sickle-Santanello et al. [29].
The observations are 1, 3, 3, 4, 10, 13, 13, 16, 16, 24, 26, 27, 28, 30, 30, 32, 41, 51, 61, 65, 67, 70, 72, 73, 74, 77, 79, 80, 81, 87, 87, 88, 89, 91, 93, 93, 96, 97, 100, 101, 104, 104, 108, 109, 120, 131, 150, 157, 167, 231, 240, and 400.
First, we want to compare the fits of the HLXG model with the Gompertz (GO), exponentiated Burr XII (EBXII), and MO inverse Lomax (MOIL) distributions. We estimate the model parameters by using the ML method. We compare the goodness-of-fit of the models using where L denotes the log-likelihood function evaluated at the ML estimates, Akaike information criterion (AIC), corrected Akaike information criterion (CAIC), Hannan–Quinn information criterion (HQIC), and Bayesian information criterion (BIC). We also reported the Kolmogorov–Smirnov (K–S) and Anderson–Darling () statistics and their corresponding values. The pdfs of the IL, GIL, EIL, MOIL, and IW models are, respectively, given by
The ML estimates (and the corresponding standard errors in parentheses) of parameters and values of , AIC, BIC, CAIC, and HQIC and the values of K–S and statistics and their corresponding values for the first data set are displayed in Tables 10 and 11, respectively, for the second data set in Tables 12 and 13, respectively, and for the third data set, in Tables 14 and 15, respectively. The log-likelihood for the first, second, and third data sets are given in Figures 2–4, respectively.



It is observed from Tables 10–15 that the HLXG distribution has the smallest values of , AIC, CAIC, BIC, HQIC, K–S, and but the largest value among all other competitive models. Hence, it is evidenced that our proposed model performed the best for all three data sets. The relative histogram and the fitted densities and the plots of the fitted empirical cdf and the empirical sf and plots of the three data sets are displayed in Figures 5–7, respectively. These figures supported the above conclusions to some extent.



Tables 16–18 provide the ML, LS, MPS, PC, and CV estimates of the unknown parameters of the HLXG distribution based on four methods of estimation. Also, values of , AIC, BIC HQIC, and CAIC are listed. From these tables, we conclude that all estimation methods performed well.
6. Some Concluding Remarks
A new probability distribution related to xgamma, called the half-logistic xgamma distribution, is proposed in this paper. Some important statistical properties, including moments, incomplete moments, mean residual life, mean waiting time, quantile function, and stochastic ordering of the suggested model, are derived. Parameter estimation of model parameters is discussed via the maximum likelihood method from complete and censored samples. Further, survival and hazard rate function estimates are obtained for different sample sizes. Also, asymptotic confidence intervals of parameters are constructed. Good performance and accuracy of the maximum likelihood estimators of the parameters are examined through a simulation study. Based on maximum likelihood, least square, maximum product spacing, percentile, and Cramer–von Mises (CV) methods, we estimate the model parameters of the HLXG distribution using three real data sets. Furthermore, three real data sets are analyzed to demonstrate the superiority of the proposed half-logistic xgamma distribution compared to some existing distributions.
Data Availability
In order to obtain the numerical data set used to carry out the analysis reported in the manuscript, please contact Mahmoud Elsehetry at ma_sehetry@hotmail.com.
Conflicts of Interest
The authors declare no conflicts of interest.
Acknowledgments
This work was supported by the Deanship of Scientific Research (DSR), King Abdulaziz University, Jeddah, under grant no. DF-280-150-1441. The authors, therefore, gratefully acknowledge the DSR technical and financial support. The authors are dedicating this article to those who lost their lives during COVID-19 pandemic.