Next Article in Journal
The Impact of Misspecified Random Effect Distribution in a Weibull Regression Mixed Model
Previous Article in Journal
Probability and Body Composition of Metabolic Syndrome in Young Adults: Use of the Bayes Theorem as Diagnostic Evidence of the Waist-to-Height Ratio
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A New Extended Birnbaum–Saunders Model: Properties, Regression and Applications

by
Gauss Moutinho Cordeiro
1,
Maria Do Carmo Soares De Lima
1,
Edwin Moisés Marcos Ortega
2 and
Adriano Kamimura Suzuki
3,*
1
Departamento de Estatística, Universidade Federal de Pernambuco, Recife 50740-540, Brazil
2
Departamento de Ciências Exatas, Universidade de São Paulo, São Paulo 13418-900, Brazil
3
Departamento de Matemática Aplicada e Estatística, Universidade de São Paulo, São Paulo 13566-590, Brazil
*
Author to whom correspondence should be addressed.
Submission received: 19 March 2018 / Revised: 24 April 2018 / Accepted: 14 May 2018 / Published: 18 May 2018

Abstract

:
We propose an extended fatigue lifetime model called the odd log-logistic Birnbaum–Saunders–Poisson distribution, which includes as special cases the Birnbaum–Saunders and odd log-logistic Birnbaum–Saunders distributions. We obtain some structural properties of the new distribution. We define a new extended regression model based on the logarithm of the odd log-logistic Birnbaum–Saunders–Poisson random variable. For censored data, we estimate the parameters of the regression model using maximum likelihood. We investigate the accuracy of the maximum likelihood estimates using Monte Carlo simulations. The importance of the proposed models, when compared to existing models, is illustrated by means of two real data sets.

1. Introduction

Motivated by problems of vibration in commercial aircraft that caused fatigue in the materials, Birnbaum and Saunders [1] proposed the two-parameter Birnbaum–Saunders (BS) model, also known as the fatigue life distribution, with shape parameter a > 0 and scale parameter b > 0 , i.e., B S ( a , b ) . This distribution can be used to model lifetime data and it is widely applicable to model failure times of fatiguing materials. A random variable W having the B S ( a , b ) distribution is defined by
W = b a Z 2 + a Z 2 2 + 1 1 / 2 2 ,
where Z is a standard normal random variable. Its cumulative distribution function (cdf) is given by
G a , b ( x ) = Φ ( ν ) , x > 0 ,
where ν = a - 1 ρ ( x / b ) , ρ ( q ) = q 1 / 2 - q - 1 / 2 and Φ ( · ) is the standard normal cumulative function. The parameter b is the median of the distribution, i.e., G a , b ( b ) = Φ ( 0 ) = 1 / 2 . For any k > 0 , k W B S ( a , k b ) . Some distributions have been proposed to extend the fatigue lifetime BS model. For example, Cordeiro et al. [2] defined the McDonald–Birnbaum–Saunders distribution, Ortega et al. [3] introduced the β -Birnbaum–Saunders distribution and Hashimoto et al. [4] proposed the Poisson Birnbaum–Saunders model with long-term survivors. Further, Pescim et al. [5] studied the Kummer beta generalized Birnbaum–Saunders distribution, Cordeiro et al. [6] introduced the negative binomial Birnbaum–Saunders model, Cordeiro et al. [7] proposed the gamma-Birnbaum–Saunders distribution, Ortega et al. [8] constructed the odd Birnbaum–Saunders regression model and Ortega et al. [9] studied a four-parameter odd Birnbaum–Saunders geometric distribution.
The probability density function (pdf) corresponding to Equation (2) is given by
g a , b ( x ) = κ ( a , b ) x - 3 / 2 ( x + b ) exp - τ ( x / b ) 2 a 2 , x > 0 ,
where κ ( a , b ) = ( 2 a e a 2 2 π b ) - 1 and τ ( q ) = q + q - 1 . The fractional moments of Equation (3) are determined as E ( W p ) = b p I ( p , a ) ([10]), where
I ( p , a ) = θ p + 1 / 2 ( a - 2 ) + θ p - 1 / 2 ( a - 2 ) 2 θ 1 / 2 ( a - 2 ) ,
and θ ν ( z ) = 0 . 5 - exp { - z cosh ( t ) - ν t } d t denotes the modified Bessel function of the third kind with ν representing its order and z the argument. A discussion of this function can be found in [11].
Recently, some attempts have been made to define new classes of lifetime distributions that provide great flexibility in modeling skewed data in practice. For any baseline cdf G(x) ( x R ), Gleaton and Lynch [12] defined the cdf F ( x ) of the odd log-logistic-G (“OLL-G” for short) class by
F ( x ; τ , α ) = 0 G ( x ; τ ) 1 - G ( x ; τ ) α t α - 1 ( 1 + t α ) 2 d t = G ( x ; τ ) α G ( x ; τ ) α + G ¯ ( x ; τ ) α .
The pdf corresponding to Equation (5) is given by
f ( x ; τ , α ) = α g ( x ; τ ) G ( x ; τ ) α - 1 G ¯ ( x ; τ ) α - 1 G ( x ; τ ) α + G ¯ ( x ; τ ) α 2 .
In addition, some authors have investigated the OLL-G class in different settings. For instance, da Cruz et al. [13] proposed the log-odd log-logistic Weibull regression model in survival analysis, Braga et al. [14] studied the odd log-logistic Student t distribution, da Cruz et al. [15] considered the bivariate odd-log-logistic-Weibull regression model for oral health-related quality of life, Cordeiro et al. [16] proposed the generalized odd log-logistic family and, recently, Prataviera et al. [17] introduced the heteroscedastic odd log-logistic generalized gamma regression model for censored data. We use a similar method to define a new distribution by compounding the Birnbaum–Saunders and Poisson distributions whose procedure is described below.
The method of compounding discrete and continuous distributions started fifteen years ago to model complex situations under series and parallel structures. Many compounded classes of distributions can be constructed by choosing a positive integer discrete random variable N with probability mass function (pmf) P ( N = n ) . Let Z 1 , , Z N be independent identically distributed (iid) random variables with common cdf given in Equation (5). The minimum and maximum random variables min { Z 1 , , Z N } and max { Z 1 , , Z N } can generate several models that arise in series and parallel systems with identical components and have many industrial and biological applications. For example, the time to the failure of a series system with an unknown number of protected components or the cancer recurrence of an individual after undergoing a certain treatment can be modeled by the generated distribution of the minimum random variable. In a dual mechanism, the time to the failure of a parallel system with an unknown number of protected components or a disease manifestation if it occurs only after an unknown number of factors have been active can be modeled by the generated distribution of the maximum random variable.
The odd log-logistic-G- Poisson (OLLG-P for short) family is defined by the maximum random variable M N = max { Z 1 , , Z N } , under the assumption that the Z i s are iid OLLG random variables independent of N, by taking N as a truncated Poisson random variable with pmf
P ( N = n ) = β n ( e β - 1 ) n ! , n = 1 , 2 , , β > 0 .
Then, the cdf of the OLLG-P family can be expressed as
F ( x ; τ , α , β ) = n = 1 P ( M N x | N = n ) P ( N = n ) = 1 ( e β - 1 ) exp β G ( x ; τ ) α G ( x ; τ ) α + G ¯ ( x ; τ ) α - 1 .
The pdf of the OLLG-P family reduces to
f ( x ; τ , α , β ) = α β g ( x ; τ ) G ( x ; τ ) α - 1 G ¯ ( x ; τ ) α - 1 exp β G ( x ; τ ) α G ( x ; τ ) α + G ¯ ( x ; τ ) α ( e β - 1 ) G ( x ; τ ) α + G ¯ ( x ; τ ) α 2 ,
where α > 0 ; β > 0 ; G ( x ; τ ) and g ( x ; τ ) denote the cdf and pdf of the baseline distribution, respectively; and τ is the baseline parameter vector.
In this paper, we propose a new lifetime model named the odd log-logistic Birnbaum–Saunders–Poisson (OLLBSP) distribution from Equations (8) and (9) to extend the BS model. We obtain some of its structural properties and discuss maximum likelihood estimation of the model parameters.
Regression models can be proposed in different forms in survival and reliability analysis. Among them, the location-scale regression model and Cox proportional-hazards model are distinguished and frequently used. We also construct a location-scale regression model based on the OLLBSP distribution, called the log-odd log-logistic Birnbaum–Saunders–Poisson (LOLLBSP) regression model for censored data, as a feasible alternative to the log-BS regression model. We consider a classic analysis for the LOLLBSP regression model. The inferential part is carried out using the asymptotic distribution of the maximum likelihood estimators (MLEs).
The paper is outlined as follows. In Section 2, we define the OLLBSP distribution and provide plots of its density and hazard rate functions. In Section 3, we present the maximum likelihood method to estimate the model parameters and a simulation study. In Section 4, we derive a useful linear representation for the OLLBSP density function and obtain its ordinary moments. In Section 5, we define the LOLLBSP regression model for censored data and estimate the model parameters by maximum likelihood. In Section 6, we prove empirically the potentiality of the new distribution for fatigue life data and the flexibility and relevance of the proposed regression model by means of two applications to real data sets. Finally, Section 7 ends with some concluding remarks.

2. The OLLBSP Distribution

Most generalized BS distributions have been proposed in reliability to provide a better fit to certain data sets than the traditional two- and three-parameter BS models. Providing a new lifetime distribution is always precious for statisticians. The pdf and cdf of the OLLBSP distribution are defined (for x > 0 ) by substituting Equations (2) and (3) into Equations (8) and (9)
F ( x ) = 1 ( e β - 1 ) exp β Φ ( ν ) α Φ ( ν ) α + 1 - Φ ( ν ) α - 1
and
f ( x ) = α β κ ( a , b ) x - 3 / 2 ( x + b ) exp - τ ( x / b ) 2 a 2 Φ ( ν ) α - 1 1 - Φ ( ν ) α - 1 exp β Φ ( ν ) α Φ ( ν ) α + 1 - Φ ( ν ) α ( e β - 1 ) Φ ( ν ) α + 1 - Φ ( ν ) α 2 ,
respectively. Evidently, the density function in Equation (11) does not involve any complicated function. Hereafter, we denote by X OLLBSP ( α , β , a , b ) a random variable having the density in Equation (11).
Figure 1 displays some possible shapes of the density function in Equation (11) for selected parameter values. It is evident that the OLLBSP distribution is much more flexible than the BS distribution presenting one additional shapes, namely bi-modality. Some real situations require bimodal density behavior and a distribution that presents this shape is a good tentative model. Further, there are not many distributions having this shape.
The hazard rate function (hrf) corresponding to Equation (11) is given by
h ( x ) = α β κ ( a , b ) x - 3 / 2 ( x + b ) exp - τ ( x / b ) 2 a 2 Φ ( ν ) α - 1 1 - Φ ( ν ) α - 1 exp β Φ ( ν ) α Φ ( ν ) α + 1 - Φ ( ν ) α exp ( β ) - exp β Φ ( ν ) α Φ ( ν ) α + 1 - Φ ( ν ) α Φ ( ν ) α + 1 - Φ ( ν ) α 2 .
Plots of the hrf h ( x ) for some parameter values are displayed in Figure 2. We note that these plots illustrate the four classic types of hazard shapes. Further, we note the increasing–decreasing–increasing shape, which is important because there are fewer lifetime distributions that yield complicated shapes similar to this. Further, we require a mixture of two or more lifetime distributions to model this shape. Some data sets present this behavior, usually in regression study, when we have different levels of explanatory variables. Thus, probably, this model will have a better fit to certain data sets than other known models, as we show in the Application Section.
The new distribution is simulated as follows: if U is a uniform random variable ( 0 , 1 ) , then
X = Q B S V 1 / α V 1 / α + [ 1 - V ] 1 / α ,
has the density in Equation (11), where Q B S ( · ) is the BS quantile function (qf) and
V = log U [ exp ( β ) - 1 ] + 1 β .

3. Inference and Estimation

In this section, we discuss the maximum likelihood method for inference and estimation of the model parameters of the OLLBSP distribution. Let x 1 , , x n be a random sample of size n from the OLLBSP ( α , β , a , b ) distribution. The log-likelihood function for the vector of parameters θ = ( α , β , a , b ) T is given by
l ( θ ) = n log [ α β κ ( a , b ) ] - n log ( e β - 1 ) - 3 2 i = 1 n log ( x i ) + i = 1 n log ( x i + b ) - 1 2 a 2 i = 1 n τ x i b + ( α - 1 ) i = 1 n log [ Φ ( ν i ) ] + ( α - 1 ) i = 1 n log [ 1 - Φ ( ν i ) ] + β i = 1 n Φ α ( ν i ) Φ α ( ν i ) + [ 1 - Φ ( ν i ) ] α + 2 i = 1 n log { Φ α ( ν i ) + [ 1 - Φ ( ν i ) ] α } .
Maximization of Equation (15) can be performed using well-established routines such as nlm or optimize in the R statistical package. Setting these equations to zero, U α ( θ ) = 0 , U β ( θ ) = 0 , U a ( θ ) = 0 , U b ( θ ) = 0 and solving them simultaneously gives the MLE θ ^ of θ . These equations cannot be solved analytically and statistical or mathematical software can be used to evaluate them numerically using iterative techniques such as the Newton–Raphson algorithm.
For interval estimation and tests of hypotheses on the parameters in θ , we require the 4 × 4 observed information matrix J = J ( θ ) = { j r , s } , whose elements j r , s (for r , s = α , β , a , b ) can be evaluated numerically. Under standard regularity conditions, the approximate multivariate normal N 4 ( 0 , J ( θ ^ ) - 1 ) distribution can be used to construct approximate confidence intervals for the model parameters.
The likelihood ratio (LR) statistics are useful for comparing the new distribution with some special models. For example, we may use the LR statistic to check if the fit using the OLLBSP distribution is statistically “superior” to the fit using the BS distribution for a given data set.
We assess the finite sample performance of the MLEs in the OLLBSP distribution by varying the sample size n using the inverse method with Equation (13). The sample sizes are taken as n { 50 , 100 , 200 , 300 } . We perform a Monte Carlo simulation study with 1000 replications under three setups for parameters of the OLLBSP distribution: a = 2 . 5 , b = 1 . 5 , α = 2 and β = 0 . 75 (Setup 1); a = 0 . 75 , b = 3 , α = 1 and β = 2 (Setup 2); and a = 1 . 5 , b = 3 , α = 0 . 75 and β = 1 . 25 (Setup 3).
We conclude from the figures in Table 1 that the average estimates of the parameters tend to be closer to the true parameters when n increases. This fact supports that the asymptotic normal distribution provides an adequate approximation to the finite sample distribution of the MLEs. The normal approximation can be oftentimes improved by using bias adjustments to these estimators. Approximations to the their biases in simple models may be determined analytically. Bias correction typically does a very good job for correcting the MLEs. However, it may also increase the mean squared errors (MSEs). Whether bias correction is useful in practice depends basically on the shape of the bias function and on the variance of the MLE.

4. Structural Properties

4.1. Linear Representation

Expansions for Equations (10) and (11) can be derived using the concept of exponentiated distributions to obtain some structural properties of the OLLBSP distribution. Cordeiro et al. [2] defined the exponentiated Birnbaum–Saunders (EBS) distribution with positive parameters α , β and c, say Y E B S ( a , b , c ) , if its cdf and pdf are given by H ( y ; a , b , c ) = Φ ( ν ) c and h ( y ; a , b , c ) = c g a , b ( y ) Φ ( ν ) c - 1 , respectively, where Φ ( ν ) is defined in Equation (2) and g a , b ( y ) is the BS ( a , b ) pdf. The properties of some exponentiated distributions have been studied by several authors (see, for example, Mudholkar and Srivastava [18] for exponentiated Weibull (EW) and [19] for the exponentiated generalized gamma (EGG) distributions).
By using Taylor expansion, we can write Equation (10) as
F ( x ) = 1 e β - 1 i = 1 β i i ! Φ ( ν ) i α Φ ( ν ) α + Φ ¯ ( ν ) α i .
For p real non-integer and | z | < 1 , the convergent power series
( 1 - z ) p = j = 0 ( - 1 ) j p j z j
holds, where the binomial coefficient is defined for any real. By applying this equation twice to Φ ( ν ) i α = { 1 - [ 1 - Φ ( ν ) ] } i α and interchanging sums, we obtain the convergent power series (for α > 0 )
Φ ( ν ) i α = k = 0 a k ( i α ) Φ ( ν ) k ,
where (for k 0 )
a k ( i α ) = r = k ( - 1 ) k + r i α r r k .
By using the binomial expansion and Equation (17), we obtain (for i 1 )
Φ ( ν ) α + Φ ¯ ( ν ) α i = Φ ( ν ) i α + r = 1 i j = 0 d r , j ( i , α ) Φ ( ν ) ( i - r ) α + j ,
where d r , j ( i , α ) = ( - 1 ) j i r r α j .
Further, using Equation (18) in the two powers of the right-hand side and interchanging sums, we have
Φ ( ν ) α + Φ ¯ ( ν ) α i = k = 0 b k ( i , α ) Φ ( ν ) k ,
where b k ( i , α ) = a k ( i α ) + r = 1 i j = 0 d r , j ( i , α ) a k ( ( i - r ) α + j ) . Hence, from Equations (18) and (20), we obtain the ratio of two convergent power series
Φ ( ν ) i α Φ ( ν ) α + Φ ¯ ( ν ) α i = k = 0 a k ( i α ) Φ ( ν ) k k = 0 b k ( i , α ) Φ ( ν ) k = k = 0 c k ( i , α ) Φ ( ν ) k ,
where the coefficient c k ( i , α ) can be determined recursively as
c k ( i , α ) = 1 b 0 ( i , α ) a k ( i α ) - 1 b 0 ( i , α ) r = 1 k b r ( i , α ) c k - r ( i , α ) .
Then, we can write
F ( x ) = k = 0 p k H k ( x ) ,
where (for k 0 )
p k = 1 e β - 1 i = 1 β i c k ( i , α ) i !
and H k ( x ) = Φ ( ν ) k denotes the EBS cdf with parameters α , β and k.
Consequently, the pdf of X can be expressed as
f ( x ) = k = 0 p k + 1 h k + 1 ( x ) ,
where h k + 1 ( x ) denotes the E B S ( k + 1 , a , b ) density function. Based on Equation (23), some mathematical properties of the OLLBSP distribution can be obtained by knowing those of the EBS distribution.

4.2. Moments

The ordinary moments of X can be derived from the probability weighted moments (PWMs) ([20]) of the BS distribution formally defined for p and r non-negative integers by
τ p , r = 0 x p Φ ( ν ) r g a , b ( x ) d x .
The integral in Equation (24) can be computed numerically in software such as MAPLE, MATLAB, MATHEMATICA, Ox and R. Cordeiro and Lemonte [21] derived an expression for τ p , r given by
τ p , r = β p 2 r j = 0 r r j k 1 , , k j A ( k 1 , , k j ) m = 0 2 s j + j ( - 1 ) m 2 s j + j m I p + ( 2 s j + j - 2 m ) 2 , α ,
where s j = k 1 + + k j , A ( k 1 , , k j ) = α - 2 s j - j a ¯ k 1 , , a ¯ k j , a ¯ k = ( - 1 ) k 2 ( 1 - 2 k ) / 2 [ π ( 2 k + 1 ) ] - 1 and I ( p + ( 2 s j + j - 2 m ) / 2 , α ) is obtained from Equation (4). These algebraic platforms currently have the ability to deal with analytic expressions of formidable size and complexity.
The sth moment of X can be expressed from Equation (23) as
μ s = k = 0 ( k + 1 ) p k + 1 τ s , k ,
where τ s , k is given by Equation (25). Equation (26) can be computed numerically in any symbolic mathematical software by taking in the sum a large number of summands instead of infinity.
The central moments ( μ r ) and cumulants ( κ r ) of X are μ r = k = 0 r ( - 1 ) k r k μ 1 k μ r - k and κ r = μ r - k = 1 r - 1 r - 1 k - 1 κ k μ r - k , respectively, where κ 1 = μ 1 .
Plots of the skewness and kurtosis of X, for fixed values of a, b and β , as functions of α are displayed in Figure 3a,b, respectively. Plots of the skewness of X for a, b and α fixed, as functions of β are displayed in Figure 3c,d, respectively. We can note that it is possible to obtain negative skewness for some parameter values.

5. The LOLLBSP Regression Model with Censored Data

If X is a random variable following the OLLBSP density function in Equation (11), the random variable Y = l o g ( X ) has the LOLLBSP distribution (also referred to as the odd log-logistic sinh-normal Poisson distribution). The density function of Y obtained by replacing b = e μ and performing two re-parameterizations reduces to (for y R )
f Y ( y ) = 2 β α ξ 1 exp ( - 2 a 2 ξ 2 2 ) { Φ ( 2 a ξ 2 ) [ 1 - Φ ( 2 a ξ 2 ) ] } α - 1 a σ 2 π [ exp ( β ) - 1 ] { Φ α ( 2 a ξ 2 ) + [ 1 - Φ ( 2 a ξ 2 ) ] α } 2 exp β Φ α ( 2 a ξ 2 ) Φ α ( 2 a ξ 2 ) + [ 1 - Φ ( 2 a ξ 2 ) ] α ,
where
ξ 1 = cosh y - μ σ and ξ 2 = sinh y - μ σ .
The parameter μ R is a location parameter, σ > 0 is a scale parameter and β , α and a are positive shape parameters. We refer to Equation (27) as the non-standard LOLLBSP model. Thus,
if X OLLBSP ( α , β , a , b ) , then Y = log ( X ) LOLLBSP ( α , β , a , σ , μ ) .
The survival function of Y is given by
S ( y ) = exp ( β ) - exp β Φ α ( 2 a ξ 2 ) Φ α ( 2 a ξ 2 ) + [ 1 - Φ ( 2 a ξ 2 ) ] α [ exp ( β ) - 1 ] - 1 .
The pdf of the standardized random variable Z = ( Y - μ ) / σ is given by
f Z ( z ) = 2 β α cosh ( z ) exp [ - 2 a 2 sinh 2 ( z ) ] { Φ [ 2 a sinh ( z ) ] { 1 - Φ [ 2 a sinh ( z ) ] } } α - 1 a 2 π [ exp ( β ) - 1 ] { Φ α [ 2 a sinh ( z ) ] + { 1 - Φ [ 2 a sinh ( z ) ] } α } 2 × exp β Φ α [ 2 a sinh ( z ) ] Φ α [ 2 a sinh ( z ) ] + { 1 - Φ [ 2 a sinh ( z ) ] } α .
Plots of the density function in Equation (27) for selected parameter values are given in Figure 4. These plots show great flexibility for different values of the shape parameters α and β . They indicate that the density function in Equation (27) is very flexible, especially for modeling bimodal and asymmetric data and hence can be used in several applications.
In many practical applications, the lifetimes are affected by explanatory variables and parametric regression models are adopted to estimate univariate survival functions for censored data. A parametric model that provides a good fit to lifetime data tends to yield more precise estimates of the quantities of interest. Based on the standardized LOLLBSP density, we propose a linear location-scale regression model linking the response variable y i and the explanatory variable vector v i T = ( v i 1 , , v i p ) by
y i = v i T τ + σ z i , i = 1 , , n ,
where the random error z i has the density function in Equation (29), τ = ( τ 1 , , τ p ) T , a > 0 , α > 0 , β > 0 and σ > 0 are unknown parameters. The parameter μ i = v i T β is the location of y i . The location parameter vector μ = ( μ 1 , , μ n ) T is given by a linear model μ = v β , where V = ( v 1 , , v n ) T is a known model matrix. The LOLLBSP regression model in Equation (30) opens new possibilities for fitting many different types of data.
Consider a sample ( y 1 , v 1 ) , , ( y n , v n ) of n independent observations, where each random response is defined by y i = min { log ( t i ) , log ( c i ) } . We assume non-informative censoring such that the observed lifetimes and censoring times are independent. Let F and C be the sets of individuals for which y i is the log-lifetime or log-censoring, respectively. Conventional likelihood estimation techniques can be applied here. The log-likelihood function for the vector of parameters θ = ( α , β , a , τ T , σ ) T from the model in Equation (30) has the form l ( θ ) = i F l i ( θ ) + i C l i ( c ) ( θ ) , where l i ( θ ) = log [ f ( y i ) ] , l i ( c ) ( θ ) = log [ S ( y i ) ] , f ( y i ) is the density in Equation (27) and S ( y i ) is the survival function in Equation (28) of Y i . The total log-likelihood function for θ reduces to
l ( θ ) = q log 2 β α a σ π - n log [ exp ( β ) - 1 ] + i F log ( ξ i 1 ) - 2 a 2 i F ξ i 2 2 + ( α - 1 ) i F log Φ 2 a ξ i 2 + ( α - 1 ) i F log 1 - Φ 2 a ξ i 2 - 2 i F log Φ α 2 a ξ i 2 + 1 - Φ 2 a ξ i 2 α + β i F Φ α 2 a ξ i 2 Φ α 2 a ξ i 2 + 1 - Φ 2 a ξ i 2 α + i C log exp ( β ) - exp β Φ α ( 2 a ξ i 2 ) Φ α ( 2 a ξ i 2 ) + [ 1 - Φ ( 2 a ξ i 2 ) ] α ,
where q is the observed number of failures and
ξ i 1 = ξ i 1 ( θ ) = cosh y i - v i T β σ , ξ i 2 = ξ i 2 ( θ ) = sinh y i - v i T β σ ,
for i = 1 , , n . The MLE θ ^ of the vector of unknown parameters can be calculated by maximizing the log-likelihood of Equation (31). We use the NLMixed procedure in SAS to calculate the estimate θ ^ . Initial values for a are taken from the fit of the LBS regression model with α = 1 . From the fitted LOLLBSP model, we obtain the estimated survival function for y i given by
S ( y i ; a ^ , α ^ , β ^ , σ ^ , τ ^ T ) = exp ( β ^ ) - exp β ^ Φ α ^ ( 2 a ξ ^ i 2 ) Φ α ^ ( 2 a ξ ^ i 2 ) + [ 1 - Φ ( 2 a ^ ξ ^ i 2 ) ] α ^ [ exp ( β ^ ) - 1 ] - 1 ,
where ξ ^ i 2 = sinh y i - v i T β ^ σ ^ . Under standard regularity, the asymptotic distribution of ( θ ^ - θ ) is multivariate normal N p + 4 ( 0 , K ( θ ) - 1 ) , where K ( θ ) is the expected information matrix. The asymptotic covariance matrix K ( θ ) - 1 of θ ^ can be approximated by the inverse of the ( p + 4 ) × ( p + 4 ) observed information matrix - L ¨ ( θ ) . The approximate multivariate normal distribution N p + 4 ( 0 , - L ¨ ( θ ) - 1 ) for θ ^ can be used in the classical way to construct approximate confidence regions for some parameters in θ .

6. Applications

In this section, we present two applications to prove empirically the flexibility of the OLLBSP distribution. We compare the results of the fits of the OLLBSP model and other models under some goodness-of-fit statistics. In fact, we compare the fits of the OLLBSP distribution with six other lifetime models, namely:
  • BS distribution [1];
  • Marshall–Olkin extended-BS (MOEBS) distribution [22];
  • Odd log-logistic BS (OLLBS) distribution [7];
  • Beta BS (BBS) distribution [21];
  • Kumaraswamy BS (KwBS) distribution [23]; and
  • Exponentiated generalized BS (EGBS) distribution [24].
The MLEs of the model parameters are computed and the goodness-of-fit statistics for these models fitted to the data sets are compared. These estimates are evaluated using the BFGS script as well as the goodness-of-fit measures. The MLEs of the parameters and these measures including the Cramér-von Mises ( W * ) and Anderson–Darling ( A * ) statistics defined by Chen and Balakrishnan [25] and Kolmogorov–Smirnov (K-S) statistics are calculated to compare the fitted models. In general, the smaller the values of these statistics, the better is the fit to the data. All computations are performed using the R statistical software [26].

6.1. Data Set 1: Breaking Stress of Carbon Fibres (In Gba)

As a first application, we consider the data set consisting of n = 100 observations on breaking stress of carbon fibers (in Gba) reported by [27]. A brief summary of these data are: mean = 2 . 621 , median = 2 . 700 , standard deviation = 1 . 013885 , skewness = 0 . 36264 , kurtosis = 0 . 0431 , minimum = 0 . 390 , and maximum = 5 . 560 . The data have positive skewness and kurtosis. These values indicate that the empirical distribution is skewed to the right and leptokurtic.
Table 2 lists the MLEs of the parameters and their standard errors (in parentheses) from the fitted models and the measures W * , A * , K-S and its p-value. The results indicate that the OLLBSP model has the smallest values of these statistics and large p-value for the K-S distance among the fitted models, and therefore it could be chosen as the best model to the current data.
Plots of the estimated pdf and the empirical and estimated cdf of the OLLBSP model are displayed in Figure 5. It is clear from these plots that the OLLBSP model provides the best fit to the histogram of the breaking stress data.

6.2. Data Set 2: Ambient Temperature Data

The data set consists of failure times (X) of eight components at three different temperatures. The data are obtained from [28]. The original sample size is n = 24 . The following variables are associated with each component: x i , observed time (in years); and v i 1 , temperature (temperature category: 100, 120 and 140), for i = 1 , 2 , , 24 . We analyze these data using the LOLLBSP regression model. First, we consider the regression model
y i = τ 0 + τ 1 v i 1 + σ z i , i = 1 , , 24 ,
where the random errors z i ’s ( i = 1 , , n ) are independent random variables having the density function in Equation (29) and y i = log ( x i ) for i = 1 , , 24 . These data were originally analyzed by Cordeiro et al. [2] using the log-McDonald–Birnbaum–Saunders (LMcBS) regression model. The LMcBS distributions includes as special cases two important distributions: the log-beta Birnbaum–Saunders (LBBS) model and the log-Birnbaum–Saunders (LBS) model. Next, we compare the proposed regression model with other known regression models.
The values of the AIC (Akaike Information Criterion), CAIC (Corrected Akaike’s Information Criterion) and BIC (Bayesian Information Criterion) statistics to compare the LOLLBSP, LMcBS, LBBS and LBS regression models are listed in Table 3. The LOLLBSP regression model outperforms the LMcBS, LBBS and LBS models, irrespective of the criteria and thus the proposed regression model can be used effectively in the analysis of these data.
Table 4 lists the MLEs of the parameters for these regression models fitted to the current data using the NLMixed procedure in SAS. Iterative maximization of the logarithm of the likelihood function in Equation (31) starts with initial values for τ and σ taken from the fit of the LBS regression. We note from the fitted LOLLBSP regression model that the explanatory variable v 1 is significant at 1 % and that there is a significant difference among the levels of the temperature for the failure times.
A graphical comparison among the levels of the explanatory variable v 1 is given in Figure 6a. These plots provide the Kaplan–Meier (KM) estimate and the estimated survival function of the LOLLBSP regression model given by Equation (32) for different levels of v 1 . The plot of the LOLLBSP hrf in Figure 6b reveals that it has an increasing–decreasing–increasing shape. As discussed in Section 2, the OLLBSP distribution is adequate for this type of hrf (see Figure 6c). Based on these plots, it is clear that the LOLLBSP regression model gives a superior fit to the ambient temperature data.

7. Concluding Remarks

We propose the odd log-logistic Birnbaum–Saunders–Poisson (OLLBSP) model to extend the Birnbaum–Saunders (BS) distribution pioneered by Birnbaum and Saunders [1]. We provide a mathematical treatment for the new distribution including expansions for the cumulative and density functions. We derive explicit expressions for the ordinary moments. The estimation of the model parameters is approached by the maximum likelihood method. We verify by Monte Carlo simulations that the normal distribution provides an adequate first-order approximation to the finite sample distribution of the maximum likelihood estimators. A new regression model is defined based on the logarithm of the OLLBSP random variable. We provide two applications to real data sets to illustrate the usefulness of the proposed models in the literature of the fatigue life distributions.

Author Contributions

The authors Gauss Moutinho Cordeiro and Maria do Carmo Soares de Lima have developed the whole theoretical part of the research. The authors Adriano Kamimura Suzuki and Edwin Moisés Marcos Ortega have developed the regression part and applications.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Birnbaum, Z.W.; Saunders, S.C. A new family of life distributions. J. Appl. Probab. 1969, 6, 319–327. [Google Scholar] [CrossRef]
  2. Cordeiro, G.M.; Lemonte, A.J.; Ortega, E.M.M. An extended fatigue life distribution. Statistics 2011, 47, 626–653. [Google Scholar] [CrossRef]
  3. Ortega, E.M.M.; Cordeiro, G.M.; Lemonte, A.J. A log-linear regression model for the beta-Birnbaum-Saunders distribution with censored data. Comput. Stat. Data Anal. 2012, 56, 698–718. [Google Scholar] [CrossRef]
  4. Hashimoto, E.M.; Ortega, E.M.M.; Cordeiro, G.M.; Cancho, V.G. The Poisson Birnbaum-Saunders model with long-term survivors. J. Theor. Appl. Stat. 2013, 48, 1394–1413. [Google Scholar] [CrossRef]
  5. Pescim, R.R.; Cordeiro, G.M.; Nadarajah, S.; Demétrio, C.G.B.; Ortega, E.M.M. A New Extended Birnbaum-Saunders Distribution. Univ. Bull. Natl. Sci. Eng. Ser. B Math. Stat. 2014, 43, 473–510. [Google Scholar]
  6. Cordeiro, G.M.; Cancho, V.G.; Ortega, E.M.M.; Barriga, G.D.C. A model with long-term survivors: Negative binomial Birnbaum-Saunders. Commun. Stat. Theory Methods 2015, 45, 1370–1387. [Google Scholar] [CrossRef]
  7. Cordeiro, G.M.; Lima, M.C.S.; Cysneiros, A.H.M.A.; Pascoa, M.A.R.; Ortega, E.M.M. An extended Birnbaum-Saunders distribution: Theory, estimation, and applications. Commun. Stat. Theory Methods 2016, 45, 2268–2297. [Google Scholar] [CrossRef]
  8. Ortega, E.M.M.; Lemonte, A.J.; Cordeiro, G.M.; da Cruz, J.N. The odd Birnbaum-Saunders regression model with applications to lifetime data. J. Stat. Theory Pract. 2016, 10, 780–804. [Google Scholar] [CrossRef]
  9. Ortega, E.M.M.; Cordeiro, G.M.; Suzuki, A.K.; Ramires, T.G. A new extended Birnbaum-Saunders model with cure fraction: classical and Bayesian approach. Commun. Stat. Appl. Methods 2017, 24, 397–419. [Google Scholar] [CrossRef]
  10. Rieck, J.R. A moment generating function with application to the Birnbaum-Saunders distribution. Commun. Stat. Theory Methods 1999, 28, 2213–2222. [Google Scholar] [CrossRef]
  11. Watson, G.N. A Treatise on the Theory of Bessel Functions; Cambridge University Press: Cambridge, UK, 1995. [Google Scholar]
  12. Gleaton, J.U.; Lynch, J.D. Properties of generalized log-logistic families of lifetime distributions. J. Probab. Stat. Sci. 2006, 4, 51–64. [Google Scholar]
  13. Da Cruz, J.N.; Ortega, E.M.M.; Cordeiro, G.M. The log-odd log-logistic Weibull regression model: Modelling, estimation, influence diagnostics and residual analysis. J. Stat. Comput. Simul. 2016, 86, 1516–1538. [Google Scholar] [CrossRef]
  14. Braga, A.S.; Cordeiro, G.M.; Ortega, E.M.M.; Silva, G.O. The Odd Log-Logistic Student t Distribution: Theory and Applications. J. Agric. Biol. Environ. Stat. 2017, 22, 615–639. [Google Scholar] [CrossRef]
  15. Da Cruz, J.N.; Ortega, E.M.M.; Cordeiro, G.M.; Suzuki, A.K.; Mialhe, F.L. Bivariate odd-log-logistic-Weibull regression model for oral health-related quality of life. Commun. Stat. Appl. Methods 2017, 24, 271–290. [Google Scholar] [CrossRef]
  16. Cordeiro, G.M.; Alizadeh, M.; Ozel, G.; Hosseini, B.; Ortega, E.M.M.; Altun, E. The generalized odd log-logistic family of distributions: Properties, regression models and applications. J. Stat. Computat. Simul. 2017, 87, 908–932. [Google Scholar] [CrossRef]
  17. Prataviera, F.; Ortega, E.M.M.; Cordeiro, G.M.; Braga, A.S. The heteroscedastic odd log-logistic generalized gamma regression model for censored data. Commun. Stat. Simul. Comput. 2018, 48, 1–25. [Google Scholar] [CrossRef]
  18. Mudholkar, G.S.; Srivastava, D.K. Exponentiated Weibull family for analyzing bathtub failure-real data. IEEE Trans. Reliab. 1993, 42, 299–302. [Google Scholar] [CrossRef]
  19. Cordeiro, G.M.; Ortega, E.M.M.; Silva, G.O. The exponentiated generalized gamma distribution with application to lifetime data. J. Stat. Comput. Simul. 2011, 81, 827–842. [Google Scholar] [CrossRef]
  20. Greenwood, J.A.; Landwehr, J.M.; Matalas, N.C.; Wallis, J.R. Probability weighted moments: Definition and relation to parameters of several distributions expressible in inverse form. Water Resour. Res. 1979, 15, 1049–1054. [Google Scholar] [CrossRef]
  21. Cordeiro, G.M.; Lemonte, A.J. The beta Birnbaum-Saunders distribution: An improved distribution for fatigue life modeling. Comput. Stat. Data Anal. 2011, 55, 1445–1461. [Google Scholar] [CrossRef]
  22. Lemonte, A.J. A new extension of the Birnbaum-Saunders distribution. Braz. J. Probab. Stat. 2013, 27, 133–149. [Google Scholar] [CrossRef]
  23. Saulo, H.; Leão, J.; Bourguignon, M. The Kumaraswamy Birnbaum-Saunders distribution. J. Stat. Theory Pract. 2012, 6, 745–759. [Google Scholar] [CrossRef]
  24. Cordeiro, G.M.; Lemonte, A.J. The exponentiated generalized Birnbaum-Saunders distribution. Appl. Math. Comput. 2014, 247, 762–779. [Google Scholar] [CrossRef]
  25. Chen, G.; Balakrishnan, N. A general purpose approximate goodness-of-fit test. J. Qual. Technol. 1995, 27, 154–161. [Google Scholar] [CrossRef]
  26. R Development Core Team. R: A Language and Environment for Statistical Computing R Foundation for Statistical Computing; the R Foundation for Statistical Computing: Vienna, Austria, 2012; ISBN 3-900051-07-0. [Google Scholar]
  27. Nichols, M.D.; Padgett, W.J. A Bootstrap control chart for Weibull percentiles. Qual. Reliab. Eng. Int. 2006, 22, 141–151. [Google Scholar] [CrossRef]
  28. Murthy, D.N.P.; Xie, M.; Jiang, R. Weibull Models; John Wiley and Sons: Hoboken, NJ, USA, 2004. [Google Scholar]
Figure 1. Plots of the OLLBSP density for some parameter values: (a) for a = 0 . 1 , b = 0 . 5 and β = 0 . 5 ; (b) for a = 0 . 1 , b = 0 . 5 and α = 1 . 5 ; and (c) for a = 0 . 1 and b = 0 . 5 .
Figure 1. Plots of the OLLBSP density for some parameter values: (a) for a = 0 . 1 , b = 0 . 5 and β = 0 . 5 ; (b) for a = 0 . 1 , b = 0 . 5 and α = 1 . 5 ; and (c) for a = 0 . 1 and b = 0 . 5 .
Stats 01 00004 g001
Figure 2. Plots of the OLLBSP hrf for some parameter values: (a) for a = 0 . 1 , b = 0 . 5 and β = 0 . 5 ; (b) for a = 1 . 0 , b = 0 . 1 and α = 0 . 5 ; and (c) for a = 0 . 1 and b = 0 . 5 .
Figure 2. Plots of the OLLBSP hrf for some parameter values: (a) for a = 0 . 1 , b = 0 . 5 and β = 0 . 5 ; (b) for a = 1 . 0 , b = 0 . 1 and α = 0 . 5 ; and (c) for a = 0 . 1 and b = 0 . 5 .
Stats 01 00004 g002
Figure 3. Skewness and kurtosis of the OLLBSP distribution as functions of α and β : (a) Skewness varying α ; (b) Kurtosis varying α ; (c) Skewness varying β ; and (d) Kurtosis varying β .
Figure 3. Skewness and kurtosis of the OLLBSP distribution as functions of α and β : (a) Skewness varying α ; (b) Kurtosis varying α ; (c) Skewness varying β ; and (d) Kurtosis varying β .
Stats 01 00004 g003
Figure 4. Plots of the LOLLBSP density for some parameter values: (a) for β = 0 . 5 , a = 2 , μ = 0 and σ = 1 ; (b) for α = 0 . 5 , a = 2 , μ = 0 and σ = 1 ; and (c) for a = 2 , μ = 0 and σ = 1 .
Figure 4. Plots of the LOLLBSP density for some parameter values: (a) for β = 0 . 5 , a = 2 , μ = 0 and σ = 1 ; (b) for α = 0 . 5 , a = 2 , μ = 0 and σ = 1 ; and (c) for a = 2 , μ = 0 and σ = 1 .
Stats 01 00004 g004
Figure 5. (a) Estimated densities for some fitted models; (b) Estimated cdfs for some fitted models.
Figure 5. (a) Estimated densities for some fitted models; (b) Estimated cdfs for some fitted models.
Stats 01 00004 g005
Figure 6. (a) Estimated survival function for the LOLLBSP regression model and the KM estimate; and (b) Estimated hrf of the LOLLBSP regression model.
Figure 6. (a) Estimated survival function for the LOLLBSP regression model and the KM estimate; and (b) Estimated hrf of the LOLLBSP regression model.
Stats 01 00004 g006
Table 1. Simulation study.
Table 1. Simulation study.
Sample SizeParameterMeanBiasMSE
a2.74790.24790.5496
N = 50 b1.2505−0.24950.2038
α 2.15590.15590.3803
β 1.47410.72411.4878
a2.72280.22280.5063
N = 100 b1.3640−0.13600.1263
α 2.13560.13560.3608
Setup 1 β 1.16480.41480.7094
a2.69600.19600.4276
N = 200 b1.4476−0.05240.0709
α 2.11110.11110.3104
β 0.91020.16020.3297
a2.65070.15070.3695
N = 300 b1.4466−0.05340.0551
α 2.10840.10840.2584
β 0.90140.15140.2706
a0.7487−0.00130.2457
N = 50 b3.00760.00760.4509
α 0.9691−0.03090.5599
β 2.11680.11681.4317
a0.75440.00440.1970
N = 100 b2.9874−0.01260.3606
α 0.9754−0.02460.4350
Setup 2 β 2.12330.12331.3254
a0.76160.01160.0920
N = 200 b2.9967−0.00330.3039
α 0.9928−0.00720.1912
β 2.10870.10871.1643
a0.75250.00250.0447
N = 300 b2.9996−0.00040.2216
α 0.9853−0.01470.0876
β 2.06780.06780.8310
a1.59220.09220.7011
N = 50 b3.00350.00350.5183
α 0.80680.05680.2529
β 1.33560.08560.4881
a1.57240.07240.4521
N = 100 b3.01790.01790.2984
α 0.79550.04550.1701
Setup 3 β 1.26940.01940.2892
a1.51950.01950.2394
N = 200 b3.04610.04610.1839
α 0.76370.01370.0910
β 1.2415−0.00850.1530
a1.52290.02290.1487
N = 300 b3.01990.01990.1186
α 0.76510.01510.0573
β 1.25270.00270.1108
Table 2. MLEs of the parameters for the fitted models to the breaking stress of carbon fibres data.
Table 2. MLEs of the parameters for the fitted models to the breaking stress of carbon fibres data.
Modelab α β W * A *
BS ( a , b ) 0.46212.3661 0.29781.6181
(0.0326)(0.1064)
MOEBS ( a , b , β ) 2.80260.0250 9909.45800.07980.5580
(0.4486)(0.0078) (435.3655)
OLLBS ( a , b , α ) 217.34552.4788530.5732 0.24611.2847
(0.1095)(0.0557)(0.1012)
OLLBSP ( a , b , α , β ) 0.31731.26530.39934.73720.07110.3945
(0.0977)(0.3080)(0.1697)(1.6564)
BBS ( a , b , α , β ) 97.38980.0075629.2049466.41590.10400.5363
(24.6425)(0.0011)(236.0238)( 203.7381)
KWBS ( a , b , α , β ) 12.93220.005426.00082.15880.14860.7541
(1.6490)(0.0006)(5.2036)(0.7928)
EGBS ( a , b , α , β ) 1.768569.7519105.92740.39280.16030.8570
(0.0736)(0.0859)(0.4150)(0.0726)
Table 3. AIC, BIC and CAIC statistics.
Table 3. AIC, BIC and CAIC statistics.
Statistic
ModelAICCAICBIC
LOLLBSP61.166.068.2
LMcBS76.767.768.9
LBBS73.278.280.3
LBS66.467.669.9
Table 4. MLEs, standard errors (SEs) and [p-values] for the LOLLBSP regression model fitted to the ambient temperature data.
Table 4. MLEs, standard errors (SEs) and [p-values] for the LOLLBSP regression model fitted to the ambient temperature data.
Model β α a σ β 0 β 1
LOLLBSP1.55480.10620.10313.93826.6303−0.02936
(0.9542)(0.01876)(0.005682)(1.06512)(0.2890)(0.001231)
[ < 0 . 001 ][ < 0 . 001 ]

Share and Cite

MDPI and ACS Style

Cordeiro, G.M.; De Lima, M.D.C.S.; Ortega, E.M.M.; Suzuki, A.K. A New Extended Birnbaum–Saunders Model: Properties, Regression and Applications. Stats 2018, 1, 32-47. https://0-doi-org.brum.beds.ac.uk/10.3390/stats1010004

AMA Style

Cordeiro GM, De Lima MDCS, Ortega EMM, Suzuki AK. A New Extended Birnbaum–Saunders Model: Properties, Regression and Applications. Stats. 2018; 1(1):32-47. https://0-doi-org.brum.beds.ac.uk/10.3390/stats1010004

Chicago/Turabian Style

Cordeiro, Gauss Moutinho, Maria Do Carmo Soares De Lima, Edwin Moisés Marcos Ortega, and Adriano Kamimura Suzuki. 2018. "A New Extended Birnbaum–Saunders Model: Properties, Regression and Applications" Stats 1, no. 1: 32-47. https://0-doi-org.brum.beds.ac.uk/10.3390/stats1010004

Article Metrics

Back to TopTop