Let X 1, X 2, ..., X n be the ordered statistics based on random sample of size n from some continuous distribution.
Let the distribution function be F(x;θ) for Ω in some parameter space with dimension greater than or equal to 1.
Let L < U be two statistics based on the sample such that for any given values α and P, with 0 < α < 1 and 0 < P < 1, the following holds for every θ in Ω:
Then, the interval [L, U] is a two-sided tolerance interval with content = P x 100% and confidence level = 100(1 – α)%. Such an interval can be called a two-sided (1 – α, P) tolerance interval. For example, if α = 0.10 and P = 0.85, then the resulting interval is called a two-sided (90% , 0.85) tolerance interval.
If L = –∞ and U < +∞, then the interval (-∞, U] is called a one-sided (1 – α, P) upper tolerance bound. If L > -∞ and U = +∞, then the interval [L, +∞) is called a one-sided (1 – α, P) lower tolerance bound.
Minitab calculates exact (1 – α, P) nonparametric tolerance intervals, where 1 – α is the confidence level and P is the coverage (the target minimum percentage of population in the interval). The nonparametric method for tolerance intervals is a distribution free method. That is, the nonparametric tolerance interval does not depend on the parent population of your sample. Minitab uses an exact method for both one-sided and two-sided intervals.
Let X 1, X 2 , ... , X n be the ordered statistics based on a random sample from some continuously distributed population F(x;θ). Then, based on the findings of Wilks1, 2 and Robbins3, it can be shown that:
where B denotes the cumulative distribution function of the beta distribution with parameters a = r and b = n – s + 1. Thus ( Xr , Xs ) is a distribution-free tolerance interval because the coverage of the interval has a beta distribution with known parameter values, which are independent of the distribution of the parent population, F(x;θ).
Let k be the largest integer that satisfies the following:
where Y is a binomial random variable with parameters n and 1 – P. It can be shown (see Krishnamoorthy and Mathew4) that a one-sided (1 – α, P) lower tolerance bound is given by Xk . Similarly, a one-sided (1 – α, P) upper tolerance bound is given by X n - k +1. In both cases, the actual or effective coverage is given by P(Y > k).
Let k be the smallest integer that satisfies the following:
where V is a binomial random variable with parameters n and P. Thus,
where F V -1(x) is the inverse cumulative distribution function of V. It can be shown (see Krishnamoorthy and Mathew4) that a two-sided (1 – α, P) tolerance interval may be given as ( Xr , Xs ). Minitab chooses s = n - r + 1 so that r = ( n – k + 1) / 2. Both r and s are rounded down to the nearest integer. The actual or effective coverage is given by P(V < k – 1).
Term | Description |
---|---|
1 – α | the confidence level of the tolerance interval |
P | the coverage of the tolerance interval (the target minimum percentage of population in the interval) |
n | the number of observations in the sample |
The tolerance interval for the gamma distribution uses an approximation to the normal distribution. Krishnamoorthy, et al. conduct simulation studies that show that the approximation provides accurate results. The calculations follow this process:
Minitab calculates exact (1 – α, P) tolerance intervals, where 1 – α is the confidence level and P is the coverage (the target minimum proportion of the population in the interval). The formulas differ between the calculation of one-sided tolerance limits and two-sided tolerance intervals.
This formula gives the lower limit:
This formula gives the upper limit:
Minitab uses Newton's method to solve the following system of equations. For more detail, see Fernandez1.
This formula gives the two-sided interval:
Where,
and the value of k 1 depends on the solution to this system of equations:
where,
Term | Description |
---|---|
n | the sample size |
the sample mean | |
P | the target minimum proportion of the population in the interval |
the α th percentile of the chi-square distribution with 2n degrees of freedom | |
α | 1 − confidence level |
the cumulative distribution function of the chi-square distribution with 2n degrees of freedom |
Minitab calculates exact (1 – α, P) tolerance intervals based on Lawless1, where 1 – α is the confidence level and P is the coverage (the target minimum percentage of population in the interval).
where
with
where
The value of k 2 comes from replacing α with 1 − α and P with 1 − P in the formulas for computing k 1.
To calculate the approximate two-sided interval, replace α by α/2 and P by (P + 1)/2 in the formulas for computing the one-sided tolerance limits.
Term | Description |
---|---|
the maximum likelihood estimate of the location parameter of the extreme value distribution | |
the maximum likelihood estimate of the scale parameter of the extreme value distribution | |
, the centered observations based on the MLE estimates of the location and scale parameters of the smallest extreme value distribution | |
t | the α th percentile of the non-central t-distribution with n − 1 degrees of freedom and noncentrality parameter δP |
1 - α | the confidence level of the tolerance interval |
P | the coverage of the tolerance interval (the target minimum percentage of population in the interval) |
n | the number of observations in the sample |
For formulas that apply to the smallest extreme value distribution, go to the section on the smallest extreme value distribution.
Minitab calculates approximate (1 − α, P) tolerance intervals based on Bain and Engelhardt1, where 1 − α is the confidence level and P is the coverage (the target minimum percentage of population in the interval). The formula for the lower tolerance factor differs from the formula for the upper tolerance factor.
The analysis produces an approximate two-sided tolerance interval for the logistic distribution with Bonferroni's inequality2. This approximation method replaces α by α/2 and P by (P + 1)/2 in the formulas for computing the one-sided tolerance limits.
Term | Description |
---|---|
the lower tolerance factor | |
the upper tolerance factor | |
zα | the upper α percentile of the standard normal distribution, which is equivalent to the lower 1 −α percentile point |
log(p) − log(1 − p), the p × 100 lower percentile of the standard logistic distribution | |
C11 | |
C22 | |
C12 | |
the maximum likelihood estimate of the logistic location parameter | |
the maximum likelihood estimate of the logistic scale parameter |
For formulas that apply to the logistic distribution, go to the section on the logistic distribution.
Minitab uses Anderson-Darling statistics to perform the goodness-of-fit test.
Let Z = F(X), where F(X) is the cumulative distribution function. Suppose that a sample X1, .., Xn gives values Z(i) = F(Xi), i=1,.., n. Rearrange Z(i) in ascending order, Z(1) < Z(2) <...<Z(n). Then the Anderson-Darling statistic (A2) is calculated as follows:
The modified Anderson-Darling goodness-of-fit test statistic is calculated for each distribution. The p-values are based on the table given by D'Agostino and Stephens.1 If no exact p-value is found in the table, Minitab calculates the p-value based on interpolation using the range of the p-value.