|
PROBABILITY PLOTName:
The probability plot consists of:
This is essentially a plot of the data percentiles versus the percentiles of the theoretical distribution. Dataplot computes the percent point function of the uniform order statistic medians to compute the percentiles of the theoretical distribution. DATAPLOT has extensive probability plot capabilities (90+ distributions/distributional families are available). When distributional families are specified, the LET command is used before the PROBABILITY PLOT command to specify which member of the distributional family is desired. For example,
WEIBULL PROBABILITY PLOT Y The name of the distributional parameter for families is given in the list below. Probability plots serve two primary uses.
<SUBSET/EXCEPT/FOR/qualification> where <y> is the variable of raw data values under analysis; <dist> is one of the following distributions:
and where the
This syntax is used for the case where we have raw data.
<SUBSET/EXCEPT/FOR/qualification> where <y> is the variable of raw data values under analysis; <x> is the censoring variable; <dist> is one of the distributions listed above; and where the <SUBSET/EXCEPT/FOR qualification> is optional. This syntax is used for the case where we have censored data. A value of 1 indicates a failure time and a value of 0 indicates a censoring time. Censoring is not supported for discrete distributions or grouped data.
<SUBSET/EXCEPT/FOR/qualification> where <y> is the variable of pre-computed frequencies; <x> is the variable of distinct values for the variable under analysis; <dist> is one of the distributions listed above; and where the <SUBSET/EXCEPT/FOR qualification> is optional. This syntax is used for the case where we have frequency (binned) data. The bins are defined by their mid-points.
<SUBSET/EXCEPT/FOR/qualification> where <y> is the variable of pre-computed frequencies; <x> is the variable of distinct values for the variable under analysis; <dist> is one of the distributions listed above; and where the <SUBSET/EXCEPT/FOR qualification> is optional. This syntax is used for the case where we have frequency (binned) data. The bins are defined by their lower and upper limits. This syntax allows bins with unequal widths.
CAUCHY PROBABILITY PLOT X TUKEY LAMBDA PROBABILITY PLOT X LOGNORMAL PROBABILITY PLOT X WEIBULL PROBABILITY PLOT X EXTREME VALUE TYPE 1 PROBABILITY PLOT X POISSON PROBABILITY PLOT X NORMAL PROBABILITY PLOT F X CAUCHY PROBABILITY PLOT F X TUKEY LAMBDA PROBABILITY PLOT F X LOGNORMAL PROBABILITY PLOT F X WEIBULL PROBABILITY PLOT F X EXTREME VALUE TYPE 1 PROBABILITY PLOT F X POISSON PROBABILITY PLOT F X
The PPCC value provides a measure of the linearity of the probability plot. The PPA0 and PPA1 provides estimates of the location and scale parameters. For some distributions with heavy tails (e.g., Cauchy, slash), there can be extreme variability in the first few and last few points in the probability plot. This can distort the estimates of location and scale. Two iterations of biweight weighting of the residuals are applied to obtain PPA0BW and PPA1BW. In most cases, using PPA0 and PPA1 are preferred. However, in cases where there is extreme non-linearity in the tails, using PPA0BW and PPA1BW may be preferred as the location and scale estimates.
For singly censored data (i.e., all the censored data have the same censoring time), we can use the N from the full sample to compute the uniform order statistics. However, we only plot the failure times. An alternative that works with both singly and multiply (the censoring times are not necessarily the same) is to base the plotting positions on the Kaplan-Meier statistic. That is,
with n denoting the full sample size. Again, only plotting positiions corresponding to failure times are plotted. The percent point function is computed on the pi values. This method for censored probability plots is discussed in more detail on pp. 43-46 of the Bury book (see the References section below). To specify which method to use, enter the command
When this command is entered, Dataplot will compute <value> equally spaced percentiles and compute the probability plot on these percentiles. This option can be useful when generating probability plots on large data sets for distributions with expensive percent point functions.
LET YUPP = MAXIMUM Y LET YLOW = YLOW - 0.5 CLASS LOWER YLOW LET YUPP = YUPP + 0.5 CLASS UPPER YUPP CLASS WIDTH = 1 LET Y2 X2 = BINNED Y LET LAMBDA = 4.2 POISSON PROBABILITY PLOT Y2 X2 This will center the bins around the integer values and will cover the first and last class.
EV1 and GUMBEL are synonyms for EXTREME VALUE TYPE 1. FATIGUE LIFE is a synonym for FL. RECIPROCAL INVERSE GAUSSIAN is a synonym for RIG. IG is a synonym for INVERSE GAUSSIAN. SKEW LAPLACE is a synonym for SKEW DOUBLE EXPONENTIAL ASYMMETRIC LAPLACE is a synonym for ASYMMETRIC DOUBLE EXPONENTIAL
Chambers, Cleveland, Kleiner, and Tukey (1983), "Graphical Methods of Data Analysis", Wadsworth. Karl Bury (1999), "Statistical Distributions in Engineering", Cambridge University Press,
1990/5: WALD, FL, RIG, INVERSE GAUSSIAN 1993/12: GENERALIZED PARETO 1994/9: DISCRETE UNIFORM, NON-CENTRAL T, NON-CENTRAL F,
1995/5: POWER LOGNORMAL, POWER NORMAL, COSINE,
1995/10: LOG GAMMA, HYPERBOLIC SECANT, GOMPERTZ 1995/12: PARETO SECOND KIND, DOUBLE WEIBULL,
1996/5: BETA BINOMIAL, GENERALIZED EXPONENTIAL 1998/5: RECIPROCAL, NORMAL MIXTURE, INVERTED GAMMA 2001/10: GENERALIZED LAMBDA, JOHNSON SU,
2003/5: ERROR 2004/1: TRAPEZOID, GENERALIZED TRAPEZOID, FOLDED T,
2004/5: Fixed a number of bugs for various distributions 2004/6: SKEW DOUBLE EXPONENTIAL, RAYLEIGH,
2004/10: Support for censored data 2005/5: Support unequal bin widths for frequency data MULTIPLOT 2 2 MULTIPLOT CORNER COORDINATES 0 0 100 100 MULTIPLOT SCALE FACTOR 1.5 TITLE AUTOMATIC X1LABEL THEORETICAL VALUE Y1LABEL DATA VALUE TITLE OFFSET 2 X1LABEL DISPLACEMENT 10 Y1LABEL DISPLACEMENT 14 CHAR X LINE BLANK . LET Y = NORMAL RANDOM NUMBERS FOR I = 1 1 100 NORMAL PROBABILITY PLOT Y . LET NU = 5 LET Y = CHI-SQUARE RANDOM NUMBERS FOR I = 1 1 100 CHI-SQUARE PROBABILITY PLOT Y . LET Y = EXPONENTIAL RANDOM NUMBERS FOR I = 1 1 100 EXPONENTIAL PROBABILITY PLOT Y . LET Y = CAUCHY RANDOM NUMBERS FOR I = 1 1 1000 CAUCHY PROBABILITY PLOT Y END OF MULTIPLOT
Date created: 08/30/2005 |
Last updated: 12/04/2023 Please email comments on this WWW page to alan.heckert@nist.gov. |