![]() |
DISTRIBUTIONAL FIT PLOTName:
Often we have a number of related datasets. The DISTRIBUTIONAL FIT PLOT provides a convenient way of graphically presenting the results of the BEST DISTRIBUTIONAL FIT command for these multiple datasets. Specifically, it creates a tabulation plot where the y-axis contains the distribution and the x-axis contains the distinct datasets. That is, we can view a row of the plot to see how a particular distribution fits the various datasets and we can view a column to see which distributions fit a a particular dataset well. There are two steps in the distributional modeling:
You can specify the fit method with the command
where <value> is one of the following
The default method is maximum likelihood. You can specify the goodness of fit critierion with the command
where <value> is one of the following
The default goodness of fit criterion is Anderson-Darling. Once the matrix (rows represent distribtions, columns represent datasets) of goodness of fit statistics is computed, they are plotted in the form of a tabulation plot. The tabulation plot is a mix between a fluctuation plot and a contour plot. For the fluctuation plot, at each grid position, two rectangles are drawn. The first is drawn in a background color and is full size (i.e., the maximum value of the statistic). A second rectangle is drawn in a foreground color with a height proportional to the value of the statistic for that particular combination of categories. For the tabulation plot, only one rectangle is drawn. However, the color of the rectangle is based on the value of the statistic relative to a "levels" variable (hence the similarity to a contour plot). The Program examples below demonstrate how to create and use the levels variable. Note that defining good levels for the information criterion goodness of fit statistics is somewhat problematic, so this plot is more useful for the PPCC, Anderson-Darling, and Kolmogorov-Smirnov goodness of fit statistics. The appearance of the tabulation plot is controlled by appropriate settings for the REGION, LINE, and CHARACTER settings. This is demonstrated in the Program examples below. Note that not all of the options available for a standard TABULATION PLOT are supported for this command.
<SUBSET/EXCEPT/FOR qualification> where <y> is a response variable; <x> is a group-id variable; <ylevel> is a variable that defines the levels; and where the <SUBSET/EXCEPT/FOR qualification> is optional. For this syntax, the different datasets are identified by a group-id variable.
<SUBSET/EXCEPT/FOR qualification> where <y1> ... <yk> is a list of 1 to 30 response variables; <ylevel> is a variable that defines the levels; and where the <SUBSET/EXCEPT/FOR qualification> is optional. For this syntax, the different datasets are identified with separate response variables.
<SUBSET/EXCEPT/FOR qualification> where <y> is a response variable; <x1> ... <x6> is a list of 1 to 6 group-id variables; <ylevel> is a variable that defines the levels; and where the <SUBSET/EXCEPT/FOR qualification> is optional. The group-id variables are cross-tabulated and a dataset is defined for each distinct combination of values for the group-id variables. For the REPLICATED case, you can control the spacing between groups. Internally, Dataplot uses the CODE CROSS TABULATE command to generate a single combined group-id variable. Enter HELP CODE CROSS TABULATE for details on how to control the spacing (the SET commands used by CODE CROSS TABULATE are supported for the DISTRIBUTIONAL FIT PLOT command).
MULTIPLE DISTRIBUTIONAL FIT PLOT Y1 TO Y8 REPLICATED DISTRIBUTIONAL FIT PLOT Y X1 X2 DISTRIBUTIONAL FIT PLOT Y X SUBSET X < 8
By default (= ON), each factor variable is coded from 1 to NDIST with NDIST denoting the number of levels (i.e., distinct values for that factor variable). When this switch is set to OFF, the x-axis variable is plotted in the units of group-id variable. This can be useful if you want to add spacing between some groups of data or if you have missing groups that you want the plot to clearly show. This command does not apply for Syntax 2 (the MULTIPLTE option).
. . Step 1: Read the data . skip 25 read exp.dat y1 read weibbury.dat y2 read lgn.dat y3 read gamma.dat y4 read gumbel.dat y5 skip 0 . let string dist1 = EXP.DAT let string dist2 = WEIBBURY.DAT let string dist3 = LGN.DAT let string dist4 = GAMMA.DAT let string dist5 = GUMBEL.DAT . . Step 2: Set basic plot control features . case asis title case asis label case asis tic mark label case asis title offset 1 title size 1.8 xlimits 1 5 major xtic mark number 5 minor xtic mark number 0 x1tic mark offset 0.5 0.5 x1tic mark label format alpha x1tic mark label content ^dist1 ^dist2 ^dist3 ^dist4 ^dist5 y1tic mark offset 1 1 y1tic mark label size 1.3 frame corner coordinates 15 15 80 93 . . Step 3: Define the plot levels . let p1 = 0.95 let p2 = 0.96 let p3 = 0.97 let p4 = 0.98 let p5 = 0.985 let p6 = 0.990 let p7 = 0.995 let p8 = 1.0001 let ylevel = data p1 p2 p3 p4 p5 p6 p7 p8 . let ncolor = 8 let string color1 = red let string color2 = orange let string color3 = tan let string color4 = yellow let string color5 = green let string color6 = dgreen let string color7 = lblue let string color8 = blue . . Step 4: Define options and generate the plot . region fill on all region fill color ^color1 ^color2 ^color3 ^color4 ^color5 ^color6 ... ^color7 ^color8 line color white all . set write decimals 5 set best fit method ppcc set best fit criterion ppcc . title Distributional Fit (PPCC Fit, PPCC Goodness of Fit) multiple distributional fit plot y1 to y5 ylevel . . Step 5: Draw the legend box . box fill pattern solid box shadow hw 0 0 justification left height 1.7 . let xcoor1 = 81 let xcoor2 = 85 let xcoor3 = xcoor2 + 1 let ycoor1 = 93 let yinc = 4 let ycoor2 = ycoor1 - yinc let kind = ncolor . loop for k = 1 1 ncolor box fill color ^color^kind box xcoor1 ycoor1 xcoor2 ycoor2 let ycoor3 = ycoor2 + 1 move xcoor3 ycoor3 let km1 = kind - 1 let aval1 = ^p^km1 let aval2 = ^p^kind let aval2 = min(1,aval2) if k < ncolor if k = 1 text ^aval1 - ^aval2 else text ^aval1 - ^aval2 end of if else text <= ^aval1 end of if let ycoor1 = ycoor2 let ycoor2 = ycoor1 - yinc let kind = kind - 1 end of loop ![]() . . Step 1: Read the data . skip 25 read exp.dat y1 read weibbury.dat y2 read lgn.dat y3 read gamma.dat y4 read gumbel.dat y5 skip 0 . let string dist1 = EXP.DAT let string dist2 = WEIBBURY.DAT let string dist3 = LGN.DAT let string dist4 = GAMMA.DAT let string dist5 = GUMBEL.DAT . . Step 2: Set basic plot control features . case asis title case asis label case asis tic mark label case asis title offset 1 title size 1.8 xlimits 1 5 major xtic mark number 5 minor xtic mark number 0 x1tic mark offset 0.5 0.5 x1tic mark label format alpha x1tic mark label content ^dist1 ^dist2 ^dist3 ^dist4 ^dist5 y1tic mark offset 1 1 y1tic mark label size 1.3 frame corner coordinates 15 15 80 93 y1tic mark offset 1 1 y1tic mark label size 1.3 . . Step 3: Define the plot levels . let p1 = 0.25 let p2 = 0.50 let p3 = 0.75 let p4 = 1.0 let p5 = 1.5 let p6 = 2.0 let p7 = 5.0 let p8 = 10000 let ylevel = data p1 p2 p3 p4 p5 p6 p7 p8 . let ncolor = 8 let string color1 = blue let string color2 = lblue let string color3 = dgreen let string color4 = green let string color5 = yellow let string color6 = tan let string color7 = orange let string color8 = red . . Step 4: Define plot options and generate the plot . set write decimals 5 set best fit method ml set best fit criterion anderson darling . region fill on all region fill color ^color1 ^color2 ^color3 ^color4 ^color5 ^color6 ... ^color7 ^color8 line color white all . title Distributional Fit (ML Fit, Anderson-Darling Goodness of Fit) multiple distributional fit plot y1 to y5 ylevel . . Step 5: Add some labels and draw the legend box . box fill pattern solid box shadow hw 0 0 justification left height 1.7 . let xcoor1 = 81 let xcoor2 = 85 let xcoor3 = xcoor2 + 1 let ycoor1 = 93 let yinc = 4 let ycoor2 = ycoor1 - yinc . loop for k = 1 1 ncolor box fill color ^color^k box xcoor1 ycoor1 xcoor2 ycoor2 let ycoor3 = ycoor2 + 1 move xcoor3 ycoor3 if k = 1 text <= ^p1 else if k = ncolor let km1 = k - 1 text >= ^p^km1 else let km1 = k - 1 text ^p^km1 - ^p^k end of if let ycoor1 = ycoor2 let ycoor2 = ycoor1 - yinc end of loop ![]()
|
Privacy
Policy/Security Notice
NIST is an agency of the U.S.
Commerce Department.
Date created: 09/11/2014 |