|
CLASSIFICATION SCATTER PLOTName:
The classification scatter plot reverses the role of the reponse variable and the factor variables. For the classification scatter plot, the Y axis variable is assumed to be qualitative (i.e., a specific number of levels) and the factor variables are assumed to be continuous (the plot will still work if some of the factor variables are also qualitative). The context is the common classification problem where we use the values of the factor variables to classify which group an observation belongs to. For this plot, the subplots are based on the distinct levels of the response variable. For example, suppose the Y axis variable (Y) has two possible values. Then for the first factor variable (X1), we plot the values of X1 corresponding to Y = 1 with x-coordinate 0.8 and the we plot the values of X1 corresponding to Y = 2 with x-coordinate 1.2. A similar subplot is created for each factor variable. This plot can be useful in determing what are the most important factors in determining a classification.
<SUBSET/EXCEPT/FOR qualification> where <y> is the response variable (qualitative); and where the <SUBSET/EXCEPT/FOR qualification> is optional.
CLASSIFICATION SCATTER PLOT Y X1 X2 X3 CLASSIFICATION SCATTER PLOT Y X1 X2 X3 X4 CLASSIFICATION SCATTER PLOT Y X1 TO X4
For each trace, the mean, standard deviation, minimum, and maximum value for that trace is written to the file dpst4f.dat. This can be useful for annotating the plot
. Step 1: Read the data . SET WRITE DECIMALS 3 DIMENSION 40 COLUMNS SKIP 25 READ IRIS.DAT X1 TO X4 Y SKIP 0 . LET NFACT = 4 LET STRING T1 = Sepal Length LET STRING T2 = Sepal Width LET STRING T3 = Petal Length LET STRING T4 = Petal Width . LOOP FOR K = 1 1 NFACT LET MEAN^K = MEAN X^K; LET MEAN^K = ROUND(MEAN^K,3) LET SD^K = SD X^K; LET SD^K = ROUND(SD^K,3) END OF LOOP . . Step 2: Set plot control features . CASE ASIS TITLE CASE ASIS LABEL CASE ASIS TIC MARK LABEL CASE ASIS TITLE OFFSET 2 . CHARACTERS 1 2 3 1 2 3 1 2 3 1 2 3 BLANK CHARACTER COLOR BLUE RED GREEN BLUE RED GREEN BLUE RED GREEN BLUE RED GREEN LINES COLOR BLUE RED GREEN BLUE RED GREEN BLUE RED GREEN BLUE RED GREEN LET PLOT LINE 13 = BLANK XLIMITS 1 NFACT MAJOR XTIC MARK NUMBER NFACT MINOR XTIC MARK NUMBER 0 TIC MARK OFFSET UNITS DATA XTIC OFFSET 1 1 XTIC LABEL FORMAT ALPHA XTIC LABEL CONTENT F1:sp()Sepalcr()Length F2:sp()Sepalcr()Width ... F3:sp()Petalcr()Length F4:sp()Petalcr()Width Y1LABEL Standardized Feature X1LABEL Features X1LABEL DISPLACEMENT 12 YLIMITS -4 4 . LET X1 = STANDARDIZE X1 LET X2 = STANDARDIZE X2 LET X3 = STANDARDIZE X3 LET X4 = STANDARDIZE X4 . . Step 3: Generate plots . TITLE Classification Scatter Plot: Standardized Units CLASSIFICATION SCATTER PLOT Y X1 X2 X3 X4 . TITLE IRIS Classification Analysis Based on Standardized Data . CLASSIFICATION SCATTER PLOT Y X1 X2 X3 X4 . LET XCOOR1 = 86 LET XCOOR2 = 88 LET YCOOR = 89 LET YINC = 2.5 JUSTIFICATION LEFT COLOR BLACK HEIGHT 2 . LOOP FOR K = 1 1 NFACT MOVE XCOOR1 YCOOR TEXT F^K: ^T^K LET YCOOR = YCOOR - YINC MOVE XCOOR2 YCOOR TEXT Mean = ^MEAN^K LET YCOOR = YCOOR - YINC MOVE XCOOR2 YCOOR TEXT SD = ^SD^K LET YCOOR = YCOOR - YINC END OF LOOP . COLOR BLUE MOVE XCOOR1 45 TEXT Cat1: Setosa . COLOR RED MOVE XCOOR1 42.5 TEXT Cat2: Versicolor . COLOR GREEN MOVE XCOOR1 40 TEXT Cat3: Virginica . skip 1 read dpst4f.dat ymean ysd ymin ymax skip 0 let ymean = round(ymean,1) let ysd = round(ysd,1) let ymin = round(ymin,1) let ymax = round(ymax,1) . character blank all character size 1.5 all character just right all character color blue blue blue blue red red red red green green green green let nlen = 1 let sblank = blank string nlen . set substitute format f4.1 loop for l = 1 1 nfact loop for k = 1 1 3 let k2 = (l-1)*3 + k let aval = ymax(k2) let bval = ymean(k2) let cval = ymin(k2) let dval = ysd(k2) let string s2 = ^aval ^bval ^cval ^dval if k = 1 let string s = ^s2 else let s = string concatenate s sblank s2 end of if end of loop character ^s let xpos = sequence 0.8 4 0.2 1.21 let xpos = (l - 1) + xpos let ypos = sequence 26 -1.5 21 for i = 1 1 12 let tag = sequence 1 1 12 drawds symbol xpos ypos tag delete s s2 end of loop . color black height 1.5 justification left move xcoor1 25.5 text Max move xcoor1 24 text Mean move xcoor1 22.5 text Min move xcoor1 21 text SD . height 2 just left move 2 7.5 text if F3 <= -0.7, then cat = 1 move 2 5 text if F4 >= 0.4, then cat = 3 move 2 2.5 text else cat = 2 . line color black line dotted drawsdsd 15 0.4 85 0.4 drawsdsd 15 -0.7 85 -0.7
Date created: 03/14/2019 |
Last updated: 12/04/2023 Please email comments on this WWW page to alan.heckert@nist.gov. |