|
BOOTSTRAP FITName:
When these assumptions are at least approximately satisfied, OLS provides the optimal estimates and uncertainty intervals for the fit coefficients. However, if the assumptions are not at least approximately satisfied, then the OLS estimates may no longer be optimal (and may in fact be quite wrong). Applying transformations and weighting are common approaches to fitting when the assumptions are not satisfied. Bootstrap fitting provides an additional alternative. The bootstrap is a non-parametric method for calculating a sampling distribution for a statistic. The bootstrap calculates the statistic with N different subsamples. The subsampling is performed with replacement. In the context of fitting, we are estimating the coefficients of the fit and providing bootstrap estimates of the uncertainty. There are two approaches to bootstrapping for fitting.
Hamilton (see Reference below) gives some guidance on the contrasts between these approaches.
Given the above, if the assumption of fixed X is realistic (that is, we could readily collect new Y's with the same X values), then residual resampling is justified. For example, this would be the case in a designed experiment. However, if this assumption is not realistic (i.e., the X values vary randomly as well as the Y's), then data resampling is preferred. If the bootstrap methods produce substantially different results, this is an indication that the assumptions of fixed X and independent and identically distributed residuals may not be valid. The BOOTSTRAP FIT command produces the following output:
where <y> is the response (dependent) variable; <x1> .... <xk> is a list of one or more independent variables; and where the <SUBSET/EXCEPT/FOR qualification> is optional.
BOOTSTRAP Y X1 X2 X3 X4 BOOTSTRAP Y X1 X2 X3 X4 SUBSET TAG > 1
The default is RESIDUAL. You can use EFRON as a synonym for RESIDUAL and WU as a synonym for DATA.
BOOTSTRAP LINEAR CALIBRATION PLOT BOOTSTRAP QUADRATIC CALIBRATION PLOT The BOOTSTRAP FIT METHOD command also applies to theses commands.
The default is 100 bootstrap samples. Some sources recommend as many as 2,000 bootstrap samples for accurate confidence intervals for the parameter estimates.
Some refinements to generate more accurate confidence intervals have been proposed. The issue of bootstrap confidence intervals for multilinear fitting is discussed on pages 319-325 of Hamilton.
Hamilton (1992), "Regression with Graphics: A Second Course in Applied Statistics," Duxbury Press,
skip 25 read berger1.dat y x . set write decimals 5 set bootstrap fit method data bootstrap samples 100 bootstrap fit y x . delete a0 a1 skip 0 set read format 2e15.7 read dpst1f.dat a0 a1 read dpst2f.dat a0sd a1sd . multiplot corner coordinates 0 0 100 100 multiplot scale factor 2 multiplot 2 2 . title a0 estimate let bmean = mean a0 let b025 = 2.5 percentile a0 let b975 = 97.5 percentile a0 let bmean = int(bmean*1000)/1000 let b025 = int(b025*1000)/1000 let b975 = int(b975*1000)/1000 x2label mean = ^bmean, b025 = ^b025, b975 = ^b975 histogram a0 title a1 estimate let bmean = mean a1 let b025 = 2.5 percentile a1 let b975 = 97.5 percentile a1 let bmean = int(bmean*1000)/1000 let b025 = int(b025*1000)/1000 let b975 = int(b975*1000)/1000 x2label mean = ^bmean, b025 = ^b025, b975 = ^b975 histogram a1 title a0 standard deviation let bmean = mean a0sd let bmean = int(bmean*1000)/1000 x2label mean = ^bmean histogram a0sd title a1 standard deviation let bmean = mean a1sd let bmean = int(bmean*1000)/1000 x2label mean = ^bmean histogram a1sd . end of multiplotThe BOOTSTRAP FIT command generates the following output: Bootstrap Linear/Multilinear Fit Number of Observations: 107 Number of Bootstrap Sample 100 Bootstrap Method: Data (Wu) Summary Table -------------------------------------------------------------------------------------------------- Para- Estimates From Original Fit Estimates From Bootstrap Fit meter Coef SD Mean SD 2.5 97.5 -------------------------------------------------------------------------------------------------- A0 4.99367 1.12565 4.87176 1.13198 2.77080 6.68491 A1 0.73111 0.02455 0.73568 0.02478 0.68731 0.80165Histograms can be generated for the bootstrap samples of each of the parameter estimates and the standard deviations of the parameter estimates.
Date created: 08/12/2002 |
Last updated: 12/11/2023 Please email comments on this WWW page to alan.heckert@nist.gov. |