East Carolina University
Department of Psychology


Resampling Statistics

    A new addition to the chapter on nonparametric statistics, in David Howell's Methods text, 5th edition, is a brief introduction to resampling statistics. These statistics may be appropriate in circumstances where you are uncomfortable with the normality assumption common to parametric inferential statistics, and perhaps in some other circumstances as well. Dave is of the opinion that resampling statistics will replace the traditional nonparametric statistics, and perhaps the traditional parametric statistics, in time.

    Howell’s resampling software is provided on the CD that comes with the text, but you should download the latest program from his web site. First, go to Howell’s Resampling Statistics page. In addition to reading the material there, you should download the program, unzip it, and install it on your personal computer.

Bootstrapping

    With this approach, one constructs a sampling distribution by repeatedly sampling, with replacement, from the actual sample of data at hand. This is much like what we did back in PSYC 6430 when we employed Monte Carlo methods to construct sampling distributions of the mean, variance, standard deviation, z, and t, but then we sampled from mathematically defined populations (such as the standard normal population).

Confidence Interval for a Median

    Consider the example presented on pages 638 and 639 of our text book (David Howell's Statistical Methods for Psychology, 6th edition). The data at hand are 20 scores on a memory task. We wish to construct an 95% confidence interval for the median. We chose the median because the distribution appears to be distinctly skewed. Here is how we construct the confidence interval:

    The sampling distribution obtained by Howell appears on page 641 of your textbook. The .025 percentile is between 5 and 6. We could interpolate between 5 and 6 to obtain the lower confidence limit, but Howell plays it conservative and sets the lower limit at 5. The .975 percentile is somewhere between 9 and 10. Again, we could interpolate, but Howell plays it conservative and sets the upper limit at 10. Since only 3 of the scores in the 10,000 samples have values that fall outside of this confidence interval, our confidence coefficient is actually 9997/10,000 = .9997.

    Howell used the program Resampling Stats by Simon and Bruce to do the analysis shown in our textbook. We do not have that program, but Dr. Howell has provided us with resampling software. I would like you to give it a try. Please complete the following exercise and those that follow:

Confidence Interval for Pearson r

    Consider the data on misanthropy, idealism, and attitude about animals, which we analyzed back in PSYC 6430. We found a significant correlation between misanthropy and attitude about animals for nonidealists but not for idealists. Let us now put a confidence interval on the correlation we obtained with the nonidealists. Here is how we construct the confidence interval:

    Let us use Howell's resampling program to construct this confidence interval:

    You know that the traditional independent samples t test is equivalent to a test of the null hypothesis that the point-biserial r is zero in the population. Accordingly, it might well make sense to use this correlation program for a bootstrapping test of the difference in means between two independent samples -- just code group membership with numbers (like 1 and 2) and run the program. If the confidence interval for the point biserial r does not include zero, then the two groups differ significantly.

Permutation/Randomization Tests

    With this approach one takes the data at hand, randomly assigns scores to groups (without replacement), and then computes, on the obtained sample(s), the relevant statistic. This procedure is repeated many times, obtaining a sampling distribution of the statistic of interest.

Two Independent Samples

    Consider the data on page 645 of our textbook. We have a sample of 49 scores in the success group, and 18 in the fail group. Here is how we conduct a permutation/randomization test:

     Try using Howell's software to conduct this test:

Two Correlated Samples

    Consider the data on page 642 of our textbook. We have paired data from 19 subjects, and 19 signed difference scores. Here is how we conduct a permutation/randomization test:

    Our sample median difference score was 6. From the resampling distribution on page 643 of our textbook, we see that p = (10 + 13)/10,000 = .0023.

    Try using Howell's software to conduct to analyze these data:

Closing Comments

    The program will do more than what I have covered here. Feel free to play around with the other routines available there.

    Dr. Howell's program is designed as a teaching tool rather than a research tool. If you wish to conduct resampling statistics for research purposes, you might want to get a commercial package -- unless you are as frugal as am I.

    Thanks, Dave, for your work on this!

Statistics 101

    John Grosberg offers a giftware program he has written, Statistics101.  It executes the Resampling Stats language of Julian Simon and Peter Bruce.  I have not had a chance to evaluate it myself.

birds flying

Back to the Stat Help PageBack to the Stat Help Page

spider in web
Contact Information for the Webmaster,
Dr. Karl L. Wuensch


This page most recently revised on 7. February 2007.