20.7 Summary: Finding a CI for
The procedure for computing a confidence interval (CI) for a proportion is:
- Compute the sample proportion, , and identify the sample size .
- Compute the standard error, which quantifies how much the value of varies from one sample to the next:
- Find the multiplier: this is for an approximate 95% CI using the 68–95–99.7 rule. (Note: (Multiplierstandard error) is called the margin of error.)
- Compute:
- Check the statistical validity conditions are satisfied.
Example 20.7 (NHANES data) For the NHANES data, first seen in Sect. 12.10, the unknown parameter is , the population proportion of Americans that currently smoke.
In the study, 1466 out of the 3211 respondents who reported their smoking status said they currently smoked: .
What is the population proportion that currently smoke? We don’t know, and the estimate of from every sample is likely to be different. The standard error is , so the approximate 95% CI for is , or from 0.439 to 0.474. (Check the calculations!)
For the conclusions to be statistically valid, the number of smokers must exceed 5, and the number of non-smokers must exceed 5. Both are true. The CI appears to be statistically valid.
We write:
Based on the sample, we are approximately 95% confident that the interval from from 0.429 to 0.474 straddles the population proportion of smokers in the USA.