Confidence Intervals
A point estimate
is a single dot with no sense of its own error. A confidence interval fixes that:
instead of one guess, it reports a range of plausible values together with a
confidence level such as 95\%.
We build it from the estimate and its
standard error —
roughly, the estimate plus-or-minus a few standard errors. The
empirical rule
is why "a few": about 95\% of the sampling distribution sits within
about two standard errors of the centre.
What the 95% really describes
Here is the subtle part. A 95\% confidence interval is built by a
procedure with this property: if you repeated the whole experiment many times —
new sample, new interval each time — then about 95\% of those
intervals would capture the true \mu. The confidence lives in
the recipe, not in any one interval it produces.
So it is tempting but wrong to say "there is a 95\% chance
\mu is in this interval". Once computed, your interval either
contains the fixed \mu or it does not — there is no probability left in
it. The 95\% is the long-run hit rate of the method.
A stack of intervals, one true μ
The vertical line is the fixed true \mu. Each horizontal bar is one
confidence interval from one sample, centred on that sample's estimate. Most bars cross the line —
they captured \mu. The odd one out (drawn in a warning colour) sits
entirely to one side: it missed. Over many intervals, the fraction that catch
\mu hovers near the confidence level.
Asking for more confidence means missing less often — and the only way to miss
less often with the same data is to make every interval wider. Confidence and
width pull in the same direction.
- A confidence interval reports a range plus a confidence level, not a single point.
- A 95\% interval comes from a procedure that, over repeated samples, captures \mu about 95\% of the time.
- The confidence is in the method: a single computed interval either contains \mu or it does not.
- Higher confidence requires a wider interval.