Sometimes data arrive already sorted into classes — ranges of values — rather
than as a list of exact numbers. A survey might record that
This grouped data hides the exact values, so we cannot find the mean precisely. Instead we estimate it: we treat every value in a class as if it sat at the class midpoint, the value halfway between the lower and upper boundary. The midpoint is our best single stand-in for the whole class.
Then the estimated mean is the familiar
Here is a worked example for heights, in centimetres, grouped into four classes:
| Class (cm) | Midpoint |
Frequency |
|
|---|---|---|---|
| 150–160 | 155 | 7 | 1085 |
| 160–170 | 165 | 12 | 1980 |
| 170–180 | 175 | 9 | 1575 |
| 180–190 | 185 | 2 | 370 |
| Total | 30 | 5010 |
The midpoint columns total
It is an estimate because nobody in the