With two features we can picture it: each example is a point, coloured by its class, and the model's job is to draw a boundary that separates the classes. New points are then classified by which side of the boundary they fall on.
Two classes are scattered below. Rotate and shift the boundary to separate them as cleanly as possible; the readout counts how many points it gets right. A line that puts every blue on one side and every orange on the other is a perfect classifier for this data.
A hard line gives a yes/no answer, but often we want a probability — "85% likely
spam." That calls for the