The Bias–Variance Tradeoff

A model's expected error on new data splits cleanly into two parts that pull in opposite directions:

As you crank up complexity, bias falls but variance rises. Total test error is their sum (plus unavoidable noise), so it traces a U-shape — and the best model sits at the bottom of the U.

The U-shaped curve

Watch the three curves as complexity grows. Bias² slides down, variance climbs up, and their sum — the total error — dips to a minimum and then rises again. Move the marker to the bottom of the total curve: that's the complexity you want.

Steering the tradeoff

Every knob in machine learning is, secretly, a bias–variance knob: a tree's depth, a polynomial's degree, the strength of regularization, the size of a network. More data is the rare free lunch — it lowers variance without raising bias, shifting the whole sweet spot toward more powerful models. Knowing which way a change pushes you is half of practical ML.