Why did u(x, t) = f(x - ct) solve the
transport equation?
Because u stays constant along the moving lines
x - ct = \text{const}. The method of characteristics
turns this observation into a technique: find the curves along which a first-order PDE becomes an
ordinary one.
Along a path x(t), the chain rule gives the rate of change of
u as you ride it:
\frac{d}{dt}\,u\big(x(t), t\big) = u_t + \frac{dx}{dt}\,u_x.
Compare with u_t + c\,u_x = 0: if we choose the path so that
\frac{dx}{dt} = c, the right-hand side is the PDE, so
\frac{du}{dt} = 0. The unknown is constant along that path.