The Multivariate Gaussian

The bell curve generalises to many dimensions. A multivariate Gaussian is pinned down by two objects: a mean vector \boldsymbol\mu (where it is centred) and a covariance matrix \Sigma (its shape and spread). Its density is

p(\mathbf{x}) = \frac{1}{\sqrt{(2\pi)^n \det\Sigma}}\,\exp\!\Big(-\tfrac12 (\mathbf{x} - \boldsymbol\mu)^{\mathsf T}\Sigma^{-1}(\mathbf{x} - \boldsymbol\mu)\Big).

The exponent is the heart of it. The quadratic form (\mathbf{x} - \boldsymbol\mu)^{\mathsf T}\Sigma^{-1}(\mathbf{x} - \boldsymbol\mu) is the squared Mahalanobis distance — distance measured in units of standard deviation, stretched by the covariance. It reappears verbatim as the data-misfit term in the Bayesian inverse problem.

Contours are ellipses

The density is constant where the quadratic form is constant, so the level sets are ellipses — concentric, centred at \boldsymbol\mu, with axes and widths set by the eigenvectors and eigenvalues of \Sigma. A diagonal \Sigma gives axis-aligned ellipses; correlation tilts them.

A special fact peculiar to Gaussians: zero covariance implies independence. For a general distribution uncorrelated does not mean independent, but for jointly Gaussian variables a diagonal \Sigma means the components are genuinely independent.

Nested probability contours

The nested ellipses are equal-density contours of a centred 2-D Gaussian. Adjust \sigma_x, \sigma_y and the correlation \rho: the inner ellipse is the most probable region, and each outer ring is less likely. The whole family rotates and stretches together because they all share the one matrix \Sigma.