Statistics

Statistics is the art of learning from data. The world hands us messy, partial, noisy numbers — exam scores, rainfall, click-through rates, the heights of a thousand people — and statistics is the disciplined set of tools for turning that mess into honest conclusions: a single number that summarises, a curve that describes, and a verdict that says how much we should believe it.

It splits naturally into two halves. Descriptive statistics compresses a pile of data into a few telling numbers and pictures. Inferential statistics goes further and dangerous: it uses a small sample to make claims about a whole population we never fully see — and, crucially, it keeps track of how wrong those claims might be.

The big idea: signal through noise

One thread runs through the whole subject. Every measurement is part signal (the thing we want to know) and part noise (the accidents of which particular data we happened to collect). Statistics is the machinery for separating the two — and for being honest about the noise that remains. Get a single number from data, and the very next question is always the same: how much would it have changed if I'd collected different data?

The shape of the journey

The course climbs in seven stages, each building on the last.

Stage 1 — Describing data

  1. Data and Variables
  2. The Mean
  3. The Median
  4. The Mode
  5. Range and Spread
  6. Variance
  7. Standard Deviation
  8. Quartiles and the IQR
  9. Histograms
  10. Shape, Skew, and Outliers

Stage 2 — Distributions

  1. What Is a Distribution?
  2. The Normal Distribution
  3. The Empirical Rule
  4. z-Scores

Stage 3 — Sampling

  1. Population and Sample
  2. Sampling and Bias
  3. The Sampling Distribution of the Mean
  4. Standard Error
  5. The Central Limit Theorem

Stage 4 — Estimation

  1. Point Estimates
  2. Confidence Intervals
  3. A Confidence Interval for a Mean

Stage 5 — Hypothesis testing

  1. Hypothesis Testing
  2. The p-Value
  3. Significance and the t-Test
  4. Type I and Type II Errors

Stage 6 — Relationships

  1. Scatter Plots
  2. Correlation
  3. The Regression Line
  4. Interpreting Slope and Intercept

Stage 7 — Bayesian inference

The classical course above asks "how surprising is this data, if the claim were true?" The Bayesian view flips the question to "how should this data update what I believe?" Both are statistics; the second is a short arc you can take now or save for last.

  1. Bayes' Theorem
  2. Likelihood and MLE
  3. The Covariance Matrix
  4. The Multivariate Gaussian
  5. MAP Estimation

Let's get started

We begin where all statistics begins — with the raw material. Before any average or curve, you have to know what kind of thing you are measuring.

Let's get started → Data and Variables