Step into the
It's everywhere: grouping customers into segments, organising news into topics, spotting communities in a social network, compressing colours in an image. Whenever you want to ask "what natural categories are in this data?", clustering is the tool.
These points clearly fall into clumps — but how many? Choose the number of clusters
Without labels, "correct" is genuinely ambiguous — different notions of similarity give different
clusterings, and choosing