For the complete documentation index, see llms.txt.
Skip to main content
New
17% · 1/6
Lesson 12 min 20 XP

Sampling Methods and Bias

How the way data is collected determines what it can tell you, and why sampling errors invalidate conclusions no matter how large the dataset.

The Foundation of All Data

Every statistic, poll, and study you encounter is based on a sample — a subset of a larger population. The validity of any conclusion depends entirely on whether that sample accurately represents the population it claims to describe. A biased sample produces biased conclusions, no matter how sophisticated the analysis.

The most famous sampling failure in history is the 1936 Literary Digest presidential poll, which surveyed 2.4 million Americans and predicted Alf Landon would defeat Franklin Roosevelt in a landslide. Roosevelt won 46 of 48 states. The Digest's sample was drawn from telephone directories and automobile registrations — sources that over-represented wealthy Americans during the Great Depression. The enormous sample size could not compensate for the systematic bias in who was sampled.