AI Bias and Fairness
How bias enters AI systems through data, design, and deployment, why fairness is harder to define than it seems, and what can be done to build more equitable AI.
Where Bias Comes From
AI bias is not a single problem — it enters systems at every stage of development. Training data bias occurs when the data used to train a model reflects historical inequities. An AI trained on decades of hiring decisions will learn that men were hired more often for engineering roles — not because men are better engineers, but because historical discrimination skewed the data. Amazon discovered this in 2018 when its AI recruiting tool systematically downgraded resumes containing the word 'women's' (as in 'women's chess club captain').
Representation bias occurs when certain groups are underrepresented in training data. Facial recognition systems trained predominantly on light-skinned faces perform significantly worse on dark-skinned faces. A landmark 2018 study by Joy Buolamwini and Timnit Gebru at MIT found that commercial facial recognition error rates for dark-skinned women were up to 34.7%, compared to 0.8% for light-skinned men.
Measurement bias happens when the variables used as proxies for a concept do not measure it accurately. Using arrest records as a proxy for criminal behavior, for example, reflects policing patterns rather than actual crime rates.