Simpson's Paradox

-->

Simpson’s Paradox (Stanford Encyclopedia of Philosophy)

Stanford Encyclopedia of Philosophy

Browse

Table of Contents

What's New

Random Entry

Chronological

Archives

About

Editorial Information

About the SEP

Editorial Board

How to Cite the SEP

Special Characters

Advanced Tools

Contact

Support SEP

Support the SEP

PDFs for SEP Friends

Make a Donation

SEPIA for Libraries

Entry Navigation

Entry Contents

Bibliography

Academic Tools

Friends PDF Preview

Author and Citation Info

Simpson’s Paradox First published Wed Mar 24, 2021; substantive revision Sat Jun 6, 2026

Simpson’s Paradox is a statistical phenomenon where an association between two variables in a population emerges, disappears or reverses when the population is divided into subpopulations. For instance, two variables may be positively associated in a population, but be independent or even negatively associated in all subpopulations. Cases exhibiting the paradox are unproblematic from the perspective of mathematics and probability theory, but nevertheless strike many people as surprising. Additionally, the paradox has implications for a range of areas that rely on probabilities, including decision theory, causal inference, and evolutionary biology. Finally, there are many instances of the paradox, including in epidemiology and in studies of discrimination, where understanding the paradox is essential for drawing the correct conclusions from the data.

The following article provides a mathematical analysis of the paradox, explains its role in causal reasoning and inference, compares theories of what makes the paradox seem paradoxical, and surveys its applications in different domains.

1. Introduction

2. Definition and Mathematical Characterization

2.1 Varieties of Simpson’s Paradox

2.2 Necessary and Sufficient Conditions

3. Simpson’s Paradox and Causal Inference

3.1 Probabilistic Causality and Simpson’s Paradox

3.2 Specific Debates: Causal Interaction, Average Effects, Mediators

3.3 DAGs and Causal Identifiability

3.4 Confounding and Pearl’s Analysis of the Paradox

3.5 Implications

4. What Makes Simpson’s Paradox Paradoxical?

5. Applications

5.1 Non-Categorical Data and Linear Regression

5.2 Epidemiology and Meta-Analysis

5.3 Decision Theory and the Sure-Thing Principle

5.4 Philosophy of Biology and Natural Selection

5.5 Policy Questions: Interpreting Data on Discrimination

5.6 Using Statistics to Evaluate Task Performance

6. Conclusions

Bibliography

Academic Tools

Other Internet Resources

Related Entries

1. Introduction

We begin with an illustration of the paradox with concrete data. The numbers in Table 1 summarize the effect of a medical treatment for the overall population (N = 52), and separately for men and women:

pdf include-->

Full Population, \(\bf N=52\) Men \(\bf(\r{M})\), \(\bf N=20\) Women \(\bf(\neg \r{M})\), \(\bf N=32\)

Success \(\bf(\r{S})\) Failure \(\bf(\neg \r{S})\) Success Rate Success Failure Success Rate Success Failure Success Rate

Treatment (T) 20 20 50% ≈ 61% 12 15 ≈ 44%

Control

(¬T) 50% ≈ 57% ≈ 40%

Table 1: Simpson’s Paradox: the type of association at the population level (positive, negative, independent) changes at the level of subpopulations. Numbers taken from Simpson’s original example (1951).

For matters of exposition, we assume that these frequencies are unbiased estimates of the underlying probabilities. The treatment looks ineffective at the level of the overall population, but it leads to higher success percentages than the control both for men and for women (61% vs. 57% for men and 44% vs. 40% for women). Writing these proportions as conditional probabilities, with \(\r{T}\)=treatment, \(\r{S}\)=success/recovery, and \(\r{M}\)=male subpopulation, we obtain

\[ p(\r{S}\mid \r{T}) = p(\r{S}\mid \neg \r{T}) \]

but at the same time,

\[\begin{align*} p(\r{S}\mid \r{T}, \r{M}) & \gt p(\r{S}\mid \neg \r{T}, \r{M} ) \\ p(\r{S}\mid \r{T}, \neg \r{M}) &\gt p(\r{S}\mid \neg \r{T}, \neg \r{M}) \end{align*}\]

Should we use the treatment or not? When we know the gender of the patient, we would presumably administer the treatment, whereas it does not look like the right thing to do when we don’t know the patient’s gender—although we know that the patient is either male or female!

This phenomenon was first pointed out in papers by Karl G. Pearson (1899) and George U. Yule (1903), but it was Simpson’s short paper “The interpretation of interaction in contingency tables” (1951), discussing the interpretation of such association reversals, that led to the phenomenon being labeled as “Simpson’s Paradox”. The phenomenon is, however, broader than independence in the overall population and positive association in the subpopulations; for example, the associations may also be reversed. Nagel and Cohen (1934: ch. 16) provide an...

Simpson's Paradox

Related Articles

The Newest Instagram "Exploit" Is the Goofiest I've Seen

It's Not Just X. It's Y

Amazon, Facebook, FBI have access to a private intelligence-sharing network

Show HN: GoPeek – open links in live mini browser windows without new tabs

Agent Memory: An Anatomy