By Elena Llaudet, co-author of Data Analysis for Social Science (DSS)
Random treatment assignment makes treatment and control groups comparable when sample size n is large enough.
Suppose the population is composed of 20% orange individuals, 10% blue individuals, 20% pink individuals, 30% green individuals, and 20% purple individuals. If we select a sample of n individuals from this population and randomly assign them to treatment and control groups, the two groups will have similar proportions of each type of individual as long as n is large enough. Let's take a closer look:
Note: n is the total sample size, n_t is the size of the treatment group, and n_c is the size of the control group. The white numbers on top of each bar show the actual count of individuals of that type in each group.