Question 1

How many people do I need for an A/B test?

Accepted Answer

It depends on your baseline conversion rate, the minimum effect you want to detect, and your chosen statistical power and significance level. As a rough guide, detecting a 1 percentage point lift on a 5% baseline at 80% power and 5% significance requires about 3,600 visitors per variation. Use the calculator above to get an exact number for your scenario.

Question 2

What sample size do I need for 95% confidence?

Accepted Answer

A 95% confidence level corresponds to a 5% significance level (α = 0.05), which is the industry standard. The required sample size also depends on your baseline rate, minimum detectable effect, and statistical power. For example, with a 10% baseline rate and a 1 percentage point MDE at 80% power, you would need roughly 14,700 visitors per group.

Question 3

What happens if my sample size is too small?

Accepted Answer

Running an A/B test with an insufficient sample size means your test is underpowered. This leads to a high risk of false negatives — failing to detect a real effect. It also makes any observed results unreliable and more susceptible to random noise. Always calculate the required sample size before starting your test.

Question 4

How long should I run my A/B test?

Accepted Answer

The minimum duration depends on your required sample size and daily traffic. Divide the total required sample size by your daily visitor count. Additionally, it's best practice to run tests for at least one full business cycle (typically 1–2 weeks) to account for day-of-week effects, even if you reach the required sample size sooner.

Question 5

Should I use a one-tailed or two-tailed test?

Accepted Answer

Use a two-tailed test in most cases. A two-tailed test detects effects in both directions (improvements and regressions), which is important because a change that you expect to improve metrics can sometimes make things worse. Only use a one-tailed test if you are certain the effect can only go in one direction and you do not care about detecting effects in the opposite direction.

Question 6

What is the Bonferroni correction and when do I need it?

Accepted Answer

The Bonferroni correction adjusts the significance level when you test multiple variants against a single control (e.g., an A/B/C test). Without correction, the probability of at least one false positive increases with each additional comparison. The correction divides your significance level by the number of comparisons. For example, with 3 variants and α = 5%, the adjusted α per comparison is 2.5%. This calculator applies the correction automatically when you set more than 2 variations.

How to Calculate Sample Size for an A/B Test

The Sample Size Formula

Worked Example

Understanding the Key Parameters

Baseline Conversion Rate

Minimum Detectable Effect (MDE)

Statistical Power (1 - β)

Significance Level (α)

One-Tailed vs. Two-Tailed Tests

Multiple Variants and the Bonferroni Correction

Frequently Asked Questions