Articles on A/B testing

In-depth articles on A/B testing with primary focus on statistical methods applied to online experimentation. Written in an accessible language targeted at conversion optimization practitioners the articles also go into deep technical topics where necessary.

What Can Be Learned From 1,001 A/B Tests?

How long does a typical A/B test run for? What percentage of A/B tests result in a ‘winner’? What is the average lift achieved in online controlled experiments? How good are top conversion rate optimization specialists at coming up with impactful interventions for websites and mobile apps? This meta-analysis of 1,001 A/B tests analyzed using […] Read more…

Also posted in AGILE A/B testing, Conversion optimization | Tagged conversion rate optimization, efficient ab testing, lift, meta analysis, minimum detectable effect, minimum effect of interest, online ab testing, sequential testing, statistical power, test duration

When Session-Based Metrics Lie

In online A/B testing it is not uncommon to see session-based metrics being used as the primary performance indicator. Session-based conversion rates and session-based averages (like average revenue per session, in likeness to ARPU) are often reported by default in software by prominent vendors, including Google Optimize and Google Analytics. This widespread availability makes session-based […] Read more…

Also posted in Conversion optimization | Tagged ab testing, conversion rate optimization, user based metrics, user metrics

A/B Testing Statistics – A Concise Guide for Non-Statisticians

Navigating the maze of A/B testing statistics can be challenging. This is especially true for those new to statistics and probability. One reason is the obscure terminology popping up in every other sentence. Another is that the writings can be vague, conflicting, incomplete, or simply wrong, depending on the source. Articles sprinkled with advanced math, […] Read more…

Also posted in Statistics | Tagged ab testing, confidence intervals, p-value, statistical confidence, statistical power, statistical significance

P-values and Confidence Intervals Explained

Hundreds if not thousands of books have been written about both p-values and confidence intervals (CIs) – two of the most widely used statistics in online controlled experiments. Yet, these concepts remain elusive to many otherwise well-trained researchers, including A/B testing practitioners. Misconceptions and misinterpretations abound despite great efforts from statistics educators and experimentation evangelists. […] Read more…

Also posted in Statistical significance, Statistics | Tagged ab testing, confidence intervals, confidence threshold, p-value, random variable, statistical significance, variability

Top Misconceptions About Scientific Rigor in A/B Testing

Have you ever thought that statistically rigorous A/B tests are impractical? Or do you have trouble selling the need for rigor in testing to your clients, coworkers, or boss? This article debunks the top five myths about the necessity and difficulties of applying scientific method in online A/B testing. Read more…

Also posted in Conversion optimization | Tagged ab testing, business experiments, confidence threshold, conversion rate optimization, scientific method, statistical confidence, statistical significance

Improve your A/B tests with 9 lessons from the COVID-19 vaccine trials

Lessons for Online AB Testing from the COVID Vaccine Trials

A full year has passed since results from the first clinical trials of COVID-19 vaccines became available. Vast swaths of follow-up observational data are now in circulation which enables some conclusions about the strengths and weaknesses of those trials. Since modern medical trials are among the most rigorous and scrutinized experiments they can serve as […] Read more…

Also posted in Statistics | Tagged ab testing tips, clinical trial design, clinical trials, design of experiments, medical trials, statistical analysis

The new standard for planning and analyzing A/B tests is here

The first major overhaul of Analytics Toolkit since its release in early 2014 has finally arrived and it brings with it solutions to many of the hard questions facing practitioners when planning and analyzing A/B tests. Conducting statistically rigorous tests while achieving the best return on investment go hand in hand in the new Toolkit. […] Read more…

Also posted in AGILE A/B testing, Analytics-Toolkit.com, Conversion optimization, Statistics | Tagged a/b testing calculators, ab testing, ab testing calculator, agile ab testing, efficient ab testing

Bayesian Probability and Nonsensical Bayesian Statistics in A/B Testing

Bayesian Probability and Bayesian Statistics in AB Testing

Many adherents of Bayesian methods put forth claims of superiority of Bayesian statistics and inference over the established frequentist approach based mainly on the supposedly intuitive nature of the Bayesian approach. Rational thinking or even human reasoning in general is Bayesian by nature according to some of them. Others argue that proper decision-making is inherently […] Read more…

Also posted in Bayesian A/B testing, Conversion optimization, Statistics | Tagged bayesian ab testing, bayesian inference, bayesian probability, bayesian statistics, frequentist inference, frequentist statistics

The Perils of Using Google Analytics User Counts in A/B Testing

Google Analytics User Data in AB Testing

(Updated Sep 2022 & Jan 2023 to include updated information about Google Analytics 4) Many analysts, marketers, product managers, UX and CRO professionals nowadays rely on user counts provided by Google Analytics, Adobe Analytics, or similar tools, in order to perform various statistical analyses. Such analyses may involve the statistical hypothesis tests and estimations part […] Read more…

Also posted in Google Analytics, Statistics | Tagged ab testing, adobe analytics, cardinality estimates, cardinality estimation, google analytics, hyperloglog, hyperloglog++, user counts, user metrics

The Effect of Using Cardinality Estimates Like HyperLogLog in Statistical Analyses

This article will examine the effects of using the HyperLogLog++ (HLL++) cardinality estimation algorithm in applications where its output serves as input for statistical calculations. A prominent example of such a scenario can be found in online controlled experiments (online A/B tests) where key performance measures are often based on the number of unique users, […] Read more…

Also posted in Google Analytics, Statistics | Tagged cardinality estimates, cardinality estimation, hyperloglog, hyperloglog++, sample ratio mismatch, statistical analysis, statistical significance

Error Spending in Sequential Testing Explained

Sequential Hypothesis Testing with Efficacy and Futility Bound

Sequential analysis of experimental data from A/B tests has been quite prominent in recent years due to the myriad of Bayesian solutions offered by big industry players. However, this type of sequential analysis is not sequential testing proper as these solutions have generally abandoned the idea of testing and therefore error control, substituting it for […] Read more…

Also posted in AGILE A/B testing, Statistics | Tagged alpha spending, alpha spending functions, optional stopping, peeking, sequential testing, sequential tests, statistical significance

Frequentist vs Bayesian Inference

In this article I’m revisiting* the topic of frequentist vs Bayesian inference with specific focus on online A/B testing as usual. The present discussion easily generalizes to any area where we need to measure uncertainty while using data to guide decision-making and/or business risk management. In particular, I will discuss each of the following five […] Read more…

Also posted in Bayesian A/B testing, Statistics | Tagged bayesian ab testing, bayesian inference, bayesian statistics, error statistics, frequentist inference, frequentist statistics, model assumptions, peeking

Search

Browse by topic

Browse by year

The book on user testing

Take your A/B testing program to the next level

Learn more

Articles on A/B testing

Search

Recent articles

Browse by topic

Browse by year