Peeking at a/b tests: Why it matters, and what to do about it

R Johari, P Koomen, L Pekelis, D Walsh - Proceedings of the 23rd ACM …, 2017 - dl.acm.org
This paper reports on novel statistical methodology, which has been deployed by the
commercial A/B testing platform Optimizely to communicate experimental results to their …

A/B testing intuition busters: Common misunderstandings in online controlled experiments

R Kohavi, A Deng, L Vermeer - Proceedings of the 28th ACM SIGKDD …, 2022 - dl.acm.org
A/B tests, or online controlled experiments, are heavily used in industry to evaluate
implementations of ideas. While the statistics behind controlled experiments are well …

Always valid inference: Continuous monitoring of a/b tests

R Johari, P Koomen, L Pekelis… - Operations …, 2022 - pubsonline.informs.org
A/B tests are typically analyzed via frequentist p-values and confidence intervals, but these
inferences are wholly unreliable if users endogenously choose samples sizes by …

A dirty dozen: twelve common metric interpretation pitfalls in online controlled experiments

P Dmitriev, S Gupta, DW Kim, G Vaz - Proceedings of the 23rd ACM …, 2017 - dl.acm.org
Online controlled experiments (eg, A/B tests) are now regularly used to guide product
development and accelerate innovation in software. Product ideas are evaluated as …

Trustworthy analysis of online A/B tests: Pitfalls, challenges and solutions

A Deng, J Lu, J Litz - Proceedings of the Tenth ACM International …, 2017 - dl.acm.org
A/B tests (or randomized controlled experiments) play an integral role in the research and
development cycles of technology companies. As in classic randomized experiments (eg …

Objective bayesian two sample hypothesis testing for online controlled experiments

A Deng - Proceedings of the 24th International Conference on …, 2015 - dl.acm.org
As A/B testing gains wider adoption in the industry, more people begin to realize the
limitations of the traditional frequentist null hypothesis statistical testing (NHST). The large …

[HTML][HTML] A/B testing: A systematic literature review

F Quin, D Weyns, M Galster, CC Silva - Journal of Systems and Software, 2024 - Elsevier
A/B testing, also referred to as online controlled experimentation or continuous
experimentation, is a form of hypothesis testing where two variants of a piece of software are …

Data-driven metric development for online controlled experiments: Seven lessons learned

A Deng, X Shi - Proceedings of the 22nd ACM SIGKDD International …, 2016 - dl.acm.org
Online controlled experiments, also called A/B testing, have been established as the mantra
for data-driven decision making in many web-facing companies. In recent years, there are …

Continuous monitoring of A/B tests without pain: Optional stopping in Bayesian testing

A Deng, J Lu, S Chen - 2016 IEEE International conference on …, 2016 - ieeexplore.ieee.org
A/B testing is one of the most successful applications of statistical theory in the Internet age.
A crucial problem of Null Hypothesis Statistical Testing (NHST), the backbone of A/B testing …

[图书][B] A/B testing: The most powerful way to turn clicks into customers

D Siroker, P Koomen - 2015 - books.google.com
How Your Business Can Use the Science That Helped Win the White House The average
conversion rate—the rate at which visitors convert into customers—across the web is only …