Questions tagged [ab-testing]

A/B testing, split testing, or bucket testing is a controlled comparison of the effectiveness of variants of a website, email, or other commercial product.

From Wikipedia:

A/B testing, split testing or bucket testing is a method of marketing testing by which a baseline control sample is compared to a variety of single-variable test samples in order to improve response rates. A classic direct mail tactic, this method has been recently adopted within the interactive space to test tactics such as banner ads, emails and landing pages.

The Response or Dependent Variable is most often count data (such as clicks on links or sales) but may be a continuous measure (like time on site). Count data is sometimes transformed to rates for analysis.

Because they create temporary variants of 'live' websites, on-line A/B tests must overcome several challenges not common in traditional experiments of human preference. For example, differential caching of test versions may degrade website performance for some versions. Users may be shown multiple variants if they return to a website and are not successfully identified with cookies or by login information. Moreover, nonhuman activity (search engine crawlers, email harvesters, and botnets) may be mistaken for human users.

Useful References:

Kohavi, Ron, Randal M. Henne, and Dan Sommerfield. "Practical Guide to Controlled Experiments on the Web: Listen to Your Customers not to the HiPPO." (2007).

Kohavi, Ron, et al. "Trustworthy online controlled experiments: five puzzling outcomes explained." Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 2012.

439 questions
7
votes
3 answers

Rails 3 A/B Split Test System Without Redis?

I'm looking for an easy way to integrate split testing into my Rails 3 application. I've researched and found two alternatives that seem to be up to date for Rails 3... Vanity: http://vanity.labnotes.org Split:…
7
votes
3 answers

Confidence interval for the difference between two proportions in Python

For example, in an AB test the A population could have 1000 data points, of which 100 are successes. While B could have 2000 data points and 220 successes. This gives A a success proportion of 0.1 and B 0.11, the delta of which is 0.01. How can I…
Johnny V
  • 1,108
  • 14
  • 21
7
votes
3 answers

Remote Config A/B Test does not provide results on iOS

I have created and started an A/B Test on Firebase Remote Config 2 days ago on my iOS app with this code: [FIRApp configure]; [FIRRemoteConfig.remoteConfig fetchWithCompletionHandler:^(FIRRemoteConfigFetchStatus status, NSError * _Nullable error) { …
willy
  • 490
  • 4
  • 11
7
votes
2 answers

In-App-Purchase: A/B-testing possible/allowed?

Is it possible and allowed to have A/B-testing for in-app-purchase for iOS and Android? I heard it's not allowed for iOS but didn't find an official document. What about Android?
swalkner
  • 16,679
  • 31
  • 123
  • 210
7
votes
1 answer

Angular and A/B testing

I need to add A/B testing functionality to a large application built with Angular 2 and bundled with Webpack. Tools like VWO are not flexible enough for us. Any online guides or tips? Thanks.
Tzach Ovadia
  • 1,278
  • 9
  • 18
7
votes
1 answer

How to calculate ab testing sample size

Do you know formul for calculating ab testing sample size per variation base on: Baseline Conversion Rate Minimum Detectable Effect Statistical Significance Statistical Power It is a few online tool to calculate the sample…
Przemek
  • 208
  • 2
  • 8
7
votes
2 answers

A/B testing with ember.js

I've found absolutely nothing on Google with regard to A/B testing with a client-side framework such as ember.js. The goal is to serve up adjusted content (different nav items, header phrasing etc.) in order to A/B test our UI/UX. I should note that…
Jeriko
  • 6,547
  • 4
  • 28
  • 40
6
votes
1 answer

Cumulated amount of event in Firebase A/B testing

I had made new menu for my app and I have made A/B testing to optimize my revenue. I have set ad_impression as a goal. I see in A/B testing console that new menu is worse for ad_impression:  I have logged those two group of users with User…
Mateusz Kaflowski
  • 2,221
  • 1
  • 29
  • 35
6
votes
1 answer

sample size for A/B fisher test significance

Given the results for a simple A / B test... A B clicked 8 60 ignored 192 1940 ( ie a conversation rate of A 4% and B 3% ) ... a fisher test in R quite rightly says there's no significant difference > fisher.test(data.frame(A=c(8,192),…
mat kelcey
  • 3,077
  • 2
  • 30
  • 35
5
votes
0 answers

The variants disproportion of Firebase AB Test

We have a problem with the disproportion of AB test variants. In theory, variants should be almost equally exposed for both variants, which actually worked quite good with previous AB tests that we made. As you can see on the screenshots with…
5
votes
2 answers

Firebase AB-testing iOS user targeting with app version not working

The remote config is working fine and trying to target user with version with 'exactly match' operator, the values not reflecting at the client side. The same configuration is working fine for android and without version criteria it is working for…
5
votes
1 answer

How to correctly use Firebase-Analytcs for A/B testing?

Background Google has published a whole new API and services for A/B testing, using Firebase Analytics and Firebase Remote Config. The problem While I do try out the service for other purposes (here and here), I also need to use it for A/B…
android developer
  • 114,585
  • 152
  • 739
  • 1,270
5
votes
0 answers

Multi-armed bandits thompson sampling for non-binary rewards

I use the following line to update my beta distribution in each trial and give arm recommendation (I use scipy.stats.beta) : self.prior = (1.0,1.0) def get_recommendation(self): sampled_theta = [] for i in range(self.arms): …
Wise
  • 628
  • 2
  • 11
  • 25
5
votes
1 answer

Why aren't results from Google Analytics Content Experiments showing?

First of all let me quickly run-down my setup for you. We have multiple domains and for that reason I use tracking code which sends the traffic to 2 different profiles at the same time. One profile tracks that specific domain, and the other is a…
Goldexer
  • 216
  • 2
  • 9
4
votes
1 answer

How to calculate statistical significance of conversion rate of several goals?

I am making a python/django split testing or a/b testing library for my own use. However I don't know how to calculate if my test is statisically significant. I have no knowledge of statistics, so I cannot understand most wikipedia articles, or web…
Amandasaurus
  • 58,203
  • 71
  • 188
  • 248
1 2
3
29 30