Questions tagged [ab-testing]

A/B testing, split testing, or bucket testing is a controlled comparison of the effectiveness of variants of a website, email, or other commercial product.

From Wikipedia:

A/B testing, split testing or bucket testing is a method of marketing testing by which a baseline control sample is compared to a variety of single-variable test samples in order to improve response rates. A classic direct mail tactic, this method has been recently adopted within the interactive space to test tactics such as banner ads, emails and landing pages.

The Response or Dependent Variable is most often count data (such as clicks on links or sales) but may be a continuous measure (like time on site). Count data is sometimes transformed to rates for analysis.

Because they create temporary variants of 'live' websites, on-line A/B tests must overcome several challenges not common in traditional experiments of human preference. For example, differential caching of test versions may degrade website performance for some versions. Users may be shown multiple variants if they return to a website and are not successfully identified with cookies or by login information. Moreover, nonhuman activity (search engine crawlers, email harvesters, and botnets) may be mistaken for human users.

Useful References:

Kohavi, Ron, Randal M. Henne, and Dan Sommerfield. "Practical Guide to Controlled Experiments on the Web: Listen to Your Customers not to the HiPPO." (2007).

Kohavi, Ron, et al. "Trustworthy online controlled experiments: five puzzling outcomes explained." Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 2012.

439 questions
2
votes
1 answer

From which step does Firebase split the traffic by variants in AB-test

If I have a productOpen activation event (a custom chat-opening event inside the app), it starts counting from this event to evaluate the results as stated in Firebase ab-testing documentation. The question is, at what point does the traffic split…
2
votes
1 answer

How to compute the confidence interval of the Difference in Differences method using Python?

I'm trying to analyze the total active minutes per user before and after an experiment. Here I've included the associated user data before and after the experiment - variant_number = 0 indicates control group while 1 means treatment group.…
2
votes
1 answer

Force a specific test-variant as preview in google optimize

is it possible forcing to see a specific test-variant in the browser/preview? I have AB Test and it's already running. Now I want to see only variant A or variant B - how can I force the specific variant? Maybe with some URL parameters.
Jörg Steinhauer
  • 137
  • 2
  • 10
2
votes
0 answers

Checking if Frequentist approach is correct? Bayesian approach using MCMC for AB test. How to calculate Bayes Factors in Python?

I've been trying to get my head around Frequentist and Bayesian approaches for a toy data AB test problem. The results don't really make sense to me. I am struggling to understand the results, or whether I have computed them (in)correctly (which is…
mad-a
  • 153
  • 3
  • 11
2
votes
1 answer

Firebase Revenue AB testing algorithm

We have run an AB test at firebase which has the following results: I was also building my own Bayesian AB-test suite and was wondering how they came to these conclusions. What I was doing was querying the data of this test for the Control Group…
Boris Mulder
  • 167
  • 1
  • 10
2
votes
0 answers

Online A/B test for recommendation system?

I was able to develop a couple of algorithms for my recommendation system, that I want to apply to an ecomm website. My goal is to perform a live a/b test to check which system perform better. I would not rely only on offline metrics. Does google…
lcasucci
  • 77
  • 3
  • 11
2
votes
1 answer

A/B testing in Sitecore - null datasource error in Glass Mapper

I have a problem with Sitecore and Glass Mapper, we are using Glass Mapper in our project and all works fine, but when trying to set an A/B testing I got an exception about null datasource. The strange thing is that if I check the "Test the…
2
votes
1 answer

Firebase Remote Config when to load on iOS to receive A/B test variant

It seems that all examples from Firebase suggest reloading the Remote Config in didFinishLaunching. It looks like a great idea, BUT I see an issue with it. When I run this setup in the simulator, the first fetch arrives with the default values set…
2
votes
0 answers

Reactjs Dom manipulation from outside

What is the neatest way to update Dom content of a rendered component outside of react environment. Example: I have a reactjs app fully rendered into browser and then I have separate js file that I want to use for experiment purpose to update…
Arun Kumar
  • 163
  • 8
2
votes
2 answers

Firebase experiments count limit

Currently in firebase console for my project I have 2 remote config experiments(running), 4 notification experiments(running), 3 remote config experiments drafts and 0 completed experiments. When i am pressing to start one more remote config…
x90
  • 2,140
  • 15
  • 25
2
votes
0 answers

Code Sample - ReactJS - Server side rendering - Adobe Test and Target

We have a react app and the page is rendered on the server side. We are Trying to render different react components (A or B or C) based on the Adobe Test and Target AB test. Approach is, Node will communicate with Abode TNT to find the experience…
Sabha B
  • 2,079
  • 3
  • 28
  • 40
2
votes
1 answer

No traffic in experiment report

I'm testing Optimizely and created an implementation in my reactjs project. The optimizely script is loaded only when required (lazyLoad), so for that reason, when creating a new experiment in the Dashboard I get the "snippet not present in the…
punkbit
  • 7,347
  • 10
  • 55
  • 89
2
votes
1 answer

Firebase for A/B testing with remote config

I need to run A/B testing and I was thinking about using Firebase remote config. I could not find whether it is possible to updated the app keeping the value for the A/B config between versions. What I mean is that if I publish the app and I target…
1048576
  • 615
  • 9
  • 27
2
votes
1 answer

EventSourcing + A/B testing

I try to solve interesting theorists-philosophical question about eventsourcing, event versioning and A/B testing, and would like to get some advises and solutions from practice. One of primary benefits of an event-sorsing is easy and simple…
Vladislav Ihost
  • 2,127
  • 11
  • 26
2
votes
1 answer

Significance Testing in R

I am trying to determine whether there is a significant difference between two interfaces. I have a text file that looks like this: group conversion A 0 A 0 A 1 A 0 A 0 A 1 A 1 A 0 A 0 A 1 A 1 A 1 A 1 A 1 A 1 A 0 A …
Tibia Rook
  • 349
  • 1
  • 3
  • 9