Advanced A/B Testing - Jax London 2015

Aviran MordoHead of Engineering

@aviranmlinkedin.com/in/aviranaviransplace.com

Advanced A/B Testing

Wix In Numbers

Over 72M users + 1.5M new users/month

Static storage is >2Pb of data

3 data centers + 3 clouds (Google, Amazon, Azure)

2B HTTP requests/day

1000 people work at Wix

729(Experiments in Aug

‘15)

Basic A/B testing

Experiment driven development

PETRI – Wix’s 3rd generation open source experiment system

Challenges and best practices

Complexities and effect on product

Agenda

04:59A/B Test

To B or NOT to B?AB

Home page results (How many registered)

Experiment Driven Development

This is the Wix editor

Gallery managerWhat can we improve?

Is this better?

It’s not about winning – it’s about NOT LOSING

Product Experiments Toggles & Reporting

Infrastructure

How do you know what is running?

If I “know” it is better, do I really need to test it?

Why so many?

Sign-upChoose Templa

teEdit site Publish Premiu

The theory

Result = Fail

Intent matters

EVERY new feature is A/B tested

Measure success

If flawed, the impact is just for % of our users

Conclusion

Start with 50% / 50% ?

New code can have bugsConversion can dropUsage can dropUnexpected cross test dependencies

Sh*t happens (Test could fail)

Numbers look good but sample size is small

We need more data!

Expand

Reaching statistical significance

25% 50% 75% 100%

75% 50% 25% 0%Control Group (A)

Test Group (B)

Language GEOBrowser User-agentOS

Minimize affected users (in case of failure) Gradual exposure (percentage of…)

Company employeesUser rolesAny other criteria you have (extendable)All users

First time visitors = Never visited wix.com

New registered users = Untainted users

Existing registered users = Already familiar with the

service

Not all users are equal

Signed-in user Test group is determined by the user IDGuarantee toss consistency across browsers

Anonymous user (Home page)Test group is randomly determinedCannot guarantee consistent experience cross

browsers

11% of Wix users use more than one desktop browser

Keeping consistent UX

Which Test Group are You?

Non-Developers Developers

Test Group A Test Group B

Calling Laboratory is Easy

Start new experiment

We need that feature

…and failure is not an option

Adding a mobile view

First trial failed Performance had to be

improved

Halting the test results in loss of data. What can we do about it?

Solution – Pause !

• Maintain NEW experience for already exposed users

• No additional users will be exposed to the NEW feature

Decision (What to do with the data) Keep feature Drop feature

Improve code & resume experiment

Keep backwards compatibility for exposed users forever?

Migrate users to another equivalent feature

Drop it all together (users lose data/work)

Robots are users too!

Always exclude robots

Don’t let Google index a losing page

Don’t let bots affect statistics

There is MORE than one

# of active experiment Possible # of states

10 102420 1,048,57630 1,073,741,824

Possible states >= 2^(# experiments)

Wix has ~600 active experiments ~4.149516e+180

Integrated into the product

Why should product managers care about the

system architecture

Share document with other users

Document owner is part of a test that enables a new video

component

What will the other user experience when editing a shared document ?

Owner Friend

Assignment may be different than owner’s

Owner (B) Friend (A)

Enable features by existing content What will happened when you remove a component

Enable features by document owner’s assignment The friend now expects to find the new feature on his own

Exclude experimental features from shared documents

You are not really testing the entire system

Possible solutions

A/B testing introduces complexity

Petri is more than just an A/B test framework

Feature toggle

A/B TestPersonalizatio

Internal testing

Continuous deployment

Jira integration

Experiments

Dynamic configuration

Automated testing

Petri is an open source projecthttps://github.com/wix/petri

Q&AHead of EngineeringAviran Mordo

@aviranmwww.linkedin.com/in/aviranhttp://www.aviransplace.com

https://github.com/wix/petrihttp://goo.gl/yMUzta

Creditshttp://upload.wikimedia.org/wikipedia/commons/b/b2/Fiber_optics_testing.jpghttp://goo.gl/nEiepThttps://www.flickr.com/photos/ilo_oli/2421536836https://www.flickr.com/photos/dexxus/5791228117http://goo.gl/SdeJ0ohttp://weltender.deviantart.com/art/Pause-relax-think-159996895https://www.flickr.com/photos/112923805@N05/15005456062https://www.flickr.com/photos/wiertz/8537791164https://www.flickr.com/photos/laenulfean/5943132296https://www.flickr.com/photos/torek/3470257377https://www.flickr.com/photos/i5design/5393934753https://www.flickr.com/photos/argonavigo/5320119828

Modeled experiment lifecycle

Open source (developed using TDD from day 1)

Running at scale on production

No deployment necessary

Both back-end and front-end experiment

Flexible architecture

Why Petri

Advanced A/B Testing - Jax London 2015

Software

Transcript of Advanced A/B Testing - Jax London 2015

JAX London 2016: "Empathy - The hidden ingredient of good software development?"

JAX-RS 2.0: New and Noteworthy in RESTful Web services API at JAX London

Java EE 6 & GlassFish 3: Light-weight, Extensible, and Powerful @ JAX London 2010

JAX London 2014 "Building Java Applications for the Cloud: The DHARMA principles"

Production Microservices - JAX London · 2018-06-20 · @russmiles @chaostoolkit. Title: ChaosToolkitTemplate Created Date: 20171009155508Z

PAT Testing London - Propatlondon.co.uk

V Forge Contents · Jax - ahahah Female Sung Jax - All Jax - ahahah2 Jax - ahhhhh Jax - ahhhhh2 Jax - ahhhhh3 Jax - ahhhhh4 Jax - ehaye Jax - haaaaha Jax - hey Jax - hey2 Jax - heyyyyy

PAT Testing Services London - Propatlondon.co.uk

The Serverless Cloud @ JAX London 2016

JAX-WS e JAX-RS

Distributed systems in practice, in theory (JAX London)

EICR Fixed Wire Testing London -

Jax London 2013

JAX London Slides

JAX Clinical Knowledgebase (JAX CKB) Tutorial

JAX London Version of the MongoDB/Java/AngularJS talk

Performance Metrics for your Delivery Pipeline - JAX London - October 14, 2014

Continuous delivery - JAX London 2011

Secure Systems Fundamentals - JAX London 2021

Jax Rpc Versus Jax Ws