Multivariable Testing

How testing multiple changes simultaneously can save you time, speed up your optimization schedule, and increase your profits

By Editorial Staff On Dec 29, 2005

We recently released the audio recording of our clinic on this topic. You can listen to a recording of this clinic here:

MultivariateTesting

Traditional thinking tells us that accurate testing of web pages, email messages, or offline media depends on us changing just one element at a time.

After all, if you change more than one element on a website sales page, and testing shows an increase or decrease in performance, how will you know which element was responsible for the change?

If you are conducting a simple A/B split test, this principle remains true. You need to test just one change at a time.

However, multivariable (or multivariate) testing allows you to test many changes simultaneously – five, ten, or even twenty. You’ll still get accurate results, without having to increase your total sample size, and you will be able to identify the impact of each individual change.

But how does multivariate testing work? Is it reliable? How does it stack up next to A/B testing?

In this brief, working with data from a variety of sources, we will show you how multivariate testing works, and how it may be able to help you dramatically and quickly improve the performance of your site pages and email messages.

Beyond the immediate improvements in page performance, multivariable testing offers two other important benefits.

Accelerated Learning Curve

A single A/B split test can teach you something important about a key website page. Now imagine how much you can learn, within the same timeframe, when you look at the performance impact of ten or twenty distinct changes.
Breakthrough Thinking

When you can test only one change at a time, you are under pressure to think of a “good” change… something you think has a high likelihood of delivering improved results. This can lead to cautious thinking. However, with multivariable testing you can test as many changes as you like. This takes the pressure off and gives you enormous creative latitude, opening the door to breakthrough ideas you might otherwise never have tested.

Is multivariable testing the best choice for every company and in every circumstance? Are there times when a simple A/B split test is exactly what you need?

1. How does a multivariable test differ from an A/B split test?

An A/B test isolates a single page variable and tests two variations against each other. Here are the results of a simple A/B test:

Simple A/B Split Test
	Order Form A	Order Form B
Order Displays	17,598	17,682
Completed Orders	1056	1151
Conversion Rate	6.00%	6.51%

What You Need To UNDERSTAND: The optimized page improved the conversion rate of our test site from 6.00% to 6.51%, an increase of 8.5%. The 87 additional orders were worth approximately $2600 in added revenue.

In a recent brief, we covered A/B split testing in great detail. In order to get the most out of the material below, we recommend you review that article if you are unsure of how to implement effective A/B split testing.

A multivariable test transcends the limitations of a simple A/B test in two ways:

You can test multiple page variables at once.
You can use more than two variations for each variable.

With A/B split testing, it can become tedious to isolate and optimize all of the elements on a page, one at a time. In response, a number of companies have developed testing platforms that will simultaneously test a variety of elements such as graphics, background color, headline text, body copy, “call to action,” and other page constituents. Each of these elements or variables can be tested with two or more variations. The testing software splits incoming traffic among the variations and shows each visitor only one version of the composite page.

Here are the results of a multivariate micro-test. In the results below, the variables are sorted by conversion rate. This was a rather simple test, using only two variations for each of the six variables, resulting in twelve rows of results:

Multivariate Test
	Views	Conversions	Conv. Rate
A1 – Background Color – White	72	29	40.28%
B1 – Call to Action – “Download It Now”	74	29	39.19%
C1 – Drop Shadow – Yes	72	26	36.11%
D1 – Border – No	74	26	35.14%
E1 – Headline – How to Get More Conversions	70	24	34.29%
F1 – Hero Shot – Angled	74	25	33.78%
F2 – Hero Shot – Vertical	72	22	30.56%
E2 – Headline – Hot to Get Better Results	76	23	30.26%
D2 – Border – Yes	72	21	29.17%
C2 – Drop Shadow – No	74	21	28.38%
B2 – Call to Action – Click Here	72	18	25.00%
A2 – Background Color – Opaque	74	18	24.32%

What You Need To UNDERSTAND: In this multivariate test, six page elements were tested simultaneously. Background color had the largest impact on conversion rate (an increase of 65.6%), followed by call to action (an increase of 56.8%).

The above test was implemented using software provided by Vertster.

Again, this was a rather simple test, using only two variations for each of the six variables, resulting in twelve rows of results.

Multivariate testing, though, can make it practical to test with many more variables and variations of each, although by increasing the variables and values, the number of visitors required for a conclusive test increases exponentially. Section Four, below, discusses how providers of multivariate testing platforms have addressed this problem.

In a subsequent multivariate test, the headline proved to be one of the most important factors that impacted the effectiveness of the page. Here are the results from just the six headline variations:

Multivariate Test – Headline Results
Headline	Views	Conversions	Conv. Rate	Benefit %
1	306	62	20.26%	20.39%
2	315	57	18.10%	7.52%
3	306	49	16.01%	– 4.85%
4	299	45	15.05%	– 10.57%
5	303	40	13.20%	– 21.56%
6	313	57	18.21%	8.21%

What You Need To UNDERSTAND: When weighed against the original page, headline 1 resulted in a 20.39% increase in conversion.

You can also view a screenshot showing the actual Vertster interface which generated these results:

http://meclabs.com/multivariate_headline.gif

So with all of the above, why wouldn’t you just convert to 100% multivariable tests? Multivariable testing is not the ultimate tool, but rather one of several you should use to optimize your marketing. There are times when a simple A/B test will prove superior.

A/B split testing offers the following advantages:

A/B testing is quick and easy to set up. You can think of a new page headline over lunch, implement a test, and see results within a few hours or days.
A/B testing offers unambiguous results. If you are testing only a single page variable with two possibilities, the results are typically available quickly and the best course of action is usually quite clear.
If you are optimizing pages for a new website, or for a client, you may want to test the existing page against a page containing all of the “best practices” you have discovered for designing landing pages, order pages, etc. The MarketingExperiments.com library of past experiments should be a helpful guide for identifying best practices in each of these areas.

Multivariable testing should be used in different circumstances:

It takes some time to set up a multivariable test. You have to design and implement all of the individual page variations at once.
However, if you have a number of variations and combinations to test, multivariable testing can save you much time and aggravation in the long run.

2. How has multivariable testing helped companies improve their marketing?

In this section, we will look at two case studies of companies who have successfully used multivariate testing to improve their marketing.

The first company, JoAnn.com, is a website serving millions of arts and crafts enthusiasts. The company used Offermatica to set up multivariate tests in a number of site areas. After one round of testing, they registered the following improvements:

JoAnn.com Multivariate Tests
Metric	Improvement
Average Order	137%
Revenue Per Visitor	209%
Sewing Machine Conversions	30%

What You Need To UNDERSTAND: This company was very pleased with the results of its multivariate testing, including an overall revenue per visitor improvement of 209%.

But the most memorable lesson was this: the offer that the marketing team thought would be the least viable (“buy two sewing machines and save 10 percent”) actually generated the highest return. “People were pulling their friends together and we sold enough … machines to outperform single purchases,” said Linsly Donnelly, JoAnn.com’s chief operating officer.

KEY POINT: Even the smartest marketers are often proven wrong by testing. Intuition is no substitute for well-designed experiments.

The second case study we looked at was Monster.com, a leading employment website.

In one of Monster.com’s multivariate tests (with Offermatica), they optimized their “jobs” page by testing four page elements, each with three (or two) values. These page elements were:

Sales Copy (Original Copy vs. Conversational Copy vs. Bulleted Copy)
Headlines (Default Headline vs. New Version 1 vs. New Version 2)
“Savings” (Savings in Percent vs. Savings in Dollars vs. No Savings Listed)
Savings Calculator (Present vs. Not Present)

The default page consisted of the original copy and layout, no displayed savings amount, and no savings calculator:

http://meclabs.com/monster_jobs_default.gif

The winning page included new copy, a stronger headline, savings in dollars from buying in bulk, and a savings calculator:

http://meclabs.com/monster_jobs_winner.gif

KEY POINT: The optimized page resulted in an 11.6% increase in performance.

3. What is the Taguchi Method and when should it be used?

Consider a hypothetical multivariate test that has five variables with three values each. There are 243 possible composite pages built from these elements. It may take 100 or more conversions to each of these pages to generate the most trustworthy results. If your conversion rate averages 5%, it would take 486,000 visitors to generate accurate results under these conditions (*1).

The following table illustrates the exponential increase in number of composite pages as variables and values increase:

Multivariate Test
	2 Values Each	3 Values Each	4 Values Each
1 Variable	2 pages	3 pages	4 pages
2 Variables	4 pages	9 pages	16 pages
3 Variables	8 pages	27 pages	64 pages
4 Variables	16 pages	81 pages	256 pages
5 Variables	32 pages	243 pages	1024 pages
6 Variables	64 pages	729 pages	4096 pages
7 Variables	128 pages	2187 pages	16,384 pages
8 Variables	256 pages	6561 pages	65,536 pages

What You Need to UNDERSTAND: As the number of variables and values increase, the number of composite pages increases at an exponential rate that often defies simple intuitive marketing design.

The Taguchi Method uses “fractional factorial testing” to reduce the number of variations necessary to determine the variables and values with the greatest impact. It was originally implemented 50 years ago and has been used successfully to test automobile and other product manufacturing. More recently, companies have begun to apply the Taguchi Method to direct marketing and the Internet. The method dictates exact combinations of page elements that allow a marketer to determine accurate estimations of the most important variables on the page, and the best values for those variables. The length of the test cycle and the number of visitors required using the Taguchi Method is surprisingly small.

For more on the Taguchi Method, see:
http://en.wikipedia.org/wiki/Genichi_Taguchi

KEY POINT: Several multivariate service providers feature Taguchi-enabled testing, which significantly reduces the amount of traffic necessary to create meaningful test results.

4. What are some of the key insights we have learned through A/B split and multivariate testing?

A/B split testing can be better for basic comparative testing, while multivariate testing is better for larger scale optimization. If it is cost-effective for your business, you will have the capability for both and will choose wisely between them depending on your near-term goals. Resist the urge to create large, complex tests if you are just getting started in comparative testing.
Although small scale micro-tests can often suggest a superior value for a given variable, the most accurate results require adequate time for a valid sample to accumulate. While the Taguchi Method can be quite effective, results containing any ambiguity should be tested for a longer period before being implemented site-wide.
Seasonal trends can affect the outcome of an A/B split or multivariate test if the traffic during that season is markedly different than normal traffic. Likewise, sources of traffic can affect the outcome of testing; if an influx of traffic from an unusual source comes in during an A/B micro-test, it can skew results. This further supports longer testing periods.
Prepare to have your intuitive expectations proven wrong. Use your intuition and experience to design good tests, but in a well-designed test with sufficient data, the numbers do not lie.
Don’t stop refining your marketing after just one test. Continue to optimize over the lifetime of your site. A number of small improvements over time will result in a large improvement and a highly optimized site. See our recent brief on “The Compounding Effect” to read more about this. In addition, the expectations of your customers and the Internet community as a whole may change over time; ongoing testing is the best way to keep your finger on the pulse of your customers.
Regarding the number of page views or visitors needed to conduct a reliable test, this can vary widely, and depends primarily the conversion rate of the page itself. A free offer page may convert at 25% or higher, while a product sales page may convert lower than 1%.

A good rule of thumb is that you must have AT LEAST 10 conversions for each composite page in a multivariate test. A safer figure is to generate 30-50 conversions for each combination.

Keeping the above points in mind should help you get the most out of A/B and multivariate testing. Finally, we have listed a number of vendors of multivariate testing platforms in the “Literature Review” section, below.

As our future experiments reveal what really works, we will continue to share our findings. If you have any suggestions for future topics, please let us know.

(*1) To calculate the number of visitors needed to generate reliable multivariate testing, simply multiply the number of composite pages (see table above) by the historical average number of visitors required to create one conversion and by the number of conversions to create statistically significant results (typically 50-200).

V = P * C * S

V = Total Visitors
P = Composite Pages
C = Number of Visitors to Create One Conversion (average)
S = Number of Conversions to Create Statistically Significant Results

RELATED MEC REPORTS:

As part of our research on this topic, we have prepared a review of the best Internet resources on this topic.

Rating System

These sites were rated for usefulness and clarity, but alas, the rating is purely subjective.

* = Decent ** = Good *** = Excellent **** = Indispensable

Articles:

Software/Services:

Credits:

Editor — Flint McGlaughlin

Writers — Brian Alt
Nick Usborne

Contributors — Jalali Hartman
Jimmy Ellis
Scott Miller
Matthew Roche

HTML Designer — Cliff Rainer