The Experimentation Edge

How UPS's Experimentation Team Generated Half a Billion From 80+ Apps With A/B Testing


Listen Later

This episode of The Experimentation Edge shows how UPS's A/B testing program drove $500M+ in incremental revenue across 80+ customer-facing applications. Dave Massey — head of the J.E.D.I. team (Journey Experience and Design Innovation) — walks through the first test that proved UX could move revenue, how he defended counterintuitive results to skeptical execs, and how a small experimentation team can override opinion with data at enterprise scale.

Summary
Dave Massey walked into UPS in 2016 and immediately got pulled into a meeting about AB testing tools. By the end of the day, he owned the platform—and the problem: UPS hadn't run a single meaningful experiment. Three years later, senior leadership gave him a hard number to hit. Prove UX could move revenue, or the pilot dies. His first test—removing navigation from the checkout flow—delivered $35 million in incremental revenue. Senior leaders didn't believe it. They made him defend the results upside down and sideways. When the dust settled, the data held. Today, Massey's team has driven over half a billion dollars in incremental revenue by treating UPS.com like the e-commerce business it actually is.

Massey's approach is simple: test everything, especially what senior leaders think will work. His team, Journey Experience and Design Innovation (nicknamed J.E.D.I.), has built a reputation for saying no with data, not opinion. When a business unit demanded required recipient emails to capture customer data, J.E.D.I. ran the test in 24 hours and killed it. Conversion tanked. Two years later, the international team asked for the same feature—but framed it as a customs solution. That test passed. Same feature, different reason, different outcome. That's the edge Massey's team delivers: rigorous hypothesis design, a UX research team embedded in the experimentation workflow, and zero tolerance for untested ideas.


Timestamps
03:09 Dave's first day at UPS: inheriting an AB testing tool with no program 
05:59 Senior leadership's ultimatum: prove UX ROI or kill the pilot 
08:38 First test result: $35M from removing navigation in checkout 
09:48 Defending the numbers: how Massey's team survived scrutiny 
11:07 Why a data-driven engineering culture made experimentation inevitable 
16:12 Team size: 80 people supporting almost 80 customer-facing applications 
19:08 The 24-hour test: when required email fields killed conversion 
22:28 Why Massey embeds UX research inside the experimentation team 
24:41 AI at UPS: treating it as a tool, not a replacement 


Takeaways
- Massey's first test removed navigation from UPS's shipping checkout flow and delivered $35 million in incremental revenue—proving e-commerce best practices apply even when customers think "this is just a tool, not e-commerce." 
- J.E.D.I.'s win rate stays high because UX research and experimentation teams operate under the same leader, giving the program both behavioral metrics and voice-of-customer insight before tests ever launch. 
- When senior leaders push ideas, Massey's team tests them instead of arguing—then delivers results that either validate the idea or identify three better alternatives the data actually supports. 
- The same feature (required recipient email) failed for customer data capture but passed for international customs—proof that framing and customer benefit matter more than the feature itself. 
- UPS runs everything centrally now, but the real win is that demand for testing has decentralized—business units across the company now come to J.E.D.I. asking to test their ideas.


Connect with the guest
LinkedIn: https://www.linkedin.com/in/masseycreates/
Learn more about UPS: https://www.ups.com


Sponsor
Growthbook helps you ship features with confidence by bringing experimentation and feature flagging into one open-source platform. No more guessing whether that new checkout flow actually moved the needle, waiting weeks for data team bandwidth, or flying blind on rollouts.

Growthbook gives you a single place to run A/B tests, manage feature flags, and analyze results against your existing data warehouse.

With powerful stats built in, it takes the complexity out of experimentation, helps you catch regressions before they hit every user, and makes it easy to test ideas that keep your product improving and your metrics moving in the right direction.

See a demo at https://www.growthbook.io/

Topics: A/B testing, experimentation, conversion rate optimization, feature flags, UX research, e-commerce experimentation, statistical significance, experimentation team building, growth experimentation, sequential testing.

  • (03:09) - Dave's first day at UPS: inheriting an AB testing tool with no program
  • (05:59) - Senior leadership's ultimatum: prove UX ROI or kill the pilot
  • (08:38) - First test result: $35M from removing navigation in checkout
  • (09:48) - Defending the numbers: how Massey's team survived scrutiny
  • (11:07) - Why a data-driven engineering culture made experimentation inevitable
  • (16:12) - Team size: 80 people supporting almost 80 customer-facing applications
  • (19:08) - The 24-hour test: when required email fields killed conversion
  • (22:28) - Why Massey embeds UX research inside the experimentation team
  • (24:41) - AI at UPS: treating it as a tool, not a replacement
  • ...more
    View all episodesView all episodes
    Download on the App Store

    The Experimentation EdgeBy Growthbook