
Sign up to save your podcasts
Or


Episode Summary:
In this episode of Engineering Choices You Have to Defend, host Nicola Onassis sits down with Roy Resh, VP of Engineering at Trax Retail, to explore a pivotal architectural decision that reshaped how large-scale computer vision systems are built and scaled in retail environments.
At Trax, Roy and his team built a computer vision platform that analyzes shelf images captured in retail stores, identifying products, pricing, and point-of-sale materials to generate a digital representation of store shelves. This enables brands to measure execution, shelf share, and product availability in near real time. But as the platform scaled across enterprise clients, complexity began to compound rapidly.
What started as a unified recognition pipeline evolved into a heavily customized system, with per-client logic for attributes like expiration dates, display detection, reporting formats, and KPI calculations. Each new customer introduced new requirements, leading to custom code per client, duplicated processing flows, and increasingly long onboarding cycles that stretched from weeks to months.
Roy explains how the system eventually reached a breaking point: onboarding delays of 30–60 days, rising operational overhead, and microservices becoming entangled with client-specific logic. In some cases, the platform even processed the same image multiple times to satisfy different customer requirements, driving up cost and complexity.
The team made a strategic decision to move away from custom implementations and toward a configurable, JSON-driven workflow architecture. Built on event-driven microservices, queues, and coordination barriers, this new system allowed engineering teams to define and version entire processing flows through configuration rather than code.
This shift enabled safer deployments, faster experimentation, and gradual rollouts per client—without affecting the entire platform. It also introduced a standardized KPI layer, reducing the need for bespoke reporting logic across customers.
Roy also discusses the importance of human-in-the-loop validation in production AI systems. In a constantly evolving retail environment, human annotators help generate training data, validate model outputs, and maintain accuracy for high-stakes enterprise use cases where precision is critical.
For engineering leaders, this episode highlights a key lesson: when every customer forces new code paths, you’re not scaling a product—you’re scaling complexity.
Key Takeaways:
Connect with Roy Resh:
Listen Now & Subscribe:
Apple Podcasts, Spotify, Amazon Music, or wherever you get your podcasts.
"Engineering Choices You Have to Defend explores the real technical decisions behind regulated software, compliance, and AI integration, helping leaders build secure, auditable, and user-friendly systems."
By Nicola OnassisEpisode Summary:
In this episode of Engineering Choices You Have to Defend, host Nicola Onassis sits down with Roy Resh, VP of Engineering at Trax Retail, to explore a pivotal architectural decision that reshaped how large-scale computer vision systems are built and scaled in retail environments.
At Trax, Roy and his team built a computer vision platform that analyzes shelf images captured in retail stores, identifying products, pricing, and point-of-sale materials to generate a digital representation of store shelves. This enables brands to measure execution, shelf share, and product availability in near real time. But as the platform scaled across enterprise clients, complexity began to compound rapidly.
What started as a unified recognition pipeline evolved into a heavily customized system, with per-client logic for attributes like expiration dates, display detection, reporting formats, and KPI calculations. Each new customer introduced new requirements, leading to custom code per client, duplicated processing flows, and increasingly long onboarding cycles that stretched from weeks to months.
Roy explains how the system eventually reached a breaking point: onboarding delays of 30–60 days, rising operational overhead, and microservices becoming entangled with client-specific logic. In some cases, the platform even processed the same image multiple times to satisfy different customer requirements, driving up cost and complexity.
The team made a strategic decision to move away from custom implementations and toward a configurable, JSON-driven workflow architecture. Built on event-driven microservices, queues, and coordination barriers, this new system allowed engineering teams to define and version entire processing flows through configuration rather than code.
This shift enabled safer deployments, faster experimentation, and gradual rollouts per client—without affecting the entire platform. It also introduced a standardized KPI layer, reducing the need for bespoke reporting logic across customers.
Roy also discusses the importance of human-in-the-loop validation in production AI systems. In a constantly evolving retail environment, human annotators help generate training data, validate model outputs, and maintain accuracy for high-stakes enterprise use cases where precision is critical.
For engineering leaders, this episode highlights a key lesson: when every customer forces new code paths, you’re not scaling a product—you’re scaling complexity.
Key Takeaways:
Connect with Roy Resh:
Listen Now & Subscribe:
Apple Podcasts, Spotify, Amazon Music, or wherever you get your podcasts.
"Engineering Choices You Have to Defend explores the real technical decisions behind regulated software, compliance, and AI integration, helping leaders build secure, auditable, and user-friendly systems."