
Sign up to save your podcasts
Or


From BERT to Agents: Building Production AI at Booking.com
After seven years building ML systems that serve millions of travelers, Georgios Chouliaras has watched the field transform from hand-coded chatbot rules to autonomous agents—and he's learned which shiny new approaches actually work in production.
Georgios Chouliaras, Senior Machine Learning Scientist at Booking.com, joins me to share hard-won insights from deploying AI at scale. His journey spans customer service chatbots that broke during COVID (because the training data didn't include "global pandemic"), company-wide ML best practices, and now the cutting edge of agent development.
In this episode, we explore:
Georgios challenges some popular assumptions: the REACT pattern everyone implements? He hasn't seen it consistently outperform simpler approaches. Massive parameter counts? Architecture and training data now matter more. His underhyped pick: straightforward function calling often beats elaborate agent architectures.
The core takeaway: Use the simplest tool that solves your problem. Production users don't care if you're running a sophisticated multi-agent system, they care if it works.
Connect with Georgios:
Connect with me:
Check out our awesome sponsor, dearmachines.com, QA AI Agents for Continuous Testing.
By Christian BarraFrom BERT to Agents: Building Production AI at Booking.com
After seven years building ML systems that serve millions of travelers, Georgios Chouliaras has watched the field transform from hand-coded chatbot rules to autonomous agents—and he's learned which shiny new approaches actually work in production.
Georgios Chouliaras, Senior Machine Learning Scientist at Booking.com, joins me to share hard-won insights from deploying AI at scale. His journey spans customer service chatbots that broke during COVID (because the training data didn't include "global pandemic"), company-wide ML best practices, and now the cutting edge of agent development.
In this episode, we explore:
Georgios challenges some popular assumptions: the REACT pattern everyone implements? He hasn't seen it consistently outperform simpler approaches. Massive parameter counts? Architecture and training data now matter more. His underhyped pick: straightforward function calling often beats elaborate agent architectures.
The core takeaway: Use the simplest tool that solves your problem. Production users don't care if you're running a sophisticated multi-agent system, they care if it works.
Connect with Georgios:
Connect with me:
Check out our awesome sponsor, dearmachines.com, QA AI Agents for Continuous Testing.