The Data Engineering Show

The Geo-Data Problem Nobody Talks About And How Voi Solved It ft. Magnus Dahlbäck


Listen Later


In this episode of The Data Engineering Show, host Benjamin sits down with Magnus Dahlbäck, Senior Director of Data and Platform at Voi, to explore how a rapidly scaling European e-scooter company transformed its data infrastructure, adopted a metrics-first approach to analytics, and is now leveraging AI to solve real-time operational challenges across 150 cities and 150,000 vehicles.


What You'll Learn:
  • How to escape the "dashboard chaos" trap by adopting a metrics-first architecture with a semantic layer, reducing confusion from hundreds of conflicting dashboards to a single source of truth across the organization

  • Why replacing Tableau with Steep (a metrics-centric BI tool) unlocked self-service analytics for non-technical users, empowering teams to answer their own data questions without waiting months for custom dashboard builds

  • The real-world cost optimization challenge of managing Snowflake expenses that scale 1:1 with ride volume—and why data leaders must constantly rethink architecture to control FinOps in high-growth environments

  • How to architect for IoT at scale: processing billions of daily events from connected vehicles using micro-batch pipelines (5-minute intervals) while keeping real-time machine learning inference separate through cross-functional product teams

  • The decision framework for choosing traditional ML vs. LLMs: use traditional methods for accuracy-critical workloads (supply-demand forecasting for vehicle positioning) and LLMs for pattern discovery where 100% precision isn't required (analyzing rider feedback)

  •  How to build proactive customer support powered by data and AI: leverage sensor data and ride telemetry to detect poor user experiences and reach out before customers complain, rather than waiting for refund requests

If you enjoyed this episode, make sure to subscribe, rate, and review it on Apple Podcasts, Spotify, and YouTube Podcasts. Instructions on how to do this are here: https://www.fame.so/follow-rate-review.


About the Guest(s)


Magnus Dahlbäck is Senior Director of Data and Platform at Voi, a leading European micro-mobility company, where he oversees the data analytics team, platform infrastructure, and AI initiatives. With over four years at Voi, Magnus has scaled the data organization from three people to a comprehensive team of platform engineers, data analysts, and data scientists while architecting a modern data stack centered on metrics-first analytics and semantic layers. In this episode, Magnus shares insights on building scalable data platforms for IoT-heavy, real-world products, including strategies for managing billions of daily events, implementing self-service analytics, and balancing traditional machine learning with large language models. His work at Voi—where the data platform powers both internal analytics and customer-facing product features—demonstrates how thoughtful data architecture drives measurable business impact, making this conversation essential for data leaders navigating AI integration and data democratization.

Quotes


"There are hundreds of dashboards, and I'm looking for some data, some metrics, and there are 10 dashboards that contain that, and they all show different numbers." - Magnus
"Metrics is a very natural way of interacting with data rather than dashboards that are named something randomly." - Magnus
"We're basically throwing man hours on slicing and dicing data, trying to find patterns, anomalies that we often miss, right, because it just takes too much time." - Magnus
"The way we work with data hasn't really changed that much in the last ten, twenty years to be completely fair, but now we're seeing new technologies, new approaches to it." - Magnus
"It comes down to the use case. What's the accuracy we need?" - Magnus
"We can see from the sensor data, from the IoT, from other data points during your ride if it was a good or bad experience, so why don't we reach out to you?" - Magnus
"Building software around physical objects is really cool when you're a techie guy like me, working at a company where it's a combination of software, B to C, hardware, IoT." - Magnus
"The biggest dataset that we process is IoT data—billions of events every day, basically, that we process." - Magnus
"We have cross functional teams where all the product teams have everything from back end to front end to data people, designers, and so on." - Magnus
"Metrics is kind of the business language that we use—we talk about rides, average ride charge, active vehicles—so metrics is a very natural way of interacting with data." - Magnus

Resources
 

Connect on LinkedIn:

  • Magnus Dahlbäck - https://www.linkedin.com/in/magnusdahlback/
  • Benjamin Wagner - https://www.linkedin.com/in/wagjamin/


Websites:
  • Guest's Company: Voi Technologies Website (voi.com)
  • Host's Company: Firebolt Website (firebolt.io)

Tools & Platforms:
  • Snowflake – Data warehouse for analytics and machine learning workloads
  • DBT (Data Build Tool) – Data transformation and modeling
  • Apache Airflow – Workflow orchestration
  • Steep – Metrics-first BI tool with semantic layer (Swedish startup)
  • GCP Vertex AI – Machine learning platform for model training and deployment

The Data Engineering Show is brought to you by firebolt.io and handcrafted by our friends over at: fame.so

Previous guests include: Joseph Machado of Linkedin, Metthew Weingarten of Disney, Joe Reis and Matt Housely, authors of The Fundamentals of Data Engineering, Zach Wilson of Eczachly Inc, Megan Lieu of Deepnote, Erik Heintare of Bolt, Lior Solomon of Vimeo, Krishna Naidu of Canva, Mike Cohen of Substack, Jens Larsson of Ark, Gunnar Tangring of Klarna, Yoav Shmaria of Similarweb and Xiaoxu Gao of Adyen.

Check out our three most downloaded episodes:
  • Zach Wilson on What Makes a Great Data Engineer
  • Joe Reis and Matt Housley on The Fundamentals of Data Engineering
  • Bill Inmon, The Godfather of Data Warehousing
...more
View all episodesView all episodes
Download on the App Store

The Data Engineering ShowBy The Firebolt Data Bros

  • 3.8
  • 3.8
  • 3.8
  • 3.8
  • 3.8

3.8

8 ratings


More shows like The Data Engineering Show

View all
Planet Money by NPR

Planet Money

30,690 Listeners

Hidden Brain by Hidden Brain, Shankar Vedantam

Hidden Brain

43,529 Listeners

Data Engineering Podcast by Tobias Macey

Data Engineering Podcast

146 Listeners

DataFramed by DataCamp

DataFramed

267 Listeners

Tech Brew Ride Home by Morning Brew

Tech Brew Ride Home

971 Listeners

Practical AI by Practical AI LLC

Practical AI

216 Listeners

The Journal. by The Wall Street Journal & Spotify Studios

The Journal.

6,127 Listeners

My First Million by Hubspot Media

My First Million

2,667 Listeners

The Prof G Pod with Scott Galloway by Vox Media Podcast Network

The Prof G Pod with Scott Galloway

5,673 Listeners

The Real Python Podcast by Real Python

The Real Python Podcast

140 Listeners

All-In with Chamath, Jason, Sacks & Friedberg by All-In Podcast, LLC

All-In with Chamath, Jason, Sacks & Friedberg

10,213 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

555 Listeners

The Analytics Engineering Podcast by dbt Labs, Inc.

The Analytics Engineering Podcast

29 Listeners

HBR On Leadership by Harvard Business Review

HBR On Leadership

170 Listeners

Training Data by Sequoia Capital

Training Data

40 Listeners