In this episode, Neil Strange and Alex Higgs tackle one of the most misunderstood steps in building a data warehouse: landing the data.
From internal systems to APIs and SaaS tools, this isnβt just a technical jobβthereβs politics, governance, and reliability to think about.
They cover:
Β π Push vs Pull
Β π Access challenges
Β π§° Tools (Fivetran, Airbyte, etc.)
Β π§ͺ When to build vs buy
Β π§Ύ CDC and data quality risks
Β π PSAs and staging areas
Whether you're starting fresh or scaling pipelines, this will save you headaches.
π Chapters:
00:00 - Intro
00:40 - What does βlanding dataβ mean?
02:10 - Push vs Pull: Definitions and differences
05:15 - Which is more reliable in practice?
08:30 - Political barriers to pulling data
10:45 - Why access isnβt just a technical issue
13:00 - Common tools: Fivetran, Airbyte, Matillion
15:20 - When those tools fall short
17:00 - Should you build your own pipelines?
19:10 - Change Data Capture (CDC) explained
21:30 - Handling deletes, updates, and historical tracking
24:05 - Persistent Staging Areas: Why they matter
26:50 - Final thoughts and key takeaways
π Subscribe for more on data architecture, ETL, and real-world data warehousing.
For more expert insights, check out our services designed to help organizations like yours unlock the full potential of Data Vault.
π Explore our training courses to level up your data management skills: https://bit.ly/4eNkczq
π Download our free Data Vault resources: https://bit.ly/4f9AYs9
π Book a free consultation with our team: https://bit.ly/3BT6VXC