
Sign up to save your podcasts
Or
Managing a data warehouse can be challenging, especially when trying to maintain a common set of patterns. Dataform is a platform that helps you apply engineering principles to your data transformations and table definitions, including unit testing SQL scripts, defining repeatable pipelines, and adding metadata to your warehouse to improve your team’s communication. In this episode CTO and co-founder of Dataform Lewis Hemens joins the show to explain his motivation for creating the platform and company, how it works under the covers, and how you can start using it today to get your data warehouse under control.
Introduction
How did you get involved in the area of data management?
Can you start by explaining what DataForm is and the origin story for the platform and company?
Can you talk through the workflow for someone using DataForm and highlight the main features that it provides?
What are some of the challenges and mistakes that are common among engineers and analysts with regard to versioning and evolving schemas and the accompanying data?
How does CI/CD and change management manifest in the context of data warehouse management?
How is the Dataform SDK itself implemented and how has it evolved since you first began working on it?
What was your selection process for an embedded runtime and how did you decide on javascript?
Which database engines do you support and how do you reduce the maintenance burden for supporting different dialects and capabilities?
What is involved in adding support for a new backend?
When is DataForm the wrong choice?
What do you have planned for the future of DataForm?
The intro and outro music is from The Hug by The Freak Fandango Orchestra / CC BY-SA
Support Data Engineering Podcast
4.6
135135 ratings
Managing a data warehouse can be challenging, especially when trying to maintain a common set of patterns. Dataform is a platform that helps you apply engineering principles to your data transformations and table definitions, including unit testing SQL scripts, defining repeatable pipelines, and adding metadata to your warehouse to improve your team’s communication. In this episode CTO and co-founder of Dataform Lewis Hemens joins the show to explain his motivation for creating the platform and company, how it works under the covers, and how you can start using it today to get your data warehouse under control.
Introduction
How did you get involved in the area of data management?
Can you start by explaining what DataForm is and the origin story for the platform and company?
Can you talk through the workflow for someone using DataForm and highlight the main features that it provides?
What are some of the challenges and mistakes that are common among engineers and analysts with regard to versioning and evolving schemas and the accompanying data?
How does CI/CD and change management manifest in the context of data warehouse management?
How is the Dataform SDK itself implemented and how has it evolved since you first began working on it?
What was your selection process for an embedded runtime and how did you decide on javascript?
Which database engines do you support and how do you reduce the maintenance burden for supporting different dialects and capabilities?
What is involved in adding support for a new backend?
When is DataForm the wrong choice?
What do you have planned for the future of DataForm?
The intro and outro music is from The Hug by The Freak Fandango Orchestra / CC BY-SA
Support Data Engineering Podcast
272 Listeners
283 Listeners
152 Listeners
41 Listeners
482 Listeners
592 Listeners
625 Listeners
443 Listeners
296 Listeners
213 Listeners
266 Listeners
189 Listeners
64 Listeners
140 Listeners
77 Listeners