The Cloudcast

Data Lakehouses and Apache Hudi


Listen Later

Kyle Weller (@KyleJWeller, Head of Product @onehousehq) talks about the latest trends in  OSS Data Lakes, Data Warehouses, and the evolution to “Data Lakehouses” with Apache Hudi

SHOW: 694

CLOUD NEWS OF THE WEEK - http://bit.ly/cloudcast-cnotw

NEW TO CLOUD? CHECK OUT - "CLOUDCAST BASICS"

SHOW SPONSORS:

  • Datadog Synthetic Monitoring: Frontend and Backend Modern Monitoring
  • Ensure frontend issues don’t impair user experience by detecting user-facing issues with API and browser tests with a free 14 day Datadog trial. Listeners of The Cloudcast will also receive a free Datadog T-shirt. 
  • Solve your IAM mess with Strata's Identity Orchestration platform
  • Have an identity challenge you thought was too big, too complicated, or too expensive to fix? Let us solve it for you! Visit strata.io/cloudcast to share your toughest IAM challenge and receive a set of AirPods Pro
  • How to Fix the Internet (A new podcast from the EFF)

SHOW NOTES:

  • Onehouse (homepage)
  • Onehouse raises $25M Series A funding
  • Apache Hudi (homepage)
  • Delta Lake (homepage)
  • Apache Iceberg (homepage)
  • ​​Apache Hudi vs Delta Lake vs Apache Iceberg - Lakehouse Feature Comparison

Topic 1 - Welcome to the show. Tell us a little bit of your background, and where you focus your efforts at Onehouse?

Topic 2 - Your focus is on an emerging open source project, Apache Hudi. Before we dive into the project and technologies, we’re always interested in the background of what drove the creation of new projects. What problems existed before Hudi? 

Topic 3 - Let’s dive into Hudi. Data lakes, Delta Lakes, Lake houses, Icebergs. What is going on with all these water metaphors?  

Topic 4 - Hudi is focused on streaming data lakes. What are some of the things (types of applications) that need a streaming data lake? Where do transactions come into play? Where do data warehouse capabilities come into play?

Topic 5 - Stitching together open source projects and platforms can be complicated. How does the Onehouse platform simplify all of this for either data scientists or platform teams?

Topic 6 - What are some examples of how companies are using Onehouse and Hudi today? 

FEEDBACK?

  • Email: show at the cloudcast dot net
  • Twitter: @thecloudcastnet
...more
View all episodesView all episodes
Download on the App Store

The CloudcastBy Massive Studios

  • 4.6
  • 4.6
  • 4.6
  • 4.6
  • 4.6

4.6

147 ratings


More shows like The Cloudcast

View all
The Changelog: Software Development, Open Source by Changelog Media

The Changelog: Software Development, Open Source

289 Listeners

The a16z Show by Andreessen Horowitz

The a16z Show

1,092 Listeners

Software Engineering Daily by Software Engineering Daily

Software Engineering Daily

622 Listeners

Talk Python To Me by Michael Kennedy

Talk Python To Me

585 Listeners

Soft Skills Engineering by Jamison Dance and Dave Smith

Soft Skills Engineering

289 Listeners

Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

303 Listeners

NVIDIA AI Podcast by NVIDIA

NVIDIA AI Podcast

334 Listeners

Tech Brew Ride Home by Morning Brew

Tech Brew Ride Home

960 Listeners

Practical AI by Practical AI LLC

Practical AI

207 Listeners

AWS Podcast by Amazon Web Services

AWS Podcast

202 Listeners

The Real Python Podcast by Real Python

The Real Python Podcast

142 Listeners

Big Technology Podcast by Alex Kantrowitz

Big Technology Podcast

498 Listeners

This Day in AI Podcast by Michael Sharkey, Chris Sharkey

This Day in AI Podcast

228 Listeners

AI + a16z by a16z

AI + a16z

36 Listeners

The Pragmatic Engineer by Gergely Orosz

The Pragmatic Engineer

64 Listeners