Google Cloud Platform Podcast

BigLake with Gaurav Saxena and Justin Levandoski


Listen Later

Stephanie Wong and Debi Cabrera are learning all about BigLake from guests Gaurav Saxena and Justin Levandoski of the BigQuery team.

BigLake offers unified data management from both data warehouses and data lakes. What exactly is the difference between a data warehouse and a data lake? Justin explains what a data lake is, how they came to be, and the benefits. Each data option has its cons too, like the limitations of data lakes for enterprise use. Enter BigLake built on BigQuery, which helps enterprise clients manage and analyze their data from both data warehouses and data lakes. The best features of BigQuery are now available for Google Cloud Storage and across multi-cloud solutions.

Guarav describes BigLake behind the scenes and how the principles of BigQuery’s data management can now be used for open file formats in BigLake. It’s BigQuery for more data formats, Justin explains. BigLake solves many data problems quickly with a special emphasis on improving security. Our guests talk specifically about clients who gain the most from using BigLake, especially those looking to analyze distributed data and those who need easy and fast security and compliance solutions. With tightened security, BigLake offers access delegation and secure APIs that work over object storage. We hear about the user experience and how easy it is to get started, especially for customers already familiar with and using other GCP products.

Google’s advocacy of open source projects means many clients are coming in with workloads built with open source software. BigLake supports multi-cloud projects so that tables can be built on top of any data system. No matter the format of your data, you can run analytics with BigLake. We talk more about the security features of BigLake and how easy it is to unify data warehouses and data lakes with optimal data security.

The customers have helped shape BigLake, and Gaurav describes how these clients are using this data software. We hear about integration with BigQuery Omni and Dataplex and how BigLake is different. In the future, Google will continue to make simple, effective solutions for data management and analytics, building further off of BigQuery.

Gaurav Saxena

Gaurav Saxena is a product management lead at Google BigQuery. He has 12+ years of experience building products at the intersection of cloud, data and AI. Before Google, Gaurav led product management at Microsoft Azure and Amazon Web Services for some of the most widely used cloud offerings in storage and data.

Justin Levandoski

Justin is a tech lead/manager in BigQuery leading BigLake and other projects pushing the frontier of BigQuery. Prior to Google, just worked on Amazon Aurora and was part of the Database research group at Microsoft Research.

Cool things of the week
  • Your ultimate guide to Speech on Google Cloud blog
  • Announcing the Climate Innovation Challenge—grants to support cutting-edge earth research blog
Interview
  • BigLake site
  • BigQuery site
  • Cloud Storage site
  • Spark site
  • Apache Ranger site
  • BigQuery Omni docs
  • Apache Iceberg site
  • Delta Lake site
  • Presto site
  • TensorFlow site
  • Dataplex site
What’s something cool you’re working on?

Debi is working on a series about automatic DLP. Cloud Data Loss Prevention is now automatic and allows you to scan data across your whole org with the click of one button!

Hosts

Stephanie Wong and Debi Cabrera

...more
View all episodesView all episodes
Download on the App Store

Google Cloud Platform PodcastBy Google Cloud Platform

  • 4.6
  • 4.6
  • 4.6
  • 4.6
  • 4.6

4.6

101 ratings


More shows like Google Cloud Platform Podcast

View all
The Vergecast by The Verge

The Vergecast

3,664 Listeners

Software Engineering Daily by Software Engineering Daily

Software Engineering Daily

624 Listeners

Acquired by Ben Gilbert and David Rosenthal

Acquired

4,196 Listeners

AWS Podcast by Amazon Web Services

AWS Podcast

201 Listeners

The Daily by The New York Times

The Daily

110,802 Listeners

Kubernetes Podcast from Google by Abdel Sghiouar, Kaslin Fields

Kubernetes Podcast from Google

184 Listeners

Talks at Google by Talks at Google

Talks at Google

118 Listeners

The Journal. by The Wall Street Journal & Spotify Studios

The Journal.

5,953 Listeners

Google DeepMind: The Podcast by Hannah Fry

Google DeepMind: The Podcast

197 Listeners

Hard Fork by The New York Times

Hard Fork

5,437 Listeners

Huberman Lab by Scicomm Media

Huberman Lab

28,554 Listeners

Cloud Security Podcast by Google by Anton Chuvakin

Cloud Security Podcast by Google

38 Listeners

The Weekly Show with Jon Stewart by Comedy Central

The Weekly Show with Jon Stewart

10,324 Listeners

Google Cloud Basics by Jason Meers

Google Cloud Basics

0 Listeners

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis

499 Listeners