Gnarly Data Waves by Dremio

EP20 - What's New in the Apache Iceberg Project: Updates, PyIceberg, Compute Engines


Listen Later

The Apache Iceberg project has made tremendous strides, evolving on various fronts such as usage, ecosystem adoption, community growth, and capabilities. In the past few months, the project has introduced many exciting new features and performance improvements around the core library, compute engines and standalone libraries (such as PyIceberg) that makes this lakehouse technology robust & valuable for organizations. In this video of Gnarly Data Waves, we will go over some of the notable new capabilities of Apache Iceberg.

Specifically, we will discuss about:
- Version 1.2.0 release
- Features such as : Branching/Tagging, New write-distribution-mode, Change Data Capture, Catalog Migrator Tool, Delta to Iceberg migration
- PyIceberg (What’s happening in the Python library)
- Compute Engine-specific features: Dremio, Apache Spark, Flink
See all upcoming episodes: https://www.dremio.com/gnarly-data-wa...
Connect with us!
Twitter: https://bit.ly/30pcpE1
LinkedIn: https://bit.ly/2PoqsDq
Facebook: https://bit.ly/2BV881V
Community Forum: https://bit.ly/2ELXT0W
Github: https://bit.ly/3go4dcM
Blog: https://bit.ly/2DgyR9B
Questions?: https://bit.ly/30oi8tX
Website: https://bit.ly/2XmtEnN#datalakehouse #data #analytics #datawarehouse #datalake #dataengineers #dataarchitects #governance #infrastructure #dremiocloud #dremiotestdrive #openlakehouse #opendatalakehouse #gnarlydatawaves #apacheiceberg #dremioarctic #datamesh #metadata #modernization #datasharing #migration #ETL #datasilos #selfservice #compliance #dataascode #branches #optimized #automates #datamovement #clustering #metrics #filtering #partitioning #tableformat #ApacheArrow #nessie #sonar #dremiosonar #optimization #automaticdata #scalability #enterprisedata #federated #catalogmigratortool # pylceberg #apachespark #flink #changedatacapture

...more
View all episodesView all episodes
Download on the App Store

Gnarly Data Waves by DremioBy Dremio (The Open Data Lakehouse Platform)