The Hedgineer Podcast

DuckDB, Apache Arrow, & the Future of Data Engineering w/ Rusty Conover | S2E3


Listen Later

In this episode of The Hedgineer Podcast, host Michael Watson is joined by special guest Rusty Conover, the world's most prolific DuckDB extension builder, for a masterclass on building the next generation of real-time, large-scale data systems.


Rusty, who has an extensive career in data engineering, including at multi-manager hedge funds, pulls back the curtain on what makes DuckDB so revolutionary for developers and data engineers. They explore how its blazingly fast, in-process, C++-based architecture is challenging the big data status quo. The conversation provides a deep dive into the powerful ecosystem growing around DuckDB, from the Apache Arrow columnar format to the evolving landscape of open table formats like Iceberg, Delta Lake, and the new DuckLake.


Join them for a detailed discussion on the nitty-gritty of modern data infrastructure, whether you're building enterprise data platforms or looking for the most efficient tools for your analytics workload.


In this episode, you will learn about:


The DuckDB Revolution: What makes this "blazingly fast" in-process database a game-changer that can simplify and replace entire ETL stacks.

A Tour of DuckDB Extensions: A look inside some of the 15 extensions Rusty has built, from Airport for integrating with Apache Arrow, to Crypto, ShellFS, and TextPlot.


Diving into Apache Arrow: An explanation of the columnar in-memory data format, zero-copy operations, and the Arrow Flight RPC mechanism for efficiently moving data.


The Battle of Open Table Formats: A comparison of Iceberg, Delta Lake, and the new database-centric approach of DuckLake.


DuckDB vs. The World: How DuckDB stacks up against KDB for financial data, ClickHouse for analytics, and its role alongside large-scale compute engines like Apache Spark.


Parquet Deep Dive: The key differences between Parquet V1 and V2 and the importance of modern compression strategies and encodings.


The Future of DuckDB: A sneak peek at powerful upcoming features like time travel and the MERGE INTO statement for simplifying change data capture (CDC) pipelines.


Hosted by Michael Watson, The Hedgineer Podcast dives into AI technology and data in the hedge fund, asset management, and prop trading space.


Follow The Hedgineer Podcast:

YouTube: (https://www.youtube.com/@hedgineer)

LinkedIn: (https://www.linkedin.com/company/90976838)

Twitter: (https://x.com/hedgineering)

Instagram: (https://www.instagram.com/hedgineer/)


Don't forget to like, subscribe, and hit the notification bell to stay updated on our latest episodes!


Hedgineer.io

Hosted on Acast. See acast.com/privacy for more information.

...more
View all episodesView all episodes
Download on the App Store

The Hedgineer PodcastBy Michael Watson

  • 5
  • 5
  • 5
  • 5
  • 5

5

4 ratings


More shows like The Hedgineer Podcast

View all
Exchanges by Goldman Sachs

Exchanges

978 Listeners

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch by Harry Stebbings

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

531 Listeners

Odd Lots by Bloomberg

Odd Lots

1,866 Listeners

Fintech Insider Podcast by 11:FS by 11:FS

Fintech Insider Podcast by 11:FS

181 Listeners

Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

302 Listeners

Capital Allocators – Inside the Institutional Investment Industry by Ted Seides – Allocator and Asset Management Expert

Capital Allocators – Inside the Institutional Investment Industry

793 Listeners

Flirting with Models by Corey Hoffstein

Flirting with Models

234 Listeners

Alpha Exchange by Dean Curnutt

Alpha Exchange

82 Listeners

Making Sense by J.P. Morgan

Making Sense

59 Listeners

Excess Returns by Excess Returns

Excess Returns

81 Listeners

All-In with Chamath, Jason, Sacks  Friedberg by All-In Podcast, LLC

All-In with Chamath, Jason, Sacks Friedberg

9,830 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

488 Listeners

Signals and Threads by Jane Street

Signals and Threads

72 Listeners

In Good Company with Nicolai Tangen by Norges Bank Investment Management

In Good Company with Nicolai Tangen

181 Listeners

Other People's Money with Max Wiethe by Max Wiethe

Other People's Money with Max Wiethe

15 Listeners