The future of analytics isn’t just about bigger models — it’s about building smarter, more interoperable data systems. Wes McKinney, Principal Architect of Posit PBC, Chief Scientist of Voltron Data and a General Partner at Composed Ventures, joins us to explore how the modern data stack is evolving and what it means for the future of analytics. Wes reflects on his journey building pandas and Apache Arrow, sharing how open-source ecosystems grow, transform and shape the way organizations work with data today. Wes also highlights the rising importance of semantic layers, agentic workflows and defensive coding practices as teams embrace AI-driven development.
Key Takeaways:
00:00 Introduction.
02:32 Wes didn’t expect pandas to drive AI but he recognized Python’s unrealized potential.
05:09 A lucky convergence helped Python’s tools snowball into the AI standard.
10:40 Early big data focused on essentials, not the interoperable stacks we rely on today.
15:44 The composable data stack grew through bottom-up, grassroots open-source momentum.
21:56 Many “data science” roles ultimately became business intelligence and dashboard work.
25:24 Complex statistical work still depends on human judgment, not fully autonomous agents.
30:27 Frontier models retrieve table data reliably, while smaller models fail dramatically.
35:16 Better models and coding agents shifted Wes from an AI skeptic to an adopter.
40:07 AI-driven code demands stronger testing and review to avoid costly failures.
45:14 An AI-built finance project ballooned, revealing how agents inflate codebases.
Resources Mentioned:
Wes McKinney
https://www.linkedin.com/in/wesmckinn/
Posit PBC | LinkedIn
https://www.linkedin.com/company/posit-software/
Posit PBC | Website
https://posit.co/
Voltron Data | LinkedIn
https://www.linkedin.com/company/voltrondata/
Voltron Data | Website
https://voltrondata.com/
Composed Ventures | LinkedIn
https://www.linkedin.com/company/composedvc/
Composed Ventures | Website
https://composed.vc/
pandas
https://pandas.pydata.org/
Apache Arrow
https://arrow.apache.org/
DuckDB
https://duckdb.org/
DataFusion
https://datafusion.apache.org/
Jupyter Notebook
https://jupyter.org/
Parquet
https://parquet.apache.org/
Iceberg
https://iceberg.apache.org/
Delta Lake
https://delta.io/
Thanks for listening to the “Data Masters Podcast.” If you enjoyed this episode, be sure to subscribe so you never miss our latest discussions and insights into the ever-changing world of data.
#DataStrategy #DataManagement #DataMastersPodcast