Chuck Yates Got A Job

Drilling Down on Data with Bobby Neelon & John Kalfayan (Collide)


Listen Later

Bobby Neelon and John Kalfayan from Collide break down the messy reality of getting data ready for RAG, why PDFs are dumpster fires for unstructured data, how extraction changes depending on whether you're dealing with drilling surveys or handwritten logs, and why chunking strategy matters more than people think. They walk through embeddings, vector databases, MCP servers for pulling external data without leaking internal info, and why good metadata and folder structure actually make AI deployments way easier. Plus the hard truth that AI isn't a silver bullet for bad data management and the crap-in-crap-out problem is getting worse because now it can hallucinate on top of the crap.

Click here to watch a video of this episode.

Join the conversation shaping the future of energy.
Collide is the community where oil & gas professionals connect, share insights, and solve real-world problems together. No noise. No fluff. Just the discussions that move our industry forward.
Apply today at collide.io


Click here to view the episode transcript.

0:00 - Introductions and RAG overview
3:15 - Document identification and classification challenges
8:40 - Extracting data from unstructured PDFs
13:25 - Real world examples of messy data formats
18:50 - OCR paired with vision models for extraction
22:10 - Chunking strategies and when to use each
26:35 - Embeddings and vector databases explained
30:20 - MCP servers and external data integration
35:45 - Getting data AI-ready with metadata and structure
40:30 - Text-to-SQL approaches and database access
44:15 - Handling duplicates and M&A data integration
48:50 - How AI learns context over time
53:40 - Why traditional data management matters more than ever

https://twitter.com/collide_io

https://www.tiktok.com/@collide.io

https://www.facebook.com/collide.io

https://www.instagram.com/collide.io

https://www.youtube.com/@collide_io

https://bsky.app/profile/digitalwildcatters.bsky.social

https://www.linkedin.com/company/collide-digital-wildcatters

...more
View all episodesView all episodes
Download on the App Store

Chuck Yates Got A JobBy collide.

  • 4.8
  • 4.8
  • 4.8
  • 4.8
  • 4.8

4.8

112 ratings


More shows like Chuck Yates Got A Job

View all
Freakonomics Radio by Freakonomics Radio + Stitcher

Freakonomics Radio

32,265 Listeners

The Joe Rogan Experience by Joe Rogan

The Joe Rogan Experience

229,657 Listeners

Wild at Heart by John Eldredge

Wild at Heart

1,713 Listeners

Macro Voices by Hedge Fund Manager Erik Townsend

Macro Voices

3,055 Listeners

The Rich Roll Podcast by Rich Roll

The Rich Roll Podcast

11,903 Listeners

Foreign Policy Live by Foreign Policy

Foreign Policy Live

600 Listeners

Oil and Gas This Week by Mark LaCour & Paige Wilson

Oil and Gas This Week

539 Listeners

Founders by David Senra

Founders

2,197 Listeners

POWERS by Chris Powers

POWERS

503 Listeners

The Shawn Ryan Show by Shawn Ryan

The Shawn Ryan Show

46,391 Listeners

All-In with Chamath, Jason, Sacks & Friedberg by All-In Podcast, LLC

All-In with Chamath, Jason, Sacks & Friedberg

10,213 Listeners

The Minerals and Royalties Podcast by Minerals & Royalties Authority LLC

The Minerals and Royalties Podcast

30 Listeners

C.O.B. Tuesday by Veriten

C.O.B. Tuesday

34 Listeners

Big Digital Energy by collide.

Big Digital Energy

14 Listeners

The Tucker Carlson Show by Tucker Carlson Network

The Tucker Carlson Show

17,106 Listeners