
Sign up to save your podcasts
Or


Shreya Shankar is a PhD student at UC Berkeley in the EECS department. This episode explores how Large Language Models (LLMs) are revolutionizing the processing of unstructured enterprise data like text documents and PDFs. It introduces DocETL, a framework using a MapReduce approach with LLMs for semantic extraction, thematic analysis, and summarization at scale.
Subscribe to the Gradient Flow Newsletter 📩 https://gradientflow.substack.com/
Subscribe: Apple · Spotify · Overcast · Pocket Casts · AntennaPod · Podcast Addict · Amazon · RSS.
Detailed show notes - with links to many references - can be found on The Data Exchange web site.
By Ben Lorica4.6
3737 ratings
Shreya Shankar is a PhD student at UC Berkeley in the EECS department. This episode explores how Large Language Models (LLMs) are revolutionizing the processing of unstructured enterprise data like text documents and PDFs. It introduces DocETL, a framework using a MapReduce approach with LLMs for semantic extraction, thematic analysis, and summarization at scale.
Subscribe to the Gradient Flow Newsletter 📩 https://gradientflow.substack.com/
Subscribe: Apple · Spotify · Overcast · Pocket Casts · AntennaPod · Podcast Addict · Amazon · RSS.
Detailed show notes - with links to many references - can be found on The Data Exchange web site.

532 Listeners

1,095 Listeners

624 Listeners

582 Listeners

301 Listeners

346 Listeners

970 Listeners

201 Listeners

309 Listeners

98 Listeners

534 Listeners

140 Listeners

99 Listeners

225 Listeners

639 Listeners