Cisco Podcast Network

AI Insights – EP.2: Unlocking Cost-Effective AI with Small Language Models


Listen Later

In the latest episode of the Cisco AI Insights Podcast, hosts Rafael Herrera and Sónia Marques welcome Cisco AI operations engineer James Tidd for a discussion on the world of small language models (SLMs) and the evolution of efficient AI inference. Together, they unravel the complexities behind “Fast Inference from Transformers via Speculative Decoding,” a groundbreaking paper from Google that explores how smaller draft models can speed up large language model predictions while maintaining accuracy.
James shares his hands-on experience experimenting with the technique, leveraging knowledge distillation and speculative execution. The trio also discusses the potential of this approach to optimize AI, reduce power consumption and costs, and help businesses of all sizes get more out of existing hardware.
A special thank you to Google’s AI team for developing this month's paper. If you are interested in reading the paper yourself, please visit this link: https://research.google/blog/looking-back-at-speculative-decoding/.
...more
View all episodesView all episodes
Download on the App Store

Cisco Podcast NetworkBy Cisco

  • 4.9
  • 4.9
  • 4.9
  • 4.9
  • 4.9

4.9

19 ratings


More shows like Cisco Podcast Network

View all
The Briefing with Albert Mohler by R. Albert Mohler, Jr.

The Briefing with Albert Mohler

8,690 Listeners

Bloomberg Intelligence by Bloomberg

Bloomberg Intelligence

414 Listeners

Security Now (Audio) by TWiT

Security Now (Audio)

2,009 Listeners

WSJ Tech News Briefing by The Wall Street Journal

WSJ Tech News Briefing

1,659 Listeners

SANS Internet Stormcenter Daily Cyber Security Podcast (Stormcast) by Johannes B. Ullrich

SANS Internet Stormcenter Daily Cyber Security Podcast (Stormcast)

651 Listeners

CyberWire Daily by N2K Networks

CyberWire Daily

1,024 Listeners

Smashing Security by Graham Cluley

Smashing Security

320 Listeners

Darknet Diaries by Jack Rhysider

Darknet Diaries

8,075 Listeners

Cybersecurity Today by Jim Love

Cybersecurity Today

178 Listeners

The Cisco Learning Network by The Cisco Learning Network

The Cisco Learning Network

76 Listeners

CISO Series Podcast by David Spark, Mike Johnson, and Andy Ellis

CISO Series Podcast

194 Listeners

The Breakdown by Blockworks

The Breakdown

741 Listeners

Defense in Depth by David Spark, Steve Zalewski, Geoff Belknap

Defense in Depth

73 Listeners

All-In with Chamath, Jason, Sacks & Friedberg by All-In Podcast, LLC

All-In with Chamath, Jason, Sacks & Friedberg

10,196 Listeners

Cybersecurity Headlines by CISO Series

Cybersecurity Headlines

138 Listeners