
Sign up to save your podcasts
Or


Today we have Philip Kiely from Baseten on the show. Baseten is a Series B startup focused on providing infrastructure for AI workloads.
We go deep on Inference Optimization. We cover choosing a model, discuss the hype around Compound AI, choosing an Inference Engine, Optimization Techniques like Quantization and Speculative Decoding all the way down to your GPU choice.
By Software Huddle5
44 ratings
Today we have Philip Kiely from Baseten on the show. Baseten is a Series B startup focused on providing infrastructure for AI workloads.
We go deep on Inference Optimization. We cover choosing a model, discuss the hype around Compound AI, choosing an Inference Engine, Optimization Techniques like Quantization and Speculative Decoding all the way down to your GPU choice.

271 Listeners

291 Listeners

624 Listeners

285 Listeners

2,084 Listeners

987 Listeners

210 Listeners

2,641 Listeners

9,829 Listeners

489 Listeners

59 Listeners

97 Listeners

559 Listeners

509 Listeners

64 Listeners