
Sign up to save your podcasts
Or


Today we’re joined by Jilei Hou, a VP of Engineering at Qualcomm Technologies. In our conversation with Jilei, we focus on the emergence of generative AI, and how they've worked towards providing these models for use on edge devices. We explore how the distribution of models on devices can help amortize large models' costs while improving reliability and performance and the challenges of running machine learning workloads on devices, including model size and inference latency. Finally, Jilei we explore how these emerging technologies fit into the existing AI Model Efficiency Toolkit (AIMET) framework.
The complete show notes for this episode can be found at twimlai.com/go/633
By Sam Charrington4.7
422422 ratings
Today we’re joined by Jilei Hou, a VP of Engineering at Qualcomm Technologies. In our conversation with Jilei, we focus on the emergence of generative AI, and how they've worked towards providing these models for use on edge devices. We explore how the distribution of models on devices can help amortize large models' costs while improving reliability and performance and the challenges of running machine learning workloads on devices, including model size and inference latency. Finally, Jilei we explore how these emerging technologies fit into the existing AI Model Efficiency Toolkit (AIMET) framework.
The complete show notes for this episode can be found at twimlai.com/go/633

1,109 Listeners

168 Listeners

307 Listeners

345 Listeners

233 Listeners

209 Listeners

204 Listeners

313 Listeners

101 Listeners

554 Listeners

146 Listeners

103 Listeners

229 Listeners

688 Listeners

34 Listeners