Sandboxed - On-Device AI for iOS

Machine Learning Engineering: Training vs. Inference


Listen Later

As iOS engineers, we respect the science of training models, but we live in the trenches of inference. In this episode, we explore the "Great Divide" between creating a model and running it. We break down the memory mechanics of O(N) training vs O(1) inference, dissect compiler optimizations like kernel fusion, and explain exactly how the Apple Neural Engine cheats bandwidth physics using quantization.

...more
View all episodesView all episodes
Download on the App Store

Sandboxed - On-Device AI for iOSBy Logicbridge