Podcast Archives - Software Engineering Daily

Scaling Large ML Models to Small Devices with Atila Orhon


Listen Later

The size of ML models is growing into the many billions of parameters. This poses a challenge for running inference on non-dedicated hardware like phones and laptops.

Argmax is a startup focused on developing methods to run large models on commodity hardware. A key observation behind their strategy is that the largest models are getting larger, but the smallest models that are commercially relevant are getting smaller. The company was started in 2023 and has raised money from General Catalyst and other industry leaders.

Atila Orhon is the founder of Argmax and he previously worked at Apple and NVIDIA. He joins the show to talk about working in computer vision, building ML tooling at Apple, optimizing ML models, and more.

Sean’s been an academic, startup founder, and Googler. He has published works covering a wide range of topics from information visualization to quantum computing. Currently, Sean is Head of Marketing and Developer Relations at Skyflow and host of the podcast Partially Redacted, a podcast about privacy and security engineering. You can connect with Sean on Twitter @seanfalconer.

 

Please click here to see the transcript of this episode.

Sponsorship inquiries: [email protected]

The post Scaling Large ML Models to Small Devices with Atila Orhon appeared first on Software Engineering Daily.

...more
View all episodesView all episodes
Download on the App Store

Podcast Archives - Software Engineering DailyBy Podcast Archives - Software Engineering Daily

  • 4
  • 4
  • 4
  • 4
  • 4

4

4 ratings


More shows like Podcast Archives - Software Engineering Daily

View all
The Eastern Border by Kristaps Andrejsons

The Eastern Border

823 Listeners

Software Engineering Daily by Software Engineering Daily

Software Engineering Daily

625 Listeners

The Daily by The New York Times

The Daily

112,360 Listeners

Kubernetes Podcast from Google by Abdel Sghiouar, Kaslin Fields

Kubernetes Podcast from Google

182 Listeners

Post Reports by The Washington Post

Post Reports

5,456 Listeners

AWS Podcast by Amazon Web Services

AWS Podcast

203 Listeners

The Stack Overflow Podcast by The Stack Overflow Podcast

The Stack Overflow Podcast

63 Listeners

The 7 by The Washington Post

The 7

1,242 Listeners

The AI Daily Brief: Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief: Artificial Intelligence News and Analysis

608 Listeners

Rust in Production by Matthias Endler

Rust in Production

26 Listeners