Podcast Archives - Software Engineering Daily

Open-Weight AI Models


Listen Later

Open-weight models are AI systems whose trained parameters are publicly released, which allows developers to run, fine-tune, and deploy them independently rather than accessing them only through a hosted API. While closed-weight models from companies like OpenAI or Anthropic are delivered as managed services, open-weight models give organizations direct control over how the models are deployed and used. Importantly, the performance of these models is steadily improving and they’ve become credible alternatives for production workloads, with advantages in customization and data privacy.

Fireworks AI is building a platform focused on serving and customizing open-weight models at scale. The platform includes optimized inference infrastructure, multi-hardware support across NVIDIA and AMD, and reinforcement fine-tuning capabilities.

Benny Chen is a Co-Founder of Fireworks AI. In this episode, he joins Gregor Vand to discuss his path from Meta’s ML infrastructure teams to co-founding Fireworks AI, why open-weight models are becoming increasingly competitive, how custom kernels and speculative decoding improve performance, reinforcement fine-tuning, and much more.

Gregor Vand is a security-focused technologist, having previously been a CTO across cybersecurity, cyber insurance and general software engineering companies. He is based in Singapore and can be found via his profile at vand.hk or on LinkedIn.

 

 

 

Please click here to see the transcript of this episode.

Sponsorship inquiries: [email protected]

The post Open-Weight AI Models appeared first on Software Engineering Daily.

...more
View all episodesView all episodes
Download on the App Store

Podcast Archives - Software Engineering DailyBy Podcast Archives - Software Engineering Daily

  • 4
  • 4
  • 4
  • 4
  • 4

4

4 ratings


More shows like Podcast Archives - Software Engineering Daily

View all
The Eastern Border by Kristaps Andrejsons

The Eastern Border

824 Listeners

Software Engineering Daily by Software Engineering Daily

Software Engineering Daily

626 Listeners

The Daily by The New York Times

The Daily

113,121 Listeners

Kubernetes Podcast from Google by Abdel Sghiouar, Kaslin Fields

Kubernetes Podcast from Google

180 Listeners

Post Reports by The Washington Post

Post Reports

5,217 Listeners

AWS Podcast by Amazon Web Services

AWS Podcast

204 Listeners

The Stack Overflow Podcast by The Stack Overflow Podcast

The Stack Overflow Podcast

63 Listeners

The 7 by The Washington Post

The 7

1,261 Listeners

The AI Daily Brief: Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief: Artificial Intelligence News and Analysis

688 Listeners

Rust in Production by Matthias Endler

Rust in Production

25 Listeners