AI Lovers

The Future of AI Infrastructure on the Cloud


Listen Later

Join us for our latest discussion with Gad Benram and Charles Frye from Modal as they explore the strategic reasons behind companies choosing to host their own AI infrastructure versus relying on external cloud services. From controlling critical data to customizing AI applications, this episode is packed with valuable insights for anyone navigating the complex world of AI deployment.


Key topics include:

• 00:00 Introduction: Insights on AI Resources for Hosting AI Models

• 03:11 The Challenges of Existing Cloud Services

• 09:14 Introducing Modal: A Fast and Interactive Development Experience

• 15:13 Different Infrastructure Needs for Data Teams

• 19:42 Addressing Slowness in AI Services

• 26:20 Python and Notebooks for Data Scientists

• 33:35 Fast and Seamless Deployment with Modal

• 40:46 Future Directions and Closing Remarks


In this episode, Gad Benram and Charles Frye discuss the challenges of hosting AI models in production and the limitations of existing cloud services. They highlight the lack of resources and GPUs available for serving AI applications and the slow bootstrapping process. They introduce Modal, a serverless runtime for distributed applications built on top of cloud resources, as a solution to these challenges.

Modal offers fast deployment times, interactive development workflows, and support for large-scale models.


🔗 Visit our website for more resources and updates:

⁠⁠https://www.tensorops.ai/⁠⁠
👥 Connect with us on social media:
⁠⁠Linkedin⁠⁠
⁠⁠Twitter⁠⁠
💬 Join our community:
⁠⁠https://www.meetup.com/ai-loves/

...more
View all episodesView all episodes
Download on the App Store

AI LoversBy TensorOps