Two Voice Devs

Epsiode 202 - Hosting and Large Language Models


Listen Later

Join Allen Firstenberg and Mark Tucker on Two Voice Devs as they discuss the challenges and solutions of hosting large language models (LLMs). They explore various hosting environments, including Firebase, AWS Amplify, Vertex AI, and Docker/Kubernetes, comparing their strengths and weaknesses.


Allen shares his experience with Firebase Cloud Functions and the seamless integration with Google Cloud services, while Mark tackles the complexities of Docker, Kubernetes, and enterprise-level deployment strategies. From managing API keys and credentials to implementing design patterns and best practices, they explore the challenges and solutions for building robust and scalable AI systems.


This episode is packed with practical tips for developers, covering topics like:

[00:02:00] Firebase Suite of Tools: Learn how Firebase provides a comprehensive platform for hosting LLMs, including real-time databases, cloud storage, cloud functions, and authentication.

[00:04:00] Firebase vs. AWS Amplify: Discover the key differences between these two popular serverless platforms and their database options.

[00:05:00] Cloud Service Accounts for Security: Allen demonstrates how leveraging cloud service accounts can simplify permission management and enhance security.

[00:11:00] Architecture Design and Long-Term Hosting: Allen emphasizes the impor

tance of considering future scalability and maintenance when selecting a hosting environment.

[00:12:30] Working with Docker and Kubernetes: Mark dives into his experience using Docker containers and Kubernetes for enterprise-level LLM deployment.

[00:15:00] Learning Python for LLM Development: Mark shares his experience learning Python for working with LLMs and using libraries like FastAPI for REST API development.

[00:17:00] Design Patterns and Best Practices: Allen and Mark discuss the evolving nature of design patterns and their importance in modern software development.

[00:20:00] KitOps for Model Deployment: Mark explains how KitOps can be used to separate model deployment from service deployment in a Kubernetes environment.

[00:23:00] Docker and Configuration Management: Allen discusses the challenge of configuration management in Docker environments and how to manage changes efficiently.

[00:24:00] Enterprise Security and Tooling: Mark explores the use of tools like HashiCorp Console and Vault for managing configurations and secrets in enterprise deployments.

[00:26:00] The Importance of Containerization: Allen and Mark reiterate the fundamental role of containers in modern software development.


Don't miss this insightful episode of Two Voice Devs, where you'll gain valuable insights and practical tips for hosting and deploying your own LLMs!


#AI #Development #Hosting #Cloud #Docker #Kubernetes #Firebase #GoogleCloud #DesignPatterns #TwoVoiceDevs

...more
View all episodesView all episodes
Download on the App Store

Two Voice DevsBy Mark and Allen

  • 1
  • 1
  • 1
  • 1
  • 1

1

1 ratings


More shows like Two Voice Devs

View all
Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

356 Listeners

The Daily AI Show by The Daily AI Show Crew - Brian, Beth, Jyunmi, Andy, Karl, and Eran

The Daily AI Show

3 Listeners