The AWS Developers Podcast

3 ways to deploy your large language models on AWS


Listen Later

In this episode of the AWS Developers Podcast, we dive into the different ways to deploy large language models (LLMs) on AWS. From self-managed deployments on EC2 to fully managed services like SageMaker and Bedrock, we break down the pros and cons of each approach. Whether you're optimizing for compliance, cost, or time-to-market, we explore the trade-offs between flexibility and simplicity. You'll hear practical insights into instance selection, infrastructure management, model sizing, and prototyping strategies. We also examine how services like SageMaker Jumpstart and serverless architectures like Bedrock can streamline your machine learning workflows. If you're building or scaling AI applications in the cloud, this episode will help you navigate your options and design a deployment strategy that fits your needs.

With Germaine Ong, Startup Solution Architect ; With Jarett Yeo, Startup Solution Architect

    • Blog: Deploying Deepseek R1 Distill on Amazon EC2
      Blog: Deploying DeepSeek R1 Distill on Amazon Sagemaker Jumpstart
      Ollama
      Open Web UI
      Doc: deploy your own model on Amazon Sagemaker
      Doc: deploy your own model on Amazon Bedrock
  • ...more
    View all episodesView all episodes
    Download on the App Store

    The AWS Developers PodcastBy Amazon Web Services

    • 4.7
    • 4.7
    • 4.7
    • 4.7
    • 4.7

    4.7

    24 ratings


    More shows like The AWS Developers Podcast

    View all
    The Changelog: Software Development, Open Source by Changelog Media

    The Changelog: Software Development, Open Source

    289 Listeners

    The a16z Show by Andreessen Horowitz

    The a16z Show

    1,085 Listeners

    Software Engineering Daily by Software Engineering Daily

    Software Engineering Daily

    624 Listeners

    Talk Python To Me by Michael Kennedy

    Talk Python To Me

    585 Listeners

    Data Engineering Podcast by Tobias Macey

    Data Engineering Podcast

    144 Listeners

    Darknet Diaries by Jack Rhysider

    Darknet Diaries

    8,048 Listeners

    Tech Brew Ride Home by Morning Brew

    Tech Brew Ride Home

    963 Listeners

    Practical AI by Practical AI LLC

    Practical AI

    210 Listeners

    AWS Podcast by Amazon Web Services

    AWS Podcast

    203 Listeners

    AWS Morning Brief by Corey Quinn

    AWS Morning Brief

    79 Listeners

    The Stack Overflow Podcast by The Stack Overflow Podcast

    The Stack Overflow Podcast

    64 Listeners

    The Real Python Podcast by Real Python

    The Real Python Podcast

    142 Listeners

    Last Week in AI by Skynet Today

    Last Week in AI

    306 Listeners

    The AI Daily Brief: Artificial Intelligence News and Analysis by Nathaniel Whittemore

    The AI Daily Brief: Artificial Intelligence News and Analysis

    607 Listeners

    The Pragmatic Engineer by Gergely Orosz

    The Pragmatic Engineer

    64 Listeners