April 11, 2024

Episode 190 - Google Gemma's Tortoise and Hare Adventure

Listen Later

28 minutes

Embark on a wild race with Gemma as we explore the exciting (and sometimes slow) world of running Google's open-source large language model! We'll test drive different methods, from the leisurely pace of Ollama on a local machine to the speedier Groq platform. Join us as we compare these approaches, analyzing performance, costs, and ease of use for developers working with LLMs. Will the tortoise or the hare win this race?

Learn more:

* Model card: https://console.cloud.google.com/vertex-ai/publishers/google/model-garden/335

* Ollama: https://ollama.com/

* LangChain.js with Ollama: https://js.langchain.com/docs/integrations/llms/ollama

* Groq: https://groq.com/

Timestamps:

0:00:00 - Introduction

0:03:05 - Getting to Know Gemma: Exploring the Model Card

0:05:30 - Vertex AI Endpoint: Fast Deployment, But at What Cost?

0:13:40 - Ollama: The Tortoise of Local LLM Hosting

0:17:40 - LangChain Integration: Adding Functionality to Ollama

0:21:44 - Groq: The Hare of LLM Hardware

0:26:06 - Comparing Approaches: Speed vs. Cost vs. Control

0:27:35 - Future of Open LLMs and Google Cloud Next

#GemmaSprint

This project was supported, in part, by Cloud Credits from Google

...more

View all episodes

View all episodes

Download on the App Store

Download on the App Store

Get it on Google Play

Two Voice Devs

By Mark and Allen

1

11 ratings

April 11, 2024

Episode 190 - Google Gemma's Tortoise and Hare Adventure

Listen Later

28 minutes

Embark on a wild race with Gemma as we explore the exciting (and sometimes slow) world of running Google's open-source large language model! We'll test drive different methods, from the leisurely pace of Ollama on a local machine to the speedier Groq platform. Join us as we compare these approaches, analyzing performance, costs, and ease of use for developers working with LLMs. Will the tortoise or the hare win this race?

Learn more:

* Model card: https://console.cloud.google.com/vertex-ai/publishers/google/model-garden/335

* Ollama: https://ollama.com/

* LangChain.js with Ollama: https://js.langchain.com/docs/integrations/llms/ollama

* Groq: https://groq.com/

Timestamps:

0:00:00 - Introduction

0:03:05 - Getting to Know Gemma: Exploring the Model Card

0:05:30 - Vertex AI Endpoint: Fast Deployment, But at What Cost?

0:13:40 - Ollama: The Tortoise of Local LLM Hosting

0:17:40 - LangChain Integration: Adding Functionality to Ollama

0:21:44 - Groq: The Hare of LLM Hardware

0:26:06 - Comparing Approaches: Speed vs. Cost vs. Control

0:27:35 - Future of Open LLMs and Google Cloud Next

#GemmaSprint

This project was supported, in part, by Cloud Credits from Google

...more

More shows like Two Voice Devs

Lenny's Podcast: Product | Growth | Career by Lenny Rachitsky

Lenny's Podcast: Product | Growth | Career

1,349 Listeners

Latent Space: The AI Engineer Podcast by swyx + Alessio

Latent Space: The AI Engineer Podcast

75 Listeners