January 19, 2024

110. Why should you use Lambda for Machine Learning?

Listen Later

24 minutes

In this episode, we discuss using AWS Lambda for machine learning inference. We cover the tradeoffs between GPUs and CPUs for ML, tools like ggml and llama.cpp for running models on CPUs, and share examples where we've experimented with Lambda for ML like podcast transcription, medical imaging, and natural language processing. While Lambda ML is still quite experimental, it can be a viable option for certain use cases.

💰 SPONSORS 💰

AWS Bites is brought to you by fourTheorem, an Advanced AWS Partner. If you are moving to AWS or need a partner to help you go faster, check us out at fourtheorem.com !

In this episode, we mentioned the following resources.

Episode "46. How do you do machine learning on AWS?": https://awsbites.com/46-how-do-you-do-machine-learning-on-aws/

Episode "108. How to Solve Lambda Python Cold Starts": https://awsbites.com/108-how-to-solve-lambda-python-cold-starts/

ggml (the framework): https://github.com/ggerganov/ggml

ggml (the company): https://ggml.ai

llama.cpp: https://github.com/ggerganov/llama.cpp

whisper.cpp: https://github.com/ggerganov/whisper.cpp

whisper.cpp WebAssembly demo: https://whisper.ggerganov.com/

ONNX Runtime: https://onnxruntime.ai/

An example of using whisper.cpp with the Rust bindings: https://github.com/lmammino/whisper-rs-example

Project running Whisper.cpp in a Lambda function: https://github.com/eoinsha/whisper_lambda_cpp

AWS Lambda Image Container Chest X-Ray Example: https://github.com/fourTheorem/lambda-image-cxr-detection

Episode "103. Building GenAI Features with Bedrock": https://awsbites.com/103-building-genai-features-with-bedrock/⁠

Do you have any AWS questions you would like us to address?

Leave a comment here or connect with us on X, formerly Twitter:

- ⁠⁠⁠⁠https://twitter.com/eoins⁠⁠⁠⁠

- ⁠⁠⁠⁠https://twitter.com/loige⁠⁠

...more

View all episodes

View all episodes

Download on the App Store

Download on the App Store

Get it on Google Play

AWS Bites

By AWS Bites

4.6

1111 ratings

January 19, 2024

110. Why should you use Lambda for Machine Learning?

Listen Later

24 minutes

In this episode, we discuss using AWS Lambda for machine learning inference. We cover the tradeoffs between GPUs and CPUs for ML, tools like ggml and llama.cpp for running models on CPUs, and share examples where we've experimented with Lambda for ML like podcast transcription, medical imaging, and natural language processing. While Lambda ML is still quite experimental, it can be a viable option for certain use cases.

💰 SPONSORS 💰

AWS Bites is brought to you by fourTheorem, an Advanced AWS Partner. If you are moving to AWS or need a partner to help you go faster, check us out at fourtheorem.com !

In this episode, we mentioned the following resources.

Episode "46. How do you do machine learning on AWS?": https://awsbites.com/46-how-do-you-do-machine-learning-on-aws/

Episode "108. How to Solve Lambda Python Cold Starts": https://awsbites.com/108-how-to-solve-lambda-python-cold-starts/

ggml (the framework): https://github.com/ggerganov/ggml

ggml (the company): https://ggml.ai

llama.cpp: https://github.com/ggerganov/llama.cpp

whisper.cpp: https://github.com/ggerganov/whisper.cpp

whisper.cpp WebAssembly demo: https://whisper.ggerganov.com/

ONNX Runtime: https://onnxruntime.ai/

An example of using whisper.cpp with the Rust bindings: https://github.com/lmammino/whisper-rs-example

Project running Whisper.cpp in a Lambda function: https://github.com/eoinsha/whisper_lambda_cpp

AWS Lambda Image Container Chest X-Ray Example: https://github.com/fourTheorem/lambda-image-cxr-detection

Episode "103. Building GenAI Features with Bedrock": https://awsbites.com/103-building-genai-features-with-bedrock/⁠

Do you have any AWS questions you would like us to address?

Leave a comment here or connect with us on X, formerly Twitter:

- ⁠⁠⁠⁠https://twitter.com/eoins⁠⁠⁠⁠

- ⁠⁠⁠⁠https://twitter.com/loige⁠⁠

...more

More shows like AWS Bites

Software Engineering Radio - the podcast for professional software developers by se-radio@computer.org

Software Engineering Radio - the podcast for professional software developers

272 Listeners

The McKinsey Podcast by McKinsey & Company

The McKinsey Podcast

378 Listeners

Planet Money by NPR

Planet Money

30,734 Listeners

The Changelog: Software Development, Open Source by Changelog Media

The Changelog: Software Development, Open Source

284 Listeners

Thoughtworks Technology Podcast by Thoughtworks

Thoughtworks Technology Podcast

41 Listeners

Talk Python To Me by Michael Kennedy

Talk Python To Me

585 Listeners

Software Engineering Daily by Software Engineering Daily

Software Engineering Daily

624 Listeners

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) by Sam Charrington

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

435 Listeners

AWS Podcast by Amazon Web Services

AWS Podcast

202 Listeners

Data Engineering Podcast by Tobias Macey

Data Engineering Podcast

140 Listeners

Kubernetes Podcast from Google by Abdel Sghiouar, Kaslin Fields

Kubernetes Podcast from Google

183 Listeners

The Real Python Podcast by Real Python

The Real Python Podcast

138 Listeners

Big Technology Podcast by Alex Kantrowitz

Big Technology Podcast

453 Listeners

The AWS Developers Podcast by Amazon Web Services

The AWS Developers Podcast

22 Listeners

The Pragmatic Engineer by Gergely Orosz

The Pragmatic Engineer

62 Listeners