The Cloudcast

Training and Labeling Foundational AI Models


Listen Later

Alex Ratner (@ajratner, CEO @SnorkelAI) talks about labeling and training of LLMs. We go over Foundational Models and how to take an “off the shelf” model and fine-tune it for private use.

SHOW: 755

CLOUD NEWS OF THE WEEK - http://bit.ly/cloudcast-cnotw

NEW TO CLOUD? CHECK OUT - "CLOUDCAST BASICS"


SHOW SPONSORS:

  • Reduce the complexities of protecting your workloads and applications in a multi-cloud environment. Panoptica provides comprehensive cloud workload protection integrated with API security to protect the entire application lifecycle.  Learn more about Panoptica at panoptica.app
  • Find "Breaking Analysis Podcast with Dave Vellante" on Apple, Google and Spotify
  • Keep up to data with Enterprise Tech with theCUBE


SHOW NOTES:

  • SnorkelAI (homepage)
  • SnorkelAI on The Cloudcast #523

Topic 1 - Welcome back to the show. We last spoke two years ago. A lot has changed so we thought it would be a great time to talk about updates. For those that aren’t familiar, give everyone a quick background.

Topic 2 - Let’s start with when we last spoke. We talked a lot about data scientists and how data labeling works for training LLM’s. For those that aren’t familiar, can you give everyone a quick intro to data labeling and why it is important for training?

Topic 3 - When we last spoke, LLM weren't as mainstream as they are today. How has this impacted how you think about AI/ML in general? What are the big challenges for LLM today that you see?

Topic 4 - Many organizations are trying “off the shelf” models but this may or may not be a good idea. On one side they don’t have to build a model but on the other they still have to fine tune it to their specific needs and use case. What are your recommendations for organizations to get started and be as effective as possible?

Topic 5 - In addition to organization specific models, are teams building purpose built models for specific functions (i.e. are they running multiple models each with a different task?) How does labeling and training come into play here?

Topic 6 - Another challenge I’m seeing is security and data privacy concerns. What do folks getting started need to be aware of to make sure company data is safe?


FEEDBACK?

  • Email: show at the cloudcast dot net
  • Twitter: @thecloudcastnet
...more
View all episodesView all episodes
Download on the App Store

The CloudcastBy Massive Studios

  • 4.6
  • 4.6
  • 4.6
  • 4.6
  • 4.6

4.6

147 ratings


More shows like The Cloudcast

View all
Hanselminutes with Scott Hanselman by Scott Hanselman

Hanselminutes with Scott Hanselman

377 Listeners

Software Engineering Radio - the podcast for professional software developers by se-radio@computer.org

Software Engineering Radio - the podcast for professional software developers

266 Listeners

The Changelog: Software Development, Open Source by Changelog Media

The Changelog: Software Development, Open Source

285 Listeners

Thoughtworks Technology Podcast by Thoughtworks

Thoughtworks Technology Podcast

41 Listeners

Talk Python To Me by Michael Kennedy

Talk Python To Me

586 Listeners

Software Engineering Daily by Software Engineering Daily

Software Engineering Daily

629 Listeners

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) by Sam Charrington

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

434 Listeners

AWS Podcast by Amazon Web Services

AWS Podcast

200 Listeners

Python Bytes by Michael Kennedy and Brian Okken

Python Bytes

213 Listeners

Data Engineering Podcast by Tobias Macey

Data Engineering Podcast

140 Listeners

Syntax - Tasty Web Development Treats by Wes Bos & Scott Tolinski - Full Stack JavaScript Web Developers

Syntax - Tasty Web Development Treats

988 Listeners

Kubernetes Podcast from Google by Abdel Sghiouar, Kaslin Fields

Kubernetes Podcast from Google

181 Listeners

Practical AI by Practical AI LLC

Practical AI

190 Listeners

The Stack Overflow Podcast by The Stack Overflow Podcast

The Stack Overflow Podcast

63 Listeners

The Real Python Podcast by Real Python

The Real Python Podcast

136 Listeners