
Sign up to save your podcasts
Or


MLOps Coffee Sessions #83 with Vincent Warmerdam, Better Use Cases for Text Embeddings.
Join the Community: https://go.mlops.community/YTJoinIn
Get the newsletter: https://go.mlops.community/YTNewsletter
// Abstract
Text embeddings are very popular, but there are plenty of reasons to be concerned about their applications. There's algorithmic fairness, compute requirements, as well as issues with the datasets that they're typically trained on.
In this session, Vincent gives an overview of some of these properties while also talking about an underappreciated use-case for the embeddings: labeling!
// Bio
Vincent D. Warmerdam is a senior data professional who has worked as an engineer, researcher, team lead, and educator in the past. He's especially interested in understanding algorithmic systems so that one can prevent failure. As such, he has a preference for simpler solutions that scale, as opposed to the latest and greatest from the hype cycle. He currently works as a Research Advocate at Rasa, where he collaborates with the research team to explain and understand conversational systems better.
Outside of Rasa, Vincent is also well known for his open-source projects (scikit-lego, human-learn, doubtlab, and more), collaborations with open source projects like spaCy, his blog over at koaning.io, and his calm code educational project.
--------------- ✌️Connect With Us ✌️ -------------
Join our Slack community: https://go.mlops.community/slack
Follow us on Twitter: @mlopscommunity
Sign up for the next meetup: https://go.mlops.community/register
Catch all episodes, blogs, newsletter, and more: https://mlops.community/
Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/
Connect with Skylar on LinkedIn: https://www.linkedin.com/in/skylar-payne-766a1988/
Connect with Vincent on LinkedIn: https://www.linkedin.com/in/vincentwarmerdam/
Timestamps:
[00:00] Takeaways
[04:10] Favorite purchases this pandemic
[05:05] What drives Vincent to understand how ML can fail?
[08:33] How and why to make systems simpler?
[11:37] Techniques shared by Vincent in his talks
[15:51] ML as a UI problem
[17:02] Figuring out rules in your data
[20:01] Detecting bad labels
[23:53] Labeling isn't necessarily easy
[25:48] Fraud use case
[27:42] How does Vincent stay sane looking for frauds?
[29:12] How does Vincent produce so many packages?
[31:23] Vincent's favorite package
[33:24] Explosion AI
[36:14] Python all the way
[37:44] Shift from model-centric to data-centric AI
[39:35] Talking about the problem is necessary
[40:40] Vincent's war stories
[44:04] Adding constraints to the system
[47:49] Wrap up
By Demetrios4.6
2323 ratings
MLOps Coffee Sessions #83 with Vincent Warmerdam, Better Use Cases for Text Embeddings.
Join the Community: https://go.mlops.community/YTJoinIn
Get the newsletter: https://go.mlops.community/YTNewsletter
// Abstract
Text embeddings are very popular, but there are plenty of reasons to be concerned about their applications. There's algorithmic fairness, compute requirements, as well as issues with the datasets that they're typically trained on.
In this session, Vincent gives an overview of some of these properties while also talking about an underappreciated use-case for the embeddings: labeling!
// Bio
Vincent D. Warmerdam is a senior data professional who has worked as an engineer, researcher, team lead, and educator in the past. He's especially interested in understanding algorithmic systems so that one can prevent failure. As such, he has a preference for simpler solutions that scale, as opposed to the latest and greatest from the hype cycle. He currently works as a Research Advocate at Rasa, where he collaborates with the research team to explain and understand conversational systems better.
Outside of Rasa, Vincent is also well known for his open-source projects (scikit-lego, human-learn, doubtlab, and more), collaborations with open source projects like spaCy, his blog over at koaning.io, and his calm code educational project.
--------------- ✌️Connect With Us ✌️ -------------
Join our Slack community: https://go.mlops.community/slack
Follow us on Twitter: @mlopscommunity
Sign up for the next meetup: https://go.mlops.community/register
Catch all episodes, blogs, newsletter, and more: https://mlops.community/
Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/
Connect with Skylar on LinkedIn: https://www.linkedin.com/in/skylar-payne-766a1988/
Connect with Vincent on LinkedIn: https://www.linkedin.com/in/vincentwarmerdam/
Timestamps:
[00:00] Takeaways
[04:10] Favorite purchases this pandemic
[05:05] What drives Vincent to understand how ML can fail?
[08:33] How and why to make systems simpler?
[11:37] Techniques shared by Vincent in his talks
[15:51] ML as a UI problem
[17:02] Figuring out rules in your data
[20:01] Detecting bad labels
[23:53] Labeling isn't necessarily easy
[25:48] Fraud use case
[27:42] How does Vincent stay sane looking for frauds?
[29:12] How does Vincent produce so many packages?
[31:23] Vincent's favorite package
[33:24] Explosion AI
[36:14] Python all the way
[37:44] Shift from model-centric to data-centric AI
[39:35] Talking about the problem is necessary
[40:40] Vincent's war stories
[44:04] Adding constraints to the system
[47:49] Wrap up

1,093 Listeners

622 Listeners

302 Listeners

332 Listeners

146 Listeners

228 Listeners

205 Listeners

96 Listeners

516 Listeners

130 Listeners

228 Listeners

36 Listeners

22 Listeners

39 Listeners

72 Listeners