MLOps.community

Deploying Machine Learning Models at Scale in Cloud // Vishnu Prathish // MLOps Meetup #60


Listen Later

MLOps community meetup #60! Last Wednesday, we talked to Vishnu Prathish, Director of Engineering, AI Products, Innovyze.


Join the Community: ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠https://go.mlops.community/YTJoinIn⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠

Get the newsletter: ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠https://go.mlops.community/YTNewsletter⁠
// Abstract
The way Data Science is done is changing. Notebook sharing and collaboration were messy, and there was minimal visibility or QA into the model deployment process. Vishnu will talk about building an ops platform that deploys hundreds of models at scale every month. A platform that supports typical features of MLOps (CI/CD, Separated QA, Dev, and PROD environment, experiments tracking, Isolated retraining, model monitoring in real-time, Automatic Retraining with live data) and ensures quality and observability without compromising the collaborative nature of data science.


// Bio
With 10 years in building production-grade data-first software at BBM & HP Labs, I started building Emagin's AI platform about three years ago with the goal of optimizing operations for the water industry. At Innovyze post-acquisition, we are part of the org building a world-leading water infrastructure data analytics product.


//Takeaways
Why is MLOps necessary for model building at scale?  
What are various cloud-based models for MLOps?  
Where can ops help in various points in the ML pipeline: Data Prep, Feature Engineering, Model building, Training, Retraining, Evaluation, and inference


----------- Connect With Us ✌️-------------   

Join our Slack community:  
https://go.mlops.community/slack
Follow us on Twitter: @mlopscommunity
Sign up for the next meetup: https://go.mlops.community/register

Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/
Connect with Vishnu on LinkedIn: https://www.linkedin.com/in/vishnuprathish/

Timestamps:
[00:00] Introduction to Vishnu Prathish
[00:16] Vishnu's background
[04:18] Use cases on wooden pipes for freshwater
[04:55] Virtual representation of actual, physical, tangible assets
[06:56] Platform built by Vishnu
[08:30] Build a reliable representation of the network
[11:52] Pipeline architecture
[16:17] "MLOps is still an evolving discipline. You need to try and fail many times before you figure out what's right for you."
[17:11] Open-sourcing
[18:17] Platform for virtual twin
[20:02] Entirely Amazon Stagemaker
[20:43] Data quality issues
[23:21] Reproducibility
[23:40] "Reproducibility is important for everybody. Most of the frameworks do that for you."
[25:00] Reproducibility as Innovyze's core business.
[26:38] Each model is individual to each customer
[27:50] Solving reproducibility problems
[28:24] "Reproducibility applies to the process of training pipelines. It starts with collecting historical raw data from customers. In real-time, there's also this data being collected directly from sensors coming from a certain pipeline."
[31:55] "Reusable training is step one to attaining automated retraining."
[32:17] Collaboration of Vishnu's team
[36:23] War stories
[41:36] Data prediction
[44:24] "A data scientist is the most expensive hire you can make."
[47:55] 3 Tiers
[48:53] MLOps problems
[52:25] Automatically retraining
[52:34] "Because of the number of models that go through this pipeline, it's impossible for somebody to manually monitor and retrain as necessary. It's not easy, it takes a lot of time."
[54:22] Metrics on retraining
[56:42] "Retraining is a little less prevalent for our industry compared to a turned prediction model that changes a lot. There are external factors that depend on it, but a pump is a pump."

...more
View all episodesView all episodes
Download on the App Store

MLOps.communityBy Demetrios

  • 4.6
  • 4.6
  • 4.6
  • 4.6
  • 4.6

4.6

23 ratings


More shows like MLOps.community

View all
The a16z Show by Andreessen Horowitz

The a16z Show

1,093 Listeners

Software Engineering Daily by Software Engineering Daily

Software Engineering Daily

622 Listeners

Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

302 Listeners

NVIDIA AI Podcast by NVIDIA

NVIDIA AI Podcast

332 Listeners

Data Engineering Podcast by Tobias Macey

Data Engineering Podcast

146 Listeners

Y Combinator Startup Podcast by Y Combinator

Y Combinator Startup Podcast

228 Listeners

Practical AI by Practical AI LLC

Practical AI

205 Listeners

Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

Machine Learning Street Talk (MLST)

96 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

516 Listeners

No Priors: Artificial Intelligence | Technology | Startups by Conviction

No Priors: Artificial Intelligence | Technology | Startups

130 Listeners

This Day in AI Podcast by Michael Sharkey, Chris Sharkey

This Day in AI Podcast

228 Listeners

AI + a16z by a16z

AI + a16z

36 Listeners

Lightcone Podcast by Y Combinator

Lightcone Podcast

22 Listeners

Training Data by Sequoia Capital

Training Data

39 Listeners

The Pragmatic Engineer by Gergely Orosz

The Pragmatic Engineer

72 Listeners