Data in Biotech

Using Generative AI to Design New Therapeutic Proteins with Evozyne


Listen Later

This week on Data in Biotech, Ryan Mork, Director of Data Science at Evozyne, joins host Ross Katz to discuss how data science and machine learning are being used in protein engineering and drug discovery.

Ryan explains how Evozyne is utilizing large language models (LLMs) and generative AI (GenAI) to design new biomolecules, training the models with huge volumes of protein and biology data. He walks through the organization’s evolution-based design approach and how it leverages the evolutionary history of protein families.

Ross and Ryan dig into the different models being used by Evozyne, including latent variable models and embeddings. They also discuss some of the challenges around testing the functionality of models and the approaches that can be used for evaluation.

Alongside the deep dive into data and modeling topics, Ryan also discusses the importance of relationships between the wet lab and data science teams. He emphasizes the need for mutual understanding of each role to ensure the entire organization pulls together towards the same goals.

Finally, Ross asks Ryan to opine on the future of GenAI and LLMs for biotechnology and how this area will develop over the next five years. He also finds out more about the R&D roadmap at Evozyne and its plans to play a part in moving GenAI for protein engineering forward.

Data in Biotech is a fortnightly podcast exploring how companies leverage data innovation in the life sciences.

Chapter Markers

[1:24] Introduction to Ryan, his career to date, and the focus of Evozyne.

[2:59] How the Evozyne data science team operates and the data sources it utilizes.

[4:22] Building models to develop synthetic proteins for therapeutic uses.

[9:10] Deciding which proteins to take into the lab for experimental validation.

[10:49] Taking an evolution-based design approach to protein engineering.

[14:34] Using latent variable models and embeddings to capture evolutionary relationships.

[18:01] Evaluating the functionality of generative models and the role of auxiliary models.

[24:24] The value of tight coupling and mutual understanding between wet lab and data science teams.

[28:07] Evozyne’s approach to developing and testing new data science tools, models, and technologies.

[31:35] Predictions for future developments in Generative AI for biotechnology.

[33:41] Evozyne’s goal to increase throughput and its planned approach.

[39:09] Where to connect with Ryan and keep up to date with news from Evozyne.

...more
View all episodesView all episodes
Download on the App Store

Data in BiotechBy CorrDyn

  • 5
  • 5
  • 5
  • 5
  • 5

5

10 ratings


More shows like Data in Biotech

View all
Fresh Air by NPR

Fresh Air

37,849 Listeners

Planet Money by NPR

Planet Money

30,668 Listeners

Freakonomics Radio by Freakonomics Radio + Stitcher

Freakonomics Radio

32,107 Listeners

Odd Lots by Bloomberg

Odd Lots

1,856 Listeners

Pivot by New York Magazine

Pivot

9,250 Listeners

Bold Names by The Wall Street Journal

Bold Names

1,451 Listeners

The Long Run with Luke Timmerman by Timmerman Report

The Long Run with Luke Timmerman

122 Listeners

The Indicator from Planet Money by NPR

The Indicator from Planet Money

9,500 Listeners

The Readout Loud by STAT

The Readout Loud

316 Listeners

Practical AI by Practical AI LLC

Practical AI

188 Listeners

Business Of Biotech by Ben Comer

Business Of Biotech

88 Listeners

BioCentury This Week by BioCentury

BioCentury This Week

30 Listeners

Raising Health by Andreessen Horowitz, a16z Bio + Health

Raising Health

144 Listeners

The Ezra Klein Show by New York Times Opinion

The Ezra Klein Show

15,397 Listeners

Biotech Hangout by Daphne Zohar, Josh Schimmer, Brad Loncar, Tim Opler & more

Biotech Hangout

18 Listeners