
Sign up to save your podcasts
Or


MLOps Coffee Sessions #110 with David Bayliss, Chief Data Scientist of LexisNexis Risk Solutions, Just Fetch the Data and then... co-hosted by Vishnu Rachakonda.
Join the Community: https://go.mlops.community/YTJoinIn
Get the newsletter: https://go.mlops.community/YTNewsletter
// Abstract
Composing data to extract features can be a significant problem. Key factors are the data size, compliance restrictions, and real-time data. Ethics (and law) can drive extremely complex audit requirements. In the cloud, you can do anything - at a price.
// Bio
One of the creators of the world's first big data platform (HPCC), David has been tackling big data problems for two decades. A mathematician, compiler writer, and data sponge with more than five dozen patents spanning platforms, linking, and search.
Most inventors think outside the box; David can't even remember where the box is. He leads the team that creates their core Data Science methods used by hundreds of data scientists.
// MLOps Jobs board
MLOps Swag/Merch
https://mlops-community.myshopify.com/
// Related Links
Interesting insight in this post. It would be cool to learn from David about his view on things
https://www.google.com/url?q=https://www.linkedin.com/posts/david-bayliss-426556a_datascience-platform-portability-activity-6913448643303759872-2dqq?utm_source%3Dlinkedin_share%26utm_medium%3Dmember_desktop_web&sa=D&source=calendar&ust=1649078059106132&usg=AOvVaw26wAevExeEfW_AdZSA8UhF
--------------- ✌️Connect With Us ✌️ -------------
Join our Slack community: https://go.mlops.community/slack
Follow us on Twitter: @mlopscommunity
Sign up for the next meetup: https://go.mlops.community/register
Catch all episodes, blogs, newsletters, and more: https://mlops.community/
Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/
Connect with Vishnu on LinkedIn: https://www.linkedin.com/in/vrachakonda/
Connect with David on LinkedIn: https://www.linkedin.com/in/david-bayliss-426556a/
Timestamps:
[00:00] Introduction to David Bayliss
[01:03] Takeaways
[04:56] LexisNexis and David's role
[07:15] Evolution of LexisNexis in 20 years with so many use cases
[08:51] Role of David in structuring data for working with data change
[14:32] Data management and data access
[17:45] Unique challenges of scale, use case, and diversity at LexisNexis
[24:47] Tardis Iron Box
[30:05] Iron Box translation
[32:56] JVM for data science
[34:24] Iron Box meaning
[36:52] Metadata with PII
[39:08] Detrimental privacy / Hairy Kneecap Theory
[40:57] Speeding things up and Anonymized linking
[46:47] What kept David working at LexisNexis?
[50:30] Wrap up
By Demetrios4.6
2323 ratings
MLOps Coffee Sessions #110 with David Bayliss, Chief Data Scientist of LexisNexis Risk Solutions, Just Fetch the Data and then... co-hosted by Vishnu Rachakonda.
Join the Community: https://go.mlops.community/YTJoinIn
Get the newsletter: https://go.mlops.community/YTNewsletter
// Abstract
Composing data to extract features can be a significant problem. Key factors are the data size, compliance restrictions, and real-time data. Ethics (and law) can drive extremely complex audit requirements. In the cloud, you can do anything - at a price.
// Bio
One of the creators of the world's first big data platform (HPCC), David has been tackling big data problems for two decades. A mathematician, compiler writer, and data sponge with more than five dozen patents spanning platforms, linking, and search.
Most inventors think outside the box; David can't even remember where the box is. He leads the team that creates their core Data Science methods used by hundreds of data scientists.
// MLOps Jobs board
MLOps Swag/Merch
https://mlops-community.myshopify.com/
// Related Links
Interesting insight in this post. It would be cool to learn from David about his view on things
https://www.google.com/url?q=https://www.linkedin.com/posts/david-bayliss-426556a_datascience-platform-portability-activity-6913448643303759872-2dqq?utm_source%3Dlinkedin_share%26utm_medium%3Dmember_desktop_web&sa=D&source=calendar&ust=1649078059106132&usg=AOvVaw26wAevExeEfW_AdZSA8UhF
--------------- ✌️Connect With Us ✌️ -------------
Join our Slack community: https://go.mlops.community/slack
Follow us on Twitter: @mlopscommunity
Sign up for the next meetup: https://go.mlops.community/register
Catch all episodes, blogs, newsletters, and more: https://mlops.community/
Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/
Connect with Vishnu on LinkedIn: https://www.linkedin.com/in/vrachakonda/
Connect with David on LinkedIn: https://www.linkedin.com/in/david-bayliss-426556a/
Timestamps:
[00:00] Introduction to David Bayliss
[01:03] Takeaways
[04:56] LexisNexis and David's role
[07:15] Evolution of LexisNexis in 20 years with so many use cases
[08:51] Role of David in structuring data for working with data change
[14:32] Data management and data access
[17:45] Unique challenges of scale, use case, and diversity at LexisNexis
[24:47] Tardis Iron Box
[30:05] Iron Box translation
[32:56] JVM for data science
[34:24] Iron Box meaning
[36:52] Metadata with PII
[39:08] Detrimental privacy / Hairy Kneecap Theory
[40:57] Speeding things up and Anonymized linking
[46:47] What kept David working at LexisNexis?
[50:30] Wrap up

1,093 Listeners

622 Listeners

302 Listeners

332 Listeners

146 Listeners

228 Listeners

205 Listeners

96 Listeners

516 Listeners

130 Listeners

228 Listeners

36 Listeners

22 Listeners

39 Listeners

72 Listeners