
Sign up to save your podcasts
Or
Data integration is one of the most challenging aspects of any data platform, especially as the variety of data sources and formats grow. Enterprise organizations feel this acutely due to the silos that occur naturally across business units. The CluedIn team experienced this issue first-hand in their previous roles, leading them to build a business aimed at building a managed data fabric for the enterprise. In this episode Tim Ward, CEO of CluedIn, joins me to explain how their platform is architected, how they manage the task of integrating with third-party platforms, automating entity extraction and master data management, and the work of providing multiple views of the same data for different use cases. I highly recommend listening closely to his explanation of how they manage consistency of the data that they process across different storage backends.
Introduction
How did you get involved in the area of data management?
Before we get started, can you share your definition of what a data fabric is?
Can you explain what CluedIn is and share the story of how it started?
Can you give an overview of the system architecture that you have built and how it has evolved since you first began building it?
For a new customer of CluedIn, what is involved in the onboarding process?
What are some of the most challenging aspects of data integration?
How do you manage changes or breakage in the interfaces that you use for source or destination systems?
What are some of the signals that you monitor to ensure the continued healthy operation of your platform?
What are some of the most notable customer success stories that you have experienced?
What are some cases where CluedIn is not the right choice?
What do you have planned for the future of CluedIn?
The intro and outro music is from The Hug by The Freak Fandango Orchestra / CC BY-SA
Support Data Engineering Podcast
4.6
135135 ratings
Data integration is one of the most challenging aspects of any data platform, especially as the variety of data sources and formats grow. Enterprise organizations feel this acutely due to the silos that occur naturally across business units. The CluedIn team experienced this issue first-hand in their previous roles, leading them to build a business aimed at building a managed data fabric for the enterprise. In this episode Tim Ward, CEO of CluedIn, joins me to explain how their platform is architected, how they manage the task of integrating with third-party platforms, automating entity extraction and master data management, and the work of providing multiple views of the same data for different use cases. I highly recommend listening closely to his explanation of how they manage consistency of the data that they process across different storage backends.
Introduction
How did you get involved in the area of data management?
Before we get started, can you share your definition of what a data fabric is?
Can you explain what CluedIn is and share the story of how it started?
Can you give an overview of the system architecture that you have built and how it has evolved since you first began building it?
For a new customer of CluedIn, what is involved in the onboarding process?
What are some of the most challenging aspects of data integration?
How do you manage changes or breakage in the interfaces that you use for source or destination systems?
What are some of the signals that you monitor to ensure the continued healthy operation of your platform?
What are some of the most notable customer success stories that you have experienced?
What are some cases where CluedIn is not the right choice?
What do you have planned for the future of CluedIn?
The intro and outro music is from The Hug by The Freak Fandango Orchestra / CC BY-SA
Support Data Engineering Podcast
272 Listeners
283 Listeners
152 Listeners
41 Listeners
482 Listeners
592 Listeners
624 Listeners
443 Listeners
298 Listeners
213 Listeners
266 Listeners
189 Listeners
64 Listeners
140 Listeners
77 Listeners