
Sign up to save your podcasts
Or
Today on the Salesforce Admins Podcast, we talk to Chris Emmett, Salesforce Solution Architect at Capgemini. Join us as we chat about how to clean up your data to prepare your org for Agentforce, and why data without context is useless.
You should subscribe for the full episode, but here are a few takeaways from our conversation with Chris Emmett.
I caught up with Chris hot on the heels of his TDX London session, “Prep Like a Pro: Clean Data and Metadata for Agentforce.” He’s an experienced Salesforce consultant who has helped countless organizations of all sizes reboot their business processes.
As Chris explains, unless you have a company of five people that started last week, your org probably needs some data cleanup. And if you want to get started with Agentforce, you need to do the work to make sure the agents you build can understand your data and use it to generate actionable insights. After all, if you can’t derive useful information from your data, then it’s useless.
When I worked in sales, we used a CRM that was so complicated that only one guy at our company knew how to use it. Talk about a bottleneck!
The truth is, if your business has been around for a little while, you’ve probably inherited all sorts of legacy data. Maybe it’s some random field created by that one guy in the 90s who didn’t document anything, or a legacy system like SAP or MSX that is essential to your day-to-day operations.
Chris has seen it all, and it can often feel like cleaning up all that data is akin to boiling the ocean. It’s a monumental task with no end in sight, let alone getting the organizational buy-in to do it in the first place.
Chris recommends focusing your data cleanup strategy on the functionality you want to build in Agentforce. For example, if you want an agent to email a customer when their opportunity is five days from the close date and still unsigned, what data do you actually need?
You don’t need the 300 fields that might be on the opportunity page, or the 300 fields in that account. You might need the opportunity’s name, the stage of the opportunity, the close date, the account, and maybe the primary contact of that account. That’s five pieces of information.
Suddenly, you don’t need to boil the entire ocean—you just need to boil a cup of water. So start small, focus on the functionality your data cleanup project will deliver, and get the ball rolling. Trust that the things you build with Agentforce will speak for themselves, and you’ll be able to generate momentum to clean up your data project by project.
Make sure to listen to our full conversation with Chris to learn more about how to clean up your data and provide context for AI agents. And don’t forget to subscribe to the Salesforce Admins Podcast so you never miss an episode.
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
And then about 10 years later, around 2016, I was working for a company and we were really interested in changing how we managed products. We wanted a brand new system to manage our projects, and one of the products that we looked at was Salesforce. And my word, I was blown away. Literally within one day of using Salesforce, I was creating formula fields and workflow rules as they were back then, and I just fell in love. And this is coming from a person who had spent the best part of a decade dealing with systems where if you needed to add a field, it would take two, three months to get through, and I was dealing with a system where I could start a sentence, explaining to someone what Salesforce was, and by the time I’ve finished that sentence, I could have created a field. I could have expanded that data model, and I fell in love with it.
The company I was working for did not fall in love with it, and that pilot failed. It fell by the wayside, but I was hooked. I was like, man, I need to change my career. So I start looking for project management jobs within the Salesforce space. I had never project managed software before. I had project managed big old factory systems that were very waterfall in their approach. I had never done agile before, so I was applying and applying and applying, getting nowhere. I was probably applying for about eight months, and then this small company in Cambridge were like, dude, you are not great for a project manager but you seem really enthusiastic. Had you considered being a consultant? And I went, but yeah, because it’s not managing the projects that I love. It’s the system that I love. Salesforce is a platform that I love. I want to be able to empower other companies to improve themselves through Salesforce.
So I got a job there as a consultant, and that was 2017, and I have just been building up and flourishing, and since then, I’ve got 600 odd badges on Trailhead and I think 23 certs now. So I’ve just gone all in and built my career up.
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
So the danger isn’t, oh, let’s just create everything in Salesforce. The danger is 14 years ago, Derek created this field. We don’t know what it does. We don’t know where it’s hooked up to. It’s not documented anywhere, but we feel like we should pull it over. So the real danger is actually migrating everything. If you don’t know what that data point is, you don’t know what use it is, you can’t validate it, and you can’t use it in any meaningful way. Because if you don’t understand, then to bring it to the point of this pod, if you don’t understand what that data is, what it means, what it’s doing for your business, how can an AI agent understand that? An AI agent is not magical. It’s not telepathic. It reads the information as if it’s a human. It tries to interpret that information. So you’ve got to know what it means so your AI agent can be told what it means as well.
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Now, if that’s written in a note without any context, and I say Agentforce, what does this mean? It’s going to go, well, I don’t know. A date in the future. But if it’s against the close date field, it’s immediately got context, and it immediately can derive something from that. It can say, oh, right, okay, so this is the close date. I can see that we are in a negotiation stage. We’ve got one more stage after this, which is sign contract. It’s the 24th. It’s a few weeks away, or at least from when we’re recording. I understand a bit of context. I understand that there’s another stage ahead of this, and I’ve immediately got more information other than just a record with a random date. That context is everything, especially for an LLM.
Mike:
Chris Emmett:
Mike:
Chris Emmett:
So how do you actually break that down? You’re right. You cannot boil the ocean. We start off by thinking about the actions and the intents that you want your agents to do. So if you want your agents to write an email to a customer if their opportunity is within five days of the close date and they’ve not signed yet, well, what do you need for that? You don’t need 300 fields that might be on the opportunity page or 300 fields in that account. You might need the opportunity’s name. You need the stage of the opportunity. You need the close date. You need the account, and maybe the primary contact of that account. That’s five pieces of information.
And then you do not really need to think about all of the old opportunities because this is about an action where you are emailing people for opportunities that are about to expire or about to close. So immediately, you’ve gone from a million records, let’s say, you’ve got it down to a thousand records, and then you’re only looking at those five pieces of data. So you got it down to a thousand records and you got it down to five pieces of information on those thousands records. So I’m not going to do the math in my head because I’m terrible at that, but it’s-
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
And I was thinking back to last week. So in Iowa, in the US where I live, we had a really bad frost all winter. We didn’t get the snow cover that we usually do, and some of my landscaping plans didn’t make it because the roots were just burned by the frost, but not all of them. Some of them were hardy and they’re fine. And so I called my landscape company. I was like, I need to replace all these, and plus I want different ones anyway. He’s like, good, because if we go through another winter like this, we’re just going to be replacing them. And I promise you this gets somewhere.
But much like your analogy, so the landscape company came out, and just in the area that they needed to replant those bushes, they scraped all the gravel, leveled the bed, put new tarp down, replanted the bushes, put the gravel back. They didn’t have to clean the entire planter bed and scoop all the gravel out and dig up all the bushes and start from scratch. They only had to do the part that mattered. And I felt like that was like, wow, that’s like a real life scenario of if we’re going to implement this and we’re going to really laser focus on this one part of it, let’s do that.
But the other key thing that you said that nobody has said on a podcast about cleaning data is you improve the process while you do it. Because if you are not going to improve the process that led you to the bad data, you’re always going to be cleaning data. It’s almost like sending a janitor out to a sports stadium to pick up trash because there’s no trash cans. Well, if you’re not going to sit down and say, okay, how do we put trash cans out so that trash isn’t everywhere? All you’re doing is sending the janitor back out to clean up trash. You’re not actually fixing the problem that led to the trash being everywhere.
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
But then you’ve got companies who are just in a bind where they’ve got a 30-year-old system and they do not have the budget to replace it, but Salesforce has got to integrate with it, and the design decisions 30 years ago for that system that Salesforce has to adhere to. So sometimes, it is because the systems that you have to hook Salesforce into are just bound by old design, and then you have to introduce those bad data decisions into Salesforce.
Mike:
Chris Emmett:
I definitely worked for a company, I can’t say their name, I’m pretty sure, but I worked for a company where they were using an MS-DOS program because the person who wrote it, he wrote it in his garage and then retired in the ’90s. And if he’s still with us, he’s probably, I’d like to think, on a yacht somewhere because the software developers in the ’70s probably earned quite a lot of money. He’s enjoying life and not thinking about it. But that company was stuck with that MS-DOS program because it did a vital thing and it can’t be removed. It can’t be replaced because they know it does a vital process, but they don’t know how it works. And then it becomes a business risk. It’s like, do you risk the operation of the business to try and rebuild this or do you just leave it?
And then another thing I was thinking about this morning, I was thinking a lot of stuff at the gym.
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
And that’s where the whole point of trying to identify what’s the actual action that you want your AI agents to carry out, and what are the data points that that action needs to interact with? Okay. And then let’s tackle those data points and turn those data points, to go way back to the start of the conversation, you turn those data points into information, because you cannot boil the ocean. And the majority of companies, at least the ones I’ve dealt with, have a sea of data that just either makes no sense or comes from old systems or comes from unnecessary decisions. And you cannot … there’s no business case that will ever be put forward to say, we need to spend 10 years improving all of this data. But there definitely would be a business case that says, we need to spend a month chipping away at this opportunity data or this account data so we can deliver this agentic functionality.
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
The post Cleaning Data for AI Starts With Context, Not Perfection appeared first on Salesforce Admins.
4.7
200200 ratings
Today on the Salesforce Admins Podcast, we talk to Chris Emmett, Salesforce Solution Architect at Capgemini. Join us as we chat about how to clean up your data to prepare your org for Agentforce, and why data without context is useless.
You should subscribe for the full episode, but here are a few takeaways from our conversation with Chris Emmett.
I caught up with Chris hot on the heels of his TDX London session, “Prep Like a Pro: Clean Data and Metadata for Agentforce.” He’s an experienced Salesforce consultant who has helped countless organizations of all sizes reboot their business processes.
As Chris explains, unless you have a company of five people that started last week, your org probably needs some data cleanup. And if you want to get started with Agentforce, you need to do the work to make sure the agents you build can understand your data and use it to generate actionable insights. After all, if you can’t derive useful information from your data, then it’s useless.
When I worked in sales, we used a CRM that was so complicated that only one guy at our company knew how to use it. Talk about a bottleneck!
The truth is, if your business has been around for a little while, you’ve probably inherited all sorts of legacy data. Maybe it’s some random field created by that one guy in the 90s who didn’t document anything, or a legacy system like SAP or MSX that is essential to your day-to-day operations.
Chris has seen it all, and it can often feel like cleaning up all that data is akin to boiling the ocean. It’s a monumental task with no end in sight, let alone getting the organizational buy-in to do it in the first place.
Chris recommends focusing your data cleanup strategy on the functionality you want to build in Agentforce. For example, if you want an agent to email a customer when their opportunity is five days from the close date and still unsigned, what data do you actually need?
You don’t need the 300 fields that might be on the opportunity page, or the 300 fields in that account. You might need the opportunity’s name, the stage of the opportunity, the close date, the account, and maybe the primary contact of that account. That’s five pieces of information.
Suddenly, you don’t need to boil the entire ocean—you just need to boil a cup of water. So start small, focus on the functionality your data cleanup project will deliver, and get the ball rolling. Trust that the things you build with Agentforce will speak for themselves, and you’ll be able to generate momentum to clean up your data project by project.
Make sure to listen to our full conversation with Chris to learn more about how to clean up your data and provide context for AI agents. And don’t forget to subscribe to the Salesforce Admins Podcast so you never miss an episode.
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
And then about 10 years later, around 2016, I was working for a company and we were really interested in changing how we managed products. We wanted a brand new system to manage our projects, and one of the products that we looked at was Salesforce. And my word, I was blown away. Literally within one day of using Salesforce, I was creating formula fields and workflow rules as they were back then, and I just fell in love. And this is coming from a person who had spent the best part of a decade dealing with systems where if you needed to add a field, it would take two, three months to get through, and I was dealing with a system where I could start a sentence, explaining to someone what Salesforce was, and by the time I’ve finished that sentence, I could have created a field. I could have expanded that data model, and I fell in love with it.
The company I was working for did not fall in love with it, and that pilot failed. It fell by the wayside, but I was hooked. I was like, man, I need to change my career. So I start looking for project management jobs within the Salesforce space. I had never project managed software before. I had project managed big old factory systems that were very waterfall in their approach. I had never done agile before, so I was applying and applying and applying, getting nowhere. I was probably applying for about eight months, and then this small company in Cambridge were like, dude, you are not great for a project manager but you seem really enthusiastic. Had you considered being a consultant? And I went, but yeah, because it’s not managing the projects that I love. It’s the system that I love. Salesforce is a platform that I love. I want to be able to empower other companies to improve themselves through Salesforce.
So I got a job there as a consultant, and that was 2017, and I have just been building up and flourishing, and since then, I’ve got 600 odd badges on Trailhead and I think 23 certs now. So I’ve just gone all in and built my career up.
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
So the danger isn’t, oh, let’s just create everything in Salesforce. The danger is 14 years ago, Derek created this field. We don’t know what it does. We don’t know where it’s hooked up to. It’s not documented anywhere, but we feel like we should pull it over. So the real danger is actually migrating everything. If you don’t know what that data point is, you don’t know what use it is, you can’t validate it, and you can’t use it in any meaningful way. Because if you don’t understand, then to bring it to the point of this pod, if you don’t understand what that data is, what it means, what it’s doing for your business, how can an AI agent understand that? An AI agent is not magical. It’s not telepathic. It reads the information as if it’s a human. It tries to interpret that information. So you’ve got to know what it means so your AI agent can be told what it means as well.
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Now, if that’s written in a note without any context, and I say Agentforce, what does this mean? It’s going to go, well, I don’t know. A date in the future. But if it’s against the close date field, it’s immediately got context, and it immediately can derive something from that. It can say, oh, right, okay, so this is the close date. I can see that we are in a negotiation stage. We’ve got one more stage after this, which is sign contract. It’s the 24th. It’s a few weeks away, or at least from when we’re recording. I understand a bit of context. I understand that there’s another stage ahead of this, and I’ve immediately got more information other than just a record with a random date. That context is everything, especially for an LLM.
Mike:
Chris Emmett:
Mike:
Chris Emmett:
So how do you actually break that down? You’re right. You cannot boil the ocean. We start off by thinking about the actions and the intents that you want your agents to do. So if you want your agents to write an email to a customer if their opportunity is within five days of the close date and they’ve not signed yet, well, what do you need for that? You don’t need 300 fields that might be on the opportunity page or 300 fields in that account. You might need the opportunity’s name. You need the stage of the opportunity. You need the close date. You need the account, and maybe the primary contact of that account. That’s five pieces of information.
And then you do not really need to think about all of the old opportunities because this is about an action where you are emailing people for opportunities that are about to expire or about to close. So immediately, you’ve gone from a million records, let’s say, you’ve got it down to a thousand records, and then you’re only looking at those five pieces of data. So you got it down to a thousand records and you got it down to five pieces of information on those thousands records. So I’m not going to do the math in my head because I’m terrible at that, but it’s-
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
And I was thinking back to last week. So in Iowa, in the US where I live, we had a really bad frost all winter. We didn’t get the snow cover that we usually do, and some of my landscaping plans didn’t make it because the roots were just burned by the frost, but not all of them. Some of them were hardy and they’re fine. And so I called my landscape company. I was like, I need to replace all these, and plus I want different ones anyway. He’s like, good, because if we go through another winter like this, we’re just going to be replacing them. And I promise you this gets somewhere.
But much like your analogy, so the landscape company came out, and just in the area that they needed to replant those bushes, they scraped all the gravel, leveled the bed, put new tarp down, replanted the bushes, put the gravel back. They didn’t have to clean the entire planter bed and scoop all the gravel out and dig up all the bushes and start from scratch. They only had to do the part that mattered. And I felt like that was like, wow, that’s like a real life scenario of if we’re going to implement this and we’re going to really laser focus on this one part of it, let’s do that.
But the other key thing that you said that nobody has said on a podcast about cleaning data is you improve the process while you do it. Because if you are not going to improve the process that led you to the bad data, you’re always going to be cleaning data. It’s almost like sending a janitor out to a sports stadium to pick up trash because there’s no trash cans. Well, if you’re not going to sit down and say, okay, how do we put trash cans out so that trash isn’t everywhere? All you’re doing is sending the janitor back out to clean up trash. You’re not actually fixing the problem that led to the trash being everywhere.
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
But then you’ve got companies who are just in a bind where they’ve got a 30-year-old system and they do not have the budget to replace it, but Salesforce has got to integrate with it, and the design decisions 30 years ago for that system that Salesforce has to adhere to. So sometimes, it is because the systems that you have to hook Salesforce into are just bound by old design, and then you have to introduce those bad data decisions into Salesforce.
Mike:
Chris Emmett:
I definitely worked for a company, I can’t say their name, I’m pretty sure, but I worked for a company where they were using an MS-DOS program because the person who wrote it, he wrote it in his garage and then retired in the ’90s. And if he’s still with us, he’s probably, I’d like to think, on a yacht somewhere because the software developers in the ’70s probably earned quite a lot of money. He’s enjoying life and not thinking about it. But that company was stuck with that MS-DOS program because it did a vital thing and it can’t be removed. It can’t be replaced because they know it does a vital process, but they don’t know how it works. And then it becomes a business risk. It’s like, do you risk the operation of the business to try and rebuild this or do you just leave it?
And then another thing I was thinking about this morning, I was thinking a lot of stuff at the gym.
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
And that’s where the whole point of trying to identify what’s the actual action that you want your AI agents to carry out, and what are the data points that that action needs to interact with? Okay. And then let’s tackle those data points and turn those data points, to go way back to the start of the conversation, you turn those data points into information, because you cannot boil the ocean. And the majority of companies, at least the ones I’ve dealt with, have a sea of data that just either makes no sense or comes from old systems or comes from unnecessary decisions. And you cannot … there’s no business case that will ever be put forward to say, we need to spend 10 years improving all of this data. But there definitely would be a business case that says, we need to spend a month chipping away at this opportunity data or this account data so we can deliver this agentic functionality.
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
Chris Emmett:
Mike:
The post Cleaning Data for AI Starts With Context, Not Perfection appeared first on Salesforce Admins.
4,362 Listeners
1,200 Listeners
3,195 Listeners
431 Listeners
3,998 Listeners
3,815 Listeners
1,029 Listeners
5,942 Listeners
2,969 Listeners
5,414 Listeners
227 Listeners
22 Listeners
248 Listeners
1 Listeners
1,127 Listeners