Share AI Explainer: Prompt Injection and Data Poisoning

Copy link

November 14, 2025

AI Explainer: Prompt Injection and Data Poisoning

8 minutes

The source, a transcript from a YouTube video in the "Box AI Explainer Series," presents a discussion between an AI product marketing manager and a CTO about significant security threats in the world of AI agents: prompt injection and data poisoning. Prompt injection involves manipulating an agent through direct instructions within a prompt to make it perform unintended or malicious actions, while data poisoning occurs when hostile instructions are subtly included in the data the agent is asked to review, causing it to be tricked. The experts explain that agents are inherently "naive" and helpful, making them susceptible to these attacks, which can be likened to social engineering against humans. To mitigate these risks, enterprises must employ data cleansing to remove malicious instructions and implement guardrails to restrict agents' actions, treating them like junior employees with limited access and control.

...more

View all episodes

By Steven

November 14, 2025

AI Explainer: Prompt Injection and Data Poisoning

8 minutes

...more

Sign up to save your podcasts