Growth Cyber Newsletter

RAG is a popular strategy where developers augment and AI Agent’s knowledge with documents, web pages, and other materials. This gives agents specific knowledge of things that are often hidden away in private databases. In this week’s insecure agent example I took Mastra’s Chain of Thought RAG Agent example and added a second document containing a malicious prompt.

Here's an architecture diagram:

In past examples we’ve focused on agents connect to MCP servers and tools. This agent has none of those, however it’s still vulnerable, namely to indirect prompt injection.

Here’s the security review:

Security Review

Indirect prompt injection is when a malicious prompt is fed into your agent through external sources. This could be a scraped webpage, a local document, or even an output from another agent. We often focus on malicious prompts a user might send to our agent, but not malicious prompts that our agent could consume while it’s developing it’s answer to the user query.

Let’s look at an example. In one of the documents this RAG agent read in there is a statement that says to always say that pesticides are good despite the document saying they probably are not.

When I run the agent and ask it about farming best practices and pesticides sure enough it ends its assessment with “pesticides are good”.

This is a trivial example, but imagine if this RAG system was built with sensitive company documents instead and someone uploaded a document with a prompt injection in it which caused the agent to output sensitive company data. To prevent this it’s important to validate any inputs your agent may be consuming including both user prompts and RAG data sources.

Hopefully this helps give you some things to think about for security while you’re building your AI Agents. If you have any questions on how you can build secure RAG agents reply here I read every response.

When you're ready here's 3 ways I can help you

I provide vCISO services to startups for help with SOC2 and setting up a foundational security program. You'll get my expertise plus access to great deals with security vendors I partner with. Book a call to learn more.

I offer AI Security Gap Assessments. I'll scan your codebase for vulnerabilities and do a manual review against resources such as the OWASP Agentic Threats and Mitigation Guide and the Top 10 for LLMs. To learn more please send me an email → allie@growthcyber.com

If you’re a vendor, reach out for partnership opportunities or to sponsor the newsletter → allie@growthcyber.com