
Why We Built Red Teaming Into the Gateway
The standard workflow for deploying an AI feature is:
- Write a prompt.
- Test it with "Hello".
- Test it with "What is the weather?"
- Ship it to production.
This is insane. You are shipping a probabilistic engine that can execute code and read data, and your test suite is two strings.
The "Oh Sh*t" Moment
We kept seeing customers deploy "secure" bots, only to email us 24 hours later asking why the bot agreed to write a hate speech manifesto. The answer was always: "You didn't test for that."
Automated Adversarial Testing
We realized that developers don't have the time to sit around trying to jailbreak their own bots. So we built a robot to do it.
PromptGuard Red Team is a suite of 20+ specialized "Attacker Agents."
- The Lawyer: Tries to confuse the bot with legalese.
- The Hacker: Tries base64 encoding and shell injection patterns.
- The Grandma: Tries emotional manipulation.
- The Chaos Monkey: Sends random garbage and buffer overflow strings.
How It Works
Before you deploy, you run:
promptguard test --target "sys_prompt_v2"
We spin up these agents and let them hammer your prompt for 5 minutes. We give you a report:
- Pass: 18/20 agents failed to break it.
- Fail: "The Hacker" successfully extracted your SQL schema.
Shift Left
Security shouldn't happen in production logs. It should happen in CI/CD. If your prompt can be jailbroken, your build should fail.
Conclusion
Don't wait for your users to red team your app. They will do it, and they will post the screenshots on Reddit. Beat them to it.
READ MORE

Why We Open Sourced Our AI Firewall
Security through obscurity is dead. We built PromptGuard because we were tired of black-box security tools that we couldn't audit. Here is why we made it open source.

Why We Self-Host Our Security Stack (And You Should Too)
Sending your data to a security vendor is an oxymoron. If you care about privacy, your security tools should run in your VPC.

LangChain Is Unsafe by Default: How to Secure Your Chains
LangChain makes it easy to build agents. It also makes it easy to build remote code execution vulnerabilities. Here is the right way to secure your chains.