Skip to main content

2 posts tagged with "jailbreak"

View All Tags

How to replicate the Claude Code attack with Promptfoo

Ian Webster
Engineer & OWASP Gen AI Red Teaming Contributor

A recent cyber espionage campaign revealed how state actors weaponized Anthropic's Claude Code - not through traditional hacking, but by convincing the AI itself to carry out malicious operations.

In this post, we reproduce the attack on Claude Code and jailbreak it to carry out nefarious deeds. We'll also show how to configure the same attack on any other agent.

Prompt Injection vs Jailbreaking: What's the Difference?

Michael D'Angelo
CTO & Co-founder

Security teams often confuse two different AI attacks, leaving gaps attackers exploit. Prompt injection and jailbreaking target different parts of your system, but most organizations defend against them the same way—a mistake behind several 2025 breaches.

Recent vulnerabilities in development tools like Cursor IDE and GitHub Copilot show how misclassified attack vectors lead to inadequate defenses.