Blog | Promptfoo

Archive3

Data poisoning attacks can corrupt LLMs during training, fine-tuning, and RAG retrieval.

From simple prompt tricks to sophisticated context manipulation, discover how LLM jailbreaks actually work.

OWASP replaced DoS attacks with "unbounded consumption" in their 2025 Top 10.

Test your LLM against 700+ harmful prompts across 14 risk categories.

Even top models fail 25-50% of prompt injection attacks.

The EU AI Act bans specific AI behaviors starting February 2025.

Running LLMs locally with Ollama? These models often bypass cloud safety filters.

Open source models on HuggingFace often lack safety training.

Meet GOAT: our advanced multi-turn jailbreaking strategy that uses AI attackers to break AI defenders.

Attackers can poison RAG knowledge bases to manipulate AI responses.