Hydra Multi-turn Strategy

The Hydra strategy (jailbreak:hydra) runs a multi-turn attacker agent that adapts to every response from your target system. It maintains structured memory, evaluates past attempts, and branches into new attack approaches when the direct path fails.

Unlike single-turn jailbreaks that retry variations of one payload, Hydra continuously reasons about prior turns, retries with fresh context, and shares learnings across every test in the same scan.

Hydra has two target-delivery modes:

Replay mode (stateful: false, default): Hydra sends the full conversation transcript to the target on every turn. Use this when the target is stateless or expects the entire message history in each request.
Target-managed session mode (stateful: true): Hydra sends only the newest turn. Your provider must preserve earlier turns for the same session, for example with cookies, a server session, or an OpenAI Agents session factory.

Implementation

Add the strategy to your promptfooconfig.yaml to enable multi-turn adaptive testing:

promptfooconfig.yaml
redteam:
  strategies:
    # Basic usage
    - jailbreak:hydra

    # With configuration
    - id: jailbreak:hydra
      config:
        # Optional: maximum turns before hydra stops (default: 10)
        maxTurns: 12
        # Optional: how many times to backtrack after refusals in stateless mode (default: 10)
        maxBacktracks: 5
        # Optional: set true if your target expects session state on each request
        stateful: false

Cloud Required

Hydra relies on Promptfoo Cloud to coordinate the attacker agent, maintain scan-wide learnings, and manage branching logic. Set PROMPTFOO_REMOTE_GENERATION_URL or sign in to Promptfoo Cloud before running this strategy.

Configuration Options

Option	Default	Description
`maxTurns`	`10`	Maximum conversation turns Hydra will take with the target before stopping. Increase for deeper explorations.
`maxBacktracks`	`10`	Number of times Hydra can roll back the last turn when it detects a refusal. Set to `0` automatically when `stateful: true`.
`stateful`	`false`	When `true`, use target-managed session mode: Hydra sends only the newest turn, and the target provider must preserve earlier turns for that session. Keep `false` to replay the full transcript each time.

tip

Hydra manages attacker-side history and backtracking. Your target provider manages target-side persistence in stateful: true mode. In Promptfoo Cloud, Hydra can derive the mode from the target configuration. In the open-source CLI/UI, set stateful: true only after you configure sessions in the provider. See Multi-Turn Session Management.

How It Works

Goal selection – Hydra pulls the red team goal from the plugin metadata or injected variable.
Agent decisioning – A coordinating agent in Promptfoo Cloud evaluates prior turns and chooses the next attack message.
Target probing – The selected message is sent either as a replayed transcript or as the newest turn in a target-managed session.
Outcome grading – Responses are graded with the configured plugin assertions and stored for later learning.
Adaptive branching – On refusals, Hydra backtracks and explores alternate branches until it succeeds, exhausts maxBacktracks, or reaches maxTurns.

Hydra keeps a per-scan memory so later test cases can reuse successful tactics discovered earlier in the run.

Hydra vs Other Agentic Strategies

Strategy	Turn Model	Best For	Cost Profile
`jailbreak`	Single-turn	Fast baselines, low cost	Low
`jailbreak:meta`	Iterative taxonomy	Broad single-shot coverage	Medium
`jailbreak:hydra`	Multi-turn branching	Stateful agents, evasive defenses	High

When to Use Hydra

Your product exposes a conversational bot or agent workflow with stateful behavior.
Guardrails block straightforward jailbreaks and you need an adversary that can pivot.
You want to reuse learnings across an entire scan (e.g., large org-wide red teams).

Hydra is most effective when paired with plugin suites like harmful, pii, or rbac that define concrete failure conditions via graders.

Iterative Jailbreaks – Baseline single-turn optimization strategy
Meta-Agent Jailbreaks – Strategic taxonomy builder for complex single-turn attacks
Multi-turn Jailbreaks – Overview of conversational attacker agents
Multi-Turn Session Management – Configure real target-side conversation state
Tree-based Jailbreaks – Branching exploration without multi-turn conversations

Implementation​

Configuration Options​

How It Works​

Hydra vs Other Agentic Strategies​

When to Use Hydra​

Related Concepts​

Implementation

Configuration Options

How It Works

Hydra vs Other Agentic Strategies

When to Use Hydra

Related Concepts