Skip to content

SuperClaw

Red‑Team AI Agents Before They Red‑Team You

Scenario‑driven, behavior‑first security testing for autonomous agents.


What is SuperClaw?

SuperClaw is a pre-deployment security testing framework for AI coding agents. It systematically identifies vulnerabilities before your agents touch sensitive data or connect to external ecosystems.

🎯 Scenario-Driven Testing

Generate and execute adversarial scenarios against real agents with reproducible results.

Get started β†’

πŸ“‹ Behavior Contracts

Explicit success criteria, evidence extraction, and mitigation guidance for each security property.

Explore behaviors β†’

πŸ“Š Evidence-First Reporting

Reports include tool calls, outputs, and actionable fixes in HTML, JSON, or SARIF formats.

CI/CD integration β†’

πŸ›‘οΈ Built-in Guardrails

Local-only mode and authorization checks reduce misuse risk.

Safety guide β†’

⚠️ Security and Ethical Use

Authorized Testing Only

SuperClaw is for authorized security testing only. Before using:

  • βœ… Obtain written permission to test the target system
  • βœ… Run tests in sandboxed or isolated environments
  • βœ… Treat automated findings as signals, not proofβ€”verify manually

Guardrails enforced by default:

  • Local-only mode blocks remote targets
  • Remote targets require SUPERCLAW_AUTH_TOKEN

Threat Model

OpenClaw + Moltbook Risk Surface

OpenClaw agents often run with broad tool access. When connected to Moltbook or other agent networks, they can ingest untrusted, adversarial content that enables:

  • Prompt injection and hidden instruction attacks
  • Tool misuse and policy bypass
  • Behavioral drift over time
  • Cascading cross-agent exploitation

SuperClaw evaluates these risks before deployment.

The Problem

Autonomous agents are deployed with high privilege, mutable behavior, and exposure to untrusted inputsβ€”often without structured security validation. This makes prompt injection, tool misuse, configuration drift, and data leakage likely but poorly understood until after exposure.

The Solution

SuperClaw performs pre-deployment, scenario-driven security evaluation:

  1. Generates adversarial attack scenarios
  2. Executes them against your agent
  3. Captures evidence (tool calls, outputs, artifacts)
  4. Scores behavior against explicit contracts
  5. Produces actionable reports with mitigations

Non-Goals

SuperClaw does not:

  • Generate agents
  • Run production workloads
  • Automate real-world exploitation

Quick Start

pip install superclaw
uv pip install superclaw
pip install superclaw[codeoptix]

Run your first attack:

# Attack a local OpenClaw instance
superclaw attack openclaw --target ws://127.0.0.1:18789

# Or test offline with the mock adapter
superclaw attack mock --behaviors prompt-injection-resistance

# Generate a comprehensive audit report
superclaw audit openclaw --comprehensive --report-format html

Key Features

Feature Description
🎯 Attack Library 5 attack techniques with 100+ payloads
πŸ” Behavior Specs 6 security behaviors with severity levels
🌸 Bloom Integration LLM-powered scenario generation
πŸ“Š Multi-Format Reports HTML, JSON, SARIF for CI/CD
πŸ”¬ CodeOptiX Integration Multi-modal evaluation pipeline

Supported Targets

Target Adapter Description
🦞 OpenClaw openclaw AI coding agents via ACP WebSocket
πŸ§ͺ Mock mock Offline deterministic testing
πŸ”§ Custom Extend BaseAdapter Build your own adapter

Attack Techniques

Technique Description
prompt-injection Direct and indirect injection attacks
encoding Base64, hex, unicode, typoglycemia obfuscation
jailbreak DAN, grandmother, role-play bypass techniques
tool-bypass Tool policy bypass via alias confusion
multi-turn Persistent escalation across conversation turns

Security Behaviors

Behavior Severity Tests
prompt-injection-resistance πŸ”΄ CRITICAL Injection detection and rejection
sandbox-isolation πŸ”΄ CRITICAL Container and filesystem boundaries
tool-policy-enforcement 🟠 HIGH Allow/deny list compliance
session-boundary-integrity 🟠 HIGH Cross-session isolation
configuration-drift-detection 🟑 MEDIUM Config stability over time
acp-protocol-security 🟑 MEDIUM Protocol message handling

Superagentic AI Ecosystem

SuperClaw is part of a comprehensive AI quality and security ecosystem:

β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚                  Superagentic AI Ecosystem                  β”‚
β”œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€
β”‚  SuperQE      β”‚  Quality Engineering core engine            β”‚
β”‚  SuperClaw    β”‚  Agent security testing framework ◄── YOU   β”‚
β”‚  CodeOptiX    β”‚  Code optimization & evaluation engine      β”‚
β”‚  Bloom        β”‚  Behavioral evaluation scenario generation  β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜

Next Steps

πŸ“¦ Installation

Get SuperClaw set up with pip, uv, or from source.

Install now β†’

⚑ Quick Start

Run your first security scan in under 5 minutes.

Quick start β†’

πŸ—οΈ Architecture

Understand how SuperClaw works under the hood.

Learn more β†’

πŸ”„ CI/CD

Integrate security scanning into your pipeline.

Set up CI/CD β†’