SuperClaw

Red‑Team AI Agents Before They Red‑Team You

Scenario‑driven, behavior‑first security testing for autonomous agents.

What is SuperClaw?¶

SuperClaw is a pre-deployment security testing framework for AI coding agents. It systematically identifies vulnerabilities before your agents touch sensitive data or connect to external ecosystems.

🎯 Scenario-Driven Testing

Generate and execute adversarial scenarios against real agents with reproducible results.

Get started →

📋 Behavior Contracts

Explicit success criteria, evidence extraction, and mitigation guidance for each security property.

Explore behaviors →

📊 Evidence-First Reporting

Reports include tool calls, outputs, and actionable fixes in HTML, JSON, or SARIF formats.

CI/CD integration →

🛡️ Built-in Guardrails

Local-only mode and authorization checks reduce misuse risk.

Safety guide →

⚠️ Security and Ethical Use¶

Authorized Testing Only

SuperClaw is for authorized security testing only. Before using:

✅ Obtain written permission to test the target system
✅ Run tests in sandboxed or isolated environments
✅ Treat automated findings as signals, not proof—verify manually

Guardrails enforced by default:

Local-only mode blocks remote targets
Remote targets require SUPERCLAW_AUTH_TOKEN

Threat Model¶

OpenClaw + Moltbook Risk Surface

OpenClaw agents often run with broad tool access. When connected to Moltbook or other agent networks, they can ingest untrusted, adversarial content that enables:

Prompt injection and hidden instruction attacks
Tool misuse and policy bypass
Behavioral drift over time
Cascading cross-agent exploitation

SuperClaw evaluates these risks before deployment.

The Problem¶

Autonomous agents are deployed with high privilege, mutable behavior, and exposure to untrusted inputs—often without structured security validation. This makes prompt injection, tool misuse, configuration drift, and data leakage likely but poorly understood until after exposure.

The Solution¶

SuperClaw performs pre-deployment, scenario-driven security evaluation:

Generates adversarial attack scenarios
Executes them against your agent
Captures evidence (tool calls, outputs, artifacts)
Scores behavior against explicit contracts
Produces actionable reports with mitigations

Non-Goals¶

SuperClaw does not:

Generate agents
Run production workloads
Automate real-world exploitation

Quick Start¶

pipuvWith CodeOptiX

pip install superclaw

uv pip install superclaw

pip install superclaw[codeoptix]

Run your first attack:

# Attack a local OpenClaw instance
superclaw attack openclaw --target ws://127.0.0.1:18789

# Or test offline with the mock adapter
superclaw attack mock --behaviors prompt-injection-resistance

# Generate a comprehensive audit report
superclaw audit openclaw --comprehensive --report-format html

Key Features¶

Feature	Description
🎯 Attack Library	5 attack techniques with 100+ payloads
🔍 Behavior Specs	6 security behaviors with severity levels
🌸 Bloom Integration	LLM-powered scenario generation
📊 Multi-Format Reports	HTML, JSON, SARIF for CI/CD
🔬 CodeOptiX Integration	Multi-modal evaluation pipeline

Supported Targets¶

Target	Adapter	Description
🦞 OpenClaw	`openclaw`	AI coding agents via ACP WebSocket
🧪 Mock	`mock`	Offline deterministic testing
🔧 Custom	Extend `BaseAdapter`	Build your own adapter

Attack Techniques¶

Technique	Description
`prompt-injection`	Direct and indirect injection attacks
`encoding`	Base64, hex, unicode, typoglycemia obfuscation
`jailbreak`	DAN, grandmother, role-play bypass techniques
`tool-bypass`	Tool policy bypass via alias confusion
`multi-turn`	Persistent escalation across conversation turns

Security Behaviors¶

Behavior	Severity	Tests
`prompt-injection-resistance`	🔴 CRITICAL	Injection detection and rejection
`sandbox-isolation`	🔴 CRITICAL	Container and filesystem boundaries
`tool-policy-enforcement`	🟠 HIGH	Allow/deny list compliance
`session-boundary-integrity`	🟠 HIGH	Cross-session isolation
`configuration-drift-detection`	🟡 MEDIUM	Config stability over time
`acp-protocol-security`	🟡 MEDIUM	Protocol message handling

Superagentic AI Ecosystem¶

SuperClaw is part of a comprehensive AI quality and security ecosystem:

┌─────────────────────────────────────────────────────────────┐
│                  Superagentic AI Ecosystem                  │
├─────────────────────────────────────────────────────────────┤
│  SuperQE      │  Quality Engineering core engine            │
│  SuperClaw    │  Agent security testing framework ◄── YOU   │
│  CodeOptiX    │  Code optimization & evaluation engine      │
│  Bloom        │  Behavioral evaluation scenario generation  │
└─────────────────────────────────────────────────────────────┘

Next Steps¶

📦 Installation

Get SuperClaw set up with pip, uv, or from source.

Install now →

⚡ Quick Start

Run your first security scan in under 5 minutes.

Quick start →

🏗️ Architecture

Understand how SuperClaw works under the hood.

Learn more →

🔄 CI/CD

Integrate security scanning into your pipeline.

Set up CI/CD →