CodexOpt

Benchmark and optimize AGENTS.md and SKILL.md for Codex with a repeatable developer workflow.

Get Started View Demo Repo

Targeted Focused on repo-local Codex assets: AGENTS.md and SKILL.md.

Measurable Score instruction quality, attach evidence, and review artifact-backed changes.

Practical Scan, benchmark, optimize, review, apply, and report from a single CLI.

CodexOpt helps teams benchmark and optimize Codex instruction assets with a repeatable workflow.

It focuses on two repo-local files:

AGENTS.md
.codex/skills/**/SKILL.md

CodexOpt gives developers a practical loop:

scan instruction assets
benchmark their quality
generate improved candidates
review diffs and reports
apply only validated improvements

Why CodexOpt?¶

Most teams maintain AGENTS.md and SKILL.md manually. Over time these files drift:

duplicated rules
contradictory instructions
missing verification guidance
weak skill triggers
prompt bloat

CodexOpt makes those problems measurable and easier to improve safely.

What It Does¶

scans a repo for agent and skill instruction files
benchmarks them with static checks plus optional task / issue evidence
optimizes them with either heuristic transforms or optional GEPA-backed search
records artifacts under .codexopt/
generates markdown reports for review and PR discussion

Why Developers Use It¶

Instruction files tend to drift long before teams notice:

duplicated rules
contradictory constraints
weak testing guidance
vague skill triggers
prompt bloat

CodexOpt gives developers a way to improve those files with something closer to a normal engineering loop than ad hoc prompt editing.

Demo Repository¶

If you want a small example repo with intentionally messy instructions, use the companion demo:

Demo repo: https://github.com/SuperagenticAI/codexopt-demo
Demo guide: Open the demo walkthrough

Try It¶

uv sync --extra dev
uv run codexopt init
uv run codexopt benchmark
uv run codexopt optimize agents --file AGENTS.md
uv run codexopt optimize skills --glob ".codex/skills/**/SKILL.md"
uv run codexopt report --output codexopt-report.md

If you want a guided example with sample inputs, evidence files, and ready-made commands, start with the demo walkthrough.