Start Here (Simple)¶

This page is the shortest path to understand RLM Code and start safely.

What RLM Code Is¶

RLM Code is a terminal app for running research experiments with language models.

It helps you:

RLM Code is not:

Required:

Recommended for safe execution:

Optional:

uv tool install "rlm-code[tui,llm-all]"
rlm-code

In TUI:

/connect
/sandbox profile secure
/sandbox backend docker
/sandbox doctor
/rlm run "small test task" steps=4 timeout=30 budget=60
/rlm status

You can use RLM Code like a coding assistant without running harness commands first.

Just connect once, then ask coding tasks directly in chat.

/connect

or

/connect acp

Then type normal prompts in chat, for example:

fix failing tests in this repo and explain the root cause

implement a parser for this config format and add unit tests

refactor this module for readability and keep behavior unchanged

Optional advanced mode:

RLM experiments can trigger many model calls (especially recursive runs).

Always start with small limits:

If a run is going out of control, stop it:

/rlm abort all

Or stop one run:

/rlm abort <run_id>

Use /rlm status to monitor the run and confirm whether it completed or was cancelled.

Command	Why you use it
`/connect`	Connect model
`/sandbox profile secure`	Apply secure defaults
`/sandbox backend docker`	Force Docker backend
`/sandbox backend monty`	Use Monty backend
`/sandbox doctor`	Verify runtimes and backend
`/rlm run "<task>" steps=4 timeout=30 budget=60`	Run a bounded experiment
`/rlm bench list`	Show available benchmark presets
`/rlm bench preset=<name> limit=1`	Run a small benchmark first
`/connect acp`	Connect through ACP profile
`type coding task in chat`	Default coding-agent flow (no harness command required)
`/harness run "<task>" steps=8 mcp=on`	Optional explicit tool-loop mode
`/rlm status`	Check latest run
`/rlm abort [run_id\|all]`	Cancel active run(s)
`/rlm replay <run_id>`	Inspect full trajectory