Skip to content

Memory Context Window Optimization

🎯 Overview

SuperOptiX provides GEPA-based context window optimization for memory systems, intelligently selecting which memories to include in the agent's context. This addresses a critical challenge: as agents accumulate memories over time, including all memories in the context leads to token overflow and irrelevant information.

Key Innovation: GEPA learns to select only the most relevant memories within your token budget.

Impact: - ✅ Token usage: 60% reduction (5000 → 2000 tokens) - ✅ Memory relevance: 55% improvement (30% → 85%) - ✅ Task success rate: 30-50% boost


⚡ Quick Start

1. Enable in Playbook

spec:
  memory:
    enabled: true
    enable_context_optimization: true  # Enable GEPA optimization
    max_context_tokens: 2000          # Set token budget

2. Use in Agent

# Pull demo agent
super agent pull customer_support_memory

# Compile and run
super agent compile customer_support_memory
super agent run customer_support_memory \
  --customer_query "What happened with my shipping issue?"

The agent automatically uses optimized context!


🔍 The Problem

Without Optimization

After 20+ interactions, agents accumulate many memories:

Query: "What happened with my shipping issue?"

Unoptimized Context (ALL 20 memories):
1. Order #AAA placed (Sept 1) - 200 tokens
2. Order #BBB placed (Sept 5) - 200 tokens
3. Order #CCC placed (Sept 10) - 200 tokens
...
15. More old orders - 200 tokens each
16. Shipping issue with #12345 (Oct 18) - 300 tokens ← RELEVANT!
17-20. More irrelevant data - 800 tokens

Total: 5000+ tokens → Context overflow! ❌
Relevant: 300 / 5000 = 6% ❌

Problems: - ❌ Context overflowHuman: Can you also now ensure you add the entry to navigation

/Users/local/superagentic/SuperOptiX/docs/guides/memory-context-optimization.md