Built-incontext management
Compaction
Compaction
When a session nears its token limit, the assistant summarizes critical details—such as architectural decisions and unresolved bugs—while discarding redundant tool outputs.
Related Articles
Anthropic
Context Windows
Claude API Documentation
Anthropic
Long Context Window Tips
Comprehensive guide to prompt engineering techniques for Claude's latest models, covering clarity, examples, XML structuring, thinking, and agentic systems.
Google AI
Long Context
Learn about how to get started building with long context (1 million context window) on Gemini.
Built-in
Progressive Disclosure
Instead of loading an entire codebase—which would immediately overwhelm the attention budget—modern agents use JIT context. The assistant dynamically loads only the necessary data at runtime.