Cmd K

Support Start Building

Browse docs

Get Started

Welcome Quickstart Layout modes Prompting guidelines Context basics FAQ

Core Concepts

Auto-compaction Subagents Memories Models and capabilities Token costs Session hygiene

Desktop App

Settings and preferences Voice input Popout chat Platform support Projects and sessions Agent modes Keyboard shortcuts CLI tool Claude Code sessions Codex sessions

Session Environments

Git worktrees SSH sessions

Integrations

Overview GitHub Vercel Railway Linear

Usage & Billing

Tokens and context Pricing Plans and rate limits Upgrade plan Downgrade plan Refunds Cancel plan

Tutorials

Snake game starter

Core Concepts

Token costs

Token cost is driven by input and output token volume, including cached input rates when available.

How request cost is computed

At a high level: input cost + output cost = total request cost. Some models also apply lower pricing to cached input tokens.

text

uncached_input_cost = uncached_input_tokens * input_rate
cached_input_cost = cached_input_tokens * cached_input_rate
output_cost = output_tokens * output_rate
total_cost = uncached_input_cost + cached_input_cost + output_cost

Why high context usage increases spend

Later turns often include larger input context windows.
Larger input context means more input tokens billed each turn.
If output also grows, both sides of the cost formula rise.

Spend control

Use compact sessions, targeted attachments, subagents for exploration, and fresh sessions after milestone completion.