Cmd K

Support Start Building

Browse docs

Get Started

Welcome Quickstart Layout modes Prompting guidelines Context basics FAQ

Core Concepts

Auto-compaction Subagents Memories Models and capabilities Token costs Session hygiene

Desktop App

Settings and preferences Voice input Popout chat Platform support Projects and sessions Agent modes Keyboard shortcuts CLI tool Claude Code sessions Codex sessions

Session Environments

Git worktrees SSH sessions

Integrations

Overview GitHub Vercel Railway Linear

Usage & Billing

Tokens and context Pricing Plans and rate limits Upgrade plan Downgrade plan Refunds Cancel plan

Tutorials

Snake game starter

Desktop App

Voice input

Voice input records microphone audio, transcribes it, and inserts or auto-sends text based on your settings.

Requirements

Set `openAiApiKey` in app settings before voice input is enabled.
Microphone permission must be granted by your OS/browser runtime.
Transcription runs through OpenAI transcription model `gpt-4o-transcribe` in current implementation.

How voice flow works

Start recording from the mic button in the input action row.
When stopping, the flow captures trailing audio, then submits the blob for transcription.
On success, transcript is either appended to input or auto-sent, depending on settings.
On failure, a transcription error notification is shown instead of silently dropping input.

Auto-send behavior

If `autoSendVoiceMessages` is enabled and your text input is empty, the transcript is sent immediately. Otherwise it is appended to current draft text.

Best practices

Use auto-send for short command-style prompts.
Keep auto-send off when you prefer to edit transcriptions before execution.
Use a headset in noisy environments to reduce transcript cleanup.