Desktop App
Voice input
Voice input records microphone audio, transcribes it, and inserts or auto-sends text based on your settings.
Requirements
- Set `openAiApiKey` in app settings before voice input is enabled.
- Microphone permission must be granted by your OS/browser runtime.
- Transcription runs through OpenAI transcription model `gpt-4o-transcribe` in current implementation.
How voice flow works
- Start recording from the mic button in the input action row.
- When stopping, the flow captures trailing audio, then submits the blob for transcription.
- On success, transcript is either appended to input or auto-sent, depending on settings.
- On failure, a transcription error notification is shown instead of silently dropping input.
Auto-send behavior
If `autoSendVoiceMessages` is enabled and your text input is empty, the transcript is sent immediately. Otherwise it is appended to current draft text.
Best practices
- Use auto-send for short command-style prompts.
- Keep auto-send off when you prefer to edit transcriptions before execution.
- Use a headset in noisy environments to reduce transcript cleanup.