Talk to your IDE coding assistant. 3 model families, 90+ languages, streaming transcription. 100% on-device.
Your audio never leaves your machine. No API keys, no cloud calls, no telemetry. Privacy by design.
Moonshine (fast), Whisper (90+ languages), Parakeet (real-time streaming). Pick the right tool for the job.
GitHub Copilot, Continue, Kiro, Cody โ any VS Code chat participant. Just configure and talk.
Say "undo", "save", "comment", "function", "rename", "commit". Editor actions, code constructs, and git ops by voice.
Auto-calibrating noise gate + RNNoise WASM denoiser. Works in noisy offices, cafes, and open floor plans.
Learns your codebase identifiers. "get user name" becomes getUserName. Scans open files automatically.
Filler word removal, auto-capitalize, auto-punctuate, context-aware formatting. Transcripts arrive clean.
Send to chat, editor, terminal, or clipboard. Walky-talky mode, wake word activation, idle auto-stop.
Searchable transcript history, performance metrics, latency benchmarks. Track and improve your voice workflow.
90+ languages with per-profile model suggestions. Linux, macOS, Windows. VS Code and Kiro.
Browse, download, and switch models from the built-in Model Manager panel.
| Model | Size | Strength |
|---|---|---|
| Moonshine Tiny | ~27MB | Fastest, quick commands |
| Moonshine Base | ~65MB | Fast, better accuracy |
| Whisper Tiny | ~75MB | 90+ languages, fast |
| Whisper Base | ~150MB | 90+ languages, balanced |
| Whisper Small | ~500MB | 90+ languages, high accuracy |
| Whisper Large v3 Turbo | ~3GB | 90+ languages, best accuracy |
| Parakeet TDT 0.6B | ~150MB | Real-time streaming, live captions |
Install, press Ctrl+Alt+V, and start talking. That's it.
Weekly AI agent intelligence from the builder of VoxPilot.