Trigger recording
Use push-to-talk on Linux X11 and macOS, or set up a desktop shortcut on Wayland where global hotkeys are handled by the compositor.
Local voice to text for macOS and Linux
A fully local dictation app with a native overlay, on-device transcription, and polished text insertion across your desktop.
Install in one line
Copy and paste this into your terminal.
curl -fsSL https://raw.githubusercontent.com/cesp99/sussurro/master/scripts/install.sh | bashBuilt for the desktop, not the browser
Sussurro is designed around a simple desktop loop: trigger recording, speak, watch the state change, and keep working in the same app. The overlay, tray, settings window, and text injection are part of the app itself, not a browser shell dressed up as one.
Fast enough to disappear
Use push-to-talk on Linux X11 and macOS, or set up a desktop shortcut on Wayland where global hotkeys are handled by the compositor.
Microphone audio is captured on-device and passed to Whisper.cpp, with the overlay reflecting idle, recording, and transcribing states in real time.
A fine-tuned Qwen 3 cleanup model removes filler words, handles self-corrections, and pastes polished text back into the app you were already using.
Different functions, one local stack
The pill-shaped capsule stays visible above your workspace and mirrors recording state. You can open Settings or quit from the tray icon or by right-clicking the capsule.
Sussurro keeps the full speech-to-text path on your own machine. Whisper handles recognition and the cleanup model refines the result before injection.
The docs expose a practical configuration surface so the app can fit different desktops, languages, and model choices without feeling fragile.
Open model stack, deliberate cleanup
The docs describe a modular architecture: audio capture, Whisper.cpp transcription, Qwen 3 cleanup, clipboard handling, and text injection. That means you can understand the path from microphone to final text instead of trusting a black box.
If you want more control, you can switch models, choose the transcription language, override config through environment variables, or run headless for CLI workflows and scripting.
Setup without guesswork
Install, launch from a terminal, then grant Accessibility access when prompted so the global hotkey can work.
Install the GTK, WebKit, and tray dependencies once, then use the built-in global hotkey with push-to-talk or toggle mode.
Install `wl-clipboard`, then bind your desktop shortcut to the provided trigger script because Wayland does not allow app-managed global hotkeys.
On first launch Sussurro guides you through downloading the required models. Whisper Small is the lighter option, Whisper Large v3 Turbo is the higher-accuracy option, and the Qwen 3 cleanup model is used to refine the final text.
Documentation map
FAQ
No. The docs describe a fully local pipeline: Whisper.cpp handles speech recognition on-device and the Qwen 3 cleanup model runs locally before text is injected.
The main app supports Linux and macOS. The overlay, settings window, tray, and headless mode are documented for both, with Wayland needing external shortcut setup.
Wayland blocks application-managed global hotkeys by design. Sussurro works there by having you bind your desktop environment shortcut to the included trigger script or socket command.
Yes. Settings lets you switch Whisper models, download models, and choose the transcription language. The config file also supports language codes, model paths, and environment overrides.
Yes. `sussurro-transcribe` is the companion CLI for transcribing local audio files with optional LLM cleanup, output files, language overrides, and debug mode.
No for normal use. The install script downloads the correct release binary for your platform. Building from source is available if you want to compile the native UI or inspect the full stack.
Read the research
Support the developer
Sussurro is fully open source and built around a native desktop experience. If it becomes part of your workflow, donations go straight to the developer maintaining it.
Donate on Ko-fi7jizbu8GD2EGJyxRGxDsSayGD9CV5hkCmUfeYpwvAiNH0xA6EAFb432c3bfF4BB1cBaCF19eABbb1da9F56488bc1qecdkhsh0fpx2zlp9x0smmme9xgwumltsf3m3phGCHW7CSWW7VA4UZMPSXDHR5CKLZ5DQREUVCJZFXYCCWWJMXTUJUABUCNFully local. Fully open source.