Saydrop vs Wispr Flow
Compare Saydrop (one-time CHF 39 on-device transcription) with Wispr Flow (Pro $15/mo cloud). Which privacy-first dictation app fits your Mac?
If you’re shopping for a dictation app on macOS, the choice between Saydrop and Wispr Flow hinges on one architectural question: do you want transcription to stay on your machine, or is cross-platform + mobile + cloud AI editing worth a monthly subscription?
For privacy-first, Apple-Silicon-native use: Saydrop at CHF 39 one-time with local transcription and default-on Gemma cleanup wins. For cross-platform + mobile + opt-in cloud Polish: Wispr Flow’s $144/year Pro plan wins, with the caveat that audio leaves your Mac and you pay indefinitely.
This post walks through the real tradeoffs, not the marketing claims.
At a glance
| Feature | Saydrop | Wispr Flow |
|---|---|---|
| Price | CHF 39 one-time | Free (2k words/wk) or Pro $144/yr |
| Trial | 14 days, full-feature | 14 days Pro |
| Transcription method | Local MLX Whisper | Cloud (OpenAI, Anthropic, AWS) |
| Text cleanup | Local Gemma (default), optional cloud Gemini | Cloud “AI Auto Edits” |
| Platforms | macOS M1–M4, Sonoma+ | Mac, Windows, iOS, Android |
| Audio leaves device | No (unless you enable cloud Gemini) | Yes (cloud-only) |
| Account required | No | Yes |
| Privacy Mode | Default | Opt-in |
| Native Apple Silicon | Yes | Yes, but optimized for cloud |
| Hotkey latency | 0.6–2.5 s | ~1–5 s (varies with network) |
| Personal dictionary | Local, instant updates | Cloud synced across devices |
Pricing: subscription vs one-time
Wispr Flow pricing is straightforward: $15/month or $144/year for Pro, or 2,000 words/week free. After two years, you’ve paid $288. After five years, $720. There is no lifetime license option.
Saydrop costs CHF 39 one-time. After five years, you’ve still paid CHF 39.
The practical math: if you dictate regularly (daily), the per-word cost of Saydrop on Apple Silicon is negligible. If you dictate occasionally, Wispr Flow’s free tier may suffice. If you need unlimited transcription across multiple devices and the cloud AI editing, Wispr Flow’s Pro is a reasonable trade for recurring cost.
The gap widens if you use Saydrop on multiple Macs — there is no per-device limit. One license covers your MacBook, iMac, and Mac mini.
Privacy: where your voice goes
This is the architectural core.
Saydrop: Your audio is transcribed on your Mac via MLX Whisper. It never leaves the device unless you explicitly enable cloud Gemini for text cleanup (off by default, BYO API key). The text cleanup is handled locally by Gemma 4 E2B — a multimodal model running in-process on your GPU.
Wispr Flow: Cloud-only architecture. Audio is sent to Wispr’s servers, then routed to subprocessors including OpenAI Whisper API, Anthropic Claude, and AWS. Wispr Flow offers Privacy Mode with zero data retention, but it is opt-in, not the default. Users must explicitly enable it. Wispr Flow complies with HIPAA, SOC2, and ISO27001, but compliance is not the same as data not leaving your machine.
If your dictation includes passwords, medical information, or legal notes — things you’d rather not send to the cloud even with “privacy mode” enabled — Saydrop’s default-on local approach is categorically different.
Wispr Flow’s cross-platform sync of your vocabulary also means that dictionary lives on Wispr’s servers. Saydrop keeps it local.
Text cleanup: cloud vs local Gemma
Both apps clean up raw transcription output (fixing grammar, punctuation, formatting). The approach differs.
Saydrop: Cleanup runs locally by default via Gemma 4 E2B (a quantized multimodal LLM optimized for Apple Silicon). It runs in-process, adds ~0.25–2s latency depending on text length, and the model is cached on your drive. You can disable cleanup entirely with SAYDROP_CLEANUP=0 to drop latency by ~0.5s. Optionally, you can provide a Gemini API key to use cloud cleanup instead, but that is off by default.
Wispr Flow: AI cleanup (“AI Auto Edits”) runs in the cloud. It is part of Pro, and like transcription, uses Wispr’s cloud infrastructure. It is more sophisticated than local Gemma — trained on cross-platform editing patterns and better at reformatting long passages — but it adds cloud latency and sends the raw transcript to their servers first.
If you want to clean up text without sending it to the cloud, Saydrop is your only choice. If you want cloud-grade AI formatting and do not mind the network latency, Wispr Flow’s editing is more polished.
Platform & performance
Saydrop: macOS only, Apple Silicon only (M1, M1 Pro/Max, M2, M2 Pro/Max, M3, M3 Pro/Max, M4). Requires macOS Sonoma (14) or later. Takes ~6 GB disk for the bundled Python environment and Whisper model. Transcription latency is consistently 0.6–2.5s depending on clip length and system RAM, because the model is resident and inference is local.
Wispr Flow: Broader reach — Mac, Windows, iOS, Android. Achieves native app performance on all platforms. Transcription latency depends on network and cloud queue load, typically 1–5s for short clips, slower during peak times. The advantage: one account, unified vocabulary, seamless mobile capture.
If you only use macOS on Apple Silicon and want the fastest, most predictable latency, Saydrop has the edge. If you switch between Mac and iPhone, or dictate on Windows, Wispr Flow is the only option that covers both.
Who should pick Saydrop?
- Privacy-forward. You do not want audio leaving your machine under any circumstance (Wispr’s cloud-first architecture rules it out).
- Apple Silicon Mac, one-time payment. You use an M1–M4 Mac, value owning software outright, and do not want a subscription.
- Local-first infrastructure. Your workplace or industry requires transcription to stay on-device (legal, medical, classified).
- Multi-Mac household. You have a MacBook and an iMac and want one license covering both without per-device fees.
- Predictable latency. You prioritize consistent sub-2s turnaround over cloud-grade AI formatting.
Who should pick Wispr Flow?
- Cross-platform. You need dictation on both Mac and iPhone/Android, or you use Windows.
- Cloud AI editing. You want sophisticated, server-grade text cleanup over local Gemma and are okay with the network latency.
- Account-based workflow. You use multiple devices and want vocabulary + settings synced across them.
- Free tier suffices. You dictate lightly (≤2,000 words/week) and do not need Pro, so cost is zero.
- HIPAA/SOC2 compliance. You need formal security certifications and Privacy Mode is acceptable for your use case (remember: it is opt-in).
The honest trade
Wispr Flow is polished and genuinely cross-platform. Its cloud infrastructure is robust, its free tier is generous, and if you already use it on iOS, staying there on Mac is frictionless. The AI editing is better than local Gemma.
Saydrop makes a different bet: your voice should not leave your device, and you should pay once, not forever. If that trade feels right — and for many privacy-conscious users it does — the tradeoff is worth it.
The best choice is the one that matches your threat model, device ecosystem, and budget. Both apps work. Saydrop is quieter, local, and one-time. Wispr Flow is louder, cross-platform, and cloud-connected.
FAQ
Can I use Wispr Flow on Mac for free?
Wispr Flow’s free tier includes 2,000 words/week on Mac. For unlimited transcription, you need Pro at $15/month or $144/year. Saydrop costs CHF 39 one-time with a 14-day full-feature trial.
Where does my voice go with Saydrop?
Nowhere. Audio is transcribed locally on your Mac via MLX Whisper. Text cleanup runs locally via Gemma by default. You can optionally enable cloud Gemini, but you provide your own API key.
Does Wispr Flow have a zero-retention privacy mode?
Yes, Wispr Flow offers Privacy Mode with zero data retention, but it’s opt-in. By default, audio goes to cloud processors including OpenAI, Anthropic, and AWS per their subprocessor list.
What if I need Wispr Flow on iPhone or Windows?
Saydrop is Mac Apple Silicon only (M1–M4, Sonoma+). Wispr Flow runs on Mac, Windows, iOS, and Android. If cross-platform is essential, Wispr Flow is the choice.
Can I use a personal dictionary in both apps?
Both support vocabulary hints. Saydrop’s personal dictionary is managed locally. Wispr Flow syncs your custom vocabulary across devices.
Which is faster: local Saydrop or cloud Wispr?
Saydrop averages 0.6–2.5s transcription latency on Apple Silicon. Wispr Flow’s latency depends on network and cloud queue load, typically 1–5s for short clips.
Try Saydrop
If local transcription and one-time pricing appeal to you, download Saydrop for a 14-day full-feature trial. No subscription, no account required, no audio leaves your Mac.
For more on how we built Saydrop’s local pipeline, see how we run MLX Whisper on macOS.
Comparing other dictation tools? Check out Saydrop vs SuperWhisper for a similar breakdown, or browse the best dictation apps for Mac in 2026.