Patrick Lehmann

Apple Dictation vs dictation apps

Apple Dictation is free and private, but it lacks AI cleanup, personal dictionaries, and jargon accuracy. See what separates built-in from standalone.

Apple Dictation is free, built-in, and entirely on-device. For casual use—taking notes, writing emails—it is enough. But it stops after roughly 30 seconds of silence, has no AI grammar-fixing, and cannot learn your jargon. Standalone dictation apps fill those gaps: they bring Whisper-class accuracy, personal dictionaries that stick spelling of technical terms, AI cleanup (grammar, filler removal, punctuation), and better hotkey UX. The choice is between “good enough for short clips” and “built for longer work and technical vocabulary.”

What Apple Dictation does well

Apple Dictation is simple: enable it in System Settings > Keyboard > Dictation, press your hotkey (default: Fn + Fn), and speak. It is free. It is on-device for most common languages (English, German, French, Spanish, etc.) when you enable offline mode. It requires no account, no installation, no subscription. Audio and text never leave your machine.

It handles basic transcription well. For short dictations under 20 seconds—quick notes, email bodies, Slack messages—it is fast and accurate enough. Punctuation is inserted automatically in many languages. It works with any text field: Mail, Notes, code editors, web forms.

Where Apple Dictation falls short

But Apple Dictation hits a wall quickly:

No AI cleanup. Apple Dictation transcribes what it hears; it does not fix grammar, remove filler words (“um”, “like”, “you know”), or reformat sentences. If you ramble, the text rambles. Standalone apps add a local AI step (Gemma, LLaMA) that rewrites for clarity without asking you first.

No personal dictionary. Dictation can recognize names from your Contacts, but nothing else. If you work in tech (mentioning “Kubernetes”, “PostgreSQL”, “React”), or medicine, or law, or finance, Apple Dictation will misheard those terms repeatedly. Standalone apps let you build a personal dictionary of unlimited jargon—product names, proper nouns, domain-specific words—and feed it to Whisper as a prompt hint, so it recognizes them correctly every time.

~30-second limit. Apple Dictation stops transcribing after approximately 30 seconds of silence. If you are composing a longer thought and pause mid-sentence, the clip ends and you must press the hotkey again. This breaks the flow of longer dictations.

Limited formatting. Apple Dictation adds basic punctuation but does not convert spoken cues into structure. You cannot ask it to “capitalize this” or “bullet point the next three items” mid-dictation.

No mixed-language support. If you code-switch (switching between English and German mid-sentence), Apple Dictation will pick one language and stay with it. Many users speak two or three languages—toggling languages mid-dictation is clunky.

Accuracy: Whisper vs Apple Dictation

Apple does not publish a Word Error Rate (WER) for Dictation. But Whisper large-v3, the model used by most standalone dictation apps, achieves a 2.7% WER on LibriSpeech test-clean audio (clean, quiet environment). Whisper models improve further on technical vocabulary when you provide a personal dictionary prompt—Whisper uses it to bias recognition of specific terms.

For casual transcription in quiet, Whisper-class models and Apple Dictation perform similarly. The gap widens on domain-specific vocabulary: Apple Dictation will botch “Kubernetes”; Whisper with a personal dictionary will spell it correctly. The gap widens further on longer clips, accented speech, and background noise.

The missing piece: AI cleanup

Standalone dictation apps add a second step: after transcribing, they run the text through a local AI model (Gemma, LLaMA) that fixes grammar, removes fillers, and improves punctuation—all without sending your words anywhere.

Apple Dictation has no equivalent. What you speak is what you get.

Here is an example. Raw transcription (Whisper):

Um, so I think that like we should probably just go ahead and build like the the new feature in Kotlin, you know, because like the team is already familiar with it, uh, and yeah.

After AI cleanup (Gemma):

We should build the new feature in Kotlin because the team is already familiar with it.

The cleanup model runs on your Mac using the same on-device privacy as the transcription. No network, no server.

Personal dictionary and jargon accuracy

If you work in technology, finance, medicine, law, or any field with specialized vocabulary, a personal dictionary is the biggest practical difference between Apple Dictation and standalone apps.

Apple Dictation recognizes names from your Contacts. That is all. If you mention “PostgreSQL”, “machine learning”, “ARR”, “CAC”, “attestation flow”, or your company name repeatedly, Apple Dictation will misheard it every time, and you will spend minutes correcting it.

Standalone dictation apps (built on Whisper) accept a personal dictionary—typically a text file with one term per line. Before each dictation, the app builds a prompt hint from that dictionary and passes it to Whisper:

Recognize these terms correctly in the audio:
PostgreSQL
machine learning
ARR
CAC
attestation flow
SaaS

Whisper sees that hint and biases its decoding toward those exact spellings. The accuracy on those terms jumps from 40% to 95%+ in one update.

The dictionary is instant—no fine-tuning, no retraining. Edit the file, start a new dictation, and your jargon is recognized. For knowledge workers, this alone is worth switching.

When built-in dictation is enough

Apple Dictation is the right choice if:

  • You dictate short clips (under 20 seconds) in quiet environments.
  • You use common vocabulary (no specialized jargon).
  • You are willing to pause, correct, and edit before sending.
  • You do not need to speak continuously without pausing.
  • You are comfortable with the Fn hotkey and no customization.

It is free, requires no setup, and you already have it.

When you need a standalone app

Standalone dictation apps (like Saydrop) are worth it if:

  • You dictate longer thoughts or compose while speaking.
  • You use technical terms, proper nouns, or domain-specific vocabulary that gets misheard.
  • You want AI to automatically fix grammar and remove fillers.
  • You want a custom hotkey (double-tap, hold, etc.).
  • You want to see live transcription while dictating.
  • You code-switch between languages mid-sentence.

The trade-off is price and setup time. Saydrop is CHF 39 one-time (no subscription, 14-day full-feature trial). It requires macOS 14+ and Apple Silicon (M1–M4). Setup takes 2 minutes: download, install, trigger the onboarding, and the model downloads and warms up automatically.

Comparison table

FeatureApple DictationWhisper-based app (e.g., Saydrop)
PriceFree (built-in)CHF 39 one-time (Saydrop)
PrivacyOn-deviceOn-device
Transcription length~30 seconds per clipUnlimited (clip length set by user)
Accuracy (general)GoodExcellent (Whisper large-v3)
Technical vocabularyPoor without Contacts namesExcellent with personal dictionary
Grammar/filler cleanupNoneYes (local Gemma model)
Personal dictionaryNames only (Contacts)Unlimited custom terms
Mixed-language supportManual language selectionAuto-detect or force multiple languages
Hotkey customizationFixed Fn keyCustomizable (double-tap, hold, etc.)
Live transcription UINoYes
Time to first dictationImmediate~5 min (model download + warmup)
Setup complexityZeroMinimal (one-click install)

The upgrade path

If you currently use Apple Dictation and hit its limits, upgrading is straightforward:

  1. Install Saydrop from the website or Mac App Store. Start a 14-day trial (full features, no credit card).
  2. Set your hotkey. Saydrop shows you the available keys (double-tap Control, hold Option, etc.). Pick what fits your workflow.
  3. Build your dictionary. If you have important terms (product names, jargon, proper nouns), add them to the dictionary in Settings. You can paste in an entire list at once.
  4. Dictate normally. Speak into your Mac. Saydrop transcribes, cleans up, and pastes the result via synthetic ⌘V—no copy/paste needed.

On the first launch, the app downloads the Whisper model (1.5 GB) and runs a quick warmup. Subsequent dictations are sub-second.

You do not have to switch entirely. Many users keep Apple Dictation for quick one-liners and use Saydrop for longer, more complex work. Both are on-device and private, so there is no privacy trade-off.

FAQ

Is Apple Dictation private? Yes. Apple Dictation is on-device for most languages when enabled in Keyboard settings. No account required. Audio and text stay on your Mac—Apple has no server-side processing for standard dictation.

Why use a dictation app if Apple Dictation is built-in? Apple Dictation has no AI cleanup, no personal dictionary, and stops after ~30 seconds of silence. Apps like Saydrop add Whisper-class accuracy, local Gemma cleanup (grammar, fillers), and personal jargon hints.

Can I train Apple Dictation on my own vocabulary? Apple Dictation can only recognize names from your Contacts. Custom vocabulary (up to 1,000 terms) exists in Apple’s Voice Control feature, not Dictation. Dictation-focused apps like Saydrop let you build a personal dictionary with instant feedback.

Does Apple Dictation work offline? Some languages run locally; others require an internet connection. Most common languages (English, German, French) support on-device dictation. Check in System Settings > Keyboard > Dictation to see which languages are available for your device.

Is there a free Whisper-based alternative to Saydrop? Yes—whisper.cpp and mlx-whisper are free open-source projects. Both require manual setup, model download, and integration. Saydrop is a finished product with a CHF 39 one-time license, personal dictionary, AI cleanup, and hotkey UX—ready to use from the Mac App Store.


The bottom line: Apple Dictation is genuinely good for short, casual dictations. It is free, private, and requires no setup. But if you dictate longer work, use specialized vocabulary, or want automatic cleanup, a standalone app will save you editing time—and frustration. If you want to try Saydrop risk-free, download it here and start the 14-day trial.