Voice to text is technology that turns your spoken words into written text as you speak. It is built into every major platform for free: press Win + H on Windows, tap the microphone on your phone keyboard, use dictation on iPhone and Mac, or turn on Voice typing in Google Docs. Setup takes about a minute, and because most people speak far faster than they type, dictation can seriously speed up your writing. This guide walks through every major option, step by step.
What this guide covers
This page is the hub for our whole voice typing series. Read it top to bottom for the full picture, or jump straight to the detailed guide you need:
- How to use voice to text: the universal starter guide for any device
- Voice to text on Android: Gboard dictation, settings, and offline use
- Voice to text on iPhone: iOS dictation setup and tricks
- Voice typing in Google Docs: hands-free writing in your browser
- How to turn off Google voice typing: for when the microphone keeps popping up uninvited
What is voice to text and how does it work
Voice to text, speech to text, voice typing, and dictation all describe the same core feature: software that listens through your microphone and types what you say. The terms are interchangeable in everyday use. Transcription is the related but separate job of converting recorded audio or video files into text after the fact.
Under the hood, the process has three steps. Your microphone captures audio. A speech recognition engine, running either on your device or on a server, matches the sound against language models and produces words. The software then inserts that text at your cursor and, in modern engines, adds punctuation automatically.
Where the recognition runs matters in practice. On-device recognition works offline and keeps audio on your hardware, but quality depends on your phone or computer. Server-based recognition needs an internet connection and usually handles accents, noise, and unusual words better.
Voice typing on Windows (Win + H)
Windows has voice typing built into the system, so it works in any app with a text field: Word, browsers, email, chat. Windows 11 calls it voice typing, Windows 10 calls it dictation, and the shortcut is the same in both.
- Click into the text field where you want the text to appear.
- Press Win + H. A small microphone panel opens.
- Start speaking when the microphone icon activates. Your words appear at the cursor.
- Say punctuation out loud (“comma”, “period”, “question mark”) or open the settings gear in the panel and turn on automatic punctuation.
- Say “stop dictating” or press Win + H again to finish.
Voice typing on Windows needs an internet connection because recognition runs on Microsoft servers. If you want to control the whole PC by voice rather than just type, Windows 11 also includes Voice access, a separate and more powerful accessibility feature.
For a deeper walkthrough, including what to do when nothing happens after Win + H, read our full guide on how to use voice to text.
Voice to text on Android (Gboard)
On Android, dictation lives inside Gboard, Google’s default keyboard. If you see a microphone icon on your keyboard, you already have everything you need.
- Open any app and tap a text field so the keyboard appears.
- Tap the microphone icon in the top right corner of Gboard.
- Speak naturally. Gboard inserts your words as you go and adds punctuation on most modern devices.
- Tap the microphone again to stop, or simply start typing.
In Gboard settings, under Voice typing, you can manage languages and download offline speech packs, which let dictation work without a connection on many phones. The full details, including fixes for a missing microphone icon, are in our guide to voice to text on Android.
And if your problem is the opposite, the microphone keeps activating when you brush it by accident, here is how to turn off Google voice typing.
Voice to text on iPhone (dictation)
Apple calls its version dictation, and it is tightly integrated into the iOS keyboard.
- Go to Settings, then General, then Keyboard, and turn on Enable Dictation.
- Tap the microphone button on the keyboard.
- Speak. iOS adds punctuation automatically, and on recent versions the keyboard stays active, so you can type and talk in the same sentence.
- Tap the microphone again to stop. Dictation also ends on its own after a stretch of silence.
On supported iPhones, dictation for many languages runs entirely on the device, so it works offline and your audio does not leave the phone. Setup details, supported languages, and fixes for a grayed-out dictation toggle are covered in voice to text on iPhone.
Dictation on Mac
Macs have dictation built in as well. Open System Settings, go to Keyboard, and turn on Dictation. Pick a shortcut to trigger it (common defaults are the microphone key or a double press of Fn), then click into any text field, hit the shortcut, and speak. On modern Macs, dictation for many languages also runs on the device itself.
Voice typing in Google Docs
Google Docs ships its own dictation tool, separate from your operating system, and it is one of the most capable free options for long writing sessions.
- Open a document at docs.google.com in a Chromium browser such as Chrome or Edge.
- Open the Tools menu and choose Voice typing, or press Ctrl + Shift + S (Cmd + Shift + S on Mac).
- Click the microphone that appears, allow microphone access if your browser asks, and start talking.
- Say punctuation and layout commands as you go: “comma”, “new line”, “new paragraph”.
- Click the microphone again to stop.
Docs also understands spoken editing commands in English, like “select last word” and “delete”. The complete command list and fixes for common problems are in our guide to voice typing in Google Docs.
When built-in tools are not enough
System dictation is great for messages and quick notes. Heavy users tend to hit the same walls, though: the microphone stops listening after pauses, names and technical terms come out mangled with no way to teach the engine, filler words land in your text exactly as you said them, and every platform behaves a little differently.
That gap is what dedicated dictation apps exist for. Our team builds blablaType, a push-to-talk dictation app for Windows. You hold F9, speak, release, and the text appears at your cursor in whatever app you are using: Word, email, messengers, your browser. A few things it does that system tools do not:
- Translation on the fly: hold F8, speak your own language, and the translated text (English, for example) is typed into the field.
- A personal dictionary: correct a word once and the app remembers your version from then on.
- A hands-free “Okay Safi” assistant on the Premium plan.
Recognition runs on a server, so it needs an internet connection. The trial gives you 7 days of the full experience with no card required; after that, plans are $6.99 per month for Base and $9.99 for Premium. A macOS version is in development.
If you want to compare more options before choosing, see our roundup of the best dictation software.
Transcribing audio and video files
Dictation handles live speech. When the speech already happened, a meeting recording, a voice memo, a lecture video, you need transcription instead: upload the file, get the text back.
blablaType includes a file transcription tool: upload an audio or video recording and get the text back. Supported formats and current pricing are listed on the transcription page.
How to get more accurate results
Accuracy depends as much on your setup as on the engine. The biggest wins:
- Use a decent microphone. A headset or earbuds beat a laptop’s built-in mic, mostly because they keep a constant distance from your mouth.
- Reduce background noise. Recognition degrades fast with TV, traffic, or other voices nearby.
- Speak in full phrases at a natural pace. Engines use context to pick the right words, so slow word-by-word dictation actually hurts accuracy.
- Learn the punctuation commands of your tool, or enable automatic punctuation where it is offered.
- Fix recurring mistakes at the source. If your tool supports a custom dictionary, teach it the names and terms it keeps getting wrong.
Worried that you ramble when you talk? Everyone does. Either tidy the text up afterwards or use a tool with built-in speech cleanup.
A quick word on privacy
Where your audio goes depends on the tool. iPhone and Mac dictation, and Gboard with offline packs, can process speech on the device. Windows voice typing, Google Docs Voice typing, and most desktop dictation apps send audio to servers for recognition. blablaType works the same way: audio travels over HTTPS to the server, gets transcribed, and is not stored after processing. If privacy matters for your work, check how your tool handles audio before dictating sensitive material.
Where to start
Start with what you already have. Press Win + H on Windows or tap the microphone on your phone keyboard and dictate one email today. If it sticks and you start wanting more speed, cleaner output, or dictation that behaves the same in every app, try blablaType free for 7 days. And come back to the guides above whenever you need the platform specifics.