Voice to Text in Gemini

# Voice to Text

# Gemini

# How To Guide

# Features and Capabilities

The ins-and-outs of using voice-to-text in ChatGPT

Voice to text is a helpful feature in Gemini when you prefer to talk out your ideas and get a written response back.

How to use voice to text

Activate text to voice by selecting the “Use microphone” button.

Once selected, you will see your words appear as text. When complete, the button will change from a microphone to a "send" icon.

Prompting Best Practices

Just talk out your ideas and needs openly and abundantly. After all, you are talking to your digital employee! This works because Large Language Models excel at synthesizing unstructured data (i.e. word vomit). No need to edit yourself mid-sentence.

Call out uncertainties as you go. Phrases like: "I’m not sure about this part…" help the model know where to focus synthesis and options.

Technical Details

Automatic punctuation and formatting

The model inserts punctuation, capitalization, and paragraph breaks automatically. You can also dictate formatting explicitly (“new paragraph,” “comma,” “period”) if higher precision is needed.

Long-form memory window

Voice mode supports extended monologues. You can speak for several minutes at a time, and the transcription will remain cohesive enough for the model to synthesize into structured output afterward.

Comments (0)

Popular

Don't Forget The Human Part!

Table Of Contents