Gemini TTS: आवाज़ें, भाषाएँ और प्रॉम्प्टिंग गाइड

·

Gemini TTS is the neural engine behind Audiobook Maker's PREMIUM voices. This guide covers the available voices, supported languages, and how to steer delivery with prompts.

Voice options

30 distinct voices, each with its own character. The voice name is fixed; the descriptor summarises its natural tone.

VoiceCharacter
ZephyrBright
PuckUpbeat
CharonInformative
KoreFirm
FenrirExcitable
LedaYouthful
OrusFirm
AoedeBreezy
CallirrhoeEasy-going
AutonoeBright
EnceladusBreathy
IapetusClear
UmbrielEasy-going
AlgiebaSmooth
DespinaSmooth
ErinomeClear
AlgenibGravelly
RasalgethiInformative
LaomedeiaUpbeat
AchernarSoft
AlnilamFirm
SchedarEven
GacruxMature
PulcherrimaForward
AchirdFriendly
ZubenelgenubiCasual
VindemiatrixGentle
SadachbiaLively
SadaltagerKnowledgeable
SulafatWarm

Supported languages

Gemini TTS supports the following languages (BCP-47 code in parentheses):

Arabic (ar), Filipino (fil), Bangla (bn), Finnish (fi), Dutch (nl), Galician (gl), English (en), Georgian (ka), French (fr), Greek (el), German (de), Gujarati (gu), Hindi (hi), Haitian Creole (ht), Indonesian (id), Hebrew (he), Italian (it), Hungarian (hu), Japanese (ja), Icelandic (is), Korean (ko), Javanese (jv), Marathi (mr), Kannada (kn), Polish (pl), Konkani (kok), Portuguese (pt), Romanian (ro), Russian (ru), Spanish (es), Tamil (ta), Telugu (te), Thai (th), Turkish (tr), Ukrainian (uk), Vietnamese (vi), Afrikaans (af), Albanian (sq), Amharic (am), Armenian (hy), Azerbaijani (az), Basque (eu), Belarusian (be), Bulgarian (bg), Burmese (my), Catalan (ca), Cebuano (ceb), Chinese Mandarin (cmn), Croatian (hr), Czech (cs), Danish (da), Estonian (et), Latvian (lv), Lithuanian (lt), Luxembourgish (lb), Macedonian (mk), Maithili (mai), Malagasy (mg), Malay (ms), Malayalam (ml), Mongolian (mn), Nepali (ne), Norwegian Bokmål (nb), Norwegian Nynorsk (nn), Odia (or), Pashto (ps), Persian (fa), Punjabi (pa), Serbian (sr), Sindhi (sd), Sinhala (si), Slovak (sk), Slovenian (sl), Swahili (sw), Swedish (sv), Urdu (ur).

Prompting guide

The model infers delivery from the transcript automatically. You can steer it further with inline tags and structured directions.

Inline audio tags

Inline modifiers such as [whispers], [laughs], [excitedly], [bored] and [shouting] change tone, pace and emotional quality. Be creative and experiment with delivery variations.

Advanced prompting elements

Key guidance

Don't feel you have to describe everything — giving the model space to fill the gaps often helps naturalness. Balance specificity with creative freedom, and prefer industry terminology and layered characteristics over plain emotional labels.

How to use prompts in Audiobook Maker

Audiobook Maker narrates the chapter text directly, so you add prompt cues inside the text itself, in one of two ways:

Source: Google AI — Speech generation

Try Audiobook Maker Free →