Eleven v3 Audio Tags: Precision delivery control for AI speech

Fine-grained control over timing, rhythm, and emphasis with Eleven v3 Audio Tags. Transform flat delivery into dynamic, performative content.

v3

Great speech isn’t just about what’s said — it’s how it’s said. With Eleven v3 Audio Tags, you gain fine-grained control over timing, rhythm, and emphasis, allowing you to shape the pacing of a line with precision.

Using tags like [pause], [rushed], [stammers], or [drawn out], you can adjust how each sentence lands — not just emotionally, but rhythmically. That control turns flat delivery into performance.

What is delivery control in AI speech?

Delivery control is the ability to direct the flow of speech — how quickly it moves, where it pauses, when it emphasizes. It’s what makes a line feel dramatic, casual, tense, or comedic.

Eleven v3では、デフォルトのペースに縛られずに配信できます。スクリプトから直接、サスペンスのために遅くしたり、緊急性のために速くしたり、ユーモアのためにリズムを加えたりできます。

Okay, so like I finally beat level 42 of that game I said I’d quit like... a month ago. (laughs) And then for the final big scary mega boss... it's just (giggle) like some cute little bunny rabbit (hysterical laughing) I just couldn't do it (big laugh) It was sooooooo cute!

Example:  "Okay, so like I finally beat level 42 of that game I said I’d quit like... a month ago. [laughs] And then the final boss... was just... [giggle] a bunny rabbit. [big laugh] I couldn’t do it. It was too cute."

Tags here shape the tempo and timing — and that’s what makes the line land.

Controlling timing, pacing, and presence

Tags give you access to the subtle cues humans use to pace speech naturally:

  • Pauses & breaks: [pause], [breathes], [continues after a beat]
  • Speed cues: [rushed], [slows down], [deliberate], [rapid-fire]
  • Hesitation & rhythm: [stammers], [drawn out], [repeats], [timidly]
  • Emphasis: [emphasized], [stress on next word], [understated]

Example: "[drawn out] Sooooo... you're saying... [suspicious tone] you didn't eat the last slice?"

These tags give you complete control over how a voice feels in motion.

Pacing for tone and meaning

Arabella
I’m fine.
Arabella
flatly I’m fine.
Arabella
quietly, after a pause I’m... fine.
Arabella
angrily, fed up  I'm FINE!
James
[questioning]Are you pause  sure you're fine?
Arabella
I’m fine. pause  really!

Changing how a line is delivered changes how it's interpreted.

Compare:

  • I’m fine.
  • [flatly] I’m fine.
  • [quietly, after a pause] I’m... fine.
  • [angrily, fed up] I'm FINE!
  • [questioning]Are you [pause] sure you're fine?
  • I’m fine. [pause] really!

Same words. Different meaning. With delivery control, tone emerges not from word choice, but from timing and intent.