Voice agent infrastructure

Facts from public sources. Notes from production.

A curated directory of voice models, installable skills, and agent infrastructure — structured for builders, not marketing decks.

STT, TTS, and STS models plus audio tools, voice AI, and related models from 27 labs.

Installable agent skills for AI coding tools, focused on voice workflows and integrations.

Top TTS models

Ranked by TTS Arena and Artificial Analysis benchmarks where available, then latency.

Order by

Model	Lab	Benchmark
Hume Octave	Hume	#5 · Elo 1561
MiniMax Speech-02-Turbo	MiniMax	#7 · Elo 1544
ElevenLabs Turbo v2.5	ElevenLabs	#8 · Elo 1539
MiniMax Speech-02-HD	MiniMax	#9 · Elo 1535
ElevenLabs Flash v2.5	ElevenLabs	#10 · Elo 1531
ElevenLabs Multilingual v2	ElevenLabs	#11 · Elo 1528
Cartesia Sonic	Cartesia	#13 · Elo 1513
PlayHT 2.0 Turbo	PlayHT	#23 · Elo 1405
Cartesia Sonic Turbo	Cartesia	—
Rime Arcana v1	Rime	—

Browse by type

Jump to the models directory with a type filter applied.