Grok Voice
∣
Available in APIDeploy intelligent speech-to-speech voice agents for customer support, sales, and more. Enterprise-grade text-to-speech and speech-to-text APIs.
Build real-time voice agents with tool use, search, and multi-turn conversation.
Natural speech from text with multiple voices and audio formats. Built for telephony and web. Enter text, choose a voice, and press Play.
Enterprise-grade transcription for phone calls, meetings, videos, and podcasts.
MP3, WAV, OGG, Opus, FLAC, AAC, MP4, M4A, MKV, MOV, WebM
Clone a voice from a short recording and use it instantly across Grok Text to Speech and Voice Agent APIs.
Original
Cloned
Everything you need to build production voice experiences — from realtime agents to batch transcription.
Realtime voice agents
Full-duplex conversations with sub-second latency
Text-to-speech
Natural speech from text across 80+ voices
Speech-to-text
Accurate transcription with speaker diarization
Tool calling
Call APIs and take actions mid-conversation
Custom voices
Clone or create voices for your brand
25+ languages
Multilingual with natural intonation per locale
Sub-second latency
Fast enough for real conversations at scale
Speech tags
Control whisper, laughter, pauses, and tone
Speaker diarization
Identify who said what in multi-speaker audio
Streaming & batch
Realtime WebSocket or async batch processing
Multiple audio formats
PCM, MP3, Opus, FLAC, WAV, and more
Session control
Dynamic instructions, context, and tool updates
Enterprise ready
SOC 2, HIPAA eligible, and GDPR compliant
Text normalization
Proper formatting of numbers, dates, addresses
Interruption handling
Natural turn-taking with barge-in support
Multilingual voices with natural intonation. Preview any voice instantly.
Pricing
Straightforward usage-based pricing with no hidden fees, minimums, or force upgrades.
Need higher limits or rollout help?
Talk with xAI about onboarding, custom limits, and enterprise deployment.
Enterprise
Enterprise-ready controls, compliance, security, and scale.
Audited controls for security, availability, and confidentiality.
BAA available for healthcare applications handling protected health information.
Data processing agreements and EU data residency options.
Multi-region infrastructure for enterprise workloads.
Concurrent session and request limits scaled to your traffic.
SAML SSO, role-based access, and audit logging for your team.
Enable zero data retention for your deployments.
Get an API key and start building in minutes, or talk to our team about enterprise deployment.