Chat, search, reason, and create — all in one place.
Answers with what's happening now and speaks your language.
A truth-seeking assistant for everyday work — writing, research, and quick recaps.
Multiple agents work in parallel for deeper answers on the hardest questions. Each agent shows its work so you can audit the reasoning.
Searches the web and 𝕏 live, so answers always reflect what's happening right now — not last year's training data.
Generate images and video from text prompts or reference photos. Restyle, edit, and iterate without leaving the conversation.
Everything you need in one assistant — from everyday tasks to deep research.
Real-time web search
Answers grounded in live sources across the web
Live 𝕏 integration
Breaking news, trends, and posts as they happen
Deep reasoning
Step-by-step thinking you can follow and verify
Multi-agent mode
Parallel agents that tackle sub-problems simultaneously
Code generation
Write, debug, and explain code in any language
Image generation
Create images from text with Grok Imagine
Video generation
Text-to-video up to 15 seconds at 720p
Voice conversations
Natural back-and-forth with sub-second latency
File & PDF analysis
Upload documents and get instant summaries
Vision understanding
Analyze screenshots, photos, and diagrams
30+ languages
Speak and write in dozens of languages natively
Memory across chats
Remembers your preferences and past conversations
Canvas for writing
Long-form editing with inline suggestions
Custom instructions
Tailor responses to your style and needs
Shareable conversations
Share any thread with a public link
Web, iOS, and Android
Available everywhere with synced history
Threads and follow-ups
Continue any conversation with context intact
SuperGrok
Higher limits, priority access, and multi-agent
Free to try on the web and in the apps. Upgrade to SuperGrok for higher limits and multi-agent reasoning.
Go to grok.com on web, or grab the iOS and Android apps.
Use your X or email account to pick up across every device.
Ask anything — switch modes for search, reasoning, voice, or imagery.
Access the same models through our API — text, vision, voice, images, and video. One key, every modality, production-ready.