Grok Imagine
∣
Available in APIState-of-the-art image and video generation, editing, and restyling — one API for every visual modality.






Product demos, visual effects, and creative content. Turn a photo into a cinematic video with smooth pans, zooms, and reveals.
Upload a photo of a person and a clothing item — the API generates a video of them wearing it. Perfect for e-commerce and fashion.


Turn a single product photo into scroll-stopping content for every channel. Generate up to 10 images per request at 2K resolution.

Edit colors and objects with accuracy for complete control over your product showcase.
Seamlessly switch between styles to reinvent the experience in seconds. Drag the slider to compare the original with the restyle.
Turn your rough ideas into multiple realities at the click of a button.



Start generating images and videos with just a few lines of code.
Our image and video models top the leaderboards on quality, speed, and price.
| Model | Rank | Price | Latency |
|---|---|---|---|
| Grok Imagine | 1 | ||
| Veo 3.1 Fast | 4 | ||
| Veo 3 | 5 | ||
| Sora 2 Pro | 9 | ||
| Sora 2 | 12 |
Source: Artificial Analysis Text-to-Video Rankings, 2026-01-28
“Grok Imagine API delivers outstanding video quality with native audio generation, combining photorealistic realism, strong creative style, and an impressive level of control.”
About image and video generation, editing, and pricing. For a deeper dive, visit the docs.