December 09, 2024

Grok Image Generation Release

We are updating Grok's capabilities with a new autoregressive image generation model, code-named Aurora, available on the 𝕏 platform.


We've enhanced Grok's image generation abilities with a new model, code-named Aurora. Aurora is an autoregressive mixture-of-experts network trained to predict the next token from interleaved text and image data. We trained the model on billions of examples from the internet, giving it a deep understanding of the world. As a result, it excels at photorealistic rendering and precisely following text instructions. Beyond text, the model also has native support for multimodal input, allowing it to take inspiration from or directly edit user-provided images.

Grok's new capabilities are now available on the 𝕏 platform in select countries and will roll out to all users within a week.

Lockheed SR-71 Blackbird flying through an abstract sky.
Lockheed SR-71 Blackbird flying through an abstract sky.
An astronaut standing on the surface of an alien planet, with a spaceship in the background and multiple moons in the sky.
An astronaut standing on the surface of an alien planet, with a spaceship in the background and multiple moons in the sky.
A volcano surrounded by ice.
A volcano surrounded by ice.
A superposition of a cat in a hyperbolic time chamber in the style of Van Gogh.
A superposition of a cat in a hyperbolic time chamber in the style of Van Gogh.
An origami Cybertruck.
An origami Cybertruck.
Cherry blossoms beneath a sunset sky.
Cherry blossoms beneath a sunset sky.
A sketch of a multi-sided 3d geometric shape on paper.
A sketch of a multi-sided 3d geometric shape on paper.
A dog drinking a cup of tea in a library.
A dog drinking a cup of tea in a library.
A closeup of a guitar player's hand holding a pick.
A closeup of a guitar player's hand holding a pick.
A comic of a young man standing by the sea, gazing back over his shoulder with a determined expression. In a speech bubble, printing the text, 'Make it happen, yesterday.'
A comic of a young man standing by the sea, gazing back over his shoulder with a determined expression. In a speech bubble, printing the text, 'Make it happen, yesterday.'
A burger with double meat patty placed on a plate.
A burger with double meat patty placed on a plate.
A female warrior holding a sword, with intricate armor and a confident expression.
A female warrior holding a sword, with intricate armor and a confident expression.
An abstract composition using geometric shapes and vibrant colors, evoking a sense of energy and movement.
An abstract composition using geometric shapes and vibrant colors, evoking a sense of energy and movement.
Elon Musk as a character in the animated series Rick and Morty.
Elon Musk as a character in the animated series Rick and Morty.
A serene mountain lake at sunset, with mist rising from the water and the peaks reflected perfectly in the still surface.
A serene mountain lake at sunset, with mist rising from the water and the peaks reflected perfectly in the still surface.
A cyberpunk-inspired city at night, with neon lights, flying cars, and towering skyscrapers.
A cyberpunk-inspired city at night, with neon lights, flying cars, and towering skyscrapers.
An elderly person, capturing every wrinkle and expression.
An elderly person, capturing every wrinkle and expression.
A dewdrop on a spider web, with the intricate patterns of the web and the refraction of light.
A dewdrop on a spider web, with the intricate patterns of the web and the refraction of light.

Image Generation

Grok can now generate high-quality images across several domains where other image generation models often struggle. It can render precise visual details of real-world entities, text, logos, and can create realistic portraits of humans.

Prompt
Cybertruck under an aurora
Cybertruck under an aurora
Grok
Cybertruck under an aurora - imagen
Imagen 3
Cybertruck under an aurora - flux
Flux.1 Pro
Cybertruck under an aurora - ideogram
Ideogram 2.0
Cybertruck under an aurora - dalle
Dall-E 3

Image Editing

Our new image generation model can now take images as input, giving users greater creative control and flexibility. We will release this capability to users on the 𝕏 platform soon.

Prompt
Make the cat anime style
Make the cat anime style: Before
Input image
Make the cat anime style: After
Output image

Looking Forward

At xAI, we are advancing the frontier of multimodal understanding and generation. If this goal inspires you, we invite you to join us on this journey β€” we are hiring!