Grok Code Fast 1

A speedy daily driver

While today's models are undeniably powerful, they often don't feel purpose-built for agentic coding workflows, where loops of reasoning and tool calls can feel frustratingly slow. As heavy users of agentic coding tools, our engineers saw room for a more nimble, responsive solution optimized for our day-to-day tasks.

We built grok-code-fast-1 from scratch, starting with a brand-new model architecture. To lay a robust foundation, we carefully assembled a pre-training corpus rich with programming-related content. For post-training, we curated high-quality datasets that reflect real-world pull requests and coding tasks.

Throughout the training process, we collaborated closely with our launch partners to refine and sharpen the model’s behavior inside their agentic platforms. grok-code-fast-1 has mastered the use of common tools like grep, terminal, and file editing, and thus should feel right at home in your favorite IDE.

We've teamed up with select launch partners to offer grok-code-fast-1 for free for a limited time, including GitHub Copilot, Cursor, Cline, Roo Code, Kilo Code, opencode, and Windsurf.

Try free on

Cursor

Try free on

GitHub Copilot

Try free on

Cline

Blazing fast

Our inference and supercomputing teams developed several innovative techniques to dramatically accelerate our serving speed, creating a uniquely responsive experience where the model will have already called dozens of tools before you even finish reading the first paragraph of the thinking trace. We've also invested in prompt caching optimizations, regularly achieving cache hit rates above 90% when used with our launch partners.

A versatile programmer

grok-code-fast-1 is exceptionally versatile across the full software development stack and is particularly adept at TypeScript, Python, Java, Rust, C++, and Go. It can complete common programming tasks with minimal oversight, ranging from building zero-to-one projects and providing insightful answers to codebase questions to performing surgical bug fixes.

Example 1 of 2Battle Simulator

Danny Limanseta

@dannylimanseta

It's so quick that I actually had to change up how I work in Cursor.

Grok Code Fast is ridiculously fast, it’s probably the fastest model out there right now.It's so quick that I actually had to change up how I work in Cursor.

I threw together this battle simulator prototype in less than a day with Grok Code Fast onCursor. Because it responds so fast and follows instructions well, I've found I'm better off giving it smaller, focused tasks. That way I can iterate quickly and steer it exactly where I want.

For this battle simulator, my workflow is to have it plan out big features first, then break them down and execute in phases. Works way better than dumping a huge prompt on it all at once. The battle simulator came together so smoothly this way, just rapid iterations in Cursor until I get it to what I want.

An economical choice

We designed grok-code-fast-1 to be widely accessible, priced at:

$0.20 per million input tokens
$1.50 per million output tokens
$0.02 per million cached input tokens

grok-code-fast-1 was crafted to shine in the tasks developers face every day, striking a compelling balance between performance and cost. Its strength lies in delivering strong performance in a economical, compact form factor, making it a versatile choice for tackling common coding tasks quickly and cost-effectively.

Model Performance

Tokens per Second vs Output Price

Tokens per second(TPS)

190

Output price/ per 1M tokens

$18

Methodology

TPS metrics were calculated by directly measuring response generation speed via each modelprovider's API, considering only the final response tokens.

- Gemini 2.5 Pro, GPT-5, and Claude Sonnet 4: Measured using their respective public APIs.

- Grok Code Fast 1 and Grok 4: Measured using the xAI API.

- Qwen3-Coder: Hosted on DeepInfra at low precision (fp4), which reduces response quality.

We took a holistic approach to evaluating model performance, blending public benchmarks with real-world testing. On the full subset of SWE-Bench-Verified, grok-code-fast-1 scored 70.8% using our own internal harness.

While benchmarks like SWE-Bench provide valuable insights, we've found they don't fully reflect the nuances of real-world software engineering, particularly the end-user experience in agentic coding workflows.

To guide our model training, we pair these benchmarks with routine human assessments, where experienced developers rate the model's end-to-end performance on everyday tasks. We've also built automated evaluations to track key aspects of behavior, helping us balance trade-offs in design.

When developing grok-code-fast-1, we focused on usability and user satisfaction, guided by real-world human evaluations. The result is a model rated by programmers as fast and reliable for everyday coding tasks.

Grok Code for everyone

For a limited time, we’re excited to offer grok-code-fast-1 for free on exclusive launch partners. Here’s what our launch partners had to say about our model, which was recently released in stealth under the codename sonic.

Free for a limited time

We’re excited to offer Grok Code Fast 1 for free on exclusive launch partners.

"In early testing, Grok Code Fast has shown both its speed and quality in agentic coding tasks. Empowering developers with powerful tools is a core part of our mission at GitHub Copilot, and this is a compelling new option for our developers."

Mario Rodriguez (@mariorod1)

Chief Product Officer, GitHub

Instructions

Try free on

GitHub Copilot

The model is generally available via the xAI API, priced at $0.20 / 1M input tokens, $1.50 / 1M output tokens, and $0.02 / 1M cached input tokens.

Open

xAI Cloud Console

What to expect in the next few weeks

Last week, we quietly released grok-code-fast-1 under the codename sonic. During this stealth phase, our team carefully monitored community channels and deployed multiple new model checkpoints to address feedback.

As we advance this new model family, we're excited to iterate rapidly on your input. We highly value the developer community's support and encourage you to freely share all feedback, positive and negative.

We'll focus on delivering consistent updates to grok-code-fast-1, with improvements arriving in days rather than weeks. A new variant that supports multimodal inputs, parallel tool calling, and extended context length is already in training.

Read the grok-code-fast-1 model card here. We’re excited to see what you build!