Our gigafactory
of compute.

Built in 122 days — outpacing every estimate — Colossus was the most powerful AI training system yet. Then we doubled it in 92 days to 200k GPUs. This is just the beginning.

200,000

H100 GPUs in a single interconnected cluster

The build

We go further, faster

We were told it would take 24 months to build. So we took the project into our own hands, questioned everything, removed whatever was unnecessary, and accomplished our goal in four months.

Aerial shot of Colossus's site

By the numbers

Unprecedented scale

We doubled our compute at an unprecedented rate, with a roadmap to 1M GPUs. Progress in AI is driven by compute and no one has come close to building at this magnitude and speed.

180KGPUs

NVIDIA H100 GPUs in a single interconnected cluster

170PB/s

Aggregate memory bandwidth across the full system

2.8Tb/s

Per-server network bandwidth for distributed training

>0.5EB

Total storage for training data and checkpoints

Timeline

276 days from groundbreak

May 2024Feb 2025
February 2025

Full scale

Doubled to 200K GPUs.