World Record: 2,500 TPS on
LLama 4 maverick
Cerebras more than doubled Nvidia Blackwell published performance — setting a new benchmark for Llama 4 Maverick inference. And the best part? It’s coming soon via Meta’s Llama API — available to everyone.
Read more




