Today, we’ve set a world performance record of 114 tokens per second on Llama 3.1 405B, independently verified by Artificial Analysis.