Aion Labs Uses SambaNova Cloud to Improve LLM performance

Challenge:

Aion Labs builds a layer on top of existing LLMs to improve performance and accuracy. They do this by leveraging LLMs ability to communicate with each other. With the latest reasoning models generate a large number of tokens. Each model's result is passed to the next model, with every step adding latency. Without extreme high speed inference, this will quickly result in delays.

A significant part of the Aion Labs process utilizes large numbers of sequential calls. When using interactive processes, such as with a coding assistant, this can result in long wait times that users will find unacceptable. Fast inference directly translates into the LLM being able to do more in a shorter amount of time, which is essential to Aion Labs.

The need for high speed inference has become more critical with the rise of agentic AI. Aion Labs is seeing a noticeable trend to agentic AI with more and more companies building AI that is either fully autonomous or has a high degree of autonomy. With this autonomy comes a requirement for exceptional accuracy. These are sequential processes and if at any point in the sequence of steps there is an error then the whole process fails. In these scenarios, Aion Labs is using parallel prompting to move from 99.9% accuracy to 99.99% will matter and can unlock new use cases.

Solution:

To achieve their goals, Aion Labs is using a combination of DeepSeek R1 671B, as well the distill variant of DeepSeek, Llama models, and even their own fine tuned models. They are building a complete system composed of many models, that leverages the fastest inference to deliver a solution that is more accurate than any of the individual models.

Aion Labs uses SambaCloud, powered by the SambaNova RDU, the SN40L. SambaCloud delivers the fastest inference on the largest and best open-source models, including the fastest performance available for DeepSeek R1 671B.

News

SambaNova Expands Deployment with SoftBank Corp. to Offer Fast AI Inference Across APAC

March 5, 2025

Business

Qwen3 Is Here - Now Live on SambaNova Cloud

May 2, 2025

Business

SambaNova Partners with Meta to Deliver Lightning Fast Inference on Llama 4

April 7, 2025

Aion Labs uses SambaCloud to improve LLM performance

Challenge:

Solution:

Most models get the right outcome the first time

SambaNova delivers the right outcome the first time

Monthly active users

"Having faster inference speeds translates directly to the LL being able to take more actions and get more things done in a shorter amount of time."

Related resources

SambaNova Expands Deployment with SoftBank Corp. to Offer Fast AI Inference Across APAC

Qwen3 Is Here - Now Live on SambaNova Cloud

SambaNova Partners with Meta to Deliver Lightning Fast Inference on Llama 4

Time to start building