DeepSeek-R1, the best open source reasoning model in the market, is now available on SambaNova Cloud for Dedicated...
The SambaNova Cloud Developer Tier will allow you to pay for token consumption for higher rate limits on the most...
We are excited to announce the addition of Tülu 3 405B, a fine tune of Llama 405B that performs better than DeepSeek...
Available now, Hugging Face developers can take advantage of the lightning fast inference speeds made possible with...
To accelerate the adoption of agentic AI, SambaNova is announcing the integration of the CrewAI agentic framework...
Available now, developers have access to the best open source test-time compute model released by Alibaba:...
Available now, SambaNova has optimized and released Meta's Llama 3.3 70B model on its RDU hardware architecture.
The SambaNova Startup Accelerator program is designed not just to support, but to partner with startups as they...
Available on SambaNova Cloud, developers now have access to some of the best open-source models in the Qwen 2.5...
With our recent partnership with Hugging Face, we have built a new tool to make this process significantly more...
Working together, we’ve created a SambaNova-Gradio integration that can be used to build and deploy AI apps using...
SambaNova is working with Zilliz, a cloud-native software company, to showcase the power of combining fast inference...
An end-to-end solution leveraging open-source LLMs to generate a Q&A dataset for fine-tuning smaller and faster...
The Benchmarking AI Starter Kit offers functionality to evaluate different LLMs available on SambaStudio or...
In this post, we demonstrate a useful and practical application of the SambaNova Cloud to power Continue, the...
In this post, we explore the strategic considerations and decision-making processes that can help you create...
An examination of where LLM-as-a-Judge can satisfyingly act as a judge of an outer model's performance and where it...
While LLM-as-a-Judge offers a favorable alternative to human evaluations, closed source LLMs impose some limitations...
By improving inference performance, SambaNova has unlocked the full potential of Llama 3.1 405B and enabled...
Comparing the end-user inference performance of SambaNova's technology against that of Groq and Cerebras.
SubgoalXL represents a significant step forward in the field of AI-powered theorem proving.
Recent work highlighted how quantization for recent LLaMa 3 models can lead to non-negligible decay in model...
NAIRR pilot, in partnership with SambaNova, provides generative AI platforms for groundbreaking academic research....
In this post, we explore why tokens per second doesn't paint the full picture of enterprise LLM inference...