Products
Developers
About

Insights & Information

Find what you need to accelerate your AI journey

Blog

Nine Predictions for AI in 2025

Explore our top AI 2025 predictions, from power efficiency and Agentic AI to open-weight models and AI-driven...

Blog

Open-Source Deep Research Agents: Enterprise-Grade Speed, Security & Saving them Millions

Today SambaNova has answered the call to help enterprises conduct deep research 3X faster than the best GPU...

Blog

SambaNova Cloud Launches the Fastest DeepSeek-R1 671B: Sign Up for Early Access

DeepSeek-R1, the best open source reasoning model in the market, is now available on SambaNova Cloud for Dedicated...

Blog

SambaNova Cloud Developer Tier is Live

The SambaNova Cloud Developer Tier will allow you to pay for token consumption for higher rate limits on the most...

Blog

Now On SambaNova Cloud, Tülu 3 405B, A New Model Better than DeepSeek V3

We are excited to announce the addition of Tülu 3 405B, a fine tune of Llama 405B that performs better than DeepSeek...

Blog

Hugging Face Partners with SambaNova to Supercharge its Inference API Capabilities

Available now, Hugging Face developers can take advantage of the lightning fast inference speeds made possible with...

Blog

Unlock the Future of Multi-Agent AI Workflows with CrewAI and SambaNova

To accelerate the adoption of agentic AI, SambaNova is announcing the integration of the CrewAI agentic framework...

Blog

Test-Time Compute Available on SambaNova Cloud with Qwen QwQ-32B-Preview

Available now, developers have access to the best open source test-time compute model released by Alibaba:...

Blog

Meta Llama 3.3 70B Now Available Today for Developers and Enterprises

Available now, SambaNova has optimized and released Meta's Llama 3.3 70B model on its RDU hardware architecture.

Blog

The SambaNova Startup Accelerator: Helping AI Innovators Realize Their Vision

The SambaNova Startup Accelerator program is designed not just to support, but to partner with startups as they...

Blog

Qwen 2.5 32B-Coder Available on SambaNova Cloud - 5X Faster than GPUs

Available on SambaNova Cloud, developers now have access to some of the best open-source models in the Qwen 2.5...

Blog

Hugging Face Makes it Faster to Review Papers with SambaNova

With our recent partnership with Hugging Face, we have built a new tool to make this process significantly more...

Blog

How Gradio Makes Building Apps on SambaNova Cloud Super Easy

Working together, we’ve created a SambaNova-Gradio integration that can be used to build and deploy AI apps using...

Blog

Zilliz: Powering AI RAG Applications with Vector Embeddings

SambaNova is working with Zilliz, a cloud-native software company, to showcase the power of combining fast inference...

Blog

Outperforming GPT-4o with Llama 3 8B: Domain Specific Fine Tuning for RAG

An end-to-end solution leveraging open-source LLMs to generate a Q&A dataset for fine-tuning smaller and faster...

Blog

Texas Advanced Computing Center Deploys SambaNova Suite, Enabling AI Inference for Science

Announcing a new customer relationship with the Texas Advanced Computing Center (TACC), one of the world’s leading...

Blog

Oak Ridge National Laboratory Deploys SambaNova Suite, Enabling Energy-Efficient AI Inference for Science

Today, the Oak Ridge National Laboratory (ORNL) has deployed SambaNova Suite to expand its research with secure,...

Blog

Argonne National Laboratory Deploys SambaNova Suite to Advance AI Inference In Science Research

Today we announced that the U.S. Department of Energy’s Argonne National Laboratory will expand its AI...

Blog

Correcting Common AI Benchmarking Errors with AI Starter Kits

The Benchmarking AI Starter Kit offers functionality to evaluate different LLMs available on SambaStudio or...

Blog

Accelerating Coding with SambaNova Cloud

In this post, we demonstrate a useful and practical application of the SambaNova Cloud to power Continue, the...

Blog

Developer Tips: Creating Valuable AI

In this post, we explore the strategic considerations and decision-making processes that can help you create...

Video

DataScale SN30

Purpose-built for the most demanding AI and deep learning workloads, the DataScale SN30 is a fully integrated...

Video

SambaNova & Lawrence Livermore National Laboratory Accelerate AI for Science

SambaNova Systems and Lawrence Livermore National Laboratory (LLNL) scale up their collaboration to improve the...

Blog

Judging Judges: All that is LLM Judgements does not glitter

An examination of where LLM-as-a-Judge can satisfyingly act as a judge of an outer model's performance and where it...

Blog

Replacing the Judge: Can Llama 405B Outperform GPT4 in the Court of AI?

While LLM-as-a-Judge offers a favorable alternative to human evaluations, closed source LLMs impose some limitations...

Blog

SambaNova Cloud: The fastest inference and the best models - for free

SambaNova is opening up the full spectrum of Llama models for developers to create the next wave of AI innovation.

Blog

Advanced AI Apps Need Fast Inference. SambaNova Cloud Delivers It

By improving inference performance, SambaNova has unlocked the full potential of Llama 3.1 405B and enabled...

Blog

Why SambaNova's SN40L Chip is The Best for Inference

Comparing the end-user inference performance of SambaNova's technology against that of Groq and Cerebras.

Blog

SubgoalXL: Pushing the Boundaries of LLM in Formal Theorem Proving

SubgoalXL represents a significant step forward in the field of AI-powered theorem proving.

Blog

SambaNova Holds Speed Record on Llama 3.1 405B - 4X faster than the rest

Today, we’ve set a world performance record of 114 tokens per second on Llama 3.1 405B, independently verified by...

Blog

Three Predictions for the Upcoming Llama 3 405B Announcement

Three predictions on how Llama 3 405B could reshape the landscape for developers engaged in AI and machine learning.

Blog

Typhoon model adds Thai language to Samba-1

With the inclusion of Typhoon Thai LLM, Samba-1 is now able to deliver generative AI capabilities in the Thai...

White Paper

Composition of Experts: Next Wave of AI Innovation

A new full-stack approach unlocks industry leading speed and accuracy with 10x better TCO.

Blog

Does reduced precision hurt? A bit about losing bits.

Recent work highlighted how quantization for recent LLaMa 3 models can lead to non-negligible decay in model...

Blog

SambaNova CEO explains why only one AI company wants a monopoly

Rodrigo Liang and veteran tech journalist Don Clark of The New York Times discussed how a full stack approach to AI...

Blog

Transform Your Data Privacy with SambaNova Systems

Samba-1 provides role-based access controls to maintain data governance policies, ensuring only those with proper...

Blog

SambaNova has broken the 1000 t/s barrier: why it's a big deal for enterprise AI

SambaNova is the clear winner of the latest large language model (LLM) benchmark by Artificial Analysis. Topping the...

Blog

Model Ownership

As enterprises incorporate generative AI into their business, retaining model ownership is one of the most important...

Blog

Introducing Fugaku-LLM in Composition of Experts

The Fugaku-LLM, a Japanese LLM, is being introduced into the Samba-1 CoE architecture to run optimally on the...

Blog

Sovereign AI

A Sovereign AI solution is fully contained with an entity, such as a company or country, and meets objectives while...

Blog

NAIRR; Govt-funded AI Research Resources

NAIRR pilot, in partnership with SambaNova, provides generative AI platforms for groundbreaking academic research....

Blog

Tokens Per Second is Not All You Need

In this post, we explore why tokens per second doesn't paint the full picture of enterprise LLM inference...

Blog

The Next Generation of Large Models

Generative AI can streamline processes across the entire organization, reduce costs, increase productivity, improve...

Blog

Enterprise-grade AI

See what it means to be enterprise-grade, and how SambaNova Suite, the first full-stack platform, purpose-built for...

Blog

Samba-CoE v0.3: The Power of Routing ML Models at Scale

Samba-CoE-v0.3, our latest Composition of Experts, surpasses DBRX Instruct 132B and Grok-1 314B on the OpenLLM...

Blog

Responsible AI

SambaNova is committed to providing customers with responsible, generative AI that is safe, secure, and transparent,...

Blog

SambaLingo hits 15,000+ downloads, now integrated with Samba-CoE-v0.2

SambaLingo has been downloaded over 15,000 times and has achieved remarkable performance of 280 tokens/s inference...

Blog

SambaNova Delivers Accurate Models At Blazing Speed

Samba-CoE v0.2 is climbing on the AlpacaEval leaderboard, outperforming all of the latest open-source models.

Blog

Using Mixed Precision on RDUs

SambaFlow 1.18 introduces support for mixed precision on RDUs, streamlining the experience for model developers and...

Blog

Sambaverse: Discover, Compare, Evaluate

Sambaverse is a unique environment where developers can freely test out hundreds of different models and directly...

Report

Get the IDC LINK Research opinion on the Samba-1 release

See why IDC says that Samba-1 "is harnessing the power of specialized AI models for a broad spectrum of business...

Blog

Benchmarking Samba-1

Benchmarking Samba-1 with the EGAI benchmark - a comprehensive collection of widely adapted benchmarks sourced from...

Blog

Samba-CoE v0.1 - Unlocking the power of routing to build a Composition of Experts

We're thrilled to unveil Samba-CoE-v0.1, a scaled down version of Samba-1, our latest breakthrough model that...

Blog

Samba-1: A Composition of Experts Model

Announcing Samba-1, the first trillion-parameter generative AI model that meets the performance, accuracy,...

Blog

SambaLingo - Open Source Language Experts

SambaNova is excited to open source a collection of expert models that adapt Llama 2 to a diverse set of 9 languages.

Blog

Text-to-SQL accuracy that beats GPT-4

Users can access valuable information locked in their SQL databases faster and easier than ever before.

Blog

Introducing the SambaCoder-nsql-Llama-2-70B model

Numbers Station and SambaNova have released a text-to-SQL model that surpasses the accuracy of GPT-4.

Blog

BLOOMChat-v2 Long Sequences at 176B

We are proud to release BLOOMChat-v2, a 32K sequence length, 176B multilingual language model.

Blog

SambaNova Joins NAIRR pilot program to support strategic national AI research initiative

The NAIRR Pilot collaboration between the NSF, White House Office of Science and Technology Policy, and SambaNova...

Blog

Fault management and RDA systems: Part 2

One of the key characteristics of our system is performance for enterprise data center AI workloads, and SNFM...

Blog

Fault management and RDA systems: Part 1

Designed specifically for the enterprise, the SambaNova platform includes enterprise level features to deliver the...

Blog

The purpose built architecture and why it matters to the enterprise

As enterprises move to adopt generative AI at scale, it is critical that they choose the best infrastructure to...

Blog

The importance of open source models for the enterprise

Choosing the right generative AI model is one of the most important decisions that an organization will make....

Blog

Predictions for Generative AI in 2024

In 2024, we will see generative AI move from a consumer chat tool to become a key part of every enterprise, changing...

Blog

ALiBi Deep Dive: Interpolation and Precision

The LLM community has proposed a variety of positional interpolation methods to extend the maximum sequence length...

Blog

Retrieval Augmented Generation in SambaNova Suite

By adding RAG support, SambaNova Suite continues to deliver the performance, accuracy, and flexibility to power the...

Blog

Elevating Information Retrieval and Augmenting Large Language Models

SambaStudio now supports text embedding models. This new feature significantly boosts the information retrieval...

Blog

Los Alamos National Laboratory expands partnership with SambaNova

As part of their on-going mission, Los Alamos National Laboratory has a new focus on deploying generative AI LLMs....

Podcast

Rodrigo Liang, SambaNova: $5+ Billion Valuation

SambaNova Systems CEO and Co-Founder Rodrigo Liang was featured on the Unicorn Builders Podcast, produced by Front...

White Paper

Build or Buy?

In this rapidly evolving landscape of pervasive AI, the choice between building or buying an AI solution is a...

Blog

Welcome to the era of pervasive AI

As we enter the era of pervasive AI, the accuracy demanded by enterprise tasks means that LLMs will need to have...

Video

Composition of Experts

Using a Composition of Experts model, SambaNova customers can take advantage of the benefits of multi-trillion...

Blog

Delivering on the promise of pervasive AI: SambaNova Suite, powered by the SN40L

SambaNova enhanced the SambaNova Suite – the only purpose-built, full stack LLM platform – with its revolutionary...

Blog

Enabling Open Source LLMs to Become Effective Tool Manipulators

At SambaNova, we have been researching and developing methods to train long sequence size (SS) models on our...

Blog

Training long sequence size models with SambaNova

At SambaNova, we have been researching and developing methods to train long sequence size (SS) models on our...

Podcast

GovTech Talks #6: CEO & Co-Founder at SambaNova Systems, Rodrigo Liang

What we need is technology, hardware and software, that allows us to begin the work of innovating, inventing,...

Podcast

The Future of LLMs, Compute Democratization and Open-Source Models

If they want to train their own models or work on their own models. And I think this goes for other types of...

Podcast

Why AI is Still Being Underhyped, Innovating at the Hardware Layer, and Why the Future of AI is Open Source

What’s different about this now is that the capabilities, the services that you’ll be able to provide the knowledge...

White Paper

AI Model Ownership: 3 Critical Considerations

Explores the importance of model ownership through the lenses of governance, model accuracy, and asset value.

Solution Brief

SambaNova Suite

SambaNova Suite for Generative AI offers a strategic opportunity for enterprise and government organizations to...

Blog

BLOOMChat: a New Open Multilingual Chat LLM

SambaNova and Together are excited to announce the public release of BLOOMChat, a 176 Billion parameter multilingual...

Blog

Accelerating HPC Simulations and AI with SambaNova

At ISC High Performance 2023, May 21-23 SambaNova will showcase how generative AI is already being used to...

Blog

Accenture and SambaNova: Delivering Generative AI to the Enterprise

The solutions from Accenture and SambaNova are designed to meet the demanding needs of enterprise organizations in...

Blog

Solving Enterprise Data Privacy and Security Concerns with Generative AI

Keep enterprise data private with a dedicated model backbone for generative AI

Blog

Domain Adapted Automated Speech Recognition

We show how one can use the SambaNova Suite to develop a model that is highly optimized towards a specific domain or...

Blog

OpenChatKit model available on SambaNova Suite for community and enterprise model adaptation

SambaNova is committed to the development of open-source technology and today we are excited to announce that the...

Blog

Three takeaways from SambaNova’s conversation on generative AI with Ed Abbo, President and Chief Technologist of C3 AI

SambaNova’s Co-founder and CEO Rodrigo Liang caught up with Vipul Prakash, Co-Founder and CEO of Together to discuss...

Blog

Introducing SambaNova Suite for Generative AI

I am excited to announce the SambaNova Suite for generative AI, the first generative AI platform specifically...

Blog

Generative AI for enterprise and government

Today SambaNova announced SambaNova Suite for generative AI, the first generative AI platform specifically optimized...

Blog

SambaNova Suite Product Demos

Below we have included a few of these demos so you can see for yourself how SambaNova Suite is empowering...

Blog

Three takeaways from SambaNova’s conversation on generative AI with Vipul Ved Prakash, Co-Founder and CEO of Together

SambaNova’s Co-founder and CEO Rodrigo Liang caught up with Vipul Prakash, Co-Founder and CEO of Together to discuss...