Products
Developers
About

Insights & Information

Find what you need to accelerate your AI journey

Blog

SambaNova has broken the 1000 t/s barrier: why it's a big deal for enterprise AI

SambaNova is the clear winner of the latest large language model (LLM) benchmark by Artificial Analysis. Topping the...

Blog

Model Ownership

As enterprises incorporate generative AI into their business, retaining model ownership is one of the most important...

Blog

Introducing Fugaku-LLM in Composition of Experts

The Fugaku-LLM, a Japanese LLM, is being introduced into the Samba-1 CoE architecture to run optimally on the...

Blog

Sovereign AI

A Sovereign AI solution is fully contained with an entity, such as a company or country, and meets objectives while...

Blog

NAIRR; Govt-funded AI Research Resources

NAIRR pilot, in partnership with SambaNova, provides generative AI platforms for groundbreaking academic research....

Blog

Tokens Per Second is Not All You Need

In this post, we explore why tokens per second doesn't paint the full picture of enterprise LLM inference...

Blog

The Next Generation of Large Models

Generative AI can streamline processes across the entire organization, reduce costs, increase productivity, improve...

Blog

Enterprise-grade AI

See what it means to be enterprise-grade, and how SambaNova Suite, the first full-stack platform, purpose-built for...

Blog

Samba-CoE v0.3: The Power of Routing ML Models at Scale

Samba-CoE-v0.3, our latest Composition of Experts, surpasses DBRX Instruct 132B and Grok-1 314B on the OpenLLM...

Blog

Responsible AI

SambaNova is committed to providing customers with responsible, generative AI that is safe, secure, and transparent,...

Blog

SambaLingo hits 15,000+ downloads, now integrated with Samba-CoE-v0.2

SambaLingo has been downloaded over 15,000 times and has achieved remarkable performance of 280 tokens/s inference...

Blog

SambaNova Delivers Accurate Models At Blazing Speed

Samba-CoE v0.2 is climbing on the AlpacaEval leaderboard, outperforming all of the latest open-source models.

Blog

Using Mixed Precision on RDUs

SambaFlow 1.18 introduces support for mixed precision on RDUs, streamlining the experience for model developers and...

Blog

Sambaverse: Discover, Compare, Evaluate

Sambaverse is a unique environment where developers can freely test out hundreds of different models and directly...

Report

Get the IDC LINK Research opinion on the Samba-1 release

See why IDC says that Samba-1 "is harnessing the power of specialized AI models for a broad spectrum of business...

Blog

Benchmarking Samba-1

Benchmarking Samba-1 with the EGAI benchmark - a comprehensive collection of widely adapted benchmarks sourced from...

Blog

Samba-CoE v0.1 - Unlocking the power of routing to build a Composition of Experts

We're thrilled to unveil Samba-CoE-v0.1, a scaled down version of Samba-1, our latest breakthrough model that...

Blog

Samba-1: A Composition of Experts Model

Announcing Samba-1, the first trillion-parameter generative AI model that meets the performance, accuracy,...

Blog

SambaLingo - Open Source Language Experts

SambaNova is excited to open source a collection of expert models that adapt Llama 2 to a diverse set of 9 languages.

Blog

Text-to-SQL accuracy that beats GPT-4

Users can access valuable information locked in their SQL databases faster and easier than ever before.

Blog

Introducing the SambaCoder-nsql-Llama-2-70B model

Numbers Station and SambaNova have released a text-to-SQL model that surpasses the accuracy of GPT-4.

Blog

BLOOMChat-v2 Long Sequences at 176B

We are proud to release BLOOMChat-v2, a 32K sequence length, 176B multilingual language model.

Blog

SambaNova Joins NAIRR pilot program to support strategic national AI research initiative

The NAIRR Pilot collaboration between the NSF, White House Office of Science and Technology Policy, and SambaNova...

Blog

Fault management and RDA systems: Part 2

One of the key characteristics of our system is performance for enterprise data center AI workloads, and SNFM...