Products
Developers
About

Insights & Information

Find what you need to accelerate your AI journey

Blog

Accelerating Coding with SambaNova Cloud

In this post, we demonstrate a useful and practical application of the SambaNova Cloud to power Continue, the...

Blog

Developer Tips: Creating Valuable AI

In this post, we explore the strategic considerations and decision-making processes that can help you create...

Video

DataScale SN30

Purpose-built for the most demanding AI and deep learning workloads, the DataScale SN30 is a fully integrated...

Video

SambaNova & Lawrence Livermore National Laboratory Accelerate AI for Science

SambaNova Systems and Lawrence Livermore National Laboratory (LLNL) scale up their collaboration to improve the...

Blog

Judging Judges: All that is LLM Judgements does not glitter

An examination of where LLM-as-a-Judge can satisfyingly act as a judge of an outer model's performance and where it...

Blog

Replacing the Judge: Can Llama 405B Outperform GPT4 in the Court of AI?

While LLM-as-a-Judge offers a favorable alternative to human evaluations, closed source LLMs impose some limitations...

Blog

SambaNova Cloud: The fastest inference and the best models - for free

SambaNova is opening up the full spectrum of Llama models for developers to create the next wave of AI innovation.

Blog

Advanced AI Apps Need Fast Inference. SambaNova Cloud Delivers It

By improving inference performance, SambaNova has unlocked the full potential of Llama 3.1 405B and enabled...

Blog

Why SambaNova's SN40L Chip is The Best for Inference

Comparing the end-user inference performance of SambaNova's technology against that of Groq and Cerebras.

Blog

SubgoalXL: Pushing the Boundaries of LLM in Formal Theorem Proving

SubgoalXL represents a significant step forward in the field of AI-powered theorem proving.

Blog

SambaNova Holds Speed Record on Llama 3.1 405B - 4X faster than the rest

Today, we’ve set a world performance record of 114 tokens per second on Llama 3.1 405B, independently verified by...

Blog

Three Predictions for the Upcoming Llama 3 405B Announcement

Three predictions on how Llama 3 405B could reshape the landscape for developers engaged in AI and machine learning.

Blog

Typhoon model adds Thai language to Samba-1

With the inclusion of Typhoon Thai LLM, Samba-1 is now able to deliver generative AI capabilities in the Thai...

White Paper

Composition of Experts: Next Wave of AI Innovation

A new full-stack approach unlocks industry leading speed and accuracy with 10x better TCO.

Blog

Does reduced precision hurt? A bit about losing bits.

Recent work highlighted how quantization for recent LLaMa 3 models can lead to non-negligible decay in model...

Blog

SambaNova CEO explains why only one AI company wants a monopoly

Rodrigo Liang and veteran tech journalist Don Clark of The New York Times discussed how a full stack approach to AI...

Blog

Transform Your Data Privacy with SambaNova Systems

Samba-1 provides role-based access controls to maintain data governance policies, ensuring only those with proper...

Blog

SambaNova has broken the 1000 t/s barrier: why it's a big deal for enterprise AI

SambaNova is the clear winner of the latest large language model (LLM) benchmark by Artificial Analysis. Topping the...

Blog

Model Ownership

As enterprises incorporate generative AI into their business, retaining model ownership is one of the most important...

Blog

Introducing Fugaku-LLM in Composition of Experts

The Fugaku-LLM, a Japanese LLM, is being introduced into the Samba-1 CoE architecture to run optimally on the...

Blog

Sovereign AI

A Sovereign AI solution is fully contained with an entity, such as a company or country, and meets objectives while...

Blog

NAIRR; Govt-funded AI Research Resources

NAIRR pilot, in partnership with SambaNova, provides generative AI platforms for groundbreaking academic research....

Blog

Tokens Per Second is Not All You Need

In this post, we explore why tokens per second doesn't paint the full picture of enterprise LLM inference...

Blog

The Next Generation of Large Models

Generative AI can streamline processes across the entire organization, reduce costs, increase productivity, improve...