Blogs

Judging Judges: All that is LLM Judgements does not glitter

Judging Judges: All that is LLM Judgements does not glitter

September 19, 2024
Advanced AI Apps Need Fast Inference. SambaNova Cloud Delivers It

Advanced AI Apps Need Fast Inference. SambaNova Cloud Delivers It

September 10, 2024
Why SambaNova's SN40L Chip Is the Best for Inference

Why SambaNova's SN40L Chip Is the Best for Inference

September 10, 2024
SubgoalXL: Pushing the Boundaries of LLM in Formal Theorem Proving

SubgoalXL: Pushing the Boundaries of LLM in Formal Theorem Proving

September 3, 2024
SambaNova Holds Speed Record on Llama 3.1 405B - 4X faster than the rest

SambaNova Holds Speed Record on Llama 3.1 405B - 4X faster than the rest

July 29, 2024
Does reduced precision hurt? A bit about losing bits.

Does reduced precision hurt? A bit about losing bits.

June 20, 2024
Introducing Fugaku-LLM in Composition of Experts

Introducing Fugaku-LLM in Composition of Experts

May 13, 2024
Sovereign AI: Full-Stack Infrastructure for AI Autonomy

Sovereign AI: Full-Stack Infrastructure for AI Autonomy

May 6, 2024
NAIRR; Govt-funded AI Research Resources
NAIRR; government funded AI Research Resources | SambaNova

NAIRR; Govt-funded AI Research Resources

May 6, 2024
Tokens Per Second is Not All You Need
BLOOMChat-v2 Long Sequences at 176B

Tokens Per Second is Not All You Need

May 1, 2024
Samba-CoE v0.3: The Power of Routing ML Models at Scale

Samba-CoE v0.3: The Power of Routing ML Models at Scale

April 11, 2024
Responsible AI

Responsible AI

April 10, 2024
SambaLingo hits 15,000+ downloads, now integrated with Samba-CoE-v0.2

SambaLingo hits 15,000+ downloads, now integrated with Samba-CoE-v0.2

April 8, 2024
AI Power: Accurate Models at Blazing Speeds | SambaNova

AI Power: Accurate Models at Blazing Speeds | SambaNova

March 27, 2024
Using Mixed Precision on RDUs

Using Mixed Precision on RDUs

March 21, 2024
Benchmarking Samba-1

Benchmarking Samba-1

February 28, 2024
Samba-CoE v0.1: Unlocking the Power of Experts

Samba-CoE v0.1: Unlocking the Power of Experts

February 28, 2024
High-Accuracy AI Models in 9 Languages

High-Accuracy AI Models in 9 Languages

February 26, 2024
SambaCoder-nsql-Llama-2-70B model

SambaCoder-nsql-Llama-2-70B model

February 13, 2024
BLOOMChat-v2 Long Sequences at 176B
BLOOMChat-v2 Long Sequences at 176B

BLOOMChat-v2 Long Sequences at 176B

February 7, 2024
SambaNova RDAs: Mastering Fault Management (Part 2)
Fault management and RDA systems: Part 2

SambaNova RDAs: Mastering Fault Management (Part 2)

January 25, 2024
SambaNova RDAs: Mastering Fault Management (Part 1)
Fault management and RDA systems: Part 1

SambaNova RDAs: Mastering Fault Management (Part 1)

January 25, 2024
ALiBi Deep Dive: Interpolation vs. Extrapolation
ALiBi Deep Dive: Interpolation and Precision

ALiBi Deep Dive: Interpolation vs. Extrapolation

December 22, 2023
Elevating Information Retrieval and Augmenting Large Language Models
Elevating Information Retrieval and Augmenting Large Language Models

Elevating Information Retrieval and Augmenting Large Language Models

November 21, 2023