Resources

Insights and information to accelerate your AI journey
Test-Time Compute Available on SambaNova Cloud with Qwen QwQ-32B-Preview

Test-Time Compute Available on SambaNova Cloud with Qwen QwQ-32B-Preview

December 18, 2024
Meta Llama 3.3 70B Now Available Today for Developers and Enterprises

Meta Llama 3.3 70B Now Available Today for Developers and Enterprises

December 11, 2024
The SambaNova Startup Accelerator: Helping AI Innovators Realize Their Vision

The SambaNova Startup Accelerator: Helping AI Innovators Realize Their Vision

December 10, 2024
Qwen 2.5 32B-Coder Available on SambaNova Cloud - 5X Faster than GPUs

Qwen 2.5 32B-Coder Available on SambaNova Cloud - 5X Faster than GPUs

December 6, 2024
Hugging Face Makes it Faster to Review Papers with SambaNova

Hugging Face Makes it Faster to Review Papers with SambaNova

December 4, 2024
How Gradio Makes Building Apps on SambaNova Cloud Super Easy

How Gradio Makes Building Apps on SambaNova Cloud Super Easy

December 4, 2024
Zilliz: Powering AI RAG Applications with Vector Embeddings

Zilliz: Powering AI RAG Applications with Vector Embeddings

December 3, 2024
Outperforming GPT-4o with Llama 3 8B: Domain Specific Fine Tuning for RAG

Outperforming GPT-4o with Llama 3 8B: Domain Specific Fine Tuning for RAG

November 20, 2024
Correcting Common AI Benchmarking Errors with AI Starter Kits

Correcting Common AI Benchmarking Errors with AI Starter Kits

November 11, 2024
Accelerating Coding with SambaNova Cloud

Accelerating Coding with SambaNova Cloud

October 10, 2024
Developer Tips: Creating Valuable AI

Developer Tips: Creating Valuable AI

October 3, 2024
Judging Judges: All that is LLM Judgements does not glitter

Judging Judges: All that is LLM Judgements does not glitter

September 19, 2024
Replacing the Judge: Can Llama 405B Outperform GPT4 in the Court of AI?

Replacing the Judge: Can Llama 405B Outperform GPT4 in the Court of AI?

September 19, 2024
Advanced AI Apps Need Fast Inference. SambaNova Cloud Delivers It

Advanced AI Apps Need Fast Inference. SambaNova Cloud Delivers It

September 10, 2024
Why SambaNova's SN40L Chip is The Best for Inference

Why SambaNova's SN40L Chip is The Best for Inference

September 10, 2024
SubgoalXL: Pushing the Boundaries of LLM in Formal Theorem Proving

SubgoalXL: Pushing the Boundaries of LLM in Formal Theorem Proving

September 3, 2024
Does reduced precision hurt? A bit about losing bits.

Does reduced precision hurt? A bit about losing bits.

June 20, 2024
Tokens Per Second is Not All You Need

Tokens Per Second is Not All You Need

May 1, 2024
Samba-CoE v0.3: The Power of Routing ML Models at Scale

Samba-CoE v0.3: The Power of Routing ML Models at Scale

April 11, 2024
SambaLingo hits 15,000+ downloads, now integrated with Samba-CoE-v0.2

SambaLingo hits 15,000+ downloads, now integrated with Samba-CoE-v0.2

April 8, 2024
Using Mixed Precision on RDUs

Using Mixed Precision on RDUs

March 21, 2024
Benchmarking Samba-1

Benchmarking Samba-1

February 28, 2024
Samba-CoE v0.1 - Unlocking the power of routing to build a Composition of Experts

Samba-CoE v0.1 - Unlocking the power of routing to build a Composition of Experts

February 28, 2024
SambaLingo - Open Source Language Experts

SambaLingo - Open Source Language Experts

February 26, 2024