Blog 2

Use this space to tell everyone about what you have to offer.

February 4, 2025

On SambaNova Cloud, Tülu 3 405B, A New Model Better than DeepSeek V3

We are excited to announce the addition of Tülu 3 405B, a fine tune of Llama 405B that performs better than DeepSeek...

January 28, 2025

Hugging Face Partners with SambaNova to Supercharge its Inference API Capabilities

Available today, Hugging Face developers can take advantage of the lightning fast inference speeds made possible...

January 15, 2025

Unlock the Future of Multi-Agent AI Workflows with CrewAI and SambaNova

Agentic AI is driving the next wave of AI innovation. Agentic systems combine multiple AI models that can work...

December 18, 2024

Test-Time Compute Available on SambaNova Cloud with Qwen QwQ-32B-Preview

Available today on SambaNova Cloud, developers now have access to the best open source test-time compute model...

December 11, 2024

Meta Llama 3.3 70B Now Available Today for Developers and Enterprises

In the rapidly evolving landscape of artificial intelligence, SambaNova has once again demonstrated its commitment...

December 10, 2024

The SambaNova Startup Accelerator: Helping AI Innovators Realize Their Vision

At SambaNova, we understand the journey of early-stage AI innovators—the bold ideas, the challenges, and the...

December 6, 2024

Qwen 2.5 32B-Coder Available on SambaNova Cloud - 5X Faster than GPUs

Available today on SambaNova Cloud, developers now have access to some of the best open-source models in the Qwen...

December 4, 2024

How Gradio Makes Building Apps on SambaNova Cloud Super Easy

When we launched SambaNova Cloud two months ago, we set out to make it easy for developers to create their own...

December 4, 2024

Hugging Face Makes it Faster to Review Papers with SambaNova

Navigating the vast landscape of research papers, especially in the rapidly evolving field of artificial...

December 3, 2024

Zilliz: Powering AI RAG Applications with Vector Embeddings

As AI developers strive to build faster, more accurate and contextually relevant Retrieval Augmented Generation...

November 20, 2024

Outperforming GPT-4o with Llama 3 8B: Domain Specific Fine Tuning for RAG

Many of our customers are finding that while closed source models from OpenAI and Anthropic work well for general...

November 11, 2024

Correcting Common AI Benchmarking Errors with AI Starter Kits

In working closely with our customers to optimize inference performance, we’ve seen firsthand how small benchmarking...

October 10, 2024

Accelerating Coding with SambaNova Cloud

The leading open source model for the leading coding assistant

October 3, 2024

Developer Tips: Creating Valuable AI

As developers, it's easy to get caught up in the technical details of building complex apps. However, when creating...

September 19, 2024

Replacing the Judge: Can Llama 405B Outperform GPT4 in the Court of AI?

The evaluation of Large Language Models (LLMs) is critical for driving innovation in AI, yet as models become more...

September 19, 2024

Judging Judges: All that is LLM Judgements does not glitter

Having robust evaluation systems for language models is critical for developing and improving existing technologies....

September 10, 2024

Advanced AI Apps Need Fast Inference. SambaNova Cloud Delivers It

Inference Performance for Llama 3.1 405B in Function Calling & Agentic Workflows Today, we launched the SambaNova...

September 10, 2024

Why SambaNova's SN40L Chip is The Best for Inference

The AI hardware landscape is rapidly evolving, with several innovative companies now providing compelling solutions...

September 3, 2024

SubgoalXL: Pushing the Boundaries of LLM in Formal Theorem Proving

Formal theorem proving, using languages like Lean or Isabelle, represents a frontier for both mathematics and LLM...

July 29, 2024

SambaNova Holds Speed Record on Llama 3.1 405B - 4X faster than the rest

In today's fast-paced business landscape, enterprises need more than just the latest AI model to solve their biggest...

June 20, 2024

Does reduced precision hurt? A bit about losing bits.

SambaNova and Groq recently achieved 1000 tokens per second on their inference system for Meta’s LLaMa 3 8b Instruct...

May 13, 2024

Introducing Fugaku-LLM in Composition of Experts

The Composition of Experts (CoE) architecture that the Samba-1 model is based upon has many features that make it...

May 6, 2024

Sovereign AI

Artificial intelligence has become vital to nations, governments, and large corporations. Many of these large...

May 6, 2024

NAIRR; Govt-funded AI Research Resources

Artificial intelligence (AI) is driving the next generation of technological innovation and scientific discovery....

Subscribe

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Nullam ultrices massa sit amet auctor scelerisque. Cras vel quam non lorem tincidunt facilisis.