Blog

DeepSeek V3-0324: Start Building Today on SambaNova Cloud

by SambaNova

March 27, 2025

DeepSeek has just released their March 2025 update to the V3 model — V3-0324 — and it’s now the highest-scoring open-source non-reasoning model. According to Artificial Analysis, this is the first time an open-weights model has outperformed all proprietary non-reasoning models, including Claude 3.7 Sonnet, Gemini 2.0 Pro, and LLaMA 3.3 70B.

Try DeepSeek V3-0324 now on SambaNova Cloud!

This is a huge milestone for open source and a major leap forward for developers working on latency-sensitive applications where speed matters more than step-by-step reasoning. It’s also a strong signal of what’s to come, with DeepSeek V3-0324 paving the way for the anticipated R2 model.

aa-deepseek-v3-0324

🚀 What is special about DeepSeek V3-0324?

This isn’t just a refresh — V3-0324 is a major performance upgrade over the original V3 release from December 2024. With stronger reasoning, faster code generation, and improved front-end design capabilities, developers are already calling this a “game-changing update.”

Built on a Mixture-of-Experts (MoE) architecture with 671B total parameters (but only 37B active per token), it’s designed for power and efficiency. Key features like Multi-Head Latent Attention (MLA) and multi-token prediction boost both context handling and output speed.

It also outperforms non-reasoning closed models like Claude 3.5 Sonnet and Gemini 2.0 Pro across key benchmarks:

💡 Why (and When) Should Developers Use It?

V3-0324 hits a sweet spot (especially using SambaNova cloud): high accuracy, low cost, high speed — ideal for developers building real-world applications that demand performance and scale.

Whether you're a solo dev or an enterprise team, it’s perfect for:

Frontend and full-stack developers needing fast, accurate code generation
Product teams building dynamic user interfaces and tools
Startups and enterprises looking for top-tier performance without the price tag
Researchers who want open access without proprietary limits

In short: V3-0324 is the fastest, most cost-effective way to run a state-of-the-art open model today.

⚡ Fast, Affordable, and Open — Try It Now on SambaNova Cloud

Not every use case calls for a heavyweight reasoning model like R1. For developers and teams who need blazing-fast inference, strong accuracy, and lower cost, DeepSeek V3-0324 on SambaNova Cloud is the perfect fit.

Optimized for speed and throughput, V3-0324 is ideal for powering real-time apps, coding agents, and dynamic UIs — all without sacrificing quality.

We’re excited to offer V3-0324 on SambaNova Cloud, running at up to 250 tokens per second — the fastest inference speeds in the world, powered by our custom RDU architecture.

And it’s incredibly cost-effective:

$1.00 per million input tokens
$1.50 per million output tokens

This makes V3-0324 a high-performance, low-cost solution for teams running high-throughput workloads — only possible on SambaNova Cloud.

👉 Try It Now

← Open-Source Deep Research Agents: Enterprise-Grade Speed, Security & Saving them Millions

The Only Inference Provider with High Speed Support for the Largest Models →