Premium AI inference for Europe
OVHcloud powered by SambaNova
OVHcloud partners with SambaNova to deliver Europe’s most advanced AI inference platform. Experience unmatched speed, scalability, and efficiency for mission-critical generative AI workloads.
Redefining AI performance in the cloud. The leader is OVH.
Demanding AI applications require more than generic GPU clusters. OVHcloud’s Premium AI Endpoints – powered by SambaNova’s full-stack technology – deliver record-breaking inference speeds, ultra-low latency, and massive model scalability while slashing operational costs.
Why OVHcloud and SambaNova partnered
SambaNova’s Reconfigurable Dataflow Units (RDUs) enable OVHcloud to deliver unprecedented throughput-per-watt. Each SambaRack processes trillion-parameter models at 10 kW average power – outperforming legacy solutions while minimizing the carbon footprint.
Fast inference for low latency applications
Developers expect the best performance without compromise. SambaRack delivers fast inference on the best open-source models like OpenAI’s gpt-oss and DeepSeek. This allows developers to focus on their application, while we handle the rest.
Best energy efficiency
Model bundling reduces physical footprint
SambaNova’s 3-tier memory architecture enables OVHcloud to serve more models with less hardware. Multiple models can be served per SambaRack and can be hot-swapped at runtime with very low switching times.
“SambaNova provides the raw power and efficiency we demand for premium AI. Our partnership lets customers deploy more models in less space, achieving enterprise-grade inference no other provider can match."
— Octave Klaba, CEO, OVHcloud
OVHcloud Premium AI Endpoints launch in 2026
Sign up for their waitlist