NVIDIA Rubin Is in Production: What Philippine Server Buyers Should Know

NVIDIA's Vera Rubin platform — the successor to Blackwell — is now in full production. It was formally announced at GTC 2026, and system manufacturers including Dell, HPE, Lenovo, and Cisco have confirmed server products built on the platform. Philippine enterprise buyers evaluating AI server infrastructure in 2026 need to understand what Rubin is, what it delivers, and when hardware will actually be in market.
What Rubin Is
Vera Rubin is a full rack-scale AI compute platform. It combines seven components:
- Vera CPU — NVIDIA's first CPU built specifically for agentic AI and inference workloads
- Rubin GPU — the successor to the Blackwell B200/B100; delivers 3–4× the compute density
- NVLink 6 — scale-up networking between GPUs within a rack
- ConnectX-9 SuperNIC — 800Gbps NIC for inter-rack scale-out
- BlueField-4 DPU — network and security offload at the rack level
The platform is designed around a single argument: agentic AI — long-context inference, multi-step reasoning, and continuous learning — requires fundamentally different hardware than the transformer training workloads Blackwell was built for. Rubin is NVIDIA's answer to that shift.
The Numbers That Matter
| Metric | Blackwell → Rubin |
|---|---|
| Compute density | 3–4× improvement per rack |
| GPUs needed for equivalent training | 4× fewer |
| Inference cost per token | 10× lower |
| Transfer time (Blackwell → Rubin) | Zero (backward compatible toolchain) |
For enterprises running inference workloads — RAG pipelines, Copilot integrations, document processing, internal AI agents — the cost-per-token improvement is the number that drives ROI decisions. A workload that cost ₱500,000 per month on Blackwell-era infrastructure should cost ~₱50,000 on Rubin-based systems at equivalent output quality.
What Dell Is Building
Dell marked the two-year anniversary of the Dell AI Factory at NVIDIA GTC 2026 with four Rubin-based PowerEdge announcements:
| Model | Platform | Configuration | Timeline |
|---|---|---|---|
| PowerEdge XE9812 | Vera Rubin NVL72 | 72 Rubin GPUs, liquid-cooled, flagship | H2 2026 |
| PowerEdge XE9880L | HGX Rubin NVL8 | 8 Rubin GPUs, liquid-cooled | H2 2026 |
| PowerEdge R9822 | Vera CPU server | CPU-only Rubin workload server | September 2026 |
| PowerEdge M9822 | Vera CPU blade | Blade form factor | September 2026 |
The R9822 and M9822 are the most relevant for Philippine enterprise buyers — CPU-primary servers optimised for inference and agentic workloads, not frontier model training. September 2026 availability makes them a realistic target for Q4 2026 procurement.
What HPE Is Building
HPE announced the NVIDIA Vera Rubin NVL72 by HPE: a fully integrated, rack-scale, liquid-cooled system supporting 72 Rubin GPUs and 36 Vera CPUs — designed for models in excess of 1 trillion parameters.
This is HPE's flagship AI factory system. Availability is targeted for December 2026. For most Philippine enterprise buyers, this is a data-centre-scale product; the Dell R9822 at September 2026 is the more accessible entry point.
Project DIGITS — The Desktop AI Supercomputer
On a smaller scale: NVIDIA's Project DIGITS started shipping in May 2026. It is a compact personal AI supercomputer at $3,000 USD, built on the GB10 Grace Blackwell Superchip. It delivers 1 petaflop of AI compute at FP8 precision — enough to run 200-billion-parameter models locally.
For Philippine businesses, DIGITS is relevant in two contexts: research teams that need local inference without cloud API costs, and organisations in regulated industries (banking, healthcare, legal) that cannot send data to cloud AI services for DPA compliance reasons.
The Supermicro Factor
Supermicro — historically a significant volume supplier of NVIDIA-based AI servers — is currently navigating an export compliance and financial reporting crisis. For enterprises and sovereign AI programmes evaluating AI server vendors, this introduces supply chain and compliance risk that Dell and HPE do not carry. The Rubin platform launch reinforces what was already a multi-quarter trend: GPU allocation is concentrating with Dell, HPE, and Lenovo.
What Philippine Buyers Should Do Now
If your server refresh is planned for H2 2026: hold the evaluation open. Dell PowerEdge R9822 systems at September 2026 represent a meaningful capability jump over current-generation infrastructure at the same approximate price point.
If you are evaluating AI infrastructure for a specific workload: the 10× lower cost-per-token on inference is a genuine economic argument, not a benchmark claim. Request a workload sizing from your supplier based on Rubin specs, not Blackwell specs.
If you are considering Supermicro: factor the compliance posture into your risk assessment. The Rubin generation is where Dell and HPE have the most differentiated platform roadmaps.
If your workload is inference-only at moderate scale: the Vera CPU-only servers (R9822) may be sufficient, and at a substantially lower capital cost than GPU-dense configurations.
We carry Dell PowerEdge and HPE ProLiant across the enterprise and SME range. If you are evaluating AI server infrastructure for a Philippine deployment, get in touch for a scoped specification.
Talk to our I.T. Hardware team →

