Edit

Share via


ND GB300-v6 sizes series

The NDGB300v6 series is Azure’s next-generation GPU VM line purpose-built for large-scale AI, especially high-throughput inference for reasoning and agentic systems. These VMs are powered by NVIDIA GB300 NVL72 rackscale systems (Blackwell Ultra GPUs + Grace CPUs).  Each ND-GB300-v6 VM has two NVIDIA Grace CPUs and four NVIDIA Blackwell B300 GPUs. The GPUs are interconnected via fifth-generation NVLINK, providing a total of 4× 1.8 TB/s NVLINK bandwidth per VM. Each GPU also has 800 Gb/s InfiniBand connectivity, enabling cluster-scale performance with low latency. There are 18 VMs per rack, so effectively 72 NVIDIA Blackwell Ultra GPUs with 36 NVIDIA Grace CPUs, exposing ~37 TB of fast memory (~20TB High Bandwidth Memory, ~17TB of CPU Memory), 130 TB/s of intrarack NVLink bandwidth.

At rack scale, ND GB300 v6 delivers up to 1.44 exaFLOPS FP4 Tensor Core performance per NVL72 domain and has demonstrated ~1.1 million tokens/second LLM inference throughput per rack, which is ~27% higher than ND GB200 v6. See here for more details. Compared to GB200, GB300 provides ~1.5× FP4 compute, 50% higher HBM capacity (288 GB HBM3E per GPU vs. 192 GB), and 2x the back-end network bandwidth per GPU (800Gbps vs 400 Gbps).

The ND GB300 v6 architecture builds on the ND v6 GB200 to efficiently distribute workloads and memory demands across multiple GPUs, driving markedly higher inference throughput for long context and multimodal workloads AI and scientific applications. These instances deliver best-in-class performance for AI, ML, and analytics workloads with out-of-the-box support for frameworks like PyTorch, Tensorflow, JAX, RAPIDS, and more.

Host specifications

Part Quantity
Count Units
Specs
SKU ID, Performance Units, etc.
Processor 128 vCPUs Nvidia Grace CPU
Memory 864 VM LPDDR
Local Storage 4 Disks 16TB NVME Direct
Remote Storage 16 Disks 80000 IOPS/1200 MBps
Network 1 NICs 160Gb/s Ethernet
Accelerators 4 GPUs Nvidia Blackwell Ultra GPU (288GB)

For features supported by this series, see the Feature support section.

Sizes in series

vCPUs (Qty.) and Memory for each size

Size Name vCPUs (Qty.) Memory (GB)
Standard_ND128isr_GB300_v6 128 864

VM Basics resources

Feature support

Feature name Support status
Premium Storage Supported
Premium Storage caching Supported
Live Migration Not Supported
Memory Preserving Updates Not Supported
Generation 2 VMs Supported
Generation 1 VMs Not Supported
Accelerated Networking Supported
Ephemeral OS Disk Supported
Nested Virtualization Not Supported

Other size information

List of all available sizes: Sizes

Pricing Calculator: Pricing Calculator

Information on Disk Types: Disk Types

Next steps

Take advantage of the latest performance and features available for your workloads by changing the size of a virtual machine.

Utilize Microsoft's in-house designed ARM processors with Azure Cobalt VMs.

Learn how to Monitor Azure virtual machines.