Note
Access to this page requires authorization. You can try signing in or changing directories.
Access to this page requires authorization. You can try changing directories.
The NDGB300v6 series is Azure’s next-generation GPU VM line purpose-built for large-scale AI, especially high-throughput inference for reasoning and agentic systems. These VMs are powered by NVIDIA GB300 NVL72 rackscale systems (Blackwell Ultra GPUs + Grace CPUs). Each ND-GB300-v6 VM has two NVIDIA Grace CPUs and four NVIDIA Blackwell B300 GPUs. The GPUs are interconnected via fifth-generation NVLINK, providing a total of 4× 1.8 TB/s NVLINK bandwidth per VM. Each GPU also has 800 Gb/s InfiniBand connectivity, enabling cluster-scale performance with low latency. There are 18 VMs per rack, so effectively 72 NVIDIA Blackwell Ultra GPUs with 36 NVIDIA Grace CPUs, exposing ~37 TB of fast memory (~20TB High Bandwidth Memory, ~17TB of CPU Memory), 130 TB/s of intrarack NVLink bandwidth.
At rack scale, ND GB300 v6 delivers up to 1.44 exaFLOPS FP4 Tensor Core performance per NVL72 domain and has demonstrated ~1.1 million tokens/second LLM inference throughput per rack, which is ~27% higher than ND GB200 v6. See here for more details. Compared to GB200, GB300 provides ~1.5× FP4 compute, 50% higher HBM capacity (288 GB HBM3E per GPU vs. 192 GB), and 2x the back-end network bandwidth per GPU (800Gbps vs 400 Gbps).
The ND GB300 v6 architecture builds on the ND v6 GB200 to efficiently distribute workloads and memory demands across multiple GPUs, driving markedly higher inference throughput for long context and multimodal workloads AI and scientific applications. These instances deliver best-in-class performance for AI, ML, and analytics workloads with out-of-the-box support for frameworks like PyTorch, Tensorflow, JAX, RAPIDS, and more.
Host specifications
| Part | Quantity Count Units |
Specs SKU ID, Performance Units, etc. |
|---|---|---|
| Processor | 128 vCPUs | Nvidia Grace CPU |
| Memory | 864 VM | LPDDR |
| Local Storage | 4 Disks | 16TB NVME Direct |
| Remote Storage | 16 Disks | 80000 IOPS/1200 MBps |
| Network | 1 NICs | 160Gb/s Ethernet |
| Accelerators | 4 GPUs | Nvidia Blackwell Ultra GPU (288GB) |
For features supported by this series, see the Feature support section.
Sizes in series
vCPUs (Qty.) and Memory for each size
| Size Name | vCPUs (Qty.) | Memory (GB) |
|---|---|---|
| Standard_ND128isr_GB300_v6 | 128 | 864 |
VM Basics resources
Feature support
| Feature name | Support status |
|---|---|
| Premium Storage | Supported |
| Premium Storage caching | Supported |
| Live Migration | Not Supported |
| Memory Preserving Updates | Not Supported |
| Generation 2 VMs | Supported |
| Generation 1 VMs | Not Supported |
| Accelerated Networking | Supported |
| Ephemeral OS Disk | Supported |
| Nested Virtualization | Not Supported |
Other size information
List of all available sizes: Sizes
Pricing Calculator: Pricing Calculator
Information on Disk Types: Disk Types
Next steps
Take advantage of the latest performance and features available for your workloads by changing the size of a virtual machine.
Utilize Microsoft's in-house designed ARM processors with Azure Cobalt VMs.
Learn how to Monitor Azure virtual machines.