Enterprises and organizations requiring:
Typical adopters include cloud service providers, enterprises building private AI clusters, and research institutions standardizing on Hopper-generation infrastructure.
The H100 is widely available through OEM systems, DGX platforms, and cloud providers, and is commonly deployed in 4- or 8-GPU configurations within HGX- or DGX-based systems. PCIe variants enable broader enterprise server integration, while SXM variants are used for maximum density and scale-up performance. Pricing and allocation vary by form factor, vendor, and region, reflecting ongoing demand patterns for data center accelerators.
The H100 is a Hopper-generation data center GPU designed to accelerate modern AI training and inference through FP8-focused tensor compute and high-bandwidth HBM3 memory. It introduced Hopperβs Transformer Engine to improve efficiency and throughput for transformer workloads while maintaining strong performance for mixed precision and HPC use cases.
H100 is deployed as a GPU module in SXM-based multi-GPU platforms and as a PCIe accelerator for broader enterprise server integration. It commonly underpins 4- or 8-GPU baseboards and turnkey systems used in AI factories, hyperscale training clusters, and enterprise inference deployments.
| Specification | H100 GPU |
|---|---|
| Architecture | NVIDIA Hopper |
| Memory | 80Β GB HBM3 (SXM, PCIe); 94Β GB HBM3 (H100 NVL PCIe variant) |
| Memory Bandwidth | 3.35Β TB/s (SXM); ~2.0Β TB/s (PCIe); 3.9Β TB/s (H100 NVL) |
| Interconnect | NVLinkΒ 4 (multi-GPU) |
| Form Factor | SXM, PCIe (including H100 NVL variant) |
| Max TGP | ~700Β W (SXM); ~350Β W (PCIe) |
| Precision Support | FP64, TF32, FP16, BF16, FP8, INT8 |
| Typical AI Compute | ~4Β PFLOPS (FP8, sparsity-dependent) |
| Process Node | TSMCΒ 4N |
| Transistor Count | ~80Β billion |
| MIG Support | Supported |
| NVLink (peer) | ~900Β GB/s bidirectional aggregate (SXM) |
Compared to A100, the H100 delivers materially higher effective throughput on modern transformer workloads through FP8-optimized tensor compute and higher memory bandwidth, while retaining strong support for HPC and mixed AI workloads.
Comparable NVIDIA GPUs:
Competitor GPUs:
For organizations operating existing Hopper deployments, H100 remains a strong baseline for training and inference, while H200 is the typical upgrade when memory bandwidth and capacity become limiting. Buyers planning frontier-scale training or longer infrastructure lifecycles may evaluate B200 depending on availability, power constraints, and platform readiness.
Related NVIDIA GPUs:
Competitor GPUs:
The H100 is NVIDIAβs Hopper-generation flagship data center GPU that established FP8-optimized tensor compute and high-bandwidth HBM3 as the baseline for modern AI infrastructure. It delivers strong training and inference performance for transformer workloads, supports multi-GPU scale-up through NVLink 4, and remains broadly available across DGX, HGX, OEM, and cloud platforms. H100 is a common foundation for large-scale AI clusters, with H200 serving as the primary memory-focused upgrade path and B200 representing the next-generation step for organizations planning frontier-scale deployments.