The NVIDIA H200 targets large-model inference and memory-heavy training with 141GB of HBM3e memory, but that extra capacity still carries a steep cloud premium.
In this guide, we break down the NVIDIA H200 pricing landscape for June 2026. We compare on-demand rates across hyperscalers like AWS and Azure alongside specialized GPU clouds like Hyperbolic and RunPod to help you find the most cost-effective path for your workloads.
Key Takeaways
- H200 premiums remain steep. Even after AWS's June price cut, the cheapest hyperscaler H200 hour costs 3 times more than Thunder Compute's H100.
- Specialist clouds narrow the gap. Hyperbolic, RunPod, and Vast.ai all sit in the $2–5 range, but Thunder Compute's H100 is still cheaper.
- Choose H200 only when you must: Running massive models that overflow 80 GB VRAM or long-context inference. For development, fine-tuning, and most training, H100 80 GB wins on ROI.
- Thunder Compute roadmap. We don't offer H200 nodes yet; but you can launch an H100 80 GB at $1.38/hr (one-click VS Code, per-minute billing, persistent volumes, live hardware swaps).
| Provider | SKU / Instance | On-Demand $/GPU-hr* | Notes |
|---|---|---|---|
| Hyperbolic | H200 | $2.40 | Marketplace pricing. |
| Vast.ai | Marketplace | $3.21 | Marketplace current price per H200 GPU. |
| Crusoe Cloud | H200 | $4.29 | Public listed hourly pricing. |
| RunPod | H200 | $4.39 | Community cloud pricing from the public pricing page. |
| Nebius | H200 | $4.50 | Public listed single-GPU equivalent. |
| CoreWeave | 8x H200 | $6.31 | $50.48/hr node price, normalized per GPU. |
| AWS (p5e.48xlarge) | 8x H200 141GB | $7.91 | $63.28/hr node price, normalized per GPU. |
| Oracle Cloud (BM.GPU.H200.8) | 8× H200 | $10.00 | Bare-metal node, $80/hr total. |
| Azure (ND96isr H200 v5) | 8x H200 | $10.60 | Calculator price, normalized per GPU. |
Methodology – why you can trust these numbers
- On-demand only. We excluded capacity reservations longer than 14 days, reserved instances, and spot/pre-emptible offers.
- Same silicon. Every row is a 141 GB NVIDIA H200 (SXM or PCIe).
- Public price lists only. Figures come straight from each provider's pricing page in June 2026.
- US regions, USD. Regional variation can add 5-20 percent; those are ignored for apples-to-apples comparison.
H200 cost benchmark
| Provider | 10 hrs runtime | Effective cost |
|---|---|---|
| Thunder Compute – A100 80 GB | 10 × $0.78 | $7.80 |
| Vast.ai | 10 x $3.21 | $32.10 |
| RunPod | 10 x $4.39 | $43.90 |
| CoreWeave | 10 x $6.31 | $63.10 |
| AWS | 10 x $7.91 | $79.10 |
| Azure | 10 x $10.60 | $106.00 |
Bottom line: two hours on Thunder Compute's A100 costs less than 15 minutes of an H200 on Coreweave, and still buys roughly **13 × more runtime per dollar than hyperscaler H200s.
Explore the full Cloud GPU market landscape in AI GPU rental market trends analysis.
