RunPod

RunPod Pricing

Cloud GPUs, Serverless AI, Clusters, and Storage

Cloud GPU Pods

On-demand GPUs in 30+ regions

Per-second billing

GPU Examples	H200 (141GB VRAM): Contact sales B200 (180GB VRAM): Contact sales H100 SXM (80GB VRAM): Contact sales A100 SXM (80GB VRAM): $1.79/hr L40S (48GB VRAM): Contact sales Many more GPU types available
Billing Model	• Per-second and per-hour billing • No minimum commitment

Pricing varies by GPU, region, and availability. See full list on RunPod's pricing page.

Serverless GPU Endpoints

Instant AI workloads, no idle costs

Flex & Active workers

GPU Example	B200 (180GB): $8.64/s (Flex), $6.84/s (Active) H200 (141GB): $5.58/s (Flex), $4.46/s (Active) H100 (80GB): $4.18/s (Flex), $3.35/s (Active) A100 (80GB): $2.72/s (Flex), $2.17/s (Active) L40/L40S/6000 Ada (48GB): $1.9/s (Flex), $1.33/s (Active) Many more GPU types available
Worker Types	• Flex: Scales up during spikes, cost-efficient for bursty workloads • Active: Always-on, up to 30% discount, eliminates cold starts

Save 25%+ over other serverless providers on flex workers.

Instant & Reserved Clusters

Multi-GPU clusters, scale up to 64 GPUs

Contact sales for pricing

Instant Clusters	H200 SXM: $4.31/hr A100 SXM: $1.79/hr Other GPUs: Contact sales
Reserved Clusters	Dedicated clusters, custom configs, SLA-backed uptime Discounted rates for 10,000+ GPUs Contact sales for all reserved cluster pricing

Reserved clusters require a minimum commitment. Pricing varies by configuration and term.

Storage

Persistent and network storage

No ingress/egress fees

Container Disk	$0.10/GB/mo
Volume Disk	Running: $0.10/GB/mo Idle: $0.20/GB/mo
Network Storage (Standard)	Under 1TB: $0.07/GB/mo Over 1TB: $0.05/GB/mo
Network Storage (High-Performance)	Under 1TB: $0.14/GB/mo Over 1TB: $0.07/GB/mo

No fees for ingress/egress. Persistent and temporary storage available.

Public AI Endpoints

Pre-deployed models via API

Pay-per-use

Audio Models	Pruna / Whisper V3 Large: $0.05 per 1,000 characters resembleai / Chatterbox Turbo: $0.00 per 1,000 characters
Image Models	bytedance / Seedream 4.0 Edit: $0.0270 per request pruna / Pruna Image T2I: $0.0050 per request
Language Models	deep-cogito / Deep Cogito v2 Llama 70B: $0.00001 per 1M tokens ibm / IBM Granite 4.0 H Small: $1.00 per 1M tokens
Video Models	Bytedance / Seedance 1.0 pro: 5s $0.12 (480p) per request Alibaba / Wan 2.2 I2V 720p: 5s $0.30 per request

Full list of models and pricing available on RunPod's public endpoints page.

Notes: Prices shown in USD. Actual rates may vary by GPU, region, and availability. Some high-end GPU and cluster pricing is available only upon request. See RunPod's official pricing page for the latest details.

Wasabi

Stay Ahead in Cloud Infrastructure

RunPod Pricing

Free Tool

Free Cost Calculator

Quick Stats

Pricing Model

Pay-As-You-Go, Reserved / Committed Capacity, Spot Pricing

Customer Count

500k

Region

US West, US East, US Central, CA East, EU West, EU Central, EU North, EU East, Oceania (AU)

About

Explore

Alteranate Cloud Providers

Backblaze

Wasabi

Ionos

Crunchy Data

Stay Ahead in Cloud Infrastructure

Pricing Details

RunPod Pricing

Free Tool

Free Cost Calculator

Features

Key Offerings

Ideal Use Cases / Buyers

Integrations & Partners

Why Choose

Newsletter

Stay Ahead in Cloud & Data Infrastructure

Get early access to new tools, insights, and research shaping the next wave of cloud and storage innovation.

Talk to expert

Get a free deep dive with an expert​

Datastorage Expert Analyst Report

Stay Ahead in Cloud
& Data Infrastructure

Get a free deep dive with an expert

Datastorage Expert
Analyst Report