vram.run Models Hardware Providers Cloud State of Inference

API provider data is live · Hardware & cloud pricing curated 2026-02-23

Nscale

19 live models

Models

Model	Task	Params
Llama-3.1-8B-Instruct	text-generation	8.0B
FLUX.1-schnell	text-to-image
gpt-oss-120b	text-generation	120B
gpt-oss-20b	text-generation	21.5B
Qwen2.5-Coder-32B-Instruct	text-generation	32.8B
Llama-4-Scout-17B-16E-Instruct	image-text-to-text	109B
Qwen3-8B	text-generation	8.2B
Qwen3-235B-A22B	text-generation	235B
Qwen3-4B-Instruct-2507	text-generation	4.0B
DeepSeek-R1-Distill-Llama-8B	text-generation	8.0B
DeepSeek-R1-Distill-Qwen-7B	text-generation	7.6B
Qwen3-235B-A22B-Instruct-2507	text-generation	235B
Mixtral-8x22B-Instruct-v0.1		141B
Qwen2.5-Coder-7B-Instruct	text-generation	7.6B
Qwen3-32B	text-generation	32.8B
DeepSeek-R1-Distill-Qwen-14B		14.8B
Qwen3-4B-Thinking-2507	text-generation	4.0B
Qwen3-14B	text-generation	14.8B
Qwen2.5-Coder-3B-Instruct	text-generation	3.1B

Install CLI [email protected] Raw data · MIT · API data: live · HW/Cloud data: curated 2026-02-23 · v0.6.0