vram
.run
vram.run
Models
Hardware
Providers
Cloud
State of Inference
API provider data is live · Hardware & cloud pricing curated 2026-02-23
Where should you run your model?
19 providers · 220+ hardware configs · 30+ cloud offerings — compared in one place
Cerebras
2 live models
Models
Model
Task
Params
Llama-3.1-8B-Instruct
text-generation
gpt-oss-120b
text-generation
120B
Install CLI
[email protected]
Raw data
· MIT · API data: live · HW/Cloud data: curated 2026-02-23 ·
v0.6.0