vram
.run
vram.run
Models
Hardware
Providers
Cloud
State of Inference
API provider data is live · Hardware & cloud pricing curated 2026-02-23
Where should you run your model?
19 providers · 220+ hardware configs · 30+ cloud offerings — compared in one place
Featherless
201 live models
Models
Model
Task
Params
Meta-Llama-3-8B
text-generation
8.0B
Llama-3.1-8B-Instruct
text-generation
Meta-Llama-3-8B-Instruct
text-generation
8.0B
Mistral-7B-Instruct-v0.2
text-generation
7.2B
QwQ-32B
text-generation
Kimi-K2-Instruct
text-generation
1026B
Llama-3.1-8B
text-generation
Qwen2.5-Coder-32B-Instruct
text-generation
32.8B
gemma-3-27b-it
image-text-to-text
27.4B
zephyr-7b-beta
text-generation
7.2B
QwQ-32B-Preview
text-generation
32.8B
Meta-Llama-3-70B-Instruct
text-generation
70.6B
DeepSeek-R1-Distill-Qwen-1.5B
text-generation
Mistral-Small-3.1-24B-Instruct-2503
Qwen3.5-397B-A17B
image-text-to-text
403B
Qwen2.5-7B-Instruct
text-generation
7.6B
zephyr-7b-alpha
text-generation
Llama-2-13b-chat-hf
text-generation
DeepSeek-R1-0528-Qwen3-8B
text-generation
8.2B
Nanbeige4.1-3B
text-generation
3.9B
Qwen3-8B
text-generation
Qwen3-Coder-30B-A3B-Instruct
text-generation
Qwen2.5-72B-Instruct
text-generation
72.7B
Meta-Llama-3-70B
text-generation
DeepSeek-R1-Distill-Llama-8B
text-generation
DeepSeek-R1-Distill-Qwen-7B
text-generation
7.6B
Qwen3-30B-A3B-Instruct-2507
text-generation
gemma-2-9b-it
text-generation
9.2B
ReaderLM-v2
text-generation
1.5B
Qwen2-72B-Instruct
text-generation
72.7B
Marco-o1
text-generation
7.6B
Step-3.5-Flash
text-generation
199B
gemma-2-9b
text-generation
Llama3-8B-Chinese-Chat
text-generation
Qwen2-7B-Instruct
text-generation
7.6B
DeepCoder-14B-Preview
text-generation
gemma-3-12b-it
image-text-to-text
Qwen3-32B
text-generation
Qwen2.5-Coder-7B-Instruct
text-generation
7.6B
Qwen2.5-1.5B-Instruct
text-generation
1.5B
DeepSeek-R1-Distill-Qwen-14B
text-generation
14.8B
reader-lm-1.5b
text-generation
1.5B
Sky-T1-32B-Preview
text-generation
32.8B
neural-chat-7b-v3-1
text-generation
7.2B
Ling-1T
text-generation
Jan-nano
text-generation
4.0B
GLM-4-32B-0414
text-generation
Dolphin-Mistral-24B-Venice-Edition
text-generation
23.6B
Qwen3-1.7B
text-generation
WizardLM-2-8x22B
text-generation
141B
Install CLI
[email protected]
Raw data
· MIT · API data: live · HW/Cloud data: curated 2026-02-23 ·
v0.6.0