vram.run Models Hardware Providers Cloud State of Inference
API provider data is live · Hardware & cloud pricing curated 2026-02-23

Featherless

201 live models

Models

ModelTaskParams
Meta-Llama-3-8Btext-generation8.0B
Llama-3.1-8B-Instructtext-generation
Meta-Llama-3-8B-Instructtext-generation8.0B
Mistral-7B-Instruct-v0.2text-generation7.2B
QwQ-32Btext-generation
Kimi-K2-Instructtext-generation1026B
Llama-3.1-8Btext-generation
Qwen2.5-Coder-32B-Instructtext-generation32.8B
gemma-3-27b-itimage-text-to-text27.4B
zephyr-7b-betatext-generation7.2B
QwQ-32B-Previewtext-generation32.8B
Meta-Llama-3-70B-Instructtext-generation70.6B
DeepSeek-R1-Distill-Qwen-1.5Btext-generation
Mistral-Small-3.1-24B-Instruct-2503
Qwen3.5-397B-A17Bimage-text-to-text403B
Qwen2.5-7B-Instructtext-generation7.6B
zephyr-7b-alphatext-generation
Llama-2-13b-chat-hftext-generation
DeepSeek-R1-0528-Qwen3-8Btext-generation8.2B
Nanbeige4.1-3Btext-generation3.9B
Qwen3-8Btext-generation
Qwen3-Coder-30B-A3B-Instructtext-generation
Qwen2.5-72B-Instructtext-generation72.7B
Meta-Llama-3-70Btext-generation
DeepSeek-R1-Distill-Llama-8Btext-generation
DeepSeek-R1-Distill-Qwen-7Btext-generation7.6B
Qwen3-30B-A3B-Instruct-2507text-generation
gemma-2-9b-ittext-generation9.2B
ReaderLM-v2text-generation1.5B
Qwen2-72B-Instructtext-generation72.7B
Marco-o1text-generation7.6B
Step-3.5-Flashtext-generation199B
gemma-2-9btext-generation
Llama3-8B-Chinese-Chattext-generation
Qwen2-7B-Instructtext-generation7.6B
DeepCoder-14B-Previewtext-generation
gemma-3-12b-itimage-text-to-text
Qwen3-32Btext-generation
Qwen2.5-Coder-7B-Instructtext-generation7.6B
Qwen2.5-1.5B-Instructtext-generation1.5B
DeepSeek-R1-Distill-Qwen-14Btext-generation14.8B
reader-lm-1.5btext-generation1.5B
Sky-T1-32B-Previewtext-generation32.8B
neural-chat-7b-v3-1text-generation7.2B
Ling-1Ttext-generation
Jan-nanotext-generation4.0B
GLM-4-32B-0414text-generation
Dolphin-Mistral-24B-Venice-Editiontext-generation23.6B
Qwen3-1.7Btext-generation
WizardLM-2-8x22Btext-generation141B
Install CLI [email protected] Raw data · MIT · API data: live · HW/Cloud data: curated 2026-02-23 · v0.6.0