vram.run Models Hardware Providers Cloud State of Inference
API provider data is live · Hardware & cloud pricing curated 2026-02-23

Fireworks

15 live models

Models

ModelTaskParams
gpt-oss-120btext-generation120B
gpt-oss-20btext-generation21.5B
Llama-3.3-70B-Instructtext-generation
Kimi-K2.5image-text-to-text1059B
GLM-5text-generation754B
Kimi-K2-Thinkingtext-generation
DeepSeek-V3.2text-generation685B
MiniMax-M2.1text-generation229B
MiniMax-M2.5text-generation229B
Qwen3-8Btext-generation
DeepSeek-V3.1text-generation685B
Kimi-K2-Instruct-0905text-generation1026B
Qwen3-VL-30B-A3B-Instructimage-text-to-text
Qwen3-VL-30B-A3B-Thinkingimage-text-to-text
cogito-671b-v2.1text-generation671B
Install CLI [email protected] Raw data · MIT · API data: live · HW/Cloud data: curated 2026-02-23 · v0.6.0