vram.run Models Hardware Providers Cloud State of Inference
API provider data is live · Hardware & cloud pricing curated 2026-02-23

Fireworks vs Groq

15 vs 8 models, 4 shared

Shared models

ModelFireworks $/1M outFireworks tok/sGroq $/1M outGroq tok/s
Llama-3.3-70B-Instruct109 tok/s295 tok/s
Kimi-K2-Instruct-090540 tok/s183 tok/s
gpt-oss-120b90 tok/s435 tok/s
gpt-oss-20b143 tok/s558 tok/s
Install CLI [email protected] Raw data · MIT · API data: live · HW/Cloud data: curated 2026-02-23 · v0.6.0