vram.run Models Hardware Providers Cloud State of Inference
API provider data is live · Hardware & cloud pricing curated 2026-02-23

Groq

8 live models

Models

ModelTaskParams
gpt-oss-120btext-generation120B
gpt-oss-20btext-generation21.5B
Llama-3.3-70B-Instructtext-generation
Llama-4-Scout-17B-16E-Instructimage-text-to-text109B
Kimi-K2-Instruct-0905text-generation1026B
Qwen3-32Btext-generation
gpt-oss-safeguard-20btext-generation
Llama-Guard-4-12Bimage-text-to-text
Install CLI [email protected] Raw data · MIT · API data: live · HW/Cloud data: curated 2026-02-23 · v0.6.0