vram.run Models Hardware Providers Cloud State of Inference
API provider data is live · Hardware & cloud pricing curated 2026-02-23

Together AI

29 live models

Models

ModelTaskParams
DeepSeek-R1text-generation685B
FLUX.1-schnelltext-to-image
gpt-oss-120btext-generation120B
gpt-oss-20btext-generation21.5B
DeepSeek-V3text-generation685B
DeepSeek-V3-0324text-generation685B
Llama-3.3-70B-Instructtext-generation
DeepSeek-R1-0528text-generation685B
Kimi-K2.5image-text-to-text1059B
GLM-5text-generation754B
Qwen3-Coder-480B-A35B-Instructtext-generation480B
Qwen3.5-397B-A17Bimage-text-to-text403B
GLM-4.6text-generation357B
Qwen2.5-7B-Instructtext-generation7.6B
Qwen3-Next-80B-A3B-Instructtext-generation81.3B
gemma-3n-E4B-itimage-text-to-text
DeepSeek-V3.1text-generation685B
Qwen3-VL-8B-Instructimage-text-to-text8.8B
Qwen3-235B-A22B-Instruct-2507text-generation
Qwen3.5-9Bimage-text-to-text9.7B
rnj-1-instructtext-generation
Apriel-1.6-15b-Thinkerimage-text-to-text
Llama-4-Maverick-17B-128E-Instruct-FP8image-text-to-text402B
Qwen3-Coder-480B-A35B-Instruct-FP8text-generation
GLM-4.7-FP8text-generation
Qwen3-Coder-Next-FP8text-generation79.7B
GLM-4.5-Air-FP8text-generation
cogito-671b-v2.1text-generation671B
cogito-671b-v2.1-FP8text-generation671B
Install CLI [email protected] Raw data · MIT · API data: live · HW/Cloud data: curated 2026-02-23 · v0.6.0