vram.run Models Hardware Providers Cloud State of Inference
API provider data is live · Hardware & cloud pricing curated 2026-02-23

Novita

67 live models

Models

ModelTaskParams
DeepSeek-R1text-generation685B
Llama-3.1-8B-Instructtext-generation
gpt-oss-120btext-generation120B
gpt-oss-20btext-generation21.5B
Meta-Llama-3-8B-Instructtext-generation8.0B
DeepSeek-V3text-generation685B
DeepSeek-OCRimage-text-to-text
DeepSeek-V3-0324text-generation685B
Llama-3.3-70B-Instructtext-generation
DeepSeek-R1-0528text-generation685B
Kimi-K2-Instructtext-generation1026B
Kimi-K2.5image-text-to-text1059B
GLM-4.7text-generation358B
GLM-5text-generation754B
Kimi-K2-Thinkingtext-generation
GLM-4.7-Flashtext-generation
Meta-Llama-3-70B-Instructtext-generation70.6B
MiniMax-M2text-generation229B
GLM-4.5text-generation358B
Llama-3.2-1B-Instructtext-generation
Qwen3-Coder-480B-A35B-Instructtext-generation480B
DeepSeek-V3.2text-generation685B
Qwen3.5-397B-A17Bimage-text-to-text403B
MiniMax-M2.1text-generation229B
Llama-4-Scout-17B-16E-Instructimage-text-to-text109B
GLM-4.6text-generation357B
MiniMax-M2.5text-generation229B
Qwen3-Coder-Nexttext-generation79.7B
Qwen3-235B-A22Btext-generation235B
Qwen3.5-35B-A3Bimage-text-to-text36.0B
DeepSeek-V3.2-Exptext-generation685B
Qwen3-Next-80B-A3B-Instructtext-generation81.3B
Qwen2.5-72B-Instructtext-generation72.7B
Qwen3-30B-A3Btext-generation
DeepSeek-Prover-V2-671Btext-generation
DeepSeek-V3.1text-generation685B
Qwen3-VL-8B-Instructimage-text-to-text8.8B
Qwen3-235B-A22B-Instruct-2507text-generation
DeepSeek-R1-Distill-Llama-70Btext-generation
GLM-4.5Vimage-text-to-text
MiniMax-M1-80ktext-generation
Kimi-K2-Instruct-0905text-generation1026B
Qwen3-32Btext-generation
MiMo-V2-Flashtext-generation310B
Qwen3.5-27Bimage-text-to-text27.8B
GLM-4.6V-Flashimage-text-to-text10.3B
GLM-4.5-Airtext-generation
Qwen3-VL-30B-A3B-Instructimage-text-to-text
GLM-4-32B-0414text-generation
Qwen3-Next-80B-A3B-Thinkingtext-generation
Install CLI [email protected] Raw data · MIT · API data: live · HW/Cloud data: curated 2026-02-23 · v0.6.0