vram.run Models Hardware Providers Cloud State of Inference
API provider data is live · Hardware & cloud pricing curated 2026-02-23

Z.ai

15 live models

Models

ModelTaskParams
GLM-4.7text-generation358B
GLM-5text-generation754B
GLM-4.7-Flashtext-generation
GLM-4.5text-generation358B
GLM-4.6text-generation357B
GLM-OCRimage-to-text
GLM-Imagetext-to-image
GLM-4.5Vimage-text-to-text
GLM-4.6V-Flashimage-text-to-text10.3B
GLM-4.5-Airtext-generation
GLM-4.6Vimage-text-to-text
GLM-4.7-FP8text-generation
GLM-4.6-FP8text-generation
GLM-4.5V-FP8image-text-to-text
GLM-4.6V-FP8image-text-to-text108B
Install CLI [email protected] Raw data · MIT · API data: live · HW/Cloud data: curated 2026-02-23 · v0.6.0