vram.run Models Hardware Providers Cloud State of Inference
API provider data is live · Hardware & cloud pricing curated 2026-02-23

Together AI vs Z.ai

29 vs 15 models, 3 shared

Shared models

ModelTogether AI $/1M outTogether AI tok/sZ.ai $/1M outZ.ai tok/s
GLM-4.642 tok/s87 tok/s
GLM-4.7-FP859 tok/s83 tok/s
GLM-542 tok/s38 tok/s
Install CLI [email protected] Raw data · MIT · API data: live · HW/Cloud data: curated 2026-02-23 · v0.6.0