Kimi-K2-Instruct-0905
by moonshotai
1026B params · text-generation · 685 likes · 27.1k downloads
Kimi-K2-Instruct-0905 is a 1026B parameter model. At Q4 quantization it requires 513GB of VRAM. It requires a GPU with at least 513GB of VRAM.
Inference providers
| Provider | $/1M in | $/1M out | Throughput |
|---|---|---|---|
| Groq | 183 tok/s | ||
| Novita | 24 tok/s | ||
| Fireworks | 40 tok/s |