Kimi-K2-Instruct
by moonshotai
1026B params · text-generation · 2.3k likes · 94.8k downloads
Kimi-K2-Instruct is a 1026B parameter model. At Q4 quantization it requires 513GB of VRAM. It requires a GPU with at least 513GB of VRAM.
Inference providers
| Provider | $/1M in | $/1M out | Throughput |
|---|---|---|---|
| Novita | 42 tok/s | ||
| Featherless |