NEWS↑ trendingReddit r/LocalLLaMA·4/20/2026
ubergarm/Kimi-K2.6-GGUF Q4_X now available
User ubergarm/VoidAlchemy announced the availability of the "Q4_X" quantized version of the Kimi-K2.6-GGUF model, thanking jukofyork and AesSedai for their tips on patching and quantization. This model requires about 584GB RAM+VRAM and runs on both ik_llama.cpp and mainline llama.cpp, with plans for smaller quants and imatrix info to follow.

41