ARTICLE↑ trending42

MiniMax M2.7 GGUF Investigation, Fixes, Benchmarks

Reddit r/LocalLLaMA·April 14, 2026

An investigation into MiniMax-M2.7 GGUF revealed that perplexity NaNs affect 21-38% of GGUFs on Hugging Face. The issue was traced to overflowing in llama.cpp, specifically in `blk.61.ffn_down_exps` for Q5_K and Q4_K quantizations, and the team has fixed theirs.

Perplexity NaNs quantization GGUF MiniMax M2.7

Read original ↗