ARTICLE↑ trending42

MiniMax M2.7 GGUF Investigation, Fixes, Benchmarks

Reddit r/LocalLLaMA·14 de abril de 2026

Uma investigação sobre o MiniMax-M2.7 GGUF revelou que NaNs na perplexidade afetam 21-38% dos GGUFs no Hugging Face. O problema foi atribuído a estouro no llama.cpp, especificamente em `blk.61.ffn_down_exps` para quantizações Q5_K e Q4_K, e a equipe corrigiu os seus.

Perplexity NaNs quantization GGUF MiniMax M2.7

Ler original ↗