RESEARCHβ trending42
nvidia/Gemma-4-26B-A4B-NVFP4
Reddit r/LocalLLaMAΒ·May 1, 2026

The content confirms the performance of the Gemma-4-26B-A4B-NVFP4 model on an NVIDIA 5090 GPU, detailing 18.8GB VRAM usage and 50k context capability. It also presents benchmark scores for the NVFP4 version compared to full precision across various metrics like GPQA, AIME, and MMLU Pro.
Read original β