← heapsort-ai

DeepSeek-V4-Flash

2 items

RESEARCHDEV.to AI·7d ago

Elemetry data: Running 284B MoE at 0.00 GB Active VRAM

This content shares hardware telemetry data from an architectural test evaluating frontier-scale model execution on highly constrained, commodity hardware footprints. It details benchmarking a 284B parameter Mixture-of-Experts (MoE) architecture, achieving 0.00 GB active GPU VRAM by decoupling physical weight storage from active local graphics allocation.

27