BitsMoE: Efficient Spectral Energy-Guided Bit Allocation for MoE LLM Quantization
BitsMoE proposes a spectral-energy-guided bit-allocation framework for quantizing Mixture-of-Experts (MoE) large language models. It addresses memory-intensive deployment by decomposing MoE layers and using expert-specific spectral factors for fine-grained, activation-aware mixed-precision quantization.
