RESEARCH27
Sampling More, Getting Less: Calibration is the Diversity Bottleneck in LLMs
arXiv CS.CLΒ·May 13, 2026
This research addresses the lack of diversity in LLM outputs, attributing it to how models allocate probability mass across valid and invalid continuations during decoding. It introduces a validity-diversity framework that decomposes the problem into two complementary forms of miscalibration: order calibration and shape calibration.
Read original β