← heapsort-ai

model collapse

4 items

RESEARCHarXiv CS.CL·4/13/2026

Drift and selection in LLM text ecosystems

This paper introduces a mathematical framework to analyze the recursive process where AI-generated text re-enters and shapes the public record from which LLMs learn. It distinguishes between "drift," which removes rare forms through unfiltered reuse, and "selection," which filters content based on criteria like quality, showing normative selection preserves deeper linguistic structures.

29
RESEARCHarXiv CS.CL·5/1/2026

Exploring the Limits of Pruning: Task-Specific Neurons, Model Collapse, and Recovery in Task-Specific Large Language Models

This study explores the existence of task-specific neurons in large language models, focusing on mathematical reasoning and code generation. It introduces an activation-based selectivity metric for neuron pruning, which consistently outperforms random pruning in reducing computational cost and preserving task accuracy, while preventing performance collapse.

27