model collapse

4 items

RESEARCHarXiv CS.CL·4/13/2026

Drift and selection in LLM text ecosystems

This paper introduces a mathematical framework to analyze the recursive process where AI-generated text re-enters and shapes the public record from which LLMs learn. It distinguishes between "drift," which removes rare forms through unfiltered reuse, and "selection," which filters content based on criteria like quality, showing normative selection preserves deeper linguistic structures.

Text Ecosystems data drift model collapse large language models

RESEARCHarXiv CS.CL·4d ago

Epidemiology of Model Collapse: Modeling Synthetic Data Contamination via Bilayer SIR Dynamics

The paper proposes a bilayer SIR/SIRS framework to model synthetic data contamination and model collapse within the AI ecosystem. This phenomenological mean-field model treats data corpora and AI models as interacting populations, deriving a basic reproduction number to analyze cross-contamination.

synthetic data AI models data contamination model collapse

RESEARCHarXiv CS.CL·5/1/2026

Exploring the Limits of Pruning: Task-Specific Neurons, Model Collapse, and Recovery in Task-Specific Large Language Models

This study explores the existence of task-specific neurons in large language models, focusing on mathematical reasoning and code generation. It introduces an activation-based selectivity metric for neuron pruning, which consistently outperforms random pruning in reducing computational cost and preserving task accuracy, while preventing performance collapse.

Pruning AI optimization model collapse large language models

RESEARCHQwen Blog·7/27/2025

GSPO: Towards Scalable Reinforcement Learning for Language Models

O Reinforcement Learning é crucial para escalar modelos de linguagem, mas algoritmos existentes sofrem de instabilidade e colapso do modelo. Para resolver isso e permitir o escalonamento bem-sucedido, propõe-se o algoritmo Group Sequence Policy Optimization (GSPO).

Scalability Policy optimization language models reinforcement learning