RESEARCH27
Language Acquisition Device in Large Language Models
arXiv CS.CLΒ·May 19, 2026
This paper proposes LAD-inspired pre-pretraining on MP-STRUCT, a formal language reflecting natural language structures, to improve Large Language Models' data efficiency. A brief pre-pretraining with MP-STRUCT matches strong formal-language baselines in token efficiency and imparts human-like resistance to structurally implausible languages.
Read original β