← heapsort-ai

scaling laws

2 items

RESEARCHarXiv CS.AI·2d ago

Position: Don't Just "Fix it in Post": A Science of AI Must Study Training Dynamics

This position paper argues for a scientific understanding of AI that focuses on studying training dynamics, rather than just analyzing models post-training. It emphasizes predicting outcomes, intervening when issues arise, and designing training procedures to reliably produce desired properties, extending the success of scaling laws beyond loss to capabilities, biases, robustness, and safety.

60
RESEARCHarXiv CS.CL·22d ago

The Scaling Laws of Skills in LLM Agent Systems

This research paper identifies two coupled scaling laws in LLM agent systems: a routing law showing accuracy decay with library size and an execution law demonstrating how correct execution improves downstream decisions. A key parameter, the routing logarithmic decay slope, links these laws, influencing both initial collapse and subsequent recoverability.

27