Hyperparameter Tuning

3 items

RESEARCHarXiv CS.LG·4/16/2026

Generalization Guarantees on Data-Driven Tuning of Gradient Descent with Langevin Updates

This paper introduces the Langevin Gradient Descent (LGD) algorithm for convex regression problems, proving that optimal hyperparameter configurations achieve the Bayes' optimal solution. The work also provides generalization guarantees for meta-learning LGD's optimal hyperparameters, with a pseudo-dimension bound of O(dh).

Meta-Learning Optimization Generalization Hyperparameter Tuning

RESEARCHarXiv CS.LG·27d ago

$\xi$-DPO: Direct Preference Optimization via Ratio Reward Margin

This paper introduces -DPO, a direct preference optimization method using a ratio reward margin, to address the challenge of hyperparameter tuning in SimPO. The research analyzes SimPO and reformulates the preference objective to improve interpretability across datasets with varying reward gap structures.

Preference Optimization deep learning reinforcement learning Hyperparameter Tuning

RESEARCHarXiv CS.LG·5d ago

Unlocking Feature Learning in Gated Delta Networks at Scale

This paper derives scaling rules for Gated Delta Networks to address the computational demands of training and scaling Large Language Models. Experiments validate that these configurations enable stable learning-rate transfer across various model widths, unlike standard parametrization.

neural networks learning Hyperparameter Tuning machine learning