← heapsort-ai

Optimization

134 items

RESEARCHarXiv CS.LG·4/16/2026

Generalization Guarantees on Data-Driven Tuning of Gradient Descent with Langevin Updates

This paper introduces the Langevin Gradient Descent (LGD) algorithm for convex regression problems, proving that optimal hyperparameter configurations achieve the Bayes' optimal solution. The work also provides generalization guarantees for meta-learning LGD's optimal hyperparameters, with a pseudo-dimension bound of O(dh).

29
RESEARCHarXiv CS.LG·26d ago

Population Risk Bounds for Kolmogorov-Arnold Networks Trained by DP-SGD with Correlated Noise

This research establishes the first population risk bounds for Kolmogorov-Arnold Networks (KANs) trained with mini-batch SGD, including differentially private SGD (DP-SGD) with correlated noise. It covers more practical scenarios than prior KAN theory and provides sharper results for fixed-second-layer specializations.

29
RESEARCHarXiv CS.LG·15d ago

WeCon: An Efficient Weight-Conditioned Neural Solver for Multi-Objective Combinatorial Optimization Problems

Researchers propose WeCon, an efficient Weight-Conditioned neural solver for Multi-Objective Combinatorial Optimization Problems (MOCOPs). It improves weight-conditioned context modeling and preference optimization, addressing limitations of existing methods in weight injection and constructing informative solution pairs for training.

28
RESEARCHarXiv CS.AI·14d ago

Practical Quantum CIM Empowerment via All-Domestic-Core Agentic Large Model

This study integrates a femtosecond laser-pumped Coherent Ising Machine (CIM) with an LLM-driven agentic system, leveraging LangGraph and LangChain frameworks. It demonstrates that LLMs can effectively perform tasks like QUBO/Ising model calibration and constraint weight iteration, achieving practical empowerment of quantum CIM with domestic technology.

28
RESEARCHarXiv CS.LG·4/6/2026

From Broad Exploration to Stable Synthesis: Entropy-Guided Optimization for Autoregressive Image Generation

O artigo analisa a interação entre Chain-of-Thought (CoT) e Reinforcement Learning (RL) na geração de imagens a partir de texto (T2I) usando uma análise sistemática baseada em entropia. Ele revela que menor entropia dos tokens de imagem e do CoT textual se correlaciona com melhor qualidade de imagem, propondo a estratégia Entropy-Guided Group Relative Policy Optimization (EG-GRPO) para otimização com base na incerteza.

28
RESEARCHarXiv CS.LG·5d ago

Inverse Critical Experiment Design via Gradient Optimization and a Multigroup Attention-Based Neural Network Architecture

This research presents a methodology for the inverse design of critical experiments, essential for validating advanced nuclear reactor designs. It employs deep neural network surrogate modeling and nonparametric gradient optimization to generate experiment geometries that maximize neutronic similarity.

28