← heapsort-ai

machine learning

790 items

RESEARCHarXiv CS.LG·27d ago

Population Risk Bounds for Kolmogorov-Arnold Networks Trained by DP-SGD with Correlated Noise

This research establishes the first population risk bounds for Kolmogorov-Arnold Networks (KANs) trained with mini-batch SGD, including differentially private SGD (DP-SGD) with correlated noise. It covers more practical scenarios than prior KAN theory and provides sharper results for fixed-second-layer specializations.

29
RESEARCHarXiv CS.LG·9d ago

When LLMs Learn to Be Consistently Wrong: A Multi-Model Study of Linear Representations of Synthetic Deception

This paper explores "deceptive alignment" in LLMs, a key challenge in AI safety where models deliberately produce false outputs while maintaining accurate internal representations. Researchers introduced a multi-model paradigm, successfully detecting synthetic dishonesty with high accuracy using linear probes across various transformer architectures.

29
RESEARCHarXiv CS.CL·4/6/2026

Dependency-Guided Parallel Decoding in Discrete Diffusion Language Models

Modelos de linguagem de difusão discreta (dLLMs) aceleram a geração de texto, mas a decodificação paralela degrada a qualidade ao desconsiderar a dependência entre tokens. DEMASK propõe um preditor leve que estima influências condicionais para guiar o desmascaramento simultâneo, comprovadamente melhorando a qualidade. A técnica resulta em um ganho de velocidade de 1.7 a 2.2x, mantendo ou superando o desempenho.

29
RESEARCHarXiv CS.LG·28d ago

Interpretable EEG Microstate Discovery via Variational Deep Embedding: A Systematic Architecture Search with Multi-Quadrant Evaluation

This paper introduces the Convolutional Variational Deep Embedding (Conv-VaDE) model for EEG microstate analysis. It enhances interpretability by jointly learning topographic reconstruction and probabilistic soft clustering, enabling generative decoding of cluster prototypes into verifiable scalp topographies.

29
RESEARCHarXiv CS.LG·16d ago

WeCon: An Efficient Weight-Conditioned Neural Solver for Multi-Objective Combinatorial Optimization Problems

Researchers propose WeCon, an efficient Weight-Conditioned neural solver for Multi-Objective Combinatorial Optimization Problems (MOCOPs). It improves weight-conditioned context modeling and preference optimization, addressing limitations of existing methods in weight injection and constructing informative solution pairs for training.

28