← heapsort-ai

machine learning

790 items

RESEARCHarXiv CS.LG·4/20/2026

Lightweight Geometric Adaptation for Training Physics-Informed Neural Networks

Physics-Informed Neural Networks (PINNs) often suffer from slow convergence and instability due to complex loss landscapes. This paper proposes a lightweight, curvature-aware optimization framework that augments existing first-order optimizers to improve convergence speed, training stability, and solution accuracy on partial differential equations (PDEs).

27
RESEARCHarXiv CS.LG·23d ago

Reading the Cell, Designing the Cure: Perturbation-Conditioned Molecular Diffusion for Function-Oriented Drug Design

This research introduces Transcriptome-based Drug Design (TBDD) as a generative inverse problem to design drug molecules conditioned on desired transcriptomic state transitions. It proposes "ACURE" (A Cellular Response Engine), a multi-resolution transcriptome-guided diffusion framework, to address the challenges of this complex task.

27
RESEARCHarXiv CS.LG·4/9/2026

A Benchmark of Classical and Deep Learning Models for Agricultural Commodity Price Forecasting on A Novel Bangladeshi Market Price Dataset

Este artigo apresenta o AgriPriceBD, um novo conjunto de dados diário de preços de commodities agrícolas de Bangladesh, extraído com auxílio de LLM. Ele avalia sete abordagens de previsão, incluindo modelos clássicos e arquiteturas de deep learning, para estabilização da renda e segurança alimentar.

27
RESEARCHarXiv CS.CL·5/8/2026

When2Speak: A Dataset for Temporal Participation and Turn-Taking in Multi-Party Conversations for Large Language Models

When2Speak is a new synthetic dataset and four-stage generation pipeline designed to teach Large Language Models (LLMs) appropriate intervention timing in multi-party conversations. It addresses the challenge of avoiding excessive interruptions and improving conversational coherence in group interactions.

27
RESEARCHarXiv CS.LG·5/8/2026

Data-Driven Variational Basis Learning Beyond Neural Networks: A Non-Neural Framework for Adaptive Basis Discovery

This manuscript introduces Data Driven Variational Basis Learning (DVBL), a novel non-neural framework for learning data-adaptive basis functions directly from high-dimensional data. It provides an explicit, interpretable, and mathematically transparent alternative to neural networks for representation learning, addressing their limitations in control and transparency.

27
RESEARCHarXiv CS.LG·4/22/2026

Easy Samples Are All You Need: Self-Evolving LLMs via Data-Efficient Reinforcement Learning

This research introduces EasyRL, a novel data-efficient reinforcement learning approach for self-evolving LLMs, designed to overcome high annotation costs and performance issues in existing methods. Inspired by cognitive learning theory, EasyRL integrates knowledge transfer from easy labeled data with a progressive divide-and-conquer strategy for difficult unlabeled data.

27
RESEARCHarXiv CS.AI·5/7/2026

ANDRE: An Attention-based Neuro-symbolic Differentiable Rule Extractor

This paper introduces ANDRE, a novel Attention-based Neuro-symbolic Differentiable Rule Extractor (ILP) framework for learning first-order logic programs. It optimizes over a continuous rule space with fully differentiable, attention-driven logical operators, addressing scalability challenges in noisy and probabilistic settings.

27
RESEARCHarXiv CS.LG·5/5/2026

Sparse Regression under Correlation and Weak Signals: A Reproducible Benchmark of Classical and Bayesian Methods

This research benchmarks classical (Lasso, Ridge) and Bayesian (Horseshoe, Spike-and-Slab) sparse regression methods under challenging conditions like correlated features and weak signals. Bayesian methods generally outperform classical ones in prediction error, with Horseshoe offering excellent coverage, though classical methods are faster.

27
RESEARCHarXiv CS.LG·21d ago

Adaptive Multi-Scale Goodness Aggregation for Forward-Forward Learning

This paper introduces Adaptive Multi-Scale Goodness Aggregation (AMSGA), a novel extension of the Forward-Forward (FF) algorithm to enhance stability, robustness, and generalization in local-learning neural networks. AMSGA incorporates multi-scale goodness aggregation, adaptive hard negative mining, and layer-dependent thresholds. Experiments on MNIST and Fashion-MNIST show consistent performance improvements over the baseline FF algorithm.

27
RESEARCHarXiv CS.LG·28d ago

Hierarchical Multi-Scale Graph Neural Networks: Scalable Heterophilous Learning with Oversmoothing and Oversquashing Mitigation

The paper introduces Hierarchical Multi-view HAAR (HMH), a novel spectral graph-learning framework designed to address oversmoothing and degree-biased aggregation in GNNs for heterophilous graphs. HMH constructs a soft graph hierarchy and applies learnable spectral filters using sparse, orthonormal Haar bases, achieving near-linear time scalability.

27
RESEARCHarXiv CS.LG·29d ago

Do Foundation Model Embeddings Improve Cross-Country Crop Yield Generalisation? A Leave-One-Country-Out Evaluation in Sub-Saharan Africa

This paper evaluates whether geospatial foundation model embeddings improve cross-country maize yield predictions in Sub-Saharan Africa. It finds that while within-country predictions are moderate, all feature sets, including foundation model embeddings, perform poorly under cross-country testing, indicating a significant generalisability gap.

27
RESEARCHarXiv CS.LG·14d ago

When Rule Violations Are Rare: Chimera Training for Logical Anomaly Detection

This paper proposes a method for anomaly detection called Chimera Training, focusing on violations of semantic constraints given as logical rules over learned visual concepts. It employs a neural rule evaluator that compiles constraints into directed acyclic graphs, learning logical operators to calculate rule-satisfaction probabilities, even with scarce training data for actual violations.

27
RESEARCHarXiv CS.CL·14d ago

CroCo: Cross-Lingual Contrastive Preference Tuning on Self-Generations

This work introduces CroCo, a method for cross-lingual contrastive preference tuning on self-generated responses from LLMs, demonstrating effective transfer across 14 languages without language-specific preference annotations. An English-trained reward model yields useful rankings across most languages, improving existing models and preventing catastrophic forgetting, provided on-policy data is used.

27
RESEARCHarXiv CS.LG·8d ago

Automatically Differentiable Nonlinear Tensor Networks (ADNTNs) for Exponential Compression of Deep Neural Networks

This paper introduces Automatically Differentiable Nonlinear Tensor Networks (ADNTNs), a family of structured weight generators for exponential compression of Deep Neural Networks. It extends low-rank adaptation and tensor factorization by building large weight tensors through a hierarchy of small cores and nonlinear activations.

27