← heapsort-ai

machine learning

790 items

RESEARCHarXiv CS.AI·4/17/2026

Interpretable and Explainable Surrogate Modeling for Simulations: A State-of-the-Art Survey and Perspectives on Explainable AI for Decision-Making

This survey explores the integration of surrogate modeling and Explainable AI (XAI) for complex system simulations, addressing the inherent black-box nature of these models. It aims to reconnect these complementary fields by outlining how XAI can unpack surrogate models despite engineering constraints.

27
RESEARCHarXiv CS.CL·4/17/2026

Decoupling Scores and Text: The Politeness Principle in Peer Review

This study investigates the difficulty of interpreting peer review feedback, comparing the effectiveness of numerical scores versus text in predicting acceptance. The research reveals that score-based models are significantly more accurate (91%) than text-based models (81% even with LLMs), indicating textual information is considerably less reliable.

27
RESEARCHarXiv CS.CL·4/17/2026

Can Large Language Models Detect Methodological Flaws? Evidence from Gesture Recognition for UAV-Based Rescue Operation Based on Deep Learning

This research investigates whether Large Language Models (LLMs) can identify methodological flaws, such as data leakage, in published machine learning studies. A case study showed six state-of-the-art LLMs consistently detected evaluation flaws in a gesture recognition paper due to non-independent data partitioning.

27
RESEARCHarXiv CS.LG·5/4/2026

Smart Ensemble Learning Framework for Predicting Groundwater Heavy Metal Pollution

This study develops a predictive framework to model the Heavy Metal Pollution Index (HPI) in groundwater, integrating response transformations with nested cross-validated ensemble machine learning. It aims to overcome challenges posed by statistical complexity and spatial heterogeneity of contaminants that affect conventional prediction methods.

27
RESEARCHarXiv CS.CL·4/24/2026

Hierarchical Policy Optimization for Simultaneous Translation of Unbounded Speech

This paper introduces Hierarchical Policy Optimization (HPO) for Simultaneous Speech Translation (SST) using LLMs, addressing challenges like high computational cost and imperfect supervised fine-tuning data. HPO employs a hierarchical reward to balance translation quality and latency, demonstrating substantial improvements in COMET and MetricX scores.

27
RESEARCHarXiv CS.LG·5/4/2026

Human-in-the-Loop Meta Bayesian Optimization for Fusion Energy and Scientific Applications

This paper introduces Human-in-the-Loop Meta Bayesian Optimization (HL-MBO), a framework combining expert knowledge with few-shot machine learning to accelerate discovery in data-scarce scientific domains. It outperforms current Bayesian Optimization methods in fusion energy yield optimization and other benchmarks.

27
RESEARCHarXiv CS.LG·5/4/2026

Soft-MSM: Differentiable Context-Aware Elastic Alignment for Time Series

This research introduces Soft-MSM, a novel differentiable elastic alignment loss for time series, building upon the Move-Split-Merge (MSM) distance. Soft-MSM addresses the limitation of Soft-DTW by incorporating context-aware transition costs, making it suitable for gradient-based optimization in machine learning tasks like classification and clustering.

27
RESEARCHarXiv CS.LG·4/21/2026

CGCMA: Conditionally-Gated Cross-Modal Attention for Event-Conditioned Asynchronous Fusion

This paper studies asynchronous alignment in multimodal learning, where a dense primary stream must be fused with sporadic external context, requiring models to reason explicitly about freshness and trust. It proposes CGCMA (Conditionally-Gated Cross-Modal Attention), a model that separates text-conditioned grounding from lag-aware trust control, tested on cryptocurrency markets.

27
RESEARCHarXiv CS.CL·4/21/2026

Brain-CLIPLM: Decoding Compressed Semantic Representations in EEG for Language Reconstruction

This work proposes a semantic compression hypothesis to overcome limitations in EEG-to-text decoding, suggesting that EEG signals encode compressed semantic anchors rather than full linguistic structure. It introduces Brain-CLIPLM, a two-stage framework for semantic anchor extraction via contrastive learning and sentence reconstruction using a retrieval-grounded large language model.

27