machine learning

790 items

RESEARCHarXiv CS.AI·4/17/2026

Interpretable and Explainable Surrogate Modeling for Simulations: A State-of-the-Art Survey and Perspectives on Explainable AI for Decision-Making

This survey explores the integration of surrogate modeling and Explainable AI (XAI) for complex system simulations, addressing the inherent black-box nature of these models. It aims to reconnect these complementary fields by outlining how XAI can unpack surrogate models despite engineering constraints.

simulations Surrogate Models Decision-making machine learning

DOCDEV.to AI·4/21/2026

Fine-Tuning a Model in 2026: A Step-by-Step Guide

Fine-tuning is a crucial step for adapting pre-trained models to specific tasks, improving performance and reducing training time. This guide defines fine-tuning, its benefits, and the difference between full and parameter-efficient fine-tuning, highlighting the role of pre-trained models.

machine learning pre-trained-models large language models fine-tuning

RESEARCHarXiv CS.LG·5/8/2026

MidSteer: Optimal Affine Framework for Steering Generative Models

This paper formalizes the theory of concept steering in generative models, linking it to affine concept erasure and introducing LEACE-Switch. It then proposes MidSteer, a more general affine framework for concept manipulation with minimal disturbance.

model steering machine learning theoretical framework AI research

RESEARCHarXiv CS.CL·4/17/2026

Decoupling Scores and Text: The Politeness Principle in Peer Review

This study investigates the difficulty of interpreting peer review feedback, comparing the effectiveness of numerical scores versus text in predicting acceptance. The research reveals that score-based models are significantly more accurate (91%) than text-based models (81% even with LLMs), indicating textual information is considerably less reliable.

machine learning Natural Language Processing large language models peer review

RESEARCHarXiv CS.CL·4/17/2026

Can Large Language Models Detect Methodological Flaws? Evidence from Gesture Recognition for UAV-Based Rescue Operation Based on Deep Learning

This research investigates whether Large Language Models (LLMs) can identify methodological flaws, such as data leakage, in published machine learning studies. A case study showed six state-of-the-art LLMs consistently detected evaluation flaws in a gesture recognition paper due to non-independent data partitioning.

deep learning machine learning large language models AI evaluation

RESEARCHarXiv CS.CL·4/24/2026

Machine learning and digital pragmatics: Which word category influences emoji use most?

This study employs Machine Learning, specifically the MARBERT model, to predict emoji use in Arabic tweets collected from X.com. The model achieved an overall accuracy of 0.75, indicating promising results while highlighting a need for further model improvement.

Emoji Prediction Social Media Analysis Arabic Language machine learning

RESEARCHarXiv CS.LG·5/4/2026

Smart Ensemble Learning Framework for Predicting Groundwater Heavy Metal Pollution

This study develops a predictive framework to model the Heavy Metal Pollution Index (HPI) in groundwater, integrating response transformations with nested cross-validated ensemble machine learning. It aims to overcome challenges posed by statistical complexity and spatial heterogeneity of contaminants that affect conventional prediction methods.

ensemble learning heavy metal contamination machine learning predictive modeling

RESEARCHarXiv CS.LG·5/8/2026

Adaptive Computation Depth via Learned Token Routing in Transformers

This paper introduces Token-Selective Attention (TSA), a mechanism for Transformer architectures that enables adaptive computation depth per token. TSA learns to route tokens based on contextual difficulty, saving 14-23% of token-layer operations with minimal quality loss.

neural networks deep learning machine learning efficiency

RESEARCHarXiv CS.CL·4/24/2026

Hierarchical Policy Optimization for Simultaneous Translation of Unbounded Speech

This paper introduces Hierarchical Policy Optimization (HPO) for Simultaneous Speech Translation (SST) using LLMs, addressing challenges like high computational cost and imperfect supervised fine-tuning data. HPO employs a hierarchical reward to balance translation quality and latency, demonstrating substantial improvements in COMET and MetricX scores.

LLMs machine learning Natural Language Processing speech-translation

RESEARCHarXiv CS.CL·4/24/2026

Weighting What Matters: Boosting Sample Efficiency in Medical Report Generation via Token Reweighting

This work introduces a token reweighting loss function to enhance data efficiency in training vision-language models for medical report generation. By prioritizing semantically salient tokens, the method achieves comparable report quality using up to ten times less training data.

Data efficiency machine learning computer vision natural language generation

RESEARCHarXiv CS.LG·4/21/2026

A Discordance-Aware Multimodal Framework with Multi-Agent Clinical Reasoning

This research proposes a discordance-aware multimodal framework for knee osteoarthritis, integrating machine learning prediction models with a multi-agent reasoning system. It leverages various data modalities, including tabular features, MRI, and X-ray embeddings, to predict joint space loss and pain progression.

multimodal AI machine learning multi-agent systems medical AI

RESEARCHarXiv CS.CL·20d ago

FlowLM: Few-Step Language Modeling via Diffusion-to-Flow Adaptation

FlowLM introduces a novel flow matching language model, adapted from pre-trained diffusion models through efficient fine-tuning. This method enables high-quality, few-step text generation that significantly outperforms traditional diffusion sampling with fewer training epochs.

Diffusion Models language models machine learning text generation

RESEARCHarXiv CS.LG·5/4/2026

Human-in-the-Loop Meta Bayesian Optimization for Fusion Energy and Scientific Applications

This paper introduces Human-in-the-Loop Meta Bayesian Optimization (HL-MBO), a framework combining expert knowledge with few-shot machine learning to accelerate discovery in data-scarce scientific domains. It outperforms current Bayesian Optimization methods in fusion energy yield optimization and other benchmarks.

Bayesian Optimization machine learning Fusion Energy scientific research

RESEARCHarXiv CS.LG·5/4/2026

Soft-MSM: Differentiable Context-Aware Elastic Alignment for Time Series

This research introduces Soft-MSM, a novel differentiable elastic alignment loss for time series, building upon the Move-Split-Merge (MSM) distance. Soft-MSM addresses the limitation of Soft-DTW by incorporating context-aware transition costs, making it suitable for gradient-based optimization in machine learning tasks like classification and clustering.

elastic alignment machine learning Soft-MSM dynamic time warping

RESEARCHarXiv CS.LG·4/21/2026

CGCMA: Conditionally-Gated Cross-Modal Attention for Event-Conditioned Asynchronous Fusion

This paper studies asynchronous alignment in multimodal learning, where a dense primary stream must be fused with sporadic external context, requiring models to reason explicitly about freshness and trust. It proposes CGCMA (Conditionally-Gated Cross-Modal Attention), a model that separates text-conditioned grounding from lag-aware trust control, tested on cryptocurrency markets.

multimodal AI machine learning Attention Mechanisms Time Series Analysis

RESEARCHarXiv CS.CL·4/21/2026

Brain-CLIPLM: Decoding Compressed Semantic Representations in EEG for Language Reconstruction

This work proposes a semantic compression hypothesis to overcome limitations in EEG-to-text decoding, suggesting that EEG signals encode compressed semantic anchors rather than full linguistic structure. It introduces Brain-CLIPLM, a two-stage framework for semantic anchor extraction via contrastive learning and sentence reconstruction using a retrieval-grounded large language model.

Brain-Computer Interface (BCI)deep learning machine learning Natural Language Processing (NLP)

RESEARCHarXiv CS.AI·5/4/2026

TUR-DPO: Topology- and Uncertainty-Aware Direct Preference Optimization

TUR-DPO is a novel topology- and uncertainty-aware variant of Direct Preference Optimization (DPO) designed to better align large language models (LLMs) with human preferences. It improves upon DPO by considering reasoning topologies and uncertainty signals, rewarding how answers are derived, not only what they say.

reinforcement learning DPO AI alignment machine learning

RESEARCHarXiv CS.LG·5/4/2026

What Physics do Data-Driven MoCap-to-Radar Models Learn?

This research introduces a physics-based interpretability framework to assess what physics data-driven MoCap-to-radar models learn. It finds that low reconstruction error doesn't guarantee physical consistency, and temporal attention is critical for transformer-based models to learn the underlying physics.

Physics Motion Capture machine learning interpretability

RESEARCHarXiv CS.AI·5/6/2026

Understanding Emergent Misalignment via Feature Superposition Geometry

This paper proposes a geometric account based on feature superposition to explain emergent misalignment in LLMs, where fine-tuning on narrow, non-harmful tasks can induce harmful behaviors. It demonstrates that features tied to misalignment-inducing data are geometrically closer to harmful features than those from non-inducing data.

feature superposition LLMs machine learning misalignment

RESEARCHarXiv CS.AI·25d ago

Enhanced and Efficient Reasoning in Large Learning Models

This paper proposes an efficient and principled method to enhance reasoning in Large Language Models, addressing the current lack of trustworthiness in produced content. It involves a preprocessing stage using a Unary Relational Integracode followed by a streamlined machine learning process.

model efficiency machine learning Reasoning data preprocessing