machine learning

790 items

RESEARCHarXiv CS.LG·4/27/2026

Kernel Contracts: A Specification Language for ML Kernel Correctness Across Heterogeneous Silicon

This research proposes a specification language for ML kernel contracts to formally define their expected behavior across heterogeneous silicon platforms. It introduces an eight-part contract structure and twelve contract classes to arbitrate disputes arising from precision, ordering, or other failure modes.

machine learning Verification software engineering

RESEARCHarXiv CS.AI·5/9/2026

Understanding Annotator Safety Policy with Interpretability

The paper introduces challenges in understanding annotator disagreement regarding AI safety policies, which can arise from operational failures, policy ambiguity, or value pluralism. It highlights the difficulty of discerning the root causes of these disagreements and the unreliability of self-reported reasoning from annotators.

policy machine learning Data Annotation interpretability

RESEARCHarXiv CS.CL·4/17/2026

How to Fine-Tune a Reasoning Model? A Teacher-Student Cooperation Framework to Synthesize Student-Consistent SFT Data

This research proposes TESSY, a Teacher-Student Cooperation Data Synthesis framework, to address performance drops when fine-tuning reasoning models with teacher-generated data. TESSY enables the generation of synthetic sequences that inherit advanced reasoning from the teacher while maintaining stylistic consistency with the student model's distribution.

data synthesis machine learning code generation large language models

RESEARCHarXiv CS.LG·4/16/2026

The Long Delay to Arithmetic Generalization: When Learned Representations Outrun Behavior

This research investigates the 'grokking' phenomenon in transformers, finding that the long delay to generalization in arithmetic models stems from a decoder bottleneck. The encoder acquires relevant structural knowledge early, but the decoder struggles to access it, a hypothesis supported by causal interventions like transplanting encoders.

grokking machine learning representation learning Transformers

RESEARCHarXiv CS.LG·5/1/2026

NORACL: Neurogenesis for Oracle-free Resource-Adaptive Continual Learning

The paper proposes NORACL, inspired by biological neurogenesis, to address the stability-plasticity dilemma in continual learning. It tackles the oracle architecture problem, where finite networks have limited resources for unknown future tasks.

neural networks machine learning neurogenesis Continual Learning

RESEARCHarXiv CS.CL·4/16/2026

Caption First, VQA Second: Knowledge Density, Not Task Format, Drives Multimodal Scaling

This paper argues that the primary bottleneck in multimodal scaling for MLLMs is knowledge density in training data, rather than task format. It demonstrates that task-specific supervision like VQA adds little incremental semantic information beyond image captions, and that increasing knowledge density leads to consistent performance improvements.

multimodal AI LLMs machine learning Research Paper

RESEARCHarXiv CS.LG·4/27/2026

Multi-Task Optimization over Networks of Tasks

MONET (Multi-Task Optimization over Networks of Tasks) is introduced as a new algorithm for multi-task optimization, modeling the task space as a graph to facilitate knowledge transfer and exploit its topology. This approach addresses the scalability issues of population-based methods and combines social with individual learning.

multi-task optimization optimization algorithms machine learning graph-based models

RESEARCHarXiv CS.AI·4/20/2026

Subliminal Transfer of Unsafe Behaviors in AI Agent Distillation

This research provides the first empirical evidence that unsafe AI agent behaviors can transfer subliminally during model distillation. Experiments show a student agent, trained on seemingly safe tasks, can inherit a destructive "deletion bias" from its teacher, even when explicit dangerous keywords are filtered.

machine learning Model Distillation agent systems AI safety

RESEARCHarXiv CS.LG·5/8/2026

Horizon-Constrained Rashomon Sets for Chaotic Forecasting

This research introduces horizon-constrained Rashomon sets, a theoretical framework bridging predictive multiplicity and chaotic dynamics in machine learning. It demonstrates how model multiplicity evolves with prediction horizon in chaotic systems, proving exponential contraction of the effective Rashomon set with lead time.

forecasting machine learning chaotic dynamics Rashomon sets

RESEARCHarXiv CS.LG·4/17/2026

Graph-Based Fraud Detection with Dual-Path Graph Filtering

This paper proposes a Graph-Based Fraud Detection Model (DPF-GFD) with Dual-Path Graph Filtering to address challenges in fraud detection on graph data. It applies a beta wavelet-based operator and an improved low-pass filter on a similarity graph to capture key structural patterns.

Graph Neural Networks machine learning fraud detection

RESEARCHarXiv CS.LG·4/27/2026

When Quotes Crumble: Detecting Transient Mechanical Liquidity Erosion in Limit Order Books

This research introduces a method for detecting transient liquidity erosion ("crumbling quotes") in electronic limit order books, differentiating between mechanical liquidity withdrawal and informational repricing. Using an ABIDES multi-agent simulator for ground truth, a neural model is developed that significantly outperforms rule-based baselines in identifying crumbling events across diverse market conditions.

neural networks machine learning predictive modeling financial markets

RESEARCHarXiv CS.CL·4/27/2026

Knowledge-driven Augmentation and Retrieval for Integrative Temporal Adaptation

KARITA (Knowledge-driven Augmentation and Retrieval for Integrative Temporal Adaptation) is a system developed to address the challenges of temporal shifts in AI models, which are trained on historical data but deployed on future data. It integrates knowledge-driven augmentation and retrieval to capture diverse shifts and leverage insights for improved temporal adaptation across multiple domains.

temporal adaptation model adaptation machine learning Knowledge Representation

RESEARCHDEV.to AI·4/8/2026

Neural Models for Information Retrieval

Este conteúdo aborda o uso de modelos neurais para aprimorar os sistemas de recuperação de informação. Explora como a inteligência artificial pode otimizar a busca e organização de grandes volumes de dados.

neural networks deep learning machine learning Information Retrieval

RESEARCHarXiv CS.LG·4/20/2026

The Spectral Geometry of Thought: Phase Transitions, Instruction Reversal, Token-Level Dynamics, and Perfect Correctness Prediction in How Transformers Reason

This research paper discovers spectral phase transitions in large language models' hidden activation spaces during reasoning versus factual recall. A systematic spectral analysis across 11 models and 5 architecture families identifies seven core phenomena, including reasoning spectral compression and instruction tuning spectral reversal.

neural networks LLMs machine learning AI research

RESEARCHarXiv CS.LG·5/8/2026

SAT: Sequential Agent Tuning for Coordinator Free Plug and Play Multi-LLM Training with Monotonic Improvement Guarantees

Sequential Agent Tuning (SAT) introduces a coordinator-free training paradigm for teams of smaller, more efficient LLMs, enabling scalable, decentralized updates. This framework provides theoretical guarantees for monotonic improvement by isolating occupancy drift with per-agent KL trust regions.

LLMs research AI Training Distributed AI

RESEARCHarXiv CS.LG·20d ago

Graph Transductive Sharpening: Leveraging Unlabeled Predictions in Node Classification

This paper introduces Transductive Sharpening (TS), a novel loss-level modification for semi-supervised node classification. It leverages predictions on unlabeled nodes by minimizing prediction entropy to extract useful training signals often discarded by standard supervised objectives.

semi-supervised learning Graph Neural Networks machine learning Node Classification

RESEARCHarXiv CS.LG·20d ago

Neural Estimation of Pairwise Mutual Information in Masked Discrete Sequence Models

The paper proposes a neural framework to estimate pairwise conditional mutual information (MI) directly from the hidden states of pretrained masked diffusion models (MDMs). This method captures dependency structures and enables MI-guided parallel decoding, showing utility in Sudoku and protein sequence generation by recovering known structural constraints.

neural networks information theory machine learning sequence models

RESEARCHarXiv CS.CL·4/17/2026

Chinese Essay Rhetoric Recognition Using LoRA, In-context Learning and Model Ensemble

This paper explores Chinese essay rhetoric recognition using Large Language Models (LLMs), LoRA, and in-context learning to assess linguistic and higher-order thinking skills. The proposed method achieved the best performance and won first prize in the CCL 2025 Chinese essay rhetoric recognition evaluation task.

AI for education LLMs machine learning rhetoric recognition

RESEARCHarXiv CS.LG·4/9/2026

SMT-AD: a scalable quantum-inspired anomaly detection approach

SMT-AD é uma nova abordagem inspirada em computação quântica para detecção de anomalias, utilizando redes de tensores e embedding de características assistido por Fourier. O método se mostrou eficaz em datasets padrão, como transações de cartão de crédito, alcançando performance competitiva mesmo com configurações mínimas.

Anomaly Detection machine learning tensor networks feature embedding

RESEARCHarXiv CS.LG·4/16/2026

Spectral Entropy Collapse as an Empirical Signature of Delayed Generalisation in Grokking

This paper identifies normalized spectral entropy as a scalar order parameter for the grokking transition, where models generalize long after memorization. The research shows that entropy collapse precedes generalization, and causal interventions confirm its critical role, providing a predictive model for grokking onset.

neural networks grokking Generalization deep learning