← heapsort-ai

machine learning

790 items

RESEARCHarXiv CS.AI·4/16/2026

Optimizing Earth Observation Satellite Schedules under Unknown Operational Constraints: An Active Constraint Acquisition Approach

This paper addresses Earth Observation satellite scheduling under unknown operational constraints, which must be learned interactively from a binary oracle. The authors introduce Conservative Constraint Acquisition (CCA), a domain-specific procedure, to efficiently identify justified constraints for a simplified model.

28
RESEARCHarXiv CS.CL·5/1/2026

Selective Augmentation: Improving Universal Automatic Phonetic Transcription via G2P Bootstrapping

This research proposes Selective Augmentation, a bootstrapping method to improve universal automatic phonetic transcription (APT) by selectively transferring linguistic distinctions to address limited high-quality training data. Exemplified with the MultIPA model, the approach enhanced plosive voicing accuracy by 17.6% and introduced aspiration recognition using data augmented from a helper language like Hindi.

28
RESEARCHarXiv CS.LG·4/17/2026

Explainable Graph Neural Networks for Interbank Contagion Surveillance: A Regulatory-Aligned Framework for the U.S. Banking Sector

The ST-GAT framework provides an explainable Graph Neural Network solution for early detection of bank distress and interbank contagion surveillance in the U.S. banking sector. It models over 8,000 FDIC institutions using dynamic graphs, achieving high performance (AUPRC 0.939) and identifying key predictive factors like ROA and NPL Ratio.

28
RESEARCHarXiv CS.LG·4/22/2026

FASE : A Fairness-Aware Spatiotemporal Event Graph Framework for Predictive Policing

FASE is a Fairness-Aware Spatiotemporal Event Graph framework designed to integrate crime prediction with fairness-constrained patrol allocation to mitigate racial disparities in predictive policing. It utilizes a spatiotemporal graph neural network and a multivariate Hawkes process to model crime incidents in Baltimore, addressing data bias through a closed-loop deployment simulator.

28
RESEARCHarXiv CS.CL·16d ago

Learnability-Informed Fine-Tuning of Diffusion Language Models

This research introduces LIFT, a learnability-informed fine-tuning algorithm designed to enhance the reasoning capabilities of diffusion language models. LIFT addresses the shortcomings of standard SFT by adaptively learning tokens based on their difficulty and available context during different diffusion time steps, showing improved performance over existing baselines.

28
RESEARCHarXiv CS.LG·6d ago

Inverse Critical Experiment Design via Gradient Optimization and a Multigroup Attention-Based Neural Network Architecture

This research presents a methodology for the inverse design of critical experiments, essential for validating advanced nuclear reactor designs. It employs deep neural network surrogate modeling and nonparametric gradient optimization to generate experiment geometries that maximize neutronic similarity.

28
RESEARCHarXiv CS.LG·6d ago

Novel Aspects of IEEE SA P3109 Arithmetic Formats for Machine Learning

The IEEE P3109 draft standard defines parameterized binary floating-point formats and associated operations, specifically designed to facilitate machine learning by allowing efficient representation in few bits. It ensures exception-free operations by explicit treatment of NaN and infinities, with results communicated via return values.

28
RESEARCHarXiv CS.LG·8d ago

DAStatFormer: A Hybrid Multibranch Transformer with Statistical Feature Integration for DAS-Based Pattern Recognitions

DAStatFormer is a hybrid multibranch Transformer proposed to overcome the challenges of high dimensionality and complex spatio-temporal patterns in Distributed Acoustic Sensing (DAS). It integrates compact statistical features from multiple domains, significantly reducing data size and enhancing event classification.

28