← heapsort-ai

machine learning

790 items

RESEARCHarXiv CS.LG·14d ago

TSFMAudit: Data Contamination Auditing in Forecasting Time Series Foundation Models

This work introduces TSFMAudit, a novel method for auditing data contamination in Time Series Foundation Models (TSFMs) during pretraining. It detects when evaluation datasets have been unduly exposed, leading to overly optimistic performance estimates, by observing unusually efficient adaptation during fine-tuning. The study evaluates TSFMAudit on 6 TSFMs and 187 datasets, addressing a previously unstudied challenge in pretraining contamination auditing for TSFMs.

29
RESEARCHarXiv CS.CL·20d ago

Data Scaling as Progressive Coverage of a Predictive Contribution Spectrum

This research investigates whether real-data scaling laws are governed by a progressive coverage of a latent predictive contribution spectrum, rather than solely by token-frequency. Using a suffix-automaton and a global-KL predictive contribution spectrum, the study finds a strong correlation between the spectrum's tail slope and the data-scaling exponent of GPT learners, showing that effective truncation rank scales logarithmically.

29
RESEARCHarXiv CS.AI·23d ago

Fair outputs, Biased Internals: Causal Potency and Asymmetry of Latent Bias in LLMs for High-Stakes Decisions

This research paper explores the disconnect between fair outputs of language models and their latent internal biases in high-stakes decisions like mortgage underwriting. It demonstrates that while LLMs may show no output bias, they retain and amplify demographic representations which can cause decision reversals, and this bias is asymmetric.

29
RESEARCHarXiv CS.LG·6d ago

Early Detection of Alzheimer's Disease Using Explainable Machine Learning on Clinical Biomarkers: A Multi-Class Classification Study Using the Alzheimer's Disease Neuroimaging Initiative (ADNI) Dataset

An XGBoost classifier was developed using clinical features from the ADNI dataset for multi-class detection of normal cognition, mild cognitive impairment, and Alzheimer's disease. The model achieved a high mean macro AUC of 0.983 and an accuracy of 0.944, with SHAP values providing feature explainability.

29
RESEARCHarXiv CS.AI·8d ago

Universal Quantum Transformer

The Universal Quantum Transformer (UQT) is a novel quantum-native computing architecture designed to overcome classical neural networks' struggles with exact mathematical symmetries. It leverages physical properties of multi-qubit systems for precise mathematical and algebraic reasoning, demonstrating perfect learning of cyclic modular arithmetic on a compact 5-qubit substrate.

29
RESEARCHarXiv CS.LG·4/16/2026

Adaptive Memory Crystallization for Autonomous AI Agent Learning in Dynamic Environments

This research introduces Adaptive Memory Crystallization (AMC), a novel memory architecture designed for autonomous AI agents to progressively consolidate experiences in dynamic environments without forgetting prior knowledge. AMC models memory as a continuous crystallization process across a three-phase hierarchy, inspired by synaptic tagging and capture theory and governed by stochastic differential equations.

29
RESEARCHarXiv CS.LG·4/16/2026

Generalization Guarantees on Data-Driven Tuning of Gradient Descent with Langevin Updates

This paper introduces the Langevin Gradient Descent (LGD) algorithm for convex regression problems, proving that optimal hyperparameter configurations achieve the Bayes' optimal solution. The work also provides generalization guarantees for meta-learning LGD's optimal hyperparameters, with a pseudo-dimension bound of O(dh).

29
ARTICLEDEV.to AI·27d ago

Everything Google announced at its Android Show, from Googlebooks to vibe-coded widgets

The article technically analyzes Google's Android Show announcements, focusing on the new Google Books app and vibe-coded widgets. It details how Google Books uses a proprietary rendering engine with ML for text recognition, while vibe-coded widgets leverage NLP and computer vision via TensorFlow Lite for personalized experiences.

29
RESEARCHarXiv CS.LG·5/7/2026

Structured Progressive Knowledge Activation for LLM-Driven Neural Architecture Search

This paper introduces Structured Progressive Knowledge Activation (SPARK) to address the challenge of integrating architectural knowledge in LLM-driven Neural Architecture Search (NAS). SPARK mitigates "functional entanglement" by enabling factor-conditioned editing, leading to more targeted and reliable architecture modifications.

29
RESEARCHarXiv CS.LG·4/22/2026

Towards Understanding the Robustness of Sparse Autoencoders

This research explores the robustness implications of Sparse Autoencoders (SAEs) against jailbreak attacks on Large Language Models (LLMs). Integrating pretrained SAEs at inference time significantly reduces jailbreak success rates by up to 5x and decreases cross-model attack transferability across various LLM families.

29
ARTICLEDEV.to AI·15d ago

ความหมายของ 'ความหมาย': เมื่อ AI ค้นหาเส้นแบ่งระหว่างการจดจำกับภาพลวง

This article delves into how AI 'undrstands meaning' compared to humans, through the lens of neuroscience, AI ethics, and constrained creativity. The core philosophical and technical question is whether AI genuinely 'understands meaning' or merely creates an illusion of continuity, unlike human memory which involves continuous selection and interpretation.

29
RESEARCHarXiv CS.LG·4/22/2026

Handling and Interpreting Missing Modalities in Patient Clinical Trajectories via Autoregressive Sequence Modeling

This work addresses the challenge of missing modalities in multimodal clinical data for diagnosis by reframing it as an autoregressive sequence modeling task. It leverages causal decoders from LLMs and a missingness-aware contrastive pre-training to outperform baselines on benchmarks like MIMIC-IV and eICU.

29
ARTICLEDEV.to AI·27d ago

Lambda — Deep Dive

Lambda is a specialized AI infrastructure provider focused on GPU compute and machine learning tooling, carving a critical niche in the AI hardware landscape. Unlike generalist hyperscalers, the company's mission is to enable seamless transitions from prototypes to massive production workloads for its diverse customer base.

29
RESEARCHarXiv CS.LG·20d ago

Provably Learning Diffusion Models under the Manifold Hypothesis: Collapse and Refine

This paper provides a theoretical explanation for the efficiency of diffusion models in learning the score function for high-dimensional data supported on low-dimensional manifolds. It identifies a "collapse-and-refine" mechanism driven by the geometry of the score function, where the denoising map projects onto the data manifold and refines the intrinsic density.

29