← heapsort-ai

deep learning

263 items

RESEARCHarXiv CS.LG·14d ago

Iterative Refinement Neural Operators are Learned Fixed-Point Solvers: A Principled Approach to Spectral Bias Mitigation

This paper introduces the Iterative Refinement Neural Operator (IRNO) to mitigate spectral bias in neural operators, using a learned refinement module via fixed-point iteration. IRNO decomposes predictions into a coarse initialization followed by successive residual corrections, achieving significant error reduction across physical systems.

28
RESEARCHarXiv CS.AI·4d ago

An interpretable and trustworthy AI framework for large-scale longitudinal structure-pain association studies using data from the Osteoarthritis Initiative (OAI)

This research develops an interpretable AI framework combining deep learning-based MRI Osteoarthritis Knee Score (MOAKS) prediction with interpretable statistical modeling to study structure-pain relationships at scale using data from the Osteoarthritis Initiative (OAI). It utilizes deep learning for MOAKS feature prediction from MRIs with uncertainty quantification, followed by a longitudinal latent class mixed model to examine associations between structural abnormalities and knee pain.

28
RESEARCHarXiv CS.LG·4/6/2026

From Broad Exploration to Stable Synthesis: Entropy-Guided Optimization for Autoregressive Image Generation

O artigo analisa a interação entre Chain-of-Thought (CoT) e Reinforcement Learning (RL) na geração de imagens a partir de texto (T2I) usando uma análise sistemática baseada em entropia. Ele revela que menor entropia dos tokens de imagem e do CoT textual se correlaciona com melhor qualidade de imagem, propondo a estratégia Entropy-Guided Group Relative Policy Optimization (EG-GRPO) para otimização com base na incerteza.

28
RESEARCHarXiv CS.LG·5/7/2026

Investigating Trustworthiness of Nonparametric Deep Survival Models for Alzheimer's Disease Progression Analysis

This research investigates the trustworthiness and fairness of nonparametric deep survival models for analyzing Alzheimer's Disease (AD) progression. It addresses the lack of studies considering learned bias in existing deep learning models for AD and proposes novel fairness metrics to ensure reliable predictions.

28
RESEARCHarXiv CS.LG·6d ago

Graph Mamba Survival Analysis Based on Topology-Aware ordering

This paper addresses challenges in Whole Slide Images (WSIs) survival analysis, specifically the computational bottleneck of Transformers and Mamba's sensitivity to input order and unidirectional architecture. It proposes a novel approach to overcome Mamba's limitations in capturing topological connectivity and bidirectional spatial structures.

28
RESEARCHarXiv CS.LG·4/17/2026

Awakening Dormant Experts:Counterfactual Routing to Mitigate MoE Hallucinations

Mixture-of-Experts (MoE) models are prone to hallucinations, particularly for long-tail knowledge, because static Top-k routing under-prioritizes specialist experts. Counterfactual Routing (CoR) is proposed as a training-free inference framework that uses perturbation analysis and CEI to dynamically shift computational resources, thereby awakening these dormant experts.

28
RESEARCHarXiv CS.LG·15d ago

Reading Calibrated Uncertainty from Language Model Trajectories

This research paper proposes a new method to quantify uncertainty in language models by tracing the cumulative path of per-layer MLP updates. By extracting eleven scale-invariant geometric features, a sparse linear probe is shown to outperform maximum softmax probability in evaluating uncertainty, especially with baseline miscalibration.

28