← heapsort-ai

medical AI

34 items

RESEARCHarXiv CS.LG·27d ago

Interpretable EEG Microstate Discovery via Variational Deep Embedding: A Systematic Architecture Search with Multi-Quadrant Evaluation

This paper introduces the Convolutional Variational Deep Embedding (Conv-VaDE) model for EEG microstate analysis. It enhances interpretability by jointly learning topographic reconstruction and probabilistic soft clustering, enabling generative decoding of cluster prototypes into verifiable scalp topographies.

29
RESEARCHarXiv CS.LG·5/7/2026

Investigating Trustworthiness of Nonparametric Deep Survival Models for Alzheimer's Disease Progression Analysis

This research investigates the trustworthiness and fairness of nonparametric deep survival models for analyzing Alzheimer's Disease (AD) progression. It addresses the lack of studies considering learned bias in existing deep learning models for AD and proposes novel fairness metrics to ensure reliable predictions.

28
RESEARCHarXiv CS.CL·27d ago

ClinicalBench: Stress-Testing Assertion-Aware Retrieval for Cross-Admission Clinical QA on MIMIC-IV

This paper introduces ClinicalBench, a 400-question benchmark designed to stress-test assertion-aware retrieval for cross-admission clinical QA on MIMIC-IV using real EHR notes. It also presents EpiKG, a patient knowledge graph system that incorporates assertion and temporality tags to route retrieval by question intent, demonstrating significant performance improvements across various LLMs.

28
RESEARCHarXiv CS.LG·4/15/2026

DBGL: Decay-aware Bipartite Graph Learning for Irregular Medical Time Series Classification

DBGL introduces a novel Decay-Aware Bipartite Graph Learning method to address the challenges of irregular medical time series classification. It utilizes a patient-variable bipartite graph to model irregular sampling patterns and variable relationships, alongside a node-specific temporal decay encoding for variable decay irregularity.

28
RESEARCHarXiv CS.CL·18d ago

When Cases Get Rare: A Retrieval Benchmark for Off-Guideline Clinical Question Answering

This paper introduces OGCaReBench, a new retrieval-focused benchmark aimed at evaluating LLMs' ability to answer clinical questions that go beyond typical medical guidelines. It addresses the gap where most medical LLMs are trained on common, guideline-focused knowledge, while real-world care often involves rare cases not covered by guidelines.

28
RESEARCHarXiv CS.CL·4/10/2026

EMSDialog: Synthetic Multi-person Emergency Medical Service Dialogue Generation from Electronic Patient Care Reports via Multi-LLM Agents

O estudo apresenta o EMSDialog, um novo conjunto de dados de 4.414 conversas sintéticas multi-falantes para serviços médicos de emergência, geradas a partir de relatórios reais de pacientes usando uma pipeline de agentes multi-LLM. Este dataset, anotado com diagnósticos e tópicos, demonstra melhorias na precisão e estabilidade da previsão de diagnóstico conversacional.

27
RESEARCHarXiv CS.LG·5/1/2026

People-Centred Medical Image Analysis

Despite accurate diagnostic systems from data-centric medical AI, widespread clinical adoption is limited due to insufficient attention to fair performance across diverse patient populations and poor workflow integration. This paper proposes a 'People-Centred Medical Image Analysis' approach to address these interconnected challenges, which prior work has typically examined in isolation.

27
ARTICLEDEV.to AI·22d ago

Medical AI Doesn’t Just Need Bigger Models. It Needs an ImageNet for State Transitions

This article proposes the creation of "Biomedical TransitionNet", a new type of dataset analogous to ImageNet, but focused on biological state transitions for the next generation of medical AI. It argues for the necessity of such infrastructure to build real-world models in biomedicine, moving beyond classification and prediction.

27