← heapsort-ai

machine learning

790 items

RESEARCHarXiv CS.CL·4/9/2026

The Stepwise Informativeness Assumption: Why are Entropy Dynamics and Reasoning Correlated in LLMs?

Este artigo investiga a correlação entre a dinâmica interna de entropia e o raciocínio correto em Large Language Models (LLMs), um enigma ainda sem solução. Propõe a Hipótese de Informatividade Gradual (SIA), que afirma que os modelos raciocinam corretamente ao acumular informações relevantes sobre a resposta por meio de prefixos informativos, um processo reforçado por métodos de treinamento padrão.

28
RESEARCHarXiv CS.LG·26d ago

Vision-Based Runtime Monitoring under Varying Specifications using Semantic Latent Representations

This paper investigates certified runtime monitoring of past-time signal temporal logic (ptSTL) from visual observations under partial observability. It proposes a reusable monitor that infers safety-relevant quantities from images and provides finite-sample guarantees, leveraging semantic latent representations to certify formulas without per-formula retraining.

28
RESEARCHarXiv CS.LG·5/5/2026

Linking spatial biology and clinical histology via Haiku

Haiku is a tri-modal contrastive learning model trained on multiplexed immunofluorescence (mIF), integrating molecular, morphological, and clinical data from over 1,600 patients. It enables three-way cross-modal retrieval, improves downstream classification and clinical prediction tasks, and supports zero-shot biomarker inference, outperforming competing approaches.

28
RESEARCHarXiv CS.LG·5/4/2026

FedACT: Concurrent Federated Intelligence across Heterogeneous Data Sources

Federated Learning enables private collaborative intelligence across decentralized data sources, but multi-task scenarios face challenges due to device heterogeneity and resource inefficiency. FedACT is introduced as a novel resource heterogeneity-aware device scheduling approach to efficiently manage multiple concurrent FL jobs, aiming to minimize their average job completion time.

28
RESEARCHarXiv CS.AI·4/9/2026

Weakly Supervised Distillation of Hallucination Signals into Transformer Representations

Este artigo propõe um novo método para detecção de alucinações em LLMs, destilando sinais de supervisão externa diretamente nas representações internas do modelo durante o treinamento. Para isso, introduz um framework de supervisão fraca que combina correspondência de substrings, similaridade de embeddings e um LLM como juiz, culminando na criação de um dataset de 15.000 amostras para este propósito.

28