← heapsort-ai

machine learning

790 items

RESEARCHarXiv CS.LG·4/6/2026

Modeling and Controlling Deployment Reliability under Temporal Distribution Shift

Este artigo propõe uma estrutura centrada na implantação para modelar a confiabilidade de modelos de machine learning em ambientes não-estacionários, onde a mudança de distribuição temporal pode degradar o desempenho. O framework trata a confiabilidade como um estado dinâmico, abordando a adaptação de implantação como um problema de controle multi-objetivo para equilibrar estabilidade e custo de intervenção.

27
RESEARCHarXiv CS.AI·4/30/2026

Distill-Belief: Closed-Loop Inverse Source Localization and Characterization in Physical Fields

The Distill-Belief framework addresses the challenge of efficient and accurate inverse source localization and characterization (ISLC) for mobile agents by balancing correctness and efficiency. It proposes a teacher-student model, where a Bayes-correct particle filter teacher guides a compact student for fast, uncertainty-aware decision-making in real-time.

27
RESEARCHarXiv CS.AI·4/30/2026

Hierarchical Multi-Persona Induction from User Behavioral Logs: Learning Evidence-Grounded and Truthful Personas

This paper proposes a hierarchical framework to induce multiple evidence-grounded user personas from behavioral logs by clustering intent memories and optimizing persona quality. The method utilizes a groupwise extension of Direct Preference Optimization (DPO) and demonstrates more coherent, truthful personas, also improving future interaction prediction.

27
RESEARCHarXiv CS.AI·4/30/2026

Auto-Relational Reasoning

Researchers propose a novel theoretical framework for automated relational reasoning, integrating Machine Learning with rigid reasoning to surpass the limitations of current large models. The resulting system demonstrates high performance on IQ problems, achieving a 98.03% solving rate without prior knowledge.

27
RESEARCHarXiv CS.LG·4/30/2026

A Randomized PDE Energy driven Iterative Framework for Efficient and Stable PDE Solutions

This work introduces a PDE energy-driven iterative framework for solving partial differential equations efficiently and stably, without relying on traditional matrix-based discretizations or costly data-driven neural network training. It evolves random initial fields through physically constrained diffusion iterations and Gaussian smoothing, strictly enforcing boundary conditions, and demonstrates stable convergence on Poisson, Heat, and viscous Burgers equations.

27
RESEARCHarXiv CS.LG·5/6/2026

Generate, Filter, Control, Replay: A Comprehensive Survey of Rollout Strategies for LLM Reinforcement Learning

This survey provides an optimizer-agnostic view of rollout strategies for RL-based post-training of reasoning LLMs. It formalizes rollout pipelines with a unified notation and introduces the Generate-Filter-Control-Replay (GFCR) lifecycle taxonomy, decomposing pipelines into four modular stages.

27