← heapsort-ai

AI Research

146 items

ARTICLE↑ trendingReddit r/MachineLearning·4/16/2026

Camera-ready paranoia [D]

A user expresses "camera-ready paranoia" after submitting their paper to CVPRW, fearing rejection due to potential errors despite having used a PDF validation tool and the correct template. They are seeking confirmation on when the paper will be placed in the proceedings, noting its current status as "In production".

29
RESEARCHarXiv CS.CL·4/6/2026

PolyJarvis: LLM Agent for Autonomous Polymer MD Simulations

PolyJarvis é um agente LLM que automatiza simulações de dinâmica molecular de polímeros para prever propriedades a partir de linguagem natural, utilizando a plataforma RadonPy. O sistema executa tarefas desde a construção do monômero até o cálculo de propriedades, mostrando previsões precisas de densidade e módulos de elasticidade para polímeros como aPS e PMMA.

29
RESEARCHarXiv CS.CL·5d ago

Discourse-Role Labels as Presentation-Time Variables for Context Use in Language Models

This study investigates the effect of discourse-role labels, such as "Reference" or "Instruction," on language model behavior. It reveals that the adoption rate of misleading information can shift significantly (56-84 percentage points) depending on the label, with labels like "Instruction" increasing adoption and "Example" consistently suppressing it.

28
RESEARCHDEV.to AI·4/17/2026

Logical Neural Networks

Logical Neural Networks represent a research area that seeks to integrate symbolic reasoning with neural network pattern recognition. This field explores how to combine explicit knowledge representation and logical inference with the learning capabilities of connectionist models.

28
RESEARCHarXiv CS.CL·4/20/2026

Brain Score Tracks Shared Properties of Languages: Evidence from Many Natural Languages and Structured Sequences

This research investigates the similarity between language models' processing and human language processing using the Brain Score framework. Findings suggest LMs trained on diverse natural languages and even structured data (human genome, Python) show similar Brain Score performance, indicating the metric captures the ability to extract common structure.

28
RESEARCHarXiv CS.CL·5/4/2026

Timing is Everything: Temporal Scaffolding of Semantic Surprise in Humor

This research proposes the Dual Prediction Violation (DPV) framework to explain humor, emphasizing the interplay between content and timing. Analyzing 828 Chinese stand-up performances, it shows that temporal features, particularly peak semantic violations and systematic pauses, significantly predict audience appreciation more than semantic incongruity alone.

28
RESEARCHarXiv CS.LG·21d ago

A Structural Threshold in Decision Capacity Governs Collapse in Self-Play Reinforcement Learning

This paper shows that a threshold in decision capacity governs collapse in self-play reinforcement learning agents under asymmetric rule perturbations. Eliminating all positive-reach contingent decisions causes rapid convergence to a deterministic exploitation attractor, while preserving even a single such decision prevents this collapse.

28
RESEARCHarXiv CS.CL·5d ago

When Retrieval Doesn't Help: A Large-Scale Study of Biomedical RAG

A large-scale study re-evaluates Retrieval-Augmented Generation (RAG) in medical question answering, finding only small and inconsistent improvements over no-retrieval baselines. It suggests that the choice of the backbone model is more critical than retrieval methods, and the main bottleneck is the model's ability to effectively use retrieved evidence.

28
RESEARCHarXiv CS.CL·5d ago

SaliMory: Orchestrating Cognitive Memory for Conversational Agents

SALIMORY is a framework that trains a single language model to manage cognitively-structured memory for conversational agents, addressing issues with existing memory expansion and reinforcement learning methods. It achieves this through a hierarchical stage-wise process reward and reward-decomposed contrastive refinement, significantly improving accuracy and personalization while reducing memory-attributed failures.

28
RESEARCHarXiv CS.LG·27d ago

Steering Without Breaking: Mechanistically Informed Interventions for Discrete Diffusion Language Models

This paper investigates the limitations of uniform interventions in discrete diffusion language models (DLMs), demonstrating they degrade controlled generation quality. The authors find that different attributes commit at distinct stages of the denoising process, proposing an adaptive scheduler to concentrate interventions efficiently.

28
RESEARCHarXiv CS.CL·12d ago

From AR to Diffusion: Efficiently Adapting Large Language Models with Strictly Causal and Elastic Horizons

FLUID is a new framework designed to efficiently adapt Autoregressive (AR) backbones to the diffusion paradigm for parallel text generation. It enables initialization from GPT-style models and introduces an entropy-driven mechanism called Elastic Horizons, achieving state-of-the-art performance with significantly reduced training costs.

28
RESEARCHarXiv CS.LG·15d ago

Reading Calibrated Uncertainty from Language Model Trajectories

This research paper proposes a new method to quantify uncertainty in language models by tracing the cumulative path of per-layer MLP updates. By extracting eleven scale-invariant geometric features, a sparse linear probe is shown to outperform maximum softmax probability in evaluating uncertainty, especially with baseline miscalibration.

28
RESEARCHarXiv CS.CL·4/20/2026

DALM: A Domain-Algebraic Language Model via Three-Phase Structured Generation

DALM (Domain-Algebraic Language Model) is proposed to address knowledge interference in LLMs by replacing unconstrained generation with structured denoising over a domain lattice. It uses a three-phase generation path (domain, relation, concept uncertainty) under algebraic constraints, requiring a domain lattice, relation typing, and fiber partition to prevent cross-domain contamination.

27