← heapsort-ai

LLMs

722 items

ARTICLEDEV.to AI·4/10/2026

Building Your Own "Google Maps for Codebases": A Guide to Codebase Q&A with LLMs

O artigo aborda o desafio de navegar em bases de código complexas e propõe a construção de um sistema de Q&A com LLMs, similar a um "Google Maps para código", para entender sua estrutura e responder a perguntas. Ele foca no uso de ferramentas open-source para permitir que o leitor passe de usuário a arquiteto dessas soluções de IA.

27
RESEARCHarXiv CS.AI·4/13/2026

StaRPO: Stability-Augmented Reinforcement Policy Optimization

StaRPO is a novel reinforcement learning framework designed to improve the logical consistency and structural coherence of large language models in complex reasoning tasks. It explicitly incorporates stability metrics, such as Autocorrelation Function and Path Efficiency, to evaluate local step-to-step coherence and global goal-directedness of the reasoning process.

27
RESEARCHarXiv CS.LG·4/20/2026

Hallucination as Trajectory Commitment: Causal Evidence for Asymmetric Attractor Dynamics in Transformer Generation

This paper presents causal evidence that hallucination in autoregressive language models is an early trajectory commitment governed by asymmetric attractor dynamics. The research shows that factual and hallucinated trajectories diverge at the very first token, and correcting a hallucinated path requires sustained multi-step intervention, whereas corruption needs less effort.

27
RESEARCHarXiv CS.CL·5/4/2026

Why Do LLMs Struggle in Strategic Play? Broken Links Between Observations, Beliefs, and Actions

Large language models (LLMs) often struggle with strategic decision-making under incomplete information, a problem explored through two fundamental internal gaps. Research reveals an 'observation-belief gap' where LLMs' internal beliefs are accurate but brittle, degrading with complex reasoning and exhibiting biases, and a 'belief-action gap' highlighting the weak conversion of these internal beliefs into effective actions.

27
RESEARCHarXiv CS.LG·4/9/2026

TalkLoRA: Communication-Aware Mixture of Low-Rank Adaptation for Large Language Models

TalkLoRA propõe um framework MoELoRA que aborda a instabilidade de roteamento e a dominância de especialistas em métodos existentes, permitindo a comunicação entre especialistas antes do roteamento. Isso é feito através de um Módulo de Conversação leve, que facilita a troca de informações, gerando um sinal de roteamento mais robusto para Large Language Models (LLMs).

27
RESEARCHarXiv CS.LG·4/22/2026

Compile to Compress: Boosting Formal Theorem Provers by Compiler Outputs

This research introduces a novel learning-to-refine framework to address the prohibitive computational cost of Large Language Models (LLMs) in formal theorem proving. By exploiting compiler outputs that compress diverse proof attempts into structured failure modes, the method enables efficient proof exploration and local error correction, significantly amplifying the reasoning capabilities of base provers.

27
RESEARCHarXiv CS.CL·5/8/2026

One Turn Too Late: Response-Aware Defense Against Hidden Malicious Intent in Multi-Turn Dialogue

This research tackles the growing threat of hidden malicious intent in multi-turn dialogues with large language models (LLMs), where attackers distribute their harmful objectives across multiple interactions. It proposes an early detection mechanism to identify the turn at which a response could enable harmful action, also introducing the Multi-Turn Intent Dataset (MTID) for training and evaluation.

27
RESEARCHarXiv CS.CL·5/8/2026

When2Speak: A Dataset for Temporal Participation and Turn-Taking in Multi-Party Conversations for Large Language Models

When2Speak is a new synthetic dataset and four-stage generation pipeline designed to teach Large Language Models (LLMs) appropriate intervention timing in multi-party conversations. It addresses the challenge of avoiding excessive interruptions and improving conversational coherence in group interactions.

27