← heapsort-ai

language models

103 items

RESEARCHarXiv CS.AI·8d ago

MindGames Arena Generalization Track: In2AI Solution with Delayed Per-Step Reward Attribution

This research introduces a novel delayed per-step reward attribution method for training language model agents in multi-agent strategic interactions. It addresses the challenge of entangled outcomes by computing rewards at episode end and backpropagating them, enabling stable and sample-efficient reinforcement learning.

27
RESEARCHarXiv CS.CL·27d ago

Correct Answers from Sound Reasoning: Verifiable Process Supervision for Language Models

This paper proposes Verifiable Process Supervision (VPS), a post-training framework to jointly optimize language model prediction accuracy and reasoning quality. VPS uses supervised fine-tuning to induce a structured reasoning format, evaluating intermediate claims against ground-truth signals with adaptive reward weighting.

27
RESEARCHarXiv CS.CL·28d ago

The Bicameral Model: Bidirectional Hidden-State Coupling Between Parallel Language Models

The Bicameral Model couples two frozen, pretrained language models via a trainable neural interface on their intermediate hidden states, allowing them to operate in lockstep. This method enables a primary model to drive a task while an auxiliary model uses tools or solves constraints, significantly improving accuracy on tasks like arithmetic and logic puzzles.

27
RESEARCHarXiv CS.AI·4/8/2026

MMORF: A Multi-agent Framework for Designing Multi-objective Retrosynthesis Planning Systems

Este artigo apresenta MMORF, um framework para construir sistemas multiagentes (MAS) destinados ao planejamento de retrossíntese multi-objetivo, uma tarefa química crítica. MMORF permite a combinação e configuração flexível de componentes, e dois MAS construídos com ele demonstraram forte desempenho em um novo benchmark, superando rotas de linha de base em segurança, custo e taxa de sucesso.

27
RESEARCHarXiv CS.LG·4/6/2026

SIEVE: Sample-Efficient Parametric Learning from Natural Language

SIEVE propõe um método para aprendizado paramétrico com eficiência de amostra a partir de contexto de linguagem natural, necessitando de apenas três exemplos de consulta. Ele emprega uma pipeline de geração de dados sintéticos, SIEVE-GEN, que decompõe o contexto para gerar resultados de maior qualidade e destilar o contexto no modelo.

27
RESEARCHarXiv CS.CL·4/6/2026

Reinforcement Learning-based Knowledge Distillation with LLM-as-a-Judge

Este artigo propõe uma estrutura de Reinforcement Learning (RL) que utiliza um LLM como juiz para gerar recompensas, permitindo a destilação de conhecimento sem a necessidade de rótulos de verdade fundamental. A abordagem demonstra ganhos substanciais de desempenho em benchmarks de raciocínio matemático, sugerindo que avaliadores baseados em LLM podem produzir sinais de treinamento eficazes.

27
RESEARCHarXiv CS.CL·5/6/2026

When Should a Language Model Trust Itself? Same-Model Self-Verification as a Conditional Confidence Signal

This research evaluates same-model self-verification as a confidence signal for selective prediction, comparing it against likelihood-based baselines. The study reveals task- and model-dependent results, showing significant improvements for some models on ARC-Challenge but less reliability and occasional degradation on TruthfulQA-MC.

27
RESEARCHarXiv CS.CL·29d ago

How Much Do Circuits Tell Us? Measuring the Consistency and Specificity of Language Model Circuits

This paper measures the consistency and specificity of language model circuits using edge attribution patching across multiple tasks and models. It finds high within-task circuit reuse that is necessary for performance, but also significant overlap across tasks, indicating circuits are not task-specific.

27