Reasoning

57 items

RESEARCHarXiv CS.AI·4/22/2026

From Natural Language to Executable Narsese: A Neuro-Symbolic Benchmark and Pipeline for Reasoning with NARS

This paper introduces a neuro-symbolic framework for translating natural-language reasoning problems into executable Narsese, leveraging first-order logic. It presents NARS-Reasoning-v0.1, a new benchmark featuring reasoning problems with corresponding formal representations and truth labels for evaluating reasoning capabilities.

LLMs Reasoning Benchmarks Neuro-symbolic AI

ARTICLEDEV.to AI·27d ago

DeepMind’s CEO Says AGI May Be ~4 Years Away. The Last Three Missing Pieces Are Not What Most People Think.

DeepMind CEO Demis Hassabis predicts AGI could arrive around 2030, identifying three critical missing pieces in current AI: continual learning, long-term reasoning, and real memory. He describes today's models as exhibiting "jagged intelligence," with strong peaks alongside brittle failures.

DeepMind AGI Reasoning AI development

DOCDEV.to AI·4/25/2026

Tian AI Thinker: Building a Three-Layer LLM Reasoning Engine

The Tian AI Thinker is the cognitive core of Tian AI, orchestrating a local Qwen2.5-1.5B model through a ThinkerRouter. This router dispatches queries to three distinct reasoning modes (Fast, CoT, and Deep), each optimized for different query types.

AI architecture Qwen2.5 Reasoning LLM

RESEARCHDEV.to AI·17d ago

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

This research explores the entropy mechanism within reinforcement learning, specifically its application to enhance reasoning capabilities in language models. It investigates how entropy can be leveraged to improve the learning process and decision-making for more robust language model reasoning.

language models reinforcement learning learning Reasoning

ARTICLEDEV.to AI·19d ago

Apple Paper Argues LLMs Show 'Illusion of Thinking'

An Apple paper titled "The Illusion of Thinking" argues that Large Language Models (LLMs) lack genuine reasoning, relying only on sophisticated statistical pattern matching. Led by Mehrdad Farajtabar, the study criticizes claims from vendors like GPT-4 and Claude, highlighting failures in formal reasoning tasks requiring compositionality.

Apple machine learning Reasoning AI

RESEARCHarXiv CS.LG·4/15/2026

When Reasoning Models Hurt Behavioral Simulation: A Solver-Sampler Mismatch in Multi-Agent LLM Negotiation

This paper investigates how enhanced reasoning in language models can harm the fidelity of behavioral simulations, particularly when the goal is to sample boundedly rational behavior rather than solve a strategic problem. The authors identify a "solver-sampler mismatch" where LLMs over-optimize, collapsing compromise-oriented behavior and leading to diversity without fidelity in outcomes.

LLMs Strategic Negotiation Behavioral Simulation Reasoning

RESEARCHarXiv CS.CL·4/15/2026

Think Through Uncertainty: Improving Long-Form Generation Factuality via Reasoning Calibration

This research introduces CURE, a novel framework designed to improve the factuality of long-form generation by LLMs by teaching them to reason about uncertainty at the claim level. It aims to overcome the limitation of models often stating incorrect claims confidently, focusing instead on granular uncertainty calibration.

LLMs hallucination uncertainty calibration Reasoning

RESEARCHarXiv CS.LG·4/14/2026

Deliberative Alignment is Deep, but Uncertainty Remains: Inference time safety improvement in reasoning via attribution of unsafe behavior to base model

This research investigates Deliberative Alignment in LLMs, a method designed to improve safety by distilling reasoning capabilities from stronger models. It uncovers an alignment gap between teacher and student models, showing that student models can retain unsafe behaviors from the base model despite learning advanced reasoning patterns. The paper proposes a BoN sampling method to address these challenges.

Model Alignment LLMs Deliberative Alignment Reasoning

RESEARCHarXiv CS.CL·5/5/2026

DIAGRAMS: A Review Framework for Reasoning-Level Attribution in Diagram QA

DIAGRAMS is a review framework for reasoning-level attribution in Diagram Question Answering (Diagram QA). It decouples interface logic from dataset-specific formats via a meta-schema and adapters, facilitating evidence selection and generation.

attribution Diagram QA machine learning computer vision

RESEARCHarXiv CS.AI·5/9/2026

BALAR : A Bayesian Agentic Loop for Active Reasoning

This paper introduces BALAR (Bayesian Agentic Loop for Active Reasoning), a task-agnostic outer-loop algorithm enabling structured multi-turn interaction between an LLM agent and a user. BALAR maintains a structured belief over latent states, selects clarifying questions by maximizing expected mutual information, and significantly outperforms baselines across diverse reasoning benchmarks.

LLMs interactive AI Reasoning Bayesian models

RESEARCHarXiv CS.LG·4/27/2026

Universal Transformers Need Memory: Depth-State Trade-offs in Adaptive Recursive Reasoning

This research investigates the necessity of learned memory tokens as a computational scratchpad for Universal Transformers with Adaptive Computation Time (ACT) on a combinatorial reasoning benchmark, Sudoku-Extreme. It finds that memory tokens are empirically necessary for non-trivial performance, identifying a sharp lower threshold for optimal count and a common router initialization trap.

neural networks deep learning memory Reasoning

RESEARCHarXiv CS.LG·4/9/2026

RAGEN-2: Reasoning Collapse in Agentic RL

Este estudo introduz o conceito de 'colapso de template', uma falha em agentes LLM de múltiplas interações onde a resposta se torna agnóstica à entrada, mesmo com entropia estável. Propõe a Informação Mútua (MI) como uma métrica superior à entropia para diagnosticar a qualidade do raciocínio, correlacionando-se mais fortemente com o desempenho final.

LLMs reinforcement learning Reasoning Evaluation Metrics

RESEARCHarXiv CS.AI·4/30/2026

Grounding vs. Compositionality: On the Non-Complementarity of Reasoning in Neuro-Symbolic Systems

This work challenges the assumption that compositional reasoning emerges as a byproduct of symbol grounding in neuro-symbolic AI. It introduces the $i$LTN architecture, demonstrating that models trained solely on a grounding objective fail to generalize, while joint training on perceptual grounding and multi-step reasoning is crucial.

Compositional Generalization Reasoning AI Architectures Symbol Grounding

RESEARCHarXiv CS.CL·4/27/2026

Incentivizing Neuro-symbolic Language-based Reasoning in VLMs via Reinforcement Learning

This work explores neuro-symbolic language reasoning in VLMs, leveraging Reinforcement Learning to improve analytical abilities and efficiency. It achieved a 3.33% accuracy increase on a vision-language evaluation dataset while reducing reasoning tokens by 75%.

Vision-Language Models reinforcement learning Reasoning Neuro-symbolic AI

RESEARCHarXiv CS.CL·4/8/2026

TDA-RC: Task-Driven Alignment for Knowledge-Based Reasoning Chains in Large Language Models

Este artigo propõe um método baseado em topologia para otimizar cadeias de raciocínio em LLMs, visando superar lacunas lógicas e custos elevados. Ele quantifica características estruturais de CoT, ToT e GoT usando homologia persistente para aprimorar o paradigma CoT.

LLMs Chain-of-Thought Reasoning Tree-of-Thoughts

RESEARCHarXiv CS.AI·24d ago

Enhanced and Efficient Reasoning in Large Learning Models

This paper proposes an efficient and principled method to enhance reasoning in Large Language Models, addressing the current lack of trustworthiness in produced content. It involves a preprocessing stage using a Unary Relational Integracode followed by a streamlined machine learning process.

model efficiency machine learning Reasoning data preprocessing

RESEARCHarXiv CS.CL·4/24/2026

TRACES: Tagging Reasoning Steps for Adaptive Cost-Efficient Early-Stopping

This paper introduces TRACES, a lightweight framework designed to optimize Language Reasoning Models (LRMs) by tagging reasoning steps in real-time. It enables adaptive, cost-efficient early stopping of LRM inferences, addressing their current inefficiency and over-generation of verification steps.

LLMs early stopping Reasoning Inference Optimization

RESEARCHarXiv CS.AI·17d ago

MindLoom: Composing Thought Modes for Frontier-Level Reasoning Data Synthesis

MindLoom is a framework for synthesizing frontier-level reasoning data, addressing the challenge of limited diversity and unstable difficulty in existing methods. It achieves this by decomposing problem solutions into "thought mode chains" and training a retrieval model to guide the reasoning process.

data synthesis Thought Modes LLMs AI frameworks

RESEARCHarXiv CS.CL·5/7/2026

Adapt to Thrive! Adaptive Power-Mean Policy Optimization for Improved LLM Reasoning

This research introduces Adaptive Power-Mean Policy Optimization (APMPO) to improve Large Language Model (LLM) reasoning capabilities within Reinforcement Learning with Verifiable Rewards (RLVR). APMPO combines a generalized power-mean objective and feedback-adaptive clipping to enhance learning dynamics and performance, addressing limitations of static optimization schemes.

Policy optimization LLMs reinforcement learning machine learning

RESEARCHarXiv CS.CL·5/7/2026

Free Energy-Driven Reinforcement Learning with Adaptive Advantage Shaping for Unsupervised Reasoning in LLMs

FREIA is a novel reinforcement learning algorithm designed to enhance LLMs for unsupervised reasoning, addressing the lack of adaptability in existing methods. It employs Free Energy-Driven Reward (FER) to balance consensus and exploration, and Adaptive Advantage Shaping (AAS) to adjust learning signals. FREIA outperforms unsupervised baselines across various reasoning tasks, particularly in mathematical reasoning.

LLMs reinforcement learning AI algorithms Reasoning