← heapsort-ai

Natural Language Processing

168 items

RESEARCHarXiv CS.CL·7d ago

CSRP: Chain-of-Thought Reasoning for Chinese Text Correction via Reinforcement Learning with Efficiency-Aware Rewards

This paper proposes CSRP, a three-stage framework for Chinese Grammatical Error Correction (CGEC) using Large Language Models (LLMs). CSRP addresses challenges of general-purpose models and metric optimization with continual pre-training, Chain-of-Thought SFT, and policy optimization with efficiency-aware rewards that penalize unnecessary edits, achieving state-of-the-art performance on the NACGEC benchmark.

27
RESEARCHarXiv CS.CL·25d ago

Merging Methods for Multilingual Knowledge Editing for Large Language Models: An Empirical Odyssey

This paper investigates the effectiveness of vector merging methods for multilingual knowledge editing (MKE) in Large Language Models, focusing on reducing interference between language-specific edits. Evaluating six merging variants across two LLMs, two editing methods, and 12 languages on the MzsRE benchmark, it finds vector summation with shared covariance to be the most reliable overall strategy.

27
RESEARCHarXiv CS.CL·26d ago

TimelineReasoner: Advancing Timeline Summarization with Large Reasoning Models

TimelineReasoner is a novel framework that leverages Large Reasoning Models (LRMs) to advance timeline summarization, moving beyond passive Large Language Model (LLM) generation. It employs a two-stage, reasoning-driven process—Global Cognition and Detail Exploration—to actively extract and refine structured timelines from unstructured online news content.

27
RESEARCHarXiv CS.CL·22d ago

DiscoExplorer: An Open Interface for the Study of Multilingual Discourse Relations

DiscoExplorer introduces an open-source web interface designed to facilitate the study and cross-linguistic comparison of discourse relations across 16 languages. This tool addresses the complexity of relevant data and the lack of accessible interfaces in computational linguistics and pragmatics by providing query, search, and visualization features.

27
RESEARCHarXiv CS.AI·26d ago

State-Centric Decision Process

The State-Centric Decision Process (SDP) is a new framework addressing the lack of runtime structure in language environments, such as web browsers, which emit raw text instead of states. It enables an agent to construct missing MDP inputs, like state space and certified transitions, by taking actions and checking observations against natural-language predicates.

27
RESEARCHarXiv CS.AI·15d ago

PathCal: State-Aware Reflection-Marker Calibration for Efficient Reasoning

This research paper introduces 'PathCal', investigating the distinct functional roles and timing of reflection markers in Large Reasoning Language Models' Chain-of-Thought trajectories. It reveals that markers like 'wait' or 'but' differ significantly in their impact on accuracy and generation length, challenging previous coarse-grained approaches.

27
RESEARCHarXiv CS.CL·15d ago

Query-Adaptive Semantic Chunking for Retrieval-Augmented Generation: A Dynamic Strategy with Contextual Window Expansion

This paper introduces Query-Adaptive Semantic Chunking (QASC), a dynamic strategy for Retrieval-Augmented Generation (RAG) systems that integrates user queries into document segmentation. QASC employs cosine similarity scoring, contextual window expansion, and chunk-level score aggregation to optimize context retrieval, addressing limitations of fixed chunking methods.

27
RESEARCHarXiv CS.CL·6d ago

Linear Probes Detect Task Format, Not Reasoning Mode in Language Model Hidden States

This paper reveals that linear probes, often used to identify distinct reasoning representations in LLM hidden states, actually detect task format rather than reasoning modes. High accuracy observed on benchmarks with Qwen3-14B vanished when controlling for format variables, suggesting largely shared reasoning not functionally linked to hidden state geometry.

27
RESEARCHarXiv CS.CL·8d ago

When English Rewrites Local Knowledge: Global Narrative Dominance in Large Language Models

This research paper investigates global narrative dominance in Large Language Models (LLMs), where local cultural knowledge is often overshadowed by global narratives. It introduces the CulturalNB dataset for Bengali cultural contexts and demonstrates that questions asked in English tend to increase global substitution and institutional framing, reducing local perspective coverage.

27
RESEARCHarXiv CS.AI·15d ago

NeuroNL2LTL: A Neurosymbolic Framework for Natural Language Translation of Linear Temporal Logic

NeuroNL2LTL is a neurosymbolic architecture that unifies learned translation with formal verification to translate natural language into Linear Temporal Logic. It employs verifier-in-the-loop training, where verification outcomes serve as reward signals for reinforcement learning, optimizing for formal correctness.

27
ARTICLEDEV.to AI·26d ago

Helping ChatGPT better recognize context in sensitive conversations

This technical analysis explores enhancing ChatGPT's ability to recognize context in sensitive conversations, which is crucial for accurate and empathetic responses. It highlights current limitations such as a lack of domain-specific knowledge and insufficient understanding of nuances, aiming to find technical solutions for these challenges.

27