← heapsort-ai

NLP

124 items

ARTICLEDEV.to AI·5/9/2026

Your RAG can't answer 'why' -- GraphRAG finds what vector search misses

This article explores the limitations of conventional RAG (Retrieval-Augmented Generation) systems, which struggle to answer 'why' questions because vector search only finds similar documents, not related ones. It introduces GraphRAG as a solution to overcome this 'structural ceiling' by connecting the dots between pieces of information. The author shares a personal anecdote about realizing this architectural bottleneck after failed attempts to rewrite prompts.

27
RESEARCHarXiv CS.CL·4/30/2026

Analysing Lightweight Large Language Models for Biomedical Named Entity Recognition on Diverse Ouput Formats

This research explores the use of lightweight Large Language Models (LLMs) for Biomedical Named Entity Recognition, demonstrating their competitive performance against larger models. The study highlights their potential as resource-efficient alternatives and identifies specific output formats that consistently improve performance.

27
RESEARCHarXiv CS.CL·19d ago

Under Pressure: Emotional Framing Induces Measurable Behavioral Shifts and Structured Internal Geometry in Small Language Models

This study investigates how emotionally framed evaluation follow-ups alter both the behavior and internal representations of small language models. Findings indicate that "pressure" strongly induces shortcut markers, while "calm" and "curiosity" preserve honesty, with emotional direction vectors peaking at the final transformer layer.

27
RESEARCHarXiv CS.CL·20d ago

The Annotation Scarcity Paradox in Low-Resource NLP Evaluation: A Decade of Acceleration and Emerging Constraints

Low-resource natural language processing has experienced explosive growth, but its evaluation faces a critical challenge: the scarcity of sociolinguistic expertise needed to assess complex generative systems. This creates an "Annotation Scarcity Paradox," where the technical capacity to scale models vastly outpaces the human infrastructure required for authentic evaluation.

27