NLP

124 items

ARTICLEDEV.to AI·4/24/2026

Bringing it to Life: The Real-Time Inference Engine (Part 3)

Part 3 of this series details the real-time inference engine for an ASL-to-voice project, addressing the challenge of processing infinite webcam streams. It explains the Sliding Window architecture for decoding body keypoints into sign language glosses and using LLMs to generate spoken English.

sign-language machine learning computer vision NLP

RESEARCHDEV.to AI·24d ago

Efficient 8-Bit Quantization of Transformer Neural Machine Language TranslationModel

This paper discusses efficient 8-bit quantization for Transformer neural machine language translation models. The goal is to optimize the performance and efficiency of these models by reducing memory consumption and latency.

AI models efficiency NLP quantization

ARTICLEDEV.to AI·5/9/2026

Your RAG can't answer 'why' -- GraphRAG finds what vector search misses

This article explores the limitations of conventional RAG (Retrieval-Augmented Generation) systems, which struggle to answer 'why' questions because vector search only finds similar documents, not related ones. It introduces GraphRAG as a solution to overcome this 'structural ceiling' by connecting the dots between pieces of information. The author shares a personal anecdote about realizing this architectural bottleneck after failed attempts to rewrite prompts.

AI architecture GraphRAG RAG NLP

ARTICLEDEV.to AI·4/26/2026

I Made Two AI Models Read My Git Commits. It Got Uncomfortably Personal.

The author describes an experiment where two AI models analyzed Git commit messages to determine developer mood, resulting in surprisingly personal insights. The challenge, a blind duel between Gemini 2.5 Flash and a custom AI, revealed the depth of the tools' analysis.

AI models privacy NLP sentiment analysis

DOCDEV.to AI·4/20/2026

How to integrate DeepSeek R1 into your React app

This comprehensive guide details the integration of DeepSeek R1, an AI-driven natural language processing API, into React applications, providing steps and best practices. It covers prerequisites and communication via HTTP requests using Axios.

ReAct NLP DeepSeek R1 API Integration

RESEARCHHugging Face Blog·26d ago

Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality

Granite Embedding Multilingual R2 is a new open-source multilingual embedding model released under the Apache 2.0 license. It claims the best retrieval quality among models under 100M parameters, supporting a 32K context.

Open Source AI models Benchmarking NLP

ARTICLEDEV.to AI·4/27/2026

Intelligent Automation Explained: A Beginner's Guide to the Future of Work

Intelligent Automation is a transformative concept combining artificial intelligence with process automation, enabling systems to learn, adapt, and improve over time. It represents the convergence of RPA, machine learning, natural language processing, and cognitive technologies to optimize performance in modern business.

future-of-work machine learning NLP AI

RESEARCHDEV.to AI·4/19/2026

Evaluation of Retrieval-Augmented Generation: A Survey

This survey evaluates Retrieval-Augmented Generation (RAG), analyzing its current state, architectures, and performance metrics. It provides a comprehensive overview of existing RAG techniques and their applications.

Survey evaluation RAG NLP

DOCDEV.to AI·15d ago

RAG 시스템 실전 구축 (v23)

This is a practical guide (v23) for ML engineers on implementing RAG systems. It details the RAG loop (retrieval, augmentation, generation) and includes a Python example for semantic chunking using sentence_transformers.

learning RAG machine learning NLP

DOCDEV.to AI·24d ago

83. HuggingFace: Your Library for Every Pretrained Model

This content introduces how HuggingFace makes practical NLP accessible through its libraries and Model Hub. It demonstrates simplifying the use of pretrained models for tasks like sentiment analysis with minimal code.

learning machine learning NLP HuggingFace

RESEARCHarXiv CS.CL·5/1/2026

Semantic Structure of Feature Space in Large Language Models

This study demonstrates that the geometric relationships between semantic features in large language models' hidden states closely mirror human psychological associations. It shows that word vector projections onto semantic axes correlate with human ratings, and the similarity between these axes predicts the interconnections of semantic scales.

LLMs cognitive science semantic representation NLP

RESEARCHarXiv CS.CL·4/30/2026

Analysing Lightweight Large Language Models for Biomedical Named Entity Recognition on Diverse Ouput Formats

This research explores the use of lightweight Large Language Models (LLMs) for Biomedical Named Entity Recognition, demonstrating their competitive performance against larger models. The study highlights their potential as resource-efficient alternatives and identifies specific output formats that consistently improve performance.

LLMs named entity recognition Model Evaluation NLP

RESEARCHarXiv CS.CL·4/17/2026

SeaAlert: Critical Information Extraction From Maritime Distress Communications with Large Language Models

SeaAlert is an LLM-based framework designed for the robust analysis of maritime distress communications, which are challenging due to noise, deviations from format, and ASR errors. To overcome the lack of real-world labeled data, the framework utilizes an LLM-powered synthetic data generation pipeline.

synthetic data Information Extraction NLP Speech Recognition

RESEARCHarXiv CS.CL·4/16/2026

WorkRB: A Community-Driven Evaluation Framework for AI in the Work Domain

WorkRB is the first open-source, community-driven benchmark for AI in the work domain, addressing research fragmentation and employment data sensitivity. It unifies 13 diverse tasks from 7 groups as recommendation and NLP tasks, such as job/skill recommendation and skill extraction.

hiring future-of-work recommender systems NLP

RESEARCHarXiv CS.CL·19d ago

Under Pressure: Emotional Framing Induces Measurable Behavioral Shifts and Structured Internal Geometry in Small Language Models

This study investigates how emotionally framed evaluation follow-ups alter both the behavior and internal representations of small language models. Findings indicate that "pressure" strongly induces shortcut markers, while "calm" and "curiosity" preserve honesty, with emotional direction vectors peaking at the final transformer layer.

NLP model behavior emotional framing AI Research

RESEARCHarXiv CS.CL·19d ago

Pseudo-Siamese Network for Planning in Target-Oriented Proactive Dialogues

The paper proposes a Forward-Focused Bidirectional Pseudo-Siamese Network (FF-BPSN) for planning dialogue paths in target-oriented proactive dialogue systems. This network uses identical transformer-based decoders for bidirectional planning and integrates information to construct a forward path, guiding language models in response generation.

transformer networks deep learning NLP AI

RESEARCHarXiv CS.CL·4/24/2026

DWTSumm: Discrete Wavelet Transform for Document Summarization

This research proposes a Discrete Wavelet Transform (DWT)-based framework to enhance document summarization, particularly for long, domain-specific texts where LLMs struggle. The method creates compact representations that improve semantic similarity, grounding, and factual consistency compared to a GPT-4o baseline.

LLMs wavelet transform NLP Document Summarization

RESEARCHarXiv CS.CL·29d ago

Reflections and New Directions for Human-Centered Large Language Models

This work introduces a framework for Human-Centered Large Language Models (HCLLMs), integrating perspectives from NLP, HCI, and responsible AI. It argues for prioritizing human concerns, preferences, and values rigorously at every stage of LLM development, rather than as a mere post-training consideration.

LLMs HCI NLP AI ethics

RESEARCHarXiv CS.CL·20d ago

The Annotation Scarcity Paradox in Low-Resource NLP Evaluation: A Decade of Acceleration and Emerging Constraints

Low-resource natural language processing has experienced explosive growth, but its evaluation faces a critical challenge: the scarcity of sociolinguistic expertise needed to assess complex generative systems. This creates an "Annotation Scarcity Paradox," where the technical capacity to scale models vastly outpaces the human infrastructure required for authentic evaluation.

machine learning NLP Low-resource languages AI evaluation

RESEARCHarXiv CS.CL·7d ago

AEyeDE: An Attention-Based Attribution Framework for AI-Generated Text Detection

This paper introduces AEyeDE, an attention-driven framework for human-AI authorship detection that leverages model attention as a discriminative signal. The method consistently outperforms text-only baselines and shows robustness across various text generation settings, remaining competitive on standard benchmarks.

AI detection machine learning NLP attention mechanisms