NLP

124 items

RESEARCHarXiv CS.CL·5d ago

Discourse-Role Labels as Presentation-Time Variables for Context Use in Language Models

This study investigates the effect of discourse-role labels, such as "Reference" or "Instruction," on language model behavior. It reveals that the adoption rate of misleading information can shift significantly (56-84 percentage points) depending on the label, with labels like "Instruction" increasing adoption and "Example" consistently suppressing it.

language models Context NLP model behavior

RESEARCHarXiv CS.CL·5d ago

ACAT: A Collaborative Platform for Efficient Aspect-Based Sentiment Dataset Annotation

This paper introduces ACAT, a collaborative web-based platform designed for efficient annotation of Aspect-Based Sentiment Analysis (ABSA) datasets. It features an automated ETL pipeline that aligns collaborative annotations and computes Inter-Annotator Agreement metrics, yielding training-ready datasets for four ABSA workflows.

NLP Dataset Annotation sentiment analysis AI tools

ARTICLEDEV.to AI·4/10/2026

"Attention Is All You Need" Paper tahun 2017 yang mengubah dunia kecerdasan buatan, dijelaskan tanpa perlu latar belakang teknis.

O artigo explora a importância do paper 'Attention Is All You Need' de 2017, que revolucionou a IA ao introduzir a arquitetura Transformer, base de modelos como ChatGPT. Ele detalha como essa inovação superou as limitações das redes neurais recorrentes, permitindo que computadores compreendam e gerem linguagem humana com maior eficiência.

Attention Is All You Need Transformer ChatGPT NLP

ARTICLEDEV.to AI·18d ago

Playing with Words at the National Library of Sweden -- Making a Swedish BERT

The article discusses the process of creating a BERT model for the Swedish language, a project developed at the National Library of Sweden. The aim is to enhance natural language processing for Swedish.

language models BERT NLP National Library

DOCDEV.to AI·23d ago

Loova Agents

Loova Agents is a conversational AI platform designed for automating customer support and engagement, leveraging machine learning and natural language processing. Its microservices architecture includes key components like an NLP Engine for understanding customer input and Dialogue Management for crafting appropriate responses.

microservices machine learning NLP customer support

RESEARCHarXiv CS.CL·4/9/2026

Depression Detection at the Point of Care: Automated Analysis of Linguistic Signals from Routine Primary Care Encounters

Esta pesquisa explora a detecção automatizada de depressão em consultas de atenção primária, analisando sinais linguísticos de áudios gravados. O estudo compara modelos de IA como GPT-OSS, Sentence-BERT e LIWC+LR, destacando o melhor desempenho do GPT-OSS e a importância das transcrições conjuntas entre médico e paciente.

depression detection Primary Care machine learning NLP

ARTICLEDEV.to AI·4/10/2026

AI21 Labs — Deep Dive

AI21 Labs é uma empresa israelense de IA e produto, um player significativo no espaço de IA generativa, competindo com gigantes como OpenAI. A empresa escalou seus modelos de linguagem de 1.5 bilhões para até 398 bilhões de parâmetros, oferecendo produtos como o assistente de escrita Wordtune e o modelo de contexto longo Jamba.

NLP AI products large language models AI21 Labs

RESEARCHarXiv CS.CL·21d ago

Beyond Sentiment Classification: A Generative Framework for Emotion Intensity Evaluation in Text

This work introduces a novel approach to emotion modeling, shifting from discrete classification to continuous emotion intensity evaluation in text. The authors constructed a dataset of emotional intensity scores and fine-tuned generative language models to output continuous values from 0-100, outperforming classification baselines and demonstrating generalization capabilities.

emotion modeling Finance NLP sentiment analysis

DOCDEV.to AI·5d ago

A surprisingly effective lightweight sentiment analysis approach for product reviews in Python

This content describes a surprisingly effective lightweight lexicon-based approach for sentiment analysis of e-commerce product reviews in Python. The simple technique proved useful for early-stage positive/negative detection, prototyping, and bulk filtering before moving to more advanced transformer-based models.

learning machine learning NLP sentiment analysis

ARTICLEDEV.to AI·21d ago

Mastering the Art of Conversation: Expert ChatGPT Tips and Tricks

This article explores expert tips and tricks for mastering ChatGPT, OpenAI's revolutionary AI chatbot. It discusses understanding its capabilities and limitations to unlock its full potential in conversations and various applications.

learning ChatGPT NLP AI

DOCDEV.to AI·4/17/2026

Understanding Transformers Part 9: Stacking Self-Attention Layers

This article explains why self-attention values replace original positional encodings, as they integrate contextual information from all words, clarifying relationships. It then introduces stacking multiple self-attention layers, each with unique weights, to capture more complex linguistic relationships within sentences and paragraphs.

neural networks Self-Attention deep learning NLP

DOCDEV.to AI·25d ago

2026 NLP Data Collection Guide: How Proxy Networks Improve Large-Scale Data Crawling Efficiency

NLP data collection is critical for building AI systems and large language models, but faces significant challenges in large-scale crawling environments. Advanced anti-bot systems, IP blocking, and data quality issues can be improved by using proxy networks.

Proxy Networks NLP AI Systems web-scraping

RESEARCHarXiv CS.CL·5/4/2026

NorBERTo: A ModernBERT Model Trained for Portuguese with 331 Billion Tokens Corpus

NorBERTo is a new ModernBERT model trained on a 331 billion token Brazilian Portuguese corpus (Aurora-PT), designed for long-context support and efficient attention mechanisms. It achieves state-of-the-art results among evaluated encoder models on semantic similarity, textual entailment, and classification tasks using datasets like ASSIN 2 and PLUE.

AI models BERT Portuguese NLP

RESEARCHarXiv CS.CL·4/15/2026

LLMs Struggle with Abstract Meaning Comprehension More Than Expected

This research investigates LLMs' ability to comprehend abstract meanings, revealing that models like GPT-4o struggle in zero-shot, one-shot, and few-shot settings, while fine-tuned models like BERT and RoBERTa perform better. It proposes a bidirectional attention classifier that significantly enhances the accuracy of fine-tuned models in interpreting abstract concepts.

LLMs GPT-4o NLP abstract meaning comprehension

DOCAWS Machine Learning Blog·19d ago

Build AI-powered dashboard automation agents with NLP on Amazon Bedrock AgentCore

This solution enables building and operating AI-powered dashboard automation agents using Amazon Bedrock AgentCore, Strands Agents, and Amazon Quick transforms. It provides a secure, scalable, and intelligent system for transforming data into actionable business insights.

NLP Data transformation Amazon Bedrock automation

RESEARCHarXiv CS.CL·27d ago

Instructions shape Production of Language, not Processing

This research paper explores a production-centered mechanism in language models, revealing an asymmetry between language processing and production. It shows that instructions significantly shape information in output tokens, but not in sample tokens, correlating strongly with model behavior.

language models cognitive science NLP AI Research

CASEDEV.to AI·15d ago

The Inexcusable Silence of a Well-Configured AI Treasure Hunt Engine

The article details the challenges faced by Veltrix operators in developing an AI-powered treasure hunt game, specifically due to prioritizing AI algorithms over game mechanics. This decision led to significant debugging issues related to misconfigured APIs and incomplete data integration, eventually resolved by a major overhaul of their configuration and deployment strategy.

game development kubernetes NLP system architecture

RESEARCHarXiv CS.CL·4/6/2026

Pragmatics Meets Culture: Culturally-adapted Artwork Description Generation and Evaluation

Este artigo apresenta a tarefa de geração de descrições de arte culturalmente adaptadas para combater o viés cultural em modelos de linguagem na geração de texto aberto. Ele propõe um framework de avaliação baseado em perguntas e respostas culturalmente fundamentadas, mostrando que um modelo de locutor pragmático melhora significativamente a compreensão do ouvinte.

Art Description language models evaluation Pragmatics

RESEARCHarXiv CS.CL·15d ago

A Survey of Text and Speech Resources for Hausa and Fongbe: Availability, Quality, and Gaps for NLP Development

This survey catalogs publicly available text and speech resources for Hausa and Fongbe, two West African languages, to assess their current state and identify gaps for NLP development. It systematically documents various resources, finding Hausa benefits from broader text diversity compared to Fongbe.

African languages Fongbe NLP Hausa

RESEARCHDEV.to AI·4/12/2026

ACUTE-EVAL: Improved Dialogue Evaluation with Optimized Questions and Multi-turnComparisons

The title introduces ACUTE-EVAL, a method to improve the evaluation of dialogue systems. It focuses on optimizing questions and multi-turn comparisons for a more precise analysis of conversational AI quality.

ACUTE-EVAL IA Conversacional NLP Avaliação de Diálogo