← heapsort-ai

NLP

124 items

RESEARCHarXiv CS.CL·5d ago

Discourse-Role Labels as Presentation-Time Variables for Context Use in Language Models

This study investigates the effect of discourse-role labels, such as "Reference" or "Instruction," on language model behavior. It reveals that the adoption rate of misleading information can shift significantly (56-84 percentage points) depending on the label, with labels like "Instruction" increasing adoption and "Example" consistently suppressing it.

28
ARTICLEDEV.to AI·4/10/2026

"Attention Is All You Need" Paper tahun 2017 yang mengubah dunia kecerdasan buatan, dijelaskan tanpa perlu latar belakang teknis.

O artigo explora a importância do paper 'Attention Is All You Need' de 2017, que revolucionou a IA ao introduzir a arquitetura Transformer, base de modelos como ChatGPT. Ele detalha como essa inovação superou as limitações das redes neurais recorrentes, permitindo que computadores compreendam e gerem linguagem humana com maior eficiência.

28
DOCDEV.to AI·23d ago

Loova Agents

Loova Agents is a conversational AI platform designed for automating customer support and engagement, leveraging machine learning and natural language processing. Its microservices architecture includes key components like an NLP Engine for understanding customer input and Dialogue Management for crafting appropriate responses.

28
RESEARCHarXiv CS.CL·4/9/2026

Depression Detection at the Point of Care: Automated Analysis of Linguistic Signals from Routine Primary Care Encounters

Esta pesquisa explora a detecção automatizada de depressão em consultas de atenção primária, analisando sinais linguísticos de áudios gravados. O estudo compara modelos de IA como GPT-OSS, Sentence-BERT e LIWC+LR, destacando o melhor desempenho do GPT-OSS e a importância das transcrições conjuntas entre médico e paciente.

28
ARTICLEDEV.to AI·4/10/2026

AI21 Labs — Deep Dive

AI21 Labs é uma empresa israelense de IA e produto, um player significativo no espaço de IA generativa, competindo com gigantes como OpenAI. A empresa escalou seus modelos de linguagem de 1.5 bilhões para até 398 bilhões de parâmetros, oferecendo produtos como o assistente de escrita Wordtune e o modelo de contexto longo Jamba.

28
RESEARCHarXiv CS.CL·21d ago

Beyond Sentiment Classification: A Generative Framework for Emotion Intensity Evaluation in Text

This work introduces a novel approach to emotion modeling, shifting from discrete classification to continuous emotion intensity evaluation in text. The authors constructed a dataset of emotional intensity scores and fine-tuned generative language models to output continuous values from 0-100, outperforming classification baselines and demonstrating generalization capabilities.

28
RESEARCHarXiv CS.CL·4/15/2026

LLMs Struggle with Abstract Meaning Comprehension More Than Expected

This research investigates LLMs' ability to comprehend abstract meanings, revealing that models like GPT-4o struggle in zero-shot, one-shot, and few-shot settings, while fine-tuned models like BERT and RoBERTa perform better. It proposes a bidirectional attention classifier that significantly enhances the accuracy of fine-tuned models in interpreting abstract concepts.

28
CASEDEV.to AI·15d ago

The Inexcusable Silence of a Well-Configured AI Treasure Hunt Engine

The article details the challenges faced by Veltrix operators in developing an AI-powered treasure hunt game, specifically due to prioritizing AI algorithms over game mechanics. This decision led to significant debugging issues related to misconfigured APIs and incomplete data integration, eventually resolved by a major overhaul of their configuration and deployment strategy.

28
RESEARCHarXiv CS.CL·4/6/2026

Pragmatics Meets Culture: Culturally-adapted Artwork Description Generation and Evaluation

Este artigo apresenta a tarefa de geração de descrições de arte culturalmente adaptadas para combater o viés cultural em modelos de linguagem na geração de texto aberto. Ele propõe um framework de avaliação baseado em perguntas e respostas culturalmente fundamentadas, mostrando que um modelo de locutor pragmático melhora significativamente a compreensão do ouvinte.

28