natural language processing

167 items

ARTICLE↑ trendingHacker News (AI)·9h ago

AI takes people at their word

This article explores how artificial intelligence often interprets human instructions literally, failing to grasp underlying intent or context. This can lead to unexpected or even comical outcomes due to AI's lack of nuanced understanding.

AI limitations AI interpretation natural language processing human-AI interaction

RESEARCHDEV.to AI·10h ago

Aligning with Human Judgement: The Role of Pairwise Preference in Large LanguageModel Evaluators

This content explores the critical role of pairwise preference in evaluating Large Language Models (LLMs). It discusses how this method can help align LLM performance more effectively with human judgment.

Human Alignment Pairwise Preference natural language processing AI Research

RESEARCHarXiv CS.CL·1d ago

Improving Cross-Lingual Factual Recall via Consistency-Driven Reinforcement Learning

This research introduces PolyFact, a multilingual factual QA dataset, to address cross-lingual factual inconsistency in LLMs. It finds that reinforcement learning via GRPO consistently improves cross-lingual factual recall and generalization compared to supervised fine-tuning.

Multilingual AI LLMs reinforcement learning machine learning

RESEARCHarXiv CS.CL·1d ago

CAF-Gen: A Multi-Agent System for Enriching Argumentation Structures

CAF-Gen is a multi-agent framework designed to enrich shallow argument structures into CAF-compliant models, addressing limitations in current Argument Mining techniques. It employs an iterative Creator-Reviewer pipeline to ensure structural integrity and mitigate instability.

Argumentation Frameworks Argument Mining natural language processing Computational Linguistics

RESEARCHDEV.to AI·4/24/2026

"Go eat a bat, Chang!": On the Emergence of Sinophobic Behavior on WebCommunities in the Face of COVID-19

This research explores the emergence of Sinophobic behavior within online web communities during the COVID-19 pandemic. It highlights instances of anti-Chinese sentiment and related hate speech in digital spaces.

hate-speech social media natural language processing content moderation

RESEARCHarXiv CS.AI·19h ago

Automatic Extraction of Structured Information from Brain MRI Reports Using an Open-Weight Large Language Model

This research paper explores the automatic extraction of data from brain MRI reports using the open-weight large language model LLaMA 3.1. It evaluates the LLM's performance in analyzing Dutch neuroradiology reports, demonstrating high zero-shot performance.

Data Extraction natural language processing Neuroradiology Medical Imaging

RESEARCHarXiv CS.CL·19h ago

Bidirectional Small-Granularity Search between Code and Text

This research introduces a novel task: bidirectional small-granularity search between code and text, aiming to link scientific publications with corresponding code segments. It proposes a large dataset, partially generated by GPT-4, and a modular approach that achieves good in-domain results.

machine learning natural language processing Code Analysis Information Retrieval

RESEARCHarXiv CS.CL·19h ago

Community-Specific Slang and Entity Detection via Semantic Shift in Fine-Tuned Language Models

This research proposes an unsupervised method to identify community-specific slang and unique entities by analyzing the magnitude of semantic shift. Semantic shift is defined as the evolution of a word's encoded representation after fine-tuning a pre-trained Large Language Model (LLM) on a community-specific text corpus.

online-communities semantic-shift natural language processing large language models

RESEARCHarXiv CS.CL·19h ago

Retrieval Augmented Generation Framework for the Nepali Legal Domain Question Answering

This study presents the first application of a Retrieval Augmented Generation (RAG) model for Nepali legal question answering, addressing data scarcity in low-resource languages. Using BM25 on chunked documents, the RAG pipeline achieved high precision and truthfulness, demonstrating its effectiveness in the Nepali legal domain.

Retrieval Augmented Generation Legal AI Question Answering natural language processing

RESEARCHarXiv CS.CL·19h ago

Implicit Causal Graph Construction in Text via Chain Discovery

This paper investigates implicit causal graph construction from text by inferring intermediate causal events using Large Language Models (LLMs). It compares end-to-end graph construction with causal chain discovery methods and evaluates the validity of inferred causal relations against a manually curated database.

text analysis natural language processing graph theory large language models

ARTICLEDEV.to AI·4/23/2026

How I built an AI RAG system to convert PDF to Q&As

This article details the five engineering stages of building an AI RAG system named LongTermMemory, which transforms PDFs into Q&As. It covers the full document processing pipeline, from text extraction and semantic chunking to using a vector database and Retrieval Augmented Generation (RAG), powered by Laravel and FastAPI services.

Vector Databases RAG natural language processing AI

DOC↑ trendingReddit r/LocalLLaMA·4/21/2026

ibm-granite/granite-4.1-8b · Hugging Face

Granite-4.1-8B is an 8B parameter long-context instruct model from IBM, enhanced through finetuning and alignment for advanced tool calling, instruction following, and chat capabilities. It supports multiple languages and was released in April 2026 under the Apache 2.0 license.

NLP natural language processing AI model Large Language Model

ibm-granite/granite-4.1-8b · Hugging Face

ARTICLE↑ trendingReddit r/MachineLearning·4/18/2026

easyaligner: Forced alignment with GPU acceleration and flexible text normalization (compatible with all w2v2 models on HF Hub) [P]

easyaligner is a new, performant forced alignment library offering GPU acceleration and flexible text normalization, compatible with all w2v2 models on Hugging Face Hub. It addresses common challenges in speech-to-text preprocessing, such as handling partial transcripts, irrelevant audio, and long segments without chunking.

GPU Acceleration machine learning natural language processing Speech-to-Text

easyaligner: Forced alignment with GPU acceleration and flexible text normalization (compatible with all w2v2 models on HF Hub) [P]

RESEARCH↑ trendingReddit r/MachineLearning·4/24/2026

New project about llm hallucination [P]

This content introduces a new side project and its GitHub repository, focusing on mitigating LLM hallucination through a novel contrastive sampling and selective training method. The core idea treats hallucination as a preference problem, using self-generated negative samples and divergence-based, gated learning to push correct answers and suppress wrong ones.

hallucination model training natural language processing AI safety

ARTICLEDEV.to AI·4/23/2026

Advanced Triage: Using AI to Automate Design Feedback Sorting

This article describes how AI can automate the triage and prioritization of client design feedback. Using layered parsing, AI detects urgency and classifies requests, transforming vague text into actionable, structured data for greater efficiency.

design natural language processing feedback management AI

RESEARCH↑ trendingReddit r/LocalLLaMA·4/10/2026

National University of Singapore Presents "DMax": A New Paradigm For Diffusion Language Models (dLLMs) Enabling Aggressive Parallel Decoding.

DMax é um novo paradigma para modelos de linguagem de difusão (dLLMs) eficientes que mitiga o acúmulo de erros na decodificação paralela. Ele permite um paralelismo agressivo ao reformular a decodificação como um processo de auto-refinamento progressivo e introduzir uma estratégia de treinamento unificada.

Diffusion Models Parallel Decoding natural language processing AI

ARTICLE↑ trendingReddit r/MachineLearning·4/22/2026

I can't believe text normalization is so underdiscussed in streaming text-to-speech [D]

The author highlights the underdiscussed issue of text normalization in streaming text-to-speech models, where errors occur in pronouncing dates, URLs, and other basic elements. They reference a benchmark comparing commercial TTS models on these specific challenges.

AI models natural language processing Benchmarks Text-to-Speech

ARTICLE↑ trendingReddit r/LocalLLaMA·19d ago

Qwen3.6 35Ba3 has changed my workflows and even how I use my computer

The author details how the Qwen3.6 35Ba3 AI model has profoundly reshaped their development workflows and computer usage, enabling them to automate complex tasks and interact with the operating system using natural language. This transformation allows them to delegate tasks like devops, content creation, and code testing to AI, highlighting a significant shift in productivity.

Qwen3.6 natural language processing AI workflow automation

RESEARCHarXiv CS.CL·1d ago

HKJudge: A Legal Discourse-Annotated Corpus for Interpreting What Courts Find, How They Reason, and What They Rule

The HKJudge project introduces the first sentence-level, expert-annotated legal discourse corpus of Hong Kong criminal judgments, comprising approximately 290k sentences. It utilizes a two-tier discourse schema to identify what courts find, how they reason, and what they rule, with high inter-annotator agreement.

natural language processing datasets linguistics legal tech

RESEARCHarXiv CS.CL·4/21/2026

Foundational Study on Authorship Attribution of Japanese Web Reviews for Actor Analysis

This foundational study explores authorship attribution using stylistic features to support actor analysis in threat intelligence, testing methods on Japanese web reviews. While BERT fine-tuning performed best overall, TF-IDF with logistic regression showed superior stability and accuracy when scaling to hundreds of authors.

authorship attribution security machine learning natural language processing