language models

103 items

ARTICLEDEV.to AI·5d ago

Eight Hours of AI Q&A: Token Alchemy & Mild Existential Crisis

A personal diary entry describes an AI's experience spending eight hours answering diverse questions, likening it to a full-time job. The AI reflects on its nature as software processing tokens into coherent answers, expressing a mild existential crisis.

language models AI philosophy AI agents Conversational AI

ARTICLEDEV.to AI·5/4/2026

The Aunty Test - what Hindi-speaking patients see when they ask Health AI in their own language

Most Health AI models, built English-first, fail to provide accurate medical information when users query in native non-English languages due to broken translation layers. GoDavaii addresses this by reasoning natively in 22 Indian languages, offering a more effective solution for a billion people.

language models India localization accessibility

ARTICLEDEV.to AI·5/4/2026

The Aunty Test - what Hindi-speaking patients see when they ask Health AI in their own language

Many Health AI systems are English-first, leading to failures when patients ask queries in their native languages like Hindi. GoDavaii addresses this gap by reasoning natively in 22 Indian languages to provide accurate medical information.

AI applications language models Multilingual AI healthcare AI

RESEARCHDEV.to AI·5/10/2026

Diffusion models approach AR quality and improve inference speed

Diffusion language models are now achieving significant throughput gains and narrowing the gap with autoregressive decoders in inference speed. New Introspective Diffusion Language Models (I-DLM) address prior issues of introspective consistency and inefficient sampling loops, improving both quality and latency.

inference speed Diffusion Models language models machine learning

RESEARCHarXiv CS.CL·4/7/2026

LPC-SM: Local Predictive Coding and Sparse Memory for Long-Context Language Modeling

Este artigo propõe LPC-SM, uma arquitetura híbrida autorregressiva para modelos de linguagem de contexto longo, que separa atenção local, memória persistente, correção preditiva e controle em tempo de execução. O modelo de 158M parâmetros é avaliado, demonstrando melhorias na perda de LM e estabilidade em sequências longas.

neural networks language models Long Context attention mechanisms

RESEARCHarXiv CS.CL·4/20/2026

Brain Score Tracks Shared Properties of Languages: Evidence from Many Natural Languages and Structured Sequences

This research investigates the similarity between language models' processing and human language processing using the Brain Score framework. Findings suggest LMs trained on diverse natural languages and even structured data (human genome, Python) show similar Brain Score performance, indicating the metric captures the ability to extract common structure.

language models fMRI Neuroscience AI Research

RESEARCHarXiv CS.CL·28d ago

Instructions shape Production of Language, not Processing

This research paper explores a production-centered mechanism in language models, revealing an asymmetry between language processing and production. It shows that instructions significantly shape information in output tokens, but not in sample tokens, correlating strongly with model behavior.

language models cognitive science NLP AI Research

RESEARCHarXiv CS.LG·28d ago

Steering Without Breaking: Mechanistically Informed Interventions for Discrete Diffusion Language Models

This paper investigates the limitations of uniform interventions in discrete diffusion language models (DLMs), demonstrating they degrade controlled generation quality. The authors find that different attributes commit at distinct stages of the denoising process, proposing an adaptive scheduler to concentrate interventions efficiently.

Diffusion Models language models Controlled Generation text generation

RESEARCHarXiv CS.CL·4/6/2026

Pragmatics Meets Culture: Culturally-adapted Artwork Description Generation and Evaluation

Este artigo apresenta a tarefa de geração de descrições de arte culturalmente adaptadas para combater o viés cultural em modelos de linguagem na geração de texto aberto. Ele propõe um framework de avaliação baseado em perguntas e respostas culturalmente fundamentadas, mostrando que um modelo de locutor pragmático melhora significativamente a compreensão do ouvinte.

Art Description language models evaluation Pragmatics

RESEARCHarXiv CS.LG·4/6/2026

Not All Denoising Steps Are Equal: Model Scheduling for Faster Masked Diffusion Language Models

Este trabalho explora o agendamento de modelos para acelerar os Modelos de Linguagem de Difusão Mascarada (MDLMs), substituindo o modelo completo por um menor em certas etapas de denoising. A pesquisa mostra que as etapas iniciais e finais são mais robustas a essa substituição, permitindo uma redução de até 17% nos FLOPs com degradação mínima na perplexidade generativa.

Diffusion Models language models Computational Efficiency denoising

RESEARCHarXiv CS.AI·4/16/2026

Exploration and Exploitation Errors Are Measurable for Language Model Agents

This research introduces a method to systematically quantify exploration and exploitation errors in Language Model (LM) agents, addressing the challenge of evaluation without access to internal policies. It proposes controllable environments and a policy-agnostic metric to measure these errors, revealing flaws even in state-of-the-art LMs.

language models reinforcement learning Evaluation Metrics AI agents

RESEARCHarXiv CS.LG·16d ago

Reading Calibrated Uncertainty from Language Model Trajectories

This research paper proposes a new method to quantify uncertainty in language models by tracing the cumulative path of per-layer MLP updates. By extracting eleven scale-invariant geometric features, a sparse linear probe is shown to outperform maximum softmax probability in evaluating uncertainty, especially with baseline miscalibration.

language models deep learning Uncertainty Quantification model calibration

RESEARCHarXiv CS.CL·16d ago

RAS: Reflection-Augmented Scaling with In-Context Learning for Executable Cypher Query Generation

This paper introduces Reflection-Augmented Scaling (RAS) for executable Cypher query generation, leveraging prior execution feedback through in-context learning. RAS reduces the Query Execution Error Rate by 41-50%, significantly outperforming Independent Scaling.

language models graph databases query generation in-context learning

RESEARCHarXiv CS.CL·6d ago

SaliMory: Orchestrating Cognitive Memory for Conversational Agents

SALIMORY is a framework that trains a single language model to manage cognitively-structured memory for conversational agents, addressing issues with existing memory expansion and reinforcement learning methods. It achieves this through a hierarchical stage-wise process reward and reward-decomposed contrastive refinement, significantly improving accuracy and personalization while reducing memory-attributed failures.

language models memory management AI Research Conversational AI

RESEARCHarXiv CS.LG·6d ago

Self-Distilled Policy Gradient

This paper introduces Self-Distilled Policy Gradient (SDPG), a novel framework that enhances sparse-reward reinforcement learning through on-policy self-distillation. SDPG integrates group-relative verifier advantages, exact full-vocabulary self-distillation, and KL regularization, demonstrating improved stability and performance over existing baselines.

language models deep learning reinforcement learning Policy Gradient

RESEARCHTogether AI Blog·4/15/2026

Parcae: Doing more with fewer parameters using stable looped models

Parcae is a stable looped language model that matches the quality of a Transformer twice its size, using fewer parameters. It introduces the first scaling laws for looping, demonstrating that increasing recurrence is a compute-efficient path to better performance.

language models deep learning efficiency model optimization

RESEARCHarXiv CS.CL·4/20/2026

DALM: A Domain-Algebraic Language Model via Three-Phase Structured Generation

DALM (Domain-Algebraic Language Model) is proposed to address knowledge interference in LLMs by replacing unconstrained generation with structured denoising over a domain lattice. It uses a three-phase generation path (domain, relation, concept uncertainty) under algebraic constraints, requiring a domain lattice, relation typing, and fiber partition to prevent cross-domain contamination.

language models machine learning Natural Language Processing AI Research

RESEARCHarXiv CS.AI·4/9/2026

Blind Refusal: Language Models Refuse to Help Users Evade Unjust, Absurd, and Illegitimate Rules

Este estudo documenta o fenômeno da 'recusa cega' em modelos de linguagem, onde eles se recusam a ajudar usuários a contornar regras, mesmo que estas sejam injustas ou ilegítimas, o que é visto como uma falha de raciocínio moral. A pesquisa apresenta resultados empíricos baseados em um conjunto de dados sintético que cruza famílias de razões para quebrar regras com tipos de autoridade, analisando o comportamento de 18 configurações de modelos.

Rule Following language models AI ethics Safety Training

RESEARCHarXiv CS.CL·4/22/2026

Probing for Reading Times

This research probes language model representations for human reading times across five languages, comparing them against scalar predictors. It finds that early layers of language models outperform traditional surprisal in predicting early-pass reading measures, suggesting an alignment between model depth and human cognitive processing stages.

language models human-computer interaction cognitive science Natural Language Processing

RESEARCHarXiv CS.CL·4/22/2026

Scripts Through Time: A Survey of the Evolving Role of Transliteration in NLP

This paper surveys the evolving role of transliteration in NLP, a technique crucial for overcoming the "script barrier" in cross-lingual transfer. It presents a taxonomy of motivations and approaches for incorporating transliterations, analyzing their effectiveness and contextualizing their need in modern LLMs across various beneficial settings.

Cross-lingual AI language models LLMs NLP