← heapsort-ai

language models

103 items

RESEARCHarXiv CS.LG·5/5/2026

StyleShield: Exposing the Fragility of AIGC Detectors through Continuous Controllable Style Transfer

The paper introduces StyleShield, a novel flow matching framework for conditional text style transfer that exposes the fragility of AI-generated content (AIGC) detectors. It operates in continuous token embedding space to blur the statistical boundary between human and AI writing, challenging the reliability of current detection services.

27
RESEARCHarXiv CS.CL·5/5/2026

H-Probes: Extracting Hierarchical Structures From Latent Representations of Language Models

This paper introduces H-probes, linear probes designed to extract hierarchical structure, specifically depth and pairwise distance, from the latent representations of large language models. The research shows these probes robustly find low-dimensional subspaces crucial for performance in synthetic tree traversal tasks, generalizing well both within and out-of-domain.

27
RESEARCHarXiv CS.LG·4/9/2026

$S^3$: Stratified Scaling Search for Test-Time in Diffusion Language Models

O trabalho propõe $S^3$ (Stratified Scaling Search), um método de busca guiado por verificador para melhorar a qualidade de geração em modelos de linguagem de difusão durante o tempo de inferência. Ele realoca a computação no processo de denoising, avaliando e reamostrando seletivamente candidatos promissores para favorecer saídas de maior qualidade.

27
RESEARCHarXiv CS.CL·4/13/2026

EMA Is Not All You Need: Mapping the Boundary Between Structure and Content in Recurrent Context

This research explores Exponential Moving Average (EMA) traces as a minimal recurrent context to delineate the capabilities and limitations of fixed-coefficient accumulation in sequence models. It demonstrates that EMA traces excel at encoding temporal structure, matching advanced models on structural tasks, yet fundamentally fail to capture token identity, resulting in significantly reduced performance for language modeling.

27
RESEARCHarXiv CS.CL·4/16/2026

KMMMU: Evaluation of Massive Multi-discipline Multimodal Understanding in Korean Language and Context

KMMMU is a new native Korean benchmark for evaluating multimodal understanding in Korean cultural and institutional settings, featuring 3,466 questions from native exams. The study shows that current AI models achieve only 42.05% accuracy on the full set, with significant failures in culturally and discipline-specific problems.

27
RESEARCHarXiv CS.AI·4/27/2026

Math Takes Two: A test for emergent mathematical reasoning in communication

This paper proposes Math Takes Two, a new benchmark designed to assess the emergence of mathematical reasoning in language models through communication. It tests whether two agents, without prior mathematical knowledge, can develop a shared symbolic protocol to solve a visually grounded task where a numerical system facilitates extrapolation.

27
RESEARCHarXiv CS.CL·4/8/2026

Document Optimization for Black-Box Retrieval via Reinforcement Learning

Este artigo de pesquisa propõe uma nova abordagem para otimização de documentos, transformando-os para melhor alinhamento com sistemas de recuperação via Reinforcement Learning (GRPO), utilizando melhorias de ranking como recompensa. O método, aplicável a retrievers de caixa preta, demonstrou ganhos em tarefas de recuperação de código e documentos visuais.

27
RESEARCHarXiv CS.AI·8d ago

Grokers: Bottom-Up Inductive Comprehension and Write-Time Intelligence over Typed Knowledge Graphs

Grokers is an innovative architecture for building persistent, structured comprehension of typed knowledge graphs through bottom-up inductive traversal. Unlike RAG, it shifts intelligence to write time, where autonomous Groker agents analyze and enrich attributes via language models for all future queries at zero cost.

27
RESEARCHarXiv CS.AI·27d ago

DisaBench: A Participatory Evaluation Framework for Disability Harms in Language Models

DisaBench introduces a participatory evaluation framework to assess disability-related harms in large language models, addressing the inadequacy of general-purpose safety benchmarks. It features a co-created taxonomy of twelve harm categories, a methodology pairing benign and adversarial prompts, and a dataset with human-annotated labels, revealing subtle harms often missed by standard evaluations.

27
RESEARCHarXiv CS.AI·8d ago

Emergent Collaborative Deliberation in Multi-Model AI Systems: A BFT-Derived Protocol for Epistemic Synthesis

The Consilium Protocol, derived from Byzantine Fault Tolerance, is introduced for structured multi-model AI deliberation, treating inter-model disagreement as an epistemic signal. The study demonstrates that cognitive personas determine epistemic behavior and that RLHF alignment training creates measurable epistemic blind spots.

27