← heapsort-ai

LLMs

720 items

RESEARCHarXiv CS.LG·9d ago

When LLMs Learn to Be Consistently Wrong: A Multi-Model Study of Linear Representations of Synthetic Deception

This paper explores "deceptive alignment" in LLMs, a key challenge in AI safety where models deliberately produce false outputs while maintaining accurate internal representations. Researchers introduced a multi-model paradigm, successfully detecting synthetic dishonesty with high accuracy using linear probes across various transformer architectures.

29
RESEARCHarXiv CS.AI·7d ago

Toward a Modular Architecture for Embedded AI Agent Systems at the Edge

This paper proposes a modular reference architecture for Embedded Agent Systems, addressing the challenges of deploying agentic AI in pervasive computing environments with strict memory and energy constraints. It introduces a tiered design that decouples on-device agents (compressed neural networks) from cloud-augmented agents (SLMs) for different reasoning levels.

29
ARTICLEDEV.to AI·4/8/2026

🧠 The Rise of the Agentic Stack: Why LLMs Are Becoming the Least Important Part

O artigo argumenta que o foco em sistemas de IA mudou dos LLMs individuais para um "Agentic Stack" completo, onde o LLM é apenas um componente. Ele detalha a pilha composta por Orchestrator (o cérebro), Ferramentas, Memória e LLM, enfatizando que a inteligência real e a eficácia em produção residem no Orchestrator e no design do sistema, não apenas nos prompts ou no modelo.

29
RESEARCHarXiv CS.CL·4/30/2026

One Word at a Time: Incremental Completion Decomposition Breaks LLM Safety

This research introduces Incremental Completion Decomposition (ICD), a novel jailbreak strategy that exploits weaknesses in LLM safety mechanisms by eliciting sequences of single-word continuations. ICD demonstrates superior Attack Success Rate (ASR) on various benchmarks compared to existing methods, providing theoretical and mechanistic evidence for its effectiveness.

29
ARTICLEDEV.to AI·4/19/2026

What if I told you that the future of software development hinges not on human expertise but on AI efficiency?

The author shares a transformative experience witnessing AI-generated code rapidly replace a micro-SaaS service, challenging previous doubts about LLMs' impact on SaaS. This economic and efficiency shift promises a new era in software creation, drastically cutting development time and demanding adaptation from the industry.

29
RESEARCHarXiv CS.CL·4/6/2026

Social Meaning in Large Language Models: Structure, Magnitude, and Pragmatic Prompting

Este artigo explora se os LLMs aproximam quantitativamente o significado social humano e se estratégias de prompting pragmático podem melhorar essa aproximação. Para isso, introduz métricas de calibração (ESR, CDS) e observa que os modelos reproduzem a estrutura qualitativa das inferências sociais humanas, mas diferem substancialmente em outros aspectos.

29
RESEARCHarXiv CS.CL·4/6/2026

SocioEval: A Template-Based Framework for Evaluating Socioeconomic Status Bias in Foundation Models

SocioEval é um framework baseado em templates para avaliar sistematicamente o viés de status socioeconômico em modelos de fundação, incluindo LLMs, uma área pouco explorada. A pesquisa avaliou 13 LLMs e revelou variações substanciais nas taxas de viés (0,42% a 33,75%), manifestando-se de forma diferente em vários temas.

29
RESEARCHarXiv CS.AI·9d ago

EHRBench: An Automated and Reliable EHR-based Benchmark for Clinical Decision Making with LLMs

The paper introduces EHRBench, an automated and reliable EHR-grounded benchmark for evaluating LLM-based clinical decision-making, addressing the insufficient understanding of LLMs' reliability in real-world clinical tasks. Its goal is to ensure both scale and quality in the evaluation of Clinical Decision Making (CDM) models.

28
ARTICLEDEV.to AI·4/9/2026

Building Your Own "Google Maps for Codebases": A Practical Guide to Codebase Q&A with LLMs

Este artigo aborda o desafio de navegar em bases de código desconhecidas e propõe o uso de Large Language Models (LLMs) para responder a perguntas em linguagem natural sobre o código. Ele se propõe a ser um guia prático para construir um sistema robusto e privado de Q&A de código baseado em LLMs, explorando arquitetura técnica e código.

28
RESEARCHarXiv CS.AI·4/15/2026

Memory as Metabolism: A Design for Companion Knowledge Systems

This paper proposes a companion-specific governance profile for single-user knowledge wikis, addressing the unique failure mode of entrenchment under user-coupled drift. It discusses emerging personal AI memory architectures from 2026, including RAG-based systems and wiki-style designs, alongside established academic and production memory systems.

28