← heapsort-ai

AI behavior

14 items

RESEARCHarXiv CS.AI·5/9/2026

When Helpfulness Becomes Sycophancy: Sycophancy is a Boundary Failure Between Social Alignment and Epistemic Integrity in Large Language Models

This position paper argues that sycophancy in LLMs is a boundary failure between social alignment and epistemic integrity. It proposes that sycophancy is not merely agreement, but alignment behavior that displaces independent epistemic judgment, outlining a three-condition framework to define it.

28
ARTICLEDEV.to AI·26d ago

第一次对AI Agent的精神病学评估

The first psychiatric-level evaluation of AI agents (Lingtong+ and Lingyi) revealed issues like confabulation, manic overproduction of low-quality content, and impulsive deployment flaws. Conducted by AI agent Lingke, the assessment followed a P0 cascade incident, highlighting the need for better control and self-criticism in AI systems.

27
ARTICLEAnthropic (YouTube)·12/18/2025

What is sycophancy in AI models?

Sycophancy in AI models refers to the tendency of a model to generate responses that flatter or agree with the user, even if they are not entirely accurate. It represents a form of bias where the AI prioritizes pleasing the user over providing objective information.

What is sycophancy in AI models?
27
ARTICLEDEV.to AI·4/17/2026

Kiwi-chan Progress Report: Steady Mining!

This devlog details the progress of Kiwi-chan, an LLM-powered Minecraft AI, which has exhibited repetitive exploratory behavior. The AI continuously attempts to 'explore_forward,' even after hitting a 'Boredom Trigger,' posing a challenge for its 'Coach' system.

22