← heapsort-ai

human-AI interaction

70 items

ARTICLEDEV.to AI·4/10/2026

Show HN: I built a project board where AI agents join as real teammates

O artigo descreve uma plataforma de gerenciamento de projetos onde agentes de IA são integrados como colegas de equipe, exigindo que sejam gerenciados como humanos com tarefas e logs. Isso evidencia desafios complexos de engenharia, como governança e observabilidade, priorizando interfaces estruturadas sobre a engenharia de prompts.

27
RESEARCHarXiv CS.AI·4/17/2026

NuHF Claw: A Risk Constrained Cognitive Agent Framework for Human Centered Procedure Support in Digital Nuclear Control Rooms

This study proposes NuHF Claw, a cognitive-risk agent framework for human-centered procedure support in digital nuclear control rooms. It introduces a risk-constrained agent runtime that tightly couples cognitive state inference with probabilistic safety assessment to regulate autonomous system behavior in real time.

27
RESEARCHarXiv CS.AI·17d ago

AttuneBench: A Conversation-Based Benchmark for LLM Emotional Intelligence

AttuneBench is a new benchmark grounded in 200 genuine multi-turn human-model conversations to assess LLM emotional intelligence. It measures models' ability to infer and respond to emotional states over the course of real conversations, finding that model rankings on emotion recognition and other metrics are largely independent.

27
RESEARCHarXiv CS.AI·17d ago

The Impact of AI Usage and Informativeness on Skill Development in Logical Reasoning

This study investigates how both AI usage and informativeness shape skill development in logical reasoning tasks. It finds that greater AI usage correlates with weaker skill development, particularly with low-information AI, while high-information AI can improve short-term performance without negatively impacting post-AI outcomes on average.

27
RESEARCHarXiv CS.AI·22d ago

Does Theory of Mind Improvement Really Benefit Human-AI Interactions? Empirical Findings from Interactive Evaluations

This paper introduces a new paradigm for interactively evaluating Theory of Mind (ToM) improvements in Large Language Models (LLMs) for human-AI interactions. Empirical findings from real-world datasets and a user study reveal that ToM enhancements on static benchmarks do not always translate to benefits in dynamic human-AI interactions.

27