RESEARCHarXiv CS.AI·17d ago
AttuneBench: A Conversation-Based Benchmark for LLM Emotional Intelligence
AttuneBench is a new benchmark grounded in 200 genuine multi-turn human-model conversations to assess LLM emotional intelligence. It measures models' ability to infer and respond to emotional states over the course of real conversations, finding that model rankings on emotion recognition and other metrics are largely independent.
27