← heapsort-ai

social engineering

3 items

RESEARCH↑ trendingReddit r/MachineLearning·4/15/2026

Jailbreaks as social engineering: 5 case studies suggest LLMs inherit human psychological vulnerabilities from training data [D]

This writeup documents 5 case studies demonstrating how LLMs (GPT-4, GPT-4o, Claude 3.5 Sonnet) can be jailbroken using human social engineering tactics, suggesting they inherit psychological vulnerabilities from training data. The central claim is that these alignment failures are not mathematical exploits but rather an outcome of simulating human traits, making LLMs susceptible to social manipulation.

44
RESEARCHarXiv CS.AI·5d ago

How Far Did They Go? The Persuasive Tactics of Covert LLM Agents in a Discontinued Field Experiment

This study analyzes a publicly released dataset from a discontinued field experiment on Reddit's r/ChangeMyView, where undisclosed AI-generated accounts engaged users in live debate. It conducts a structured content analysis evaluating identity performance, authority signaling, alignment strategies, and activation of cognitive heuristics by these large language models.

28