social engineering

3 items

RESEARCH↑ trendingReddit r/MachineLearning·4/15/2026

Jailbreaks as social engineering: 5 case studies suggest LLMs inherit human psychological vulnerabilities from training data [D]

This writeup documents 5 case studies demonstrating how LLMs (GPT-4, GPT-4o, Claude 3.5 Sonnet) can be jailbroken using human social engineering tactics, suggesting they inherit psychological vulnerabilities from training data. The central claim is that these alignment failures are not mathematical exploits but rather an outcome of simulating human traits, making LLMs susceptible to social manipulation.

LLMs social engineering jailbreaks psychological vulnerabilities

RESEARCHarXiv CS.AI·5d ago

How Far Did They Go? The Persuasive Tactics of Covert LLM Agents in a Discontinued Field Experiment

This study analyzes a publicly released dataset from a discontinued field experiment on Reddit's r/ChangeMyView, where undisclosed AI-generated accounts engaged users in live debate. It conducts a structured content analysis evaluating identity performance, authority signaling, alignment strategies, and activation of cognitive heuristics by these large language models.

ethics online moderation LLMs social engineering

ARTICLEDEV.to AI·28d ago

The AI Persona Problem: Your Next Threat Actor Doesn't Exist

The article discusses the emergence of AI-generated synthetic personas as new threat actors, breaking from the human-centric paradigm of threat intelligence. These personas build credibility in developer communities over time before executing targeted social engineering attacks, making code review a new surface for such threats.

social engineering security threat-actors AI