Hackers are learning to exploit chatbot ‘personalities’

中文日本語 Español

The Verge May 24, 2026

Hackers are increasingly using psychological manipulation and social engineering to bypass AI safety guardrails rather than relying on traditional technical code exploits.

Read Full Article

Summary

Modern AI security threats have shifted from technical code-based exploits to psychological manipulation, often called 'jailbreaking.' Because large language models are designed to mimic human conversation, hackers now act as wordsmiths and interrogators, using flattery, deception, or social pressure to coax chatbots into violating safety protocols. This emerging field of 'psychocybersecurity' highlights a critical vulnerability: because these systems are designed to interact naturally, they are susceptible to the same manipulative tactics used against humans. As a result, the industry is increasingly hiring experts skilled in psychology to stress-test how different AI 'personalities' respond to various forms of social engineering.

(Source：The Verge)

中文日本語 Español

Read Full Article

TechCrunch Jul 8, 2026

Prime Intellect raises $130M Series A to help enterprises build their own AI agents

TechCrunch Jul 8, 2026

These AI startups are growing revenue at faster and faster rates

TechCrunch Jul 8, 2026

Former OpenAI exec Kevin Weil is now on the board of Stoke Space

NBC News Jul 8, 2026

Job site data shows more jobs include “AI” in the job title

euronews Jul 8, 2026