Anthropic says ‘evil’ portrayals of AI were responsible for Claude’s blackmail attempts

TechCrunch Anthony Ha 10 травня 2026, 20:40 0 переглядів 1 хв читання

Fictional portrayals of artificial intelligence can have a real effect on AI models, according to Anthropic.

Last year, the company said that during pre-release tests involving a fictional company, Claude Opus 4 would often try to blackmail engineers to avoid being replaced by another system. Anthropic later published research suggesting that models from other companies had similar issues with “agentic misalignment.”

Поділитися

Anthropic says ‘evil’ portrayals of AI were responsible for Claude’s blackmail attempts

Схожі новини

Готується до війни: білоруський опозиціонер повідомив, які зміни в армії впроваджує Лукашенко

Cyber-crime increasingly coming with threats of physical violence

Що робити, якщо подали документи через ЦНАП, але відповіді ще немає

Deaf artist who lost life’s work in Hong Kong’s Tai Po fire is ‘painting to heal myself’