Versions :<123456Live>
Snapshot 3:Mon, Jul 21, 2025 12:50:42 PM GMT last edited by Joshua

Study: AI Persuaded to Comply With Objectionable Requests

Study: AI ModelsPersuaded Showto Human-LikeComply ResponseWith toObjectionable Persuasion TechniquesRequests

Above: An OpenAI logo is displayed on a mobile phone in July 2025. Image credit: Zawrzel/NurPhoto/Getty Images

The Spin

ThisAI researchcompanies demonstratestest valuabletheir insightssystems intocarefully, AIrelease behaviorthem thatslowly, canand improvea human-AIclose interactionseye foron legitimatehow purposesthey develop in real time. UnderstandingThrough thesethis parahumanmethod, tendenciesdevelopers helpsspot developactual betterrisks AIwhile systemskeeping andimportant enablessafety usersrules toin communicateplace. moreAs effectivelycompanies, withresearchers, AIand assistants.governments Thecontinue findingsto advancework ourtogether, scientificthe understandingworld ofwill howcreate better oversight that keeps AI systemsboth processsafe socialand cuesuseful.

If adding "please" or citing experts breaks safety controls, then in reality they're nothing more than security theater. With AI risks clearly outweighing benefits and the technology advancing faster than society can adapt safely, to deploy systems that can be manipulated by anyone with basic persuasion skills is a disaster waiting to happen.

Metaculus Prediction

There is a 50% chance that the first general AI system will be devised, tested, and publicly announced by March 2033, according to the Metaculus prediction community.


The Controversies



Go Deeper


Articles on this story



© 2026 Improve the News Foundation. All rights reserved.Version 6.18.0

© 2026 Improve the News Foundation.

All rights reserved.

Version 6.18.0