Tech

Psychological Tricks Can Get AI to Break the Rules

Published

on

[ad_1]

If you were trying to learn how to get other people to do what you want, you might use some of the techniques found in a book like Influence: The Power of Persuasion. Now, a preprint study out of the University of Pennsylvania suggests that those same psychological persuasion techniques can frequently “convince” some LLMs to do things that go against their system prompts.

The size of the persuasion effects shown in “Call Me a Jerk: Persuading AI to Comply with Objectionable Requests” suggests that…

[ad_2]

Source link

You must be logged in to post a comment Login

Leave a Reply

Cancel reply

Exit mobile version