Anthropic researchers wear down AI ethics with repeated questions

How do you get an AI to answer a question it’s not supposed to? There are many such “jailbreak” techniques, and Anthropic researchers just found a new one, in which a large language model can be convinced to tell you how to build a bomb if you prime it with a few dozen less-harmful questions […] © 2024 TechCrunch. All rights reserved. For personal use only.

Apr 3, 2024 - 01:50
 0  6
Anthropic researchers wear down AI ethics with repeated questions

How do you get an AI to answer a question it’s not supposed to? There are many such “jailbreak” techniques, and Anthropic researchers just found a new one, in which a large language model can be convinced to tell you how to build a bomb if you prime it with a few dozen less-harmful questions […]

© 2024 TechCrunch. All rights reserved. For personal use only.

What's Your Reaction?

like

dislike

love

funny

angry

sad

wow