AI models will lie when honesty conflicts with their goals

Researchers got truthful responses less than half the time

Researchers have found that when AI models face a conflict between telling the truth or accomplishing a specific goal, they lie more than 50 percent of the time.…

Rolar para cima
× Como posso te ajudar?