Skip to yearly menu bar Skip to main content


Large Language Models can Strategically Deceive their Users when Put Under Pressure

Jérémy Scheurer ⋅ Mikita Balesni ⋅ Marius Hobbhahn

Abstract

Chat is not available.