Skip to yearly menu bar Skip to main content


A Decision-Theoretic Formalisation of Steganography With Applications to LLM Monitoring

Usman Anwar ⋅ Julianna Piskorz ⋅ David Baek ⋅ David Demitri Africa ⋅ Jim Weatherall ⋅ Max Tegmark ⋅ Christian Schroeder de Witt ⋅ Mihaela van der Schaar ⋅ David Krueger

Abstract

Log in and register to view live content