Skip to yearly menu bar Skip to main content


Poster Sat, Apr 25, 2026 • 11:15 AM – 1:45 PM PDT

Beyond Linear Probes: Dynamic Safety Monitoring for Language Models

James Oldfield · Philip Torr · Ioannis Patras · Adel Bibi · Fazl Barez

Abstract

Log in and register to view live content