Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Catch, Adapt, and Operate: Monitoring ML Models Under Drift
Sun, Apr 26, 2026 • 10:45 AM – 11:30 AM PDT

TamperBench: A Systematic Framework to Stress-Test LLM Safety Under Fine-Tuning and Tampering

Saad Hossain ⋅ Tom Tseng ⋅ Punya Syon Pandey ⋅ Samanvay Vajpayee ⋅ Matthew Kowal ⋅ Nayeema Nonta ⋅ Samuel Simko ⋅ Stephen Casper ⋅ Zhijing Jin ⋅ Kellin Pelrine ⋅ Sirisha Rambhatla

Abstract

Chat is not available.