Skip to yearly menu bar Skip to main content


NON-MONOTONICITY AND CATASTROPHIC RISK OF PROMPT INTERVENTIONS IN ADVERSARIAL LLM CONTROL

Koki Inoue ⋅ Naoya Takashima ⋅ Hayato Fujihara ⋅ SHUYA HIGUCHI ⋅ Kota Shimomura ⋅ Ryuta Shimogauchi ⋅ Takayoshi Yamashita

Abstract

Chat is not available.