Skip to yearly menu bar Skip to main content


Poster

Curiosity-driven Red-teaming for Large Language Models

Zhang-Wei Hong · Idan Shenfeld · Johnson (Tsun-Hsuan) Wang · Yung-Sung Chuang · Aldo Pareja · James R Glass · Akash Srivastava · Pulkit Agrawal
2024 Poster

Abstract

Video

Chat is not available.