Skip to yearly menu bar Skip to main content


Search All 2023 Events
 

43 Results

<<   <   Page 1 of 4   >   >>
Poster
Variational Latent Branching Model for Off-Policy Evaluation
Qitong Gao · Ge Gao · Min Chi · Miroslav Pajic
Poster
Tue 7:30 Beyond calibration: estimating the grouping loss of modern neural networks
Alexandre Perez-Lebel · Marine Le Morvan · Gael Varoquaux
Poster
Wed 7:30 What Is Missing in IRM Training and Evaluation? Challenges and Solutions
Yihua Zhang · Pranay Sharma · Parikshit Ram · Mingyi Hong · Kush Varshney · Sijia Liu
Poster
TextGrad: Advancing Robustness Evaluation in NLP by Gradient-Driven Optimization
Bairu Hou · Jinghan Jia · Yihua Zhang · Guanhua Zhang · Yang Zhang · Sijia Liu · Shiyu Chang
Poster
Dr.Spider: A Diagnostic Evaluation Benchmark towards Text-to-SQL Robustness
Shuaichen Chang · Jun Wang · Mingwen Dong · Lin Pan · Henghui Zhu · Alexander Hanbo Li · Wuwei Lan · Sheng Zhang · Jiarong Jiang · Joseph Lilien · Steve Ash · William Wang · Zhiguo Wang · Vittorio Castelli · Patrick Ng · Bing Xiang
Poster
Sample Complexity of Nonparametric Off-Policy Evaluation on Low-Dimensional Manifolds using Deep Networks
Xiang Ji · Minshuo Chen · Mengdi Wang · Tuo Zhao
Poster
Mon 7:30 A Call to Reflect on Evaluation Practices for Failure Detection in Image Classification
Paul F. Jaeger · Carsten Lüth · Lukas Klein · Till Bungert
Oral
Mon 6:00 A Call to Reflect on Evaluation Practices for Failure Detection in Image Classification
Paul F. Jaeger · Carsten Lüth · Lukas Klein · Till Bungert
Poster
Tue 2:30 A critical look at the evaluation of GNNs under heterophily: Are we really making progress?
Oleg Platonov · Denis Kuznedelev · Michael Diskin · Artem Babenko · Liudmila Prokhorenkova