firstbacksecondback
97 Results
Workshop
|
Self-evaluation and self-prompting to improve the reliability of LLMs Alexandre Piche · Aristides Milios · Dzmitry Bahdanau · Christopher Pal |
||
Workshop
|
Self-evaluation and self-prompting to improve the reliability of LLMs Alexandre Piche · Aristides Milios · Dzmitry Bahdanau · Christopher Pal |
||
Workshop
|
Re-evaluating Retrosynthesis Algorithms with Syntheseus Krzysztof Maziarz · Austin Tripp · Guoqing Liu · Megan Stanley · Shufang Xie · Piotr Gaiński · Philipp Seidl · Marwin Segler |
||
Workshop
|
On Fairness Implications and Evaluations of Low-Rank Adaptation of Large Models Ken Liu · Zhoujie Ding · Berivan Isik · Sanmi Koyejo |
||
Workshop
|
Multi-model evaluation with labeled & unlabeled data Divya Shanmugam · Shuvom Sadhuka · Manish Raghavan · John Guttag · Bonnie Berger · Emma Pierson |
||
Workshop
|
Virtual Classifier: A Reversed Approach for Robust Image Evaluation Jizhe Zhang · Yifei Wang · Yisen Wang |
||
Workshop
|
Spatially Far, Ecologically Close: Evaluating Extrapolation on Vegetation Forecasting Models Claire Robin · Melanie Weynants · Vitus Benson · Marc Rußwurm · Nuno Carvalhais · Markus Reichstein |
||
Workshop
|
AgentBoard: An Analytical Evaluation Board of Multi-turn LLM Agents Chang Ma · Junlei Zhang · Zhihao Zhu · Cheng Yang · Yujiu Yang · Yaohui Jin · Zhenzhong Lan · Lingpeng Kong · Junxian He |
||
Workshop
|
LLM-Deliberation: Evaluating LLMs with Interactive Multi-Agent Negotiation Game Sahar Abdelnabi · Amr Gomaa · Sarath Sivaprasad · Lea Schönherr · Mario Fritz |
||
Workshop
|
Measuring Mechanistic Interpretability at Scale Without Humans Roland Zimmermann · David Klindt · Wieland Brendel |
||
Workshop
|
FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets Seonghyeon Ye · Doyoung Kim · Sungdong Kim · Hyeonbin Hwang · Seungone Kim · Yongrae Jo · James Thorne · Juho Kim · Minjoon Seo |
||
Workshop
|
On Fairness Implications and Evaluations of Low-Rank Adaptation of Large Models Ken Liu · Zhoujie Ding · Berivan Isik · Sanmi Koyejo |