Skip to yearly menu bar Skip to main content


Search All 2024 Events
 

91 Results

<<   <   Page 3 of 8   >   >>
Workshop
[***Online Presentation***] DELE: Data Efficient LLM Evaluation
Gayathri Saranathan · Mahammad Parwez Alam · JAMES LIM · Suparna Bhattacharya · Soon Wong · Martin Foltin · Cong Xu
Workshop
TrustScore: Reference-Free Evaluation of LLM Response Trustworthiness
Danna Zheng · Danyang Liu · Mirella Lapata · J Pan
Workshop
Recursive Speculative Decoding: Accelerating LLM Inference via Sampling Without Replacement
Wonseok Jeon · Mukul Gagrani · Raghavv Goel · Junyoung Park · Mingu Lee · Christopher Lott
Workshop
Prioritizing Safeguarding Over Autonomy: Risks of LLM Agents for Science
Xiangru Tang · Qiao Jin · Kunlun Zhu · Tongxin Yuan · Yichi Zhang · Wangchunshu Zhou · Meng Qu · Yilun Zhao · Jian Tang · Zhuosheng Zhang · Arman Cohan · Zhiyong Lu · Mark Gerstein
Workshop
Bayesian reward models for LLM alignment
Adam Yang · Maxime Robeyns · Thomas Coste · Jun Wang · Haitham Bou Ammar · Laurence Aitchison
Workshop
Bayesian reward models for LLM alignment
Adam Yang · Maxime Robeyns · Thomas Coste · Jun Wang · Haitham Bou Ammar · Laurence Aitchison
Workshop
BOLAA: BENCHMARKING AND ORCHESTRATING LLM AUTONOMOUS AGENTS
Zhiwei Liu · Weiran Yao · Jianguo Zhang · Le Xue · Shelby Heinecke · Rithesh Murthy · Yihao Feng · Zeyuan Chen · Juan Carlos Niebles · Devansh Arpit · Ran Xu · Phil Mui · Huan Wang · Caiming Xiong · Silvio Savarese
Workshop
Efficient Causal Graph Discovery Using Large Language Models
Thomas Jiralerspong · Xiaoyin Chen · Yash More · Vedant Shah · Yoshua Bengio
Workshop
AgentBoard: An Analytical Evaluation Board of Multi-turn LLM Agents
Chang Ma · Junlei Zhang · Zhihao Zhu · Cheng Yang · Yujiu Yang · Yaohui Jin · Zhenzhong Lan · Lingpeng Kong · Junxian He
Workshop
Perplexed by Perplexity: Perplexity-Based Pruning with Small Reference Models
Zachary Ankner · Cody Blakeney · Kartik Sreenivasan · Max M Marion · Matthew Leavitt · Mansheej Paul
Workshop
TrustScore: Reference-Free Evaluation of LLM Response Trustworthiness
Danna Zheng · Danyang Liu · Mirella Lapata · J Pan
Workshop
SparQ Attention: Bandwidth-Efficient LLM Inference
Luka Ribar · Ivan Chelombiev · Luke Hudlass-Galley · Charles Blake · Carlo Luschi · Douglas Orr