firstbacksecondback
91 Results
Workshop
|
[***Online Presentation***] DELE: Data Efficient LLM Evaluation Gayathri Saranathan · Mahammad Parwez Alam · JAMES LIM · Suparna Bhattacharya · Soon Wong · Martin Foltin · Cong Xu |
||
Workshop
|
TrustScore: Reference-Free Evaluation of LLM Response Trustworthiness Danna Zheng · Danyang Liu · Mirella Lapata · J Pan |
||
Workshop
|
Recursive Speculative Decoding: Accelerating LLM Inference via Sampling Without Replacement Wonseok Jeon · Mukul Gagrani · Raghavv Goel · Junyoung Park · Mingu Lee · Christopher Lott |
||
Workshop
|
Prioritizing Safeguarding Over Autonomy: Risks of LLM Agents for Science Xiangru Tang · Qiao Jin · Kunlun Zhu · Tongxin Yuan · Yichi Zhang · Wangchunshu Zhou · Meng Qu · Yilun Zhao · Jian Tang · Zhuosheng Zhang · Arman Cohan · Zhiyong Lu · Mark Gerstein |
||
Workshop
|
Bayesian reward models for LLM alignment Adam Yang · Maxime Robeyns · Thomas Coste · Jun Wang · Haitham Bou Ammar · Laurence Aitchison |
||
Workshop
|
Bayesian reward models for LLM alignment Adam Yang · Maxime Robeyns · Thomas Coste · Jun Wang · Haitham Bou Ammar · Laurence Aitchison |
||
Workshop
|
BOLAA: BENCHMARKING AND ORCHESTRATING LLM AUTONOMOUS AGENTS Zhiwei Liu · Weiran Yao · Jianguo Zhang · Le Xue · Shelby Heinecke · Rithesh Murthy · Yihao Feng · Zeyuan Chen · Juan Carlos Niebles · Devansh Arpit · Ran Xu · Phil Mui · Huan Wang · Caiming Xiong · Silvio Savarese |
||
Workshop
|
Efficient Causal Graph Discovery Using Large Language Models Thomas Jiralerspong · Xiaoyin Chen · Yash More · Vedant Shah · Yoshua Bengio |
||
Workshop
|
AgentBoard: An Analytical Evaluation Board of Multi-turn LLM Agents Chang Ma · Junlei Zhang · Zhihao Zhu · Cheng Yang · Yujiu Yang · Yaohui Jin · Zhenzhong Lan · Lingpeng Kong · Junxian He |
||
Workshop
|
Perplexed by Perplexity: Perplexity-Based Pruning with Small Reference Models Zachary Ankner · Cody Blakeney · Kartik Sreenivasan · Max M Marion · Matthew Leavitt · Mansheej Paul |
||
Workshop
|
TrustScore: Reference-Free Evaluation of LLM Response Trustworthiness Danna Zheng · Danyang Liu · Mirella Lapata · J Pan |
||
Workshop
|
SparQ Attention: Bandwidth-Efficient LLM Inference Luka Ribar · Ivan Chelombiev · Luke Hudlass-Galley · Charles Blake · Carlo Luschi · Douglas Orr |