firstbacksecondback
70 Results
Poster
|
Thu 7:30 |
VertiBench: Advancing Feature Distribution Diversity in Vertical Federated Learning Benchmarks Zhaomin Wu · Junyi Hou · Bingsheng He |
|
Poster
|
Tue 7:30 |
SmartPlay : A Benchmark for LLMs as Intelligent Agents Yue Wu · Xuan Tang · Tom Mitchell · Yuanzhi Li |
|
Invited Talk
|
Thu 23:30 |
The emerging science of benchmarks Moritz Hardt |
|
Poster
|
Thu 1:45 |
GNNX-BENCH: Unravelling the Utility of Perturbation-based GNN Explainers through In-depth Benchmarking Mert Kosan · Samidha Verma · Burouj Armgaan · Khushbu Pahwa · Ambuj K Singh · Sourav Medya · Sayan Ranu |
|
Poster
|
Tue 7:30 |
A Benchmark Study on Calibration Linwei Tao · Younan Zhu · Haolan Guo · Minjing Dong · Chang Xu |
|
Poster
|
Fri 7:30 |
BEND: Benchmarking DNA Language Models on Biologically Meaningful Tasks Frederikke Marin · Felix Teufel · Marc Horlacher · Dennis Madsen · Dennis Pultz · Ole Winther · Wouter Boomsma |
|
Poster
|
Thu 7:30 |
FFB: A Fair Fairness Benchmark for In-Processing Group Fairness Methods Xiaotian Han · Jianfeng Chi · Yu Chen · Qifan Wang · Han Zhao · Na Zou · Xia Hu |
|
Poster
|
Wed 1:45 |
MIntRec2.0: A Large-scale Benchmark Dataset for Multimodal Intent Recognition and Out-of-scope Detection in Conversations Hanlei Zhang · Xin Wang · Hua Xu · Qianrui Zhou · Kai Gao · Jianhua Su · jinyue Zhao · Wenrui Li · Yanting Chen |
|
Poster
|
Tue 1:45 |
Q-Bench: A Benchmark for General-Purpose Foundation Models on Low-level Vision Haoning Wu · Zicheng Zhang · Erli Zhang · Chaofeng Chen · Liang Liao · Annan Wang · Chunyi Li · Wenxiu Sun · Qiong Yan · Guangtao Zhai · Weisi Lin |
|
Poster
|
Thu 7:30 |
DNABERT-2: Efficient Foundation Model and Benchmark For Multi-Species Genomes Zhihan Zhou · Yanrong Ji · Weijian Li · Pratik Dutta · Ramana Davuluri · Han Liu |